LOCUS       BC000391                1056 bp    mRNA    linear   HUM 06-NOV-2003
DEFINITION  Homo sapiens nth endonuclease III-like 1 (E. coli), mRNA (cDNA
            clone IMAGE:2821314), partial cds.
ACCESSION   BC000391
VERSION     BC000391.2
KEYWORDS    .
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1056)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 1056)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (15-NOV-2000) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     On Nov 6, 2003 this sequence version replaced BC000391.1.
            Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: DCTD/DTP
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: National Institutes of Health Intramural
            Sequencing Center (NISC),
            Gaithersburg, Maryland;
            Web site: http://www.nisc.nih.gov/
            Contact: nisc_mgc@nhgri.nih.gov
            Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
            Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
            Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
            Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
            Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
            McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
            Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
            Young,A., Zhang,L.-H. and Green,E.D.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 1 Row: e Column: 24.
FEATURES             Location/Qualifiers
     source          1..1056
                     /db_xref="H-InvDB:HIT000029580"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="IMAGE:2821314"
                     /tissue_type="Lung, small cell carcinoma"
                     /clone_lib="NIH_MGC_7"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     gene            <1..1056
                     /gene="NTHL1"
                     /gene_synonym="NTH1"
                     /gene_synonym="OCTS3"
                     /db_xref="GeneID:4913"
                     /db_xref="MIM:602656"
     CDS             <1..918
                     /gene="NTHL1"
                     /gene_synonym="NTH1"
                     /gene_synonym="OCTS3"
                     /codon_start=1
                     /product="NTHL1 protein"
                     /protein_id="AAH00391.2"
                     /db_xref="GeneID:4913"
                     /db_xref="MIM:602656"
                     /translation="GMTALSARMLTRSRSLGPGAGPRGCREEPGPLRRREAAAEARKS
                     HSPVKRPRKAQRLRVAYEGSDSEKGEGAEPLKVPVWEPQDWQQQLVNIRAMRNKKDAP
                     VDHLGTEHCYDSSAPPKVRRYQVLLSLMLSSQTKDQVTAGAMQRLRARGLTVDSILQT
                     DDATLGKLIYPVGFWRSKVKYIKQTSAILQQHYGGDIPASVAELVALPGVGPKMAHLA
                     MAVAWGTVSGIAVDTHVHRIANRLRWTKKATKSPEETRAALEEWLPRELWHEINGLLV
                     GFGQQTCLPVHPRCHACLNQALCPAAQGL"
     misc_feature    391..840
                     /gene="NTHL1"
                     /gene_synonym="NTH1"
                     /gene_synonym="OCTS3"
                     /note="ENDO3c; Region: endonuclease III"
                     /db_xref="CDD:smart00478"
     misc_feature    844..906
                     /gene="NTHL1"
                     /gene_synonym="NTH1"
                     /gene_synonym="OCTS3"
                     /note="FES; Region: iron-sulpphur binding domain in
                     DNA-(apurinic or apyrimidinic site) lyase (subfamily of
                     ENDO3)"
                     /db_xref="CDD:smart00525"
BASE COUNT          239 a          300 c          352 g          165 t
ORIGIN      
        1 ggcatgaccg ccttgagcgc gaggatgctg acccggagcc ggagcctggg acccggggct
       61 gggccgcggg ggtgtaggga ggagcccggg cctctccgga gaagagaggc tgcagcagaa
      121 gcgaggaaaa gccacagccc cgtgaagcgt ccgcggaaag cacagagact gcgtgtggcc
      181 tatgagggct cggacagtga gaaaggtgag ggggctgagc ccctcaaggt gccagtctgg
      241 gagccccagg actggcagca acagctggtc aacatccgtg ccatgaggaa caaaaaggat
      301 gcacctgtgg accatctggg gactgagcac tgctatgact ccagtgcccc cccaaaggta
      361 cgcaggtacc aggtgctgct gtcactgatg ctctccagcc aaaccaaaga ccaggtgacg
      421 gcgggcgcca tgcagcgact gcgggcgcgg ggcctgacgg tggacagcat cctgcagaca
      481 gatgatgcca cgctgggcaa gctcatctac cccgtcggtt tctggaggag caaggtgaaa
      541 tacatcaagc agaccagcgc catcctgcag cagcactacg gtggggacat cccagcctct
      601 gtggccgagc tggtggcgct gccgggtgtt gggcccaaga tggcacacct ggctatggct
      661 gtggcctggg gcactgtgtc aggcattgca gtggacacgc atgtgcacag aatcgccaac
      721 aggctgaggt ggaccaagaa ggcaaccaag tccccagagg agacccgcgc cgccctggag
      781 gagtggctgc ctagggagct gtggcacgag atcaatggac tcttggtggg cttcggccag
      841 cagacctgtc tgcctgtgca ccctcgctgc cacgcctgcc tcaaccaagc cctctgcccg
      901 gccgcccagg gtctctgatg gccgcatggc tctggccgag gtgccgctgt ggccaccgtc
      961 tgtgaagtgg ctttacgctt caggaagcca cgcctgttga ataaagcttt ggtgtgtttg
     1021 cagatggaaa aaaaaaaaaa aaaaaaaaaa aaaaaa
//