LOCUS       BC037557                3165 bp    mRNA    linear   HUM 15-JUL-2006
DEFINITION  Homo sapiens thymine-DNA glycosylase, mRNA (cDNA clone MGC:40342
            IMAGE:5184772), complete cds.
ACCESSION   BC037557
VERSION     BC037557.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3165)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 3165)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (13-SEP-2002) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: Life Technologies, Inc.
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: National Institutes of Health Intramural
            Sequencing Center (NISC),
            Gaithersburg, Maryland;
            Web site: http://www.nisc.nih.gov/
            Contact: nisc_mgc@nhgri.nih.gov
            Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
            Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
            Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
            Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
            Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
            McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
            Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
            Young,A., Zhang,L.-H. and Green,E.D.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 64 Row: g Column: 12
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 59853161.
FEATURES             Location/Qualifiers
     source          1..3165
                     /db_xref="H-InvDB:HIT000051934"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:40342 IMAGE:5184772"
                     /tissue_type="Colon, Kidney, Stomach, adult, whole pooled"
                     /clone_lib="NIH_MGC_116"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     gene            1..3165
                     /gene="TDG"
                     /db_xref="GeneID:6996"
                     /db_xref="HGNC:HGNC:11700"
                     /db_xref="MIM:601423"
     CDS             177..1409
                     /gene="TDG"
                     /codon_start=1
                     /product="thymine-DNA glycosylase"
                     /protein_id="AAH37557.1"
                     /db_xref="GeneID:6996"
                     /db_xref="HGNC:HGNC:11700"
                     /db_xref="MIM:601423"
                     /translation="MEAENAGSYSLQQAQAFYTFPFQQLMAEAPNMAVVNEQQMPEEV
                     PAPAPAQEPVQEAPKGRKRKPRTTEPKQPVEPKKPVESKKSGKSAKSKEKQEKITDTF
                     KVKRKVDRFNGVSEAELLTKTLPDILTFNLDIVIIGINPGLMAAYKGHHYPGPGNHFW
                     KCLFMSGLSEVQLNHMDDHTLPGKYGIGFTNMVERTTPGSKDLSSKEFREGGRILVQK
                     LQKYQPRIAVFNGKCIYEIFSKEVFGVKVKNLEFGLQPHKIPDTETLCYGMPSSSARC
                     AQFPRAQDKVHYYIKLKDLRDQLKGIERNMDVQEVQYTFDLQLAQEDAKKMAVKEEKY
                     DPGYEAAYGGAYGENPCSSEPCGFSSNGLIESVELRGESAFSGIPNGQWMTQSFTDQI
                     PSFSNHCGTQEQEEESHA"
BASE COUNT         1011 a          550 c          673 g          931 t
ORIGIN      
        1 gggggacagt agaagcctgg aggaggagct tgagtccagc cactgtctgg gtactgccag
       61 ccatcgggcc caggtctctg gggttgtctt accgcagtga gtaccacgcg gtactacaga
      121 gaccggctgc ccgtgtgccc ggcaggtgga gccgcccgca tcagcggcct cggggaatgg
      181 aagcggagaa cgcgggcagc tattcccttc agcaagctca agctttttat acgtttccat
      241 ttcaacaact gatggctgaa gctcctaata tggcagttgt gaatgaacag caaatgccag
      301 aagaagttcc agccccagct cctgctcagg aaccagtgca agaggctcca aaaggaagaa
      361 aaagaaaacc cagaacaaca gaaccaaaac aaccagtgga acccaaaaaa cctgttgagt
      421 caaaaaaatc tggcaagtct gcaaaatcaa aagaaaaaca agaaaaaatt acagacacat
      481 ttaaagtaaa aagaaaagta gaccgtttta atggtgtttc agaagctgaa cttctgacca
      541 agactctccc cgatattttg accttcaatc tggacattgt cattattggc ataaacccgg
      601 gactaatggc tgcttacaaa gggcatcatt accctggacc tggaaaccat ttttggaagt
      661 gtttgtttat gtcagggctc agtgaggtcc agctgaacca tatggatgat cacactctac
      721 cagggaagta tggtattgga tttaccaaca tggtggaaag gaccacgccc ggcagcaaag
      781 atctctccag taaagaattt cgtgaaggag gacgtattct agtacagaaa ttacagaaat
      841 atcagccacg aatagcagtg tttaatggaa aatgtattta tgaaattttt agtaaagaag
      901 tttttggagt aaaggttaag aacttggaat ttgggcttca gccccataag attccagaca
      961 cagaaactct ctgctatggt atgccatcat ccagtgcaag atgtgctcag tttcctcgag
     1021 cccaagacaa agttcattac tacataaaac tgaaggactt aagagatcag ttgaaaggca
     1081 ttgaacgaaa tatggacgtt caagaggtgc aatatacatt tgacctacag cttgcccaag
     1141 aggatgcaaa gaagatggct gttaaggaag aaaaatatga tccaggttat gaggcagcat
     1201 atggtggtgc ttacggagaa aatccatgca gcagtgaacc ttgtggcttc tcttcaaatg
     1261 ggctaattga gagcgtggag ttaagaggag aatcagcttt cagtggcatt cctaatgggc
     1321 agtggatgac ccagtcattt acagaccaaa ttccttcctt tagtaatcac tgtggaacac
     1381 aagaacagga agaagaaagc catgcttaag aatggtgctt ctcagctctg cttaaatgct
     1441 gcagttttaa tgcagttgtc aacaagtaga acctcagttt gctaactgaa gtgttttatt
     1501 agtattttac tctagtggtg taattgtaat gtagaacagt tgtgtggtag tgtgaaccgt
     1561 atgaacctaa gtagtttgga agaaaaagta gggtttttgt atactagctt ttgtatttga
     1621 attaattatc attccagctt tttatatact atatttcatt tatgaagaaa ttgattttct
     1681 tttgggagtc acttttaatc tgtaatttta aaatacaagt ctgaatattt atagttgatt
     1741 cttaactgtg cataaaccta gatataccat tatccctttt atacctaaga agggcatgct
     1801 aataattacc actgtcaaag aggcaaaggt gttgattttt gtatatgaag ttaagcctca
     1861 gtggagtctc atttgttagt ttttagtggt aactaagggt aaactcaggg ttccctgagc
     1921 tatatgcaca ctcagacctc tttgctttac cagtggtgtt tgtgagttgc tcagtagtaa
     1981 aaactggcct tacctgacag agccctggct ttgacctgct cagccctgtg tgttaatcct
     2041 ctagtagcca attaactact ctggggtggc aggttccaga gaatgcagta gaccttttgc
     2101 cactcatctg tgttttactt gagacatgta aatatgatag ggaaggaact gaatttctcc
     2161 attcatattt ataaccattc tagttttatc ttccttggct ttaagagtgt gccatggaaa
     2221 gtgataagaa atgaacttct aggctaagca aaaagatgct ggagatattt gatactctca
     2281 tttaaactgg tgctttatgt acatgagatg tactaaaata agtaatatag aatttttctt
     2341 gctaggtaaa tccagtaagc caataatttt aaagattctt tatctgcatc attgctgttt
     2401 gttactataa attaaatgaa cctcatggaa aggttgaggt gtataccttt gtgattttct
     2461 aatgagtttt ccatggtgct acaaataatc cagactacca ggtctggtag atattaaagc
     2521 tgggtactaa gaaatgttat ttgcatcctc tcagttactc ctgaatattc tgatttcata
     2581 cgtacccagg gagcatgctg ttttgtcaat caatataaaa tatttatgag gtctccccca
     2641 cccccaggag gttatatgat tgctcttctc tttataataa gagaaacaaa ttcttattgt
     2701 gaatcttaac atgcttttta gctgtggcta tgatggattt tattttttcc taggtcaagc
     2761 tgtgtaaaag tcatttatgt tatttaaatg atgtactgta ctgctgttta catggacgtt
     2821 ttgtgcgggt gctttgaagt gccttgcatc agggattagg agcaattaaa ttattttttc
     2881 acgggactgt gtaaagcatg taactaggta ttgctttggt atataactat tgtagcttta
     2941 caagagattg ttttatttga atggggaaaa taccctttaa attatgacgg acatccacta
     3001 gagatgggtt tgaggatttt ccaagcgtgt aataatgatg tttttcctaa catgacagat
     3061 gagtagtaaa tgttgatata tcctataaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
     3121 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaa
//