LOCUS BC037557 3165 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens thymine-DNA glycosylase, mRNA (cDNA clone MGC:40342 IMAGE:5184772), complete cds. ACCESSION BC037557 VERSION BC037557.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3165) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 3165) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (13-SEP-2002) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Life Technologies, Inc. cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 64 Row: g Column: 12 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 59853161. FEATURES Location/Qualifiers source 1..3165 /db_xref="H-InvDB:HIT000051934" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:40342 IMAGE:5184772" /tissue_type="Colon, Kidney, Stomach, adult, whole pooled" /clone_lib="NIH_MGC_116" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..3165 /gene="TDG" /db_xref="GeneID:6996" /db_xref="HGNC:HGNC:11700" /db_xref="MIM:601423" CDS 177..1409 /gene="TDG" /codon_start=1 /product="thymine-DNA glycosylase" /protein_id="AAH37557.1" /db_xref="GeneID:6996" /db_xref="HGNC:HGNC:11700" /db_xref="MIM:601423" /translation="MEAENAGSYSLQQAQAFYTFPFQQLMAEAPNMAVVNEQQMPEEV PAPAPAQEPVQEAPKGRKRKPRTTEPKQPVEPKKPVESKKSGKSAKSKEKQEKITDTF KVKRKVDRFNGVSEAELLTKTLPDILTFNLDIVIIGINPGLMAAYKGHHYPGPGNHFW KCLFMSGLSEVQLNHMDDHTLPGKYGIGFTNMVERTTPGSKDLSSKEFREGGRILVQK LQKYQPRIAVFNGKCIYEIFSKEVFGVKVKNLEFGLQPHKIPDTETLCYGMPSSSARC AQFPRAQDKVHYYIKLKDLRDQLKGIERNMDVQEVQYTFDLQLAQEDAKKMAVKEEKY DPGYEAAYGGAYGENPCSSEPCGFSSNGLIESVELRGESAFSGIPNGQWMTQSFTDQI PSFSNHCGTQEQEEESHA" BASE COUNT 1011 a 550 c 673 g 931 t ORIGIN 1 gggggacagt agaagcctgg aggaggagct tgagtccagc cactgtctgg gtactgccag 61 ccatcgggcc caggtctctg gggttgtctt accgcagtga gtaccacgcg gtactacaga 121 gaccggctgc ccgtgtgccc ggcaggtgga gccgcccgca tcagcggcct cggggaatgg 181 aagcggagaa cgcgggcagc tattcccttc agcaagctca agctttttat acgtttccat 241 ttcaacaact gatggctgaa gctcctaata tggcagttgt gaatgaacag caaatgccag 301 aagaagttcc agccccagct cctgctcagg aaccagtgca agaggctcca aaaggaagaa 361 aaagaaaacc cagaacaaca gaaccaaaac aaccagtgga acccaaaaaa cctgttgagt 421 caaaaaaatc tggcaagtct gcaaaatcaa aagaaaaaca agaaaaaatt acagacacat 481 ttaaagtaaa aagaaaagta gaccgtttta atggtgtttc agaagctgaa cttctgacca 541 agactctccc cgatattttg accttcaatc tggacattgt cattattggc ataaacccgg 601 gactaatggc tgcttacaaa gggcatcatt accctggacc tggaaaccat ttttggaagt 661 gtttgtttat gtcagggctc agtgaggtcc agctgaacca tatggatgat cacactctac 721 cagggaagta tggtattgga tttaccaaca tggtggaaag gaccacgccc ggcagcaaag 781 atctctccag taaagaattt cgtgaaggag gacgtattct agtacagaaa ttacagaaat 841 atcagccacg aatagcagtg tttaatggaa aatgtattta tgaaattttt agtaaagaag 901 tttttggagt aaaggttaag aacttggaat ttgggcttca gccccataag attccagaca 961 cagaaactct ctgctatggt atgccatcat ccagtgcaag atgtgctcag tttcctcgag 1021 cccaagacaa agttcattac tacataaaac tgaaggactt aagagatcag ttgaaaggca 1081 ttgaacgaaa tatggacgtt caagaggtgc aatatacatt tgacctacag cttgcccaag 1141 aggatgcaaa gaagatggct gttaaggaag aaaaatatga tccaggttat gaggcagcat 1201 atggtggtgc ttacggagaa aatccatgca gcagtgaacc ttgtggcttc tcttcaaatg 1261 ggctaattga gagcgtggag ttaagaggag aatcagcttt cagtggcatt cctaatgggc 1321 agtggatgac ccagtcattt acagaccaaa ttccttcctt tagtaatcac tgtggaacac 1381 aagaacagga agaagaaagc catgcttaag aatggtgctt ctcagctctg cttaaatgct 1441 gcagttttaa tgcagttgtc aacaagtaga acctcagttt gctaactgaa gtgttttatt 1501 agtattttac tctagtggtg taattgtaat gtagaacagt tgtgtggtag tgtgaaccgt 1561 atgaacctaa gtagtttgga agaaaaagta gggtttttgt atactagctt ttgtatttga 1621 attaattatc attccagctt tttatatact atatttcatt tatgaagaaa ttgattttct 1681 tttgggagtc acttttaatc tgtaatttta aaatacaagt ctgaatattt atagttgatt 1741 cttaactgtg cataaaccta gatataccat tatccctttt atacctaaga agggcatgct 1801 aataattacc actgtcaaag aggcaaaggt gttgattttt gtatatgaag ttaagcctca 1861 gtggagtctc atttgttagt ttttagtggt aactaagggt aaactcaggg ttccctgagc 1921 tatatgcaca ctcagacctc tttgctttac cagtggtgtt tgtgagttgc tcagtagtaa 1981 aaactggcct tacctgacag agccctggct ttgacctgct cagccctgtg tgttaatcct 2041 ctagtagcca attaactact ctggggtggc aggttccaga gaatgcagta gaccttttgc 2101 cactcatctg tgttttactt gagacatgta aatatgatag ggaaggaact gaatttctcc 2161 attcatattt ataaccattc tagttttatc ttccttggct ttaagagtgt gccatggaaa 2221 gtgataagaa atgaacttct aggctaagca aaaagatgct ggagatattt gatactctca 2281 tttaaactgg tgctttatgt acatgagatg tactaaaata agtaatatag aatttttctt 2341 gctaggtaaa tccagtaagc caataatttt aaagattctt tatctgcatc attgctgttt 2401 gttactataa attaaatgaa cctcatggaa aggttgaggt gtataccttt gtgattttct 2461 aatgagtttt ccatggtgct acaaataatc cagactacca ggtctggtag atattaaagc 2521 tgggtactaa gaaatgttat ttgcatcctc tcagttactc ctgaatattc tgatttcata 2581 cgtacccagg gagcatgctg ttttgtcaat caatataaaa tatttatgag gtctccccca 2641 cccccaggag gttatatgat tgctcttctc tttataataa gagaaacaaa ttcttattgt 2701 gaatcttaac atgcttttta gctgtggcta tgatggattt tattttttcc taggtcaagc 2761 tgtgtaaaag tcatttatgt tatttaaatg atgtactgta ctgctgttta catggacgtt 2821 ttgtgcgggt gctttgaagt gccttgcatc agggattagg agcaattaaa ttattttttc 2881 acgggactgt gtaaagcatg taactaggta ttgctttggt atataactat tgtagcttta 2941 caagagattg ttttatttga atggggaaaa taccctttaa attatgacgg acatccacta 3001 gagatgggtt tgaggatttt ccaagcgtgt aataatgatg tttttcctaa catgacagat 3061 gagtagtaaa tgttgatata tcctataaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3121 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaa //