LOCUS BC043617 4294 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens DNA (cytosine-5-)-methyltransferase 3 alpha, mRNA (cDNA clone MGC:50948 IMAGE:6150112), complete cds. ACCESSION BC043617 VERSION BC043617.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4294) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 4294) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (09-JAN-2003) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC/DCTD/DTP cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Genome Sequence Centre, BC Cancer Agency, Vancouver, BC, Canada info@bcgsc.bc.ca Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao, Kim MacDonald, Mike R. Mayo, Josh Moran, Diana Palmquist, JR Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang, Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 89 Row: o Column: 19 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 28559066. FEATURES Location/Qualifiers source 1..4294 /db_xref="H-InvDB:HIT000052841" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:50948 IMAGE:6150112" /tissue_type="Skin, melanotic melanoma." /clone_lib="NIH_MGC_72" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..4294 /gene="DNMT3A" /gene_synonym="DNMT3A2" /gene_synonym="M.HsaIIIA" /db_xref="GeneID:1788" /db_xref="HGNC:HGNC:2978" /db_xref="MIM:602769" CDS 238..2976 /gene="DNMT3A" /gene_synonym="DNMT3A2" /gene_synonym="M.HsaIIIA" /codon_start=1 /product="DNA (cytosine-5-)-methyltransferase 3 alpha" /protein_id="AAH43617.1" /db_xref="GeneID:1788" /db_xref="HGNC:HGNC:2978" /db_xref="MIM:602769" /translation="MPAMPSSGPGDTSSSAAEREEDRKDGEEQEEPRGKEERQEPSTT ARKVGRPGRKRKHPPVESGDTPKDPAVISKSPSMAQDSGASELLPNGDLEKRSEPQPE EGSPAGGQKGGAPAEGEGAAETLPEASRAVENGCCTPKEGRGAPAEAGKEQKETNIES MKMEGSRGRLRGGLGWESSLRQRPMPRLTFQAGDPYYISKRKRDEWLARWKREAEKKA KVIAGMNAVEENQGPGESQKVEEASPPAVQQPTDPASPTVATTPEPVGSDAGDKNATK AGDDEPEYEDGRGFGIGELVWGKLRGFSWWPGRIVSWWMTGRSRAAEGTRWVMWFGDG KFSVVCVEKLMPLSSFCSAFHQATYNKQPMYRKAIYEVLQVASSRAGKLFPVCHDSDE SDTAKAVEVQNKPMIEWALGGFQPSGPKGLEPPEEEKNPYKEVYTDMWVEPEAAAYAP PPPAKKPRKSTAEKPKVKEIIDERTRERLVYEVRQKCRNIEDICISCGSLNVTLEHPL FVGGMCQNCKNCFLECAYQYDDDGYQSYCTICCGGREVLMCGNNNCCRCFCVECVDLL VGPGAAQAAIKEDPWNCYMCGHKGTYGLLRRREDWPSRLQMFFANNHDQEFDPPKVYP PVPAEKRKPIRVLSLFDGIATGLLVLKDLGIQVDRYIASEVCEDSITVGMVRHQGKIM YVGDVRSVTQKHIQEWGPFDLVIGGSPCNDLSIVNPARKGLYEGTGRLFFEFYRLLHD ARPKEGDDRPFFWLFENVVAMGVSDKRDISRFLESNPVMIDAKEVSAAHRARYFWGNL PGMNRPLASTVNDKLELQECLEHGRIAKFSKVRTITTRSNSIKQGKDQHFPVFMNEKE DILWCTEMERVFGFPVHYTDVSNMSRLARQRLLGRSWSVPVIRHLFAPLKEYFACV" BASE COUNT 1078 a 1105 c 1264 g 847 t ORIGIN 1 gacgcggcgc cgcggcacca gggcgcgcag ccgggccggc ccgaccccac cggccatacg 61 gtggagccat cgaagccccc acccacaggc tgacagaggc accgttcacc agagggctca 121 acaccgggat ctatgtttaa gttttaactc tcgcctccaa agaccacgat aattccttcc 181 ccaaagccca gcagcccccc agccccgcgc agccccagcc tgcctcccgg cgcccagatg 241 cccgccatgc cctccagcgg ccccggggac accagcagct ctgctgcgga gcgggaggag 301 gaccgaaagg acggagagga gcaggaggag ccgcgtggca aggaggagcg ccaagagccc 361 agcaccacgg cacggaaggt ggggcggcct gggaggaagc gcaagcaccc cccggtggaa 421 agcggtgaca cgccaaagga ccctgcggtg atctccaagt ccccatccat ggcccaggac 481 tcaggcgcct cagagctatt acccaatggg gacttggaga agcggagtga gccccagcca 541 gaggagggga gccctgctgg ggggcagaag ggcggggccc cagcagaggg agagggtgca 601 gctgagaccc tgcctgaagc ctcaagagca gtggaaaatg gctgctgcac ccccaaggag 661 ggccgaggag cccctgcaga agcgggcaaa gaacagaagg agaccaacat cgaatccatg 721 aaaatggagg gctcccgggg ccggctgcgg ggtggcttgg gctgggagtc cagcctccgt 781 cagcggccca tgccgaggct caccttccag gcgggggacc cctactacat cagcaagcgc 841 aagcgggacg agtggctggc acgctggaaa agggaggctg agaagaaagc caaggtcatt 901 gcaggaatga atgctgtgga agaaaaccag gggcccgggg agtctcagaa ggtggaggag 961 gccagccctc ctgctgtgca gcagcccact gaccccgcat cccccactgt ggctaccacg 1021 cctgagcccg tggggtccga tgctggggac aagaatgcca ccaaagcagg cgatgacgag 1081 ccagagtacg aggacggccg gggctttggc attggggagc tggtgtgggg gaaactgcgg 1141 ggcttctcct ggtggccagg ccgcattgtg tcttggtgga tgacgggccg gagccgagca 1201 gctgaaggca cccgctgggt catgtggttc ggagacggca aattctcagt ggtgtgtgtt 1261 gagaagctga tgccgctgag ctcgttttgc agtgcgttcc accaggccac gtacaacaag 1321 cagcccatgt accgcaaagc catctacgag gtcctgcagg tggccagcag ccgcgcgggg 1381 aagctgttcc cggtgtgcca cgacagcgat gagagtgaca ctgccaaggc cgtggaggtg 1441 cagaacaagc ccatgattga atgggccctg gggggcttcc agccttctgg ccctaagggc 1501 ctggagccac cagaagaaga gaagaatccc tacaaagaag tgtacacgga catgtgggtg 1561 gaacctgagg cagctgccta cgcaccacct ccaccagcca aaaagccccg gaagagcaca 1621 gcggagaagc ccaaggtcaa ggagattatt gatgagcgca caagagagcg gctggtgtac 1681 gaggtgcggc agaagtgccg gaacattgag gacatctgca tctcctgtgg gagcctcaat 1741 gttaccctgg aacaccccct cttcgttgga ggaatgtgcc aaaactgcaa gaactgcttt 1801 ctggagtgtg cgtaccagta cgacgacgac ggctaccagt cctactgcac catctgctgt 1861 gggggccgtg aggtgctcat gtgcggaaac aacaactgct gcaggtgctt ttgcgtggag 1921 tgtgtggacc tcttggtggg gccgggggct gcccaggcag ccattaagga agacccctgg 1981 aactgctaca tgtgcgggca caagggtacc tacgggctgc tgcggcggcg agaggactgg 2041 ccctcccggc tccagatgtt cttcgctaat aaccacgacc aggaatttga ccctccaaag 2101 gtttacccac ctgtcccagc tgagaagagg aagcccatcc gggtgctgtc tctctttgat 2161 ggaatcgcta cagggctcct ggtgctgaag gacttgggca ttcaggtgga ccgctacatt 2221 gcctcggagg tgtgtgagga ctccatcacg gtgggcatgg tgcggcacca ggggaagatc 2281 atgtacgtcg gggacgtccg cagcgtcaca cagaagcata tccaggagtg gggcccattc 2341 gatctggtga ttgggggcag tccctgcaat gacctctcca tcgtcaaccc tgctcgcaag 2401 ggcctctacg agggcactgg ccggctcttc tttgagttct accgcctcct gcatgatgcg 2461 cggcccaagg agggagatga tcgccccttc ttctggctct ttgagaatgt ggtggccatg 2521 ggcgttagtg acaagaggga catctcgcga tttctcgagt ccaaccctgt gatgattgat 2581 gccaaagaag tgtcagctgc acacagggcc cgctacttct ggggtaacct tcccggtatg 2641 aacaggccgt tggcatccac tgtgaatgat aagctggagc tgcaggagtg tctggagcat 2701 ggcaggatag ccaagttcag caaagtgagg accattacta cgaggtcaaa ctccataaag 2761 cagggcaaag accagcattt tcctgtcttc atgaatgaga aagaggacat cttatggtgc 2821 actgaaatgg aaagggtatt tggtttccca gtccactata ctgacgtctc caacatgagc 2881 cgcttggcga ggcagagact gctgggccgg tcatggagcg tgccagtcat ccgccacctc 2941 ttcgctccgc tgaaggagta ttttgcgtgt gtgtaaggga catgggggca aactgaggta 3001 gcgacacaaa gttaaacaaa caaacaaaaa acacaaaaca taataaaaca ccaagaacat 3061 gaggatggag agaagtatca gcacccagaa gagaaaaagg aatttaaaac aaaaaccaca 3121 gaggcggaaa taccggaggg ctttgccttg cgaaaagggt tggacatcat ctcctgattt 3181 ttcaatgtta ttcttcagtc ctatttaaaa acaaaaccaa gctcccttcc cttcctcccc 3241 cttccctttt ttttcggtca gaccttttat tttctactct tttcagaggg gttttctgtt 3301 tgtttgggtt ttgtttcttg ctgtgactga aacaagaagg ttattgcagc aaaaatcagt 3361 aacaaaaaat agtaacaata ccttgcagag gaaaggtggg agagaggaaa aaaaggaaat 3421 tctatagaaa tctatatatt gggttgtttt tttttttgtt ttttgttttt ttttttgggt 3481 tttttttttt actatatatc ttttttttgt tgtctctagc ctgatcagat aggagcacaa 3541 gcaggggacg gaaagagaga gacactcagg cggcagcatt ccctcccagc cactgagctg 3601 tcgtgccagc accattcctg gtcacgcaaa acagaaccca gttagcagca gggagacgag 3661 aacaccacac aagacatttt tctacagtat ttcaggtgcc taccacacag gaaaccttga 3721 agaaaatcag tttctagaag ccgctgttac ctcttgttta cagtttatat atatatgata 3781 gatatgagat atatatataa aaggtactgt taactactgt acaacccgac ttcataatgg 3841 tgctttcaaa cagcgagatg agtaaaaaca tcagcttcca cgttgccttc tgcgcaaagg 3901 gtttcaccaa ggatggagaa agggagacag cttgcagatg gcgcgttctc acggtgggct 3961 cttccccttg gtttgtaacg aagtgaagga ggagaacttg ggagccaggt tctccctgcc 4021 aaaaaggggg ctagatgagg tggtcgggcc cgtggacagc tgagagtggg attcatccag 4081 actcatgcaa taaccctttg attgttttct aaaaggagac tccctcggca agatggcaga 4141 gggtacggag tcttcaggcc cagtttctca ctttagccaa ttcgagggct ccttgtggtg 4201 ggatcagaac taatccagag tgtgggaaag tgacagtcaa aaccccacct ggagcaaata 4261 aaaaaacata caaaacgtaa aaaaaaaaaa aaaa //