LOCUS BC018718 3963 bp mRNA linear HUM 23-JUN-2005
DEFINITION Homo sapiens euchromatic histone-lysine N-methyltransferase 2, mRNA
(cDNA clone MGC:31802 IMAGE:4123580), complete cds.
ACCESSION BC018718
VERSION BC018718.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 3963)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 3963)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (07-DEC-2001) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Genome Sequence Centre,
BC Cancer Agency, Vancouver, BC, Canada
info@bcgsc.bc.ca
Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson
Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen
Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel
Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave
Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth
Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao,
Kim MacDonald, Mike R. Mayo, Josh Moran, Diana Palmquist, JR
Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang,
Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 40 Row: c Column: 10.
FEATURES Location/Qualifiers
source 1..3963
/db_xref="H-InvDB:HIT000038418_04"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:31802 IMAGE:4123580"
/tissue_type="Muscle, rhabdomyosarcoma"
/clone_lib="NIH_MGC_17"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene 1..3963
/gene="EHMT2"
/gene_synonym="G9A"
/gene_synonym="NG36"
/gene_synonym="NG36/G9a"
/db_xref="GeneID:10919"
/db_xref="MIM:604599"
CDS 616..3621
/gene="EHMT2"
/gene_synonym="G9A"
/gene_synonym="NG36"
/gene_synonym="NG36/G9a"
/codon_start=1
/product="EHMT2 protein"
/protein_id="AAH18718.1"
/db_xref="GeneID:10919"
/db_xref="MIM:604599"
/translation="MSDDVHSLGKVTSDLAKRRKLNSGGGLSEELGSARRSGEVTLTK
GDPGSLEEWETVVGDDFSLYYDSYSVDERVDSDSKSEVEALTEQLSEEEEEEEEEEEE
EEEEEEEEEEEEDEESGNQSDRSGSSGRRKAKKKWRKDSPWVKPSRKRRKREPPRAKE
PRGVNGVGSSGPSEYMEVPLGSLELPSEGTLSPNHAGVSNDTSSLETERGFEELPLCS
CRMEAPKIDRISERAGHKCMATESVDGELSGCNAAILKRETMRPSSRVALMVLCETHR
ARMVKHHCCPGCGYFCTAGTFLECHPDFRVAHRFHKACVSQLNGMVFCPHCGEDASEA
QEVTIPRGDGVTPPAGTAAPAPPPLSQDVPGRADTSQPSARMRGHGEPRRPPCDPLAD
TIDSSGPSLTLPNGGCLSAVGLPLGPGREALEKALVIQESERRKKLRFHPRQLYLSVK
QGELQKVILMLLDNLDPNFQSDQQSKRTPLHAAAQKGSVEICHVLLQAGANINAVDKQ
QRTPLMEAVVNNHLEVARYMVQRGGCVYSKEEDGSTCLHHAAKIGNLEMVSLLLSTGQ
VDVNAQDSGGWTPIIWAAEHKHIEVIRMLLTRGADVTLTDNEENICLHWASFTGSAAI
AEVLLNARCDLHAVNYHGDTPLHIAARESYHDCVLLFLSRGANPELRNKEGDTAWDLT
PERSDVWFALQLNRKLRLGVGNRAIRTEKIICRDVARGYENVPIPCVNGVDGEPCPED
YKYISENCETSTMNIDRNITHLQHCTCVDDCSSSNCLCGQLSIRCWYDKDGRLLQEFN
KIEPPLIFECNQACSCWRNCKNRVVQSGIKVRLQLYRTAKMGWGVRALQTIPQGTFIC
EYVGELISDAEADVREDDSYLFDLDNKDGEVYCIDARYYGNISRFINHLCDPNIIPVR
VFMLHQDLRFPRIAFFSSRDIRTGEELGFDYGDRFWDIKSKYFTCQCGSEKCKHSAEA
IALEQSRLARLDPHPELLPELGSLPPVNT"
BASE COUNT 873 a 1180 c 1207 g 703 t
ORIGIN
1 cagggagctg cagcggcggc ggccgccgag ggggaggccc ccgctgagat gggggcgctg
61 ctgctggaga aggaaaccag aggagccacc gagagagttc atggctcttt gggggacacc
121 cctcgtagtg aagaaaccct gcccaaggcc aaccccgact ccctggagcc tgctggcccc
181 tcatctccag cctctgtcac tgtcactgtt ggtgatgagg gggctgacac ccctgtaggg
241 gctacaccac tcattgggga tgaatctgag aatcttgagg gagatgggga cctccgtggg
301 ggccggatcc tgctgggcca tgccacaaag tcattcccct cttcccccag caaggggggt
361 tcctgtccta gccgggccaa gatgtcaatg acaggggcgg gaaaatcacc tccatctgtc
421 cagagtttgg ctatgaggct actgagtatg ccaggagccc agggagctgc agcagcaggg
481 tctgaacccc ctccagccac cacgagccca gagggacagc ccaaggtcca ccgagcccgc
541 aaaaccatgt ccaaaccagg aaatggacag cccccggtcc ctgagaagcg gccccctgaa
601 atacagcatt tccgcatgag tgatgatgtc cactcactgg gaaaggtgac ctcagatctg
661 gccaaaagga ggaagctgaa ctcaggaggt ggcctgtcgg aggagttagg ttctgcccgg
721 cgttcaggag aagtgaccct gacgaaaggg gaccccgggt ccctggagga gtgggagacg
781 gtggtgggtg atgacttcag tctctactat gattcctact ctgtggatga gcgcgtggac
841 tccgacagca agtctgaagt tgaagctcta actgaacaac taagtgaaga ggaggaggag
901 gaagaggagg aagaagaaga agaggaagag gaggaggaag aggaagaaga agaggaagat
961 gaggagtcag ggaatcagtc agataggagt ggttccagtg gccggcgcaa ggccaagaag
1021 aaatggcgaa aagacagccc atgggtgaag ccgtctcgga aacggcgcaa gcgggagcct
1081 ccgcgggcca aggagccacg aggagtgaat ggtgtgggct cctcaggccc cagtgagtac
1141 atggaggtcc ctctggggtc cctggagctg cccagcgagg ggaccctctc ccccaaccac
1201 gctggggtgt ccaatgacac atcttcgctg gagacagagc gagggtttga ggagttgccc
1261 ctgtgcagct gccgcatgga ggcacccaag attgaccgca tcagcgagag ggcggggcac
1321 aagtgcatgg ccactgagag tgtggacgga gagctgtcag gctgcaatgc cgccatcctc
1381 aagcgggaga ccatgaggcc atccagccgt gtggccctga tggtgctctg tgagacccac
1441 cgcgcccgca tggtcaaaca ccactgctgc ccgggctgcg gctacttctg cacggcgggc
1501 accttcctgg agtgccaccc tgacttccgt gtggcccacc gcttccacaa ggcctgtgtg
1561 tctcagctga atgggatggt cttctgtccc cactgtgggg aggatgcttc tgaagctcaa
1621 gaggtgacca tcccccgggg tgacggggtg accccaccgg ccggcactgc agctcctgca
1681 cccccacccc tgtcccagga tgtccccggg agagcagaca cttctcagcc cagtgcccgg
1741 atgcgagggc atggggaacc ccggcgcccg ccctgcgatc ccctggctga caccattgac
1801 agctcagggc cctccctgac cctgcccaat gggggctgcc tttcagccgt ggggctgcca
1861 ctggggccag gccgggaggc cctggaaaag gccctggtca tccaggagtc agagaggcgg
1921 aagaagctcc gtttccaccc tcggcagttg tacctgtccg tgaagcaggg cgagctgcag
1981 aaggtgatcc tgatgctgtt ggacaacctg gaccccaact tccagagcga ccagcagagc
2041 aagcgcacgc ccctgcatgc agccgcccag aagggctccg tggagatctg ccatgtgctg
2101 ctgcaggctg gagccaacat aaatgcagtg gacaaacagc agcggacgcc actgatggag
2161 gccgtggtga acaaccacct ggaggtagcc cgttacatgg tgcagcgtgg tggctgtgtc
2221 tatagcaagg aggaggacgg ttccacctgc ctccaccacg cagccaaaat cgggaacttg
2281 gagatggtca gcctgctgct gagcacagga caggtggacg tcaacgccca ggacagtggg
2341 gggtggacgc ccatcatctg ggctgcagag cacaagcaca tcgaggtgat ccgcatgcta
2401 ctgacgcggg gcgccgacgt caccctcact gacaacgagg agaacatctg cctgcactgg
2461 gcctccttca cgggcagcgc cgccatcgcc gaagtccttc tgaatgcgcg ctgtgacctc
2521 catgctgtca actaccatgg ggacaccccc ctgcacatcg cagctcggga gagctaccat
2581 gactgcgtgc tgttattcct gtcacgtggg gccaaccctg agctgcggaa caaagagggg
2641 gacacagcat gggacctgac tcccgagcgc tccgacgtgt ggtttgcgct tcaactcaac
2701 cgcaagctcc gacttggggt gggaaatcgg gccatccgca cagagaagat catctgccgg
2761 gacgtggctc ggggctatga gaacgtgccc attccctgtg tcaacggtgt ggatggggag
2821 ccctgccctg aggattacaa gtacatctca gagaactgcg agacgtccac catgaacatc
2881 gatcgcaaca tcacccacct gcagcactgc acgtgtgtgg acgactgctc tagctccaac
2941 tgcctgtgcg gccagctcag catccggtgc tggtatgaca aggatgggcg attgctccag
3001 gaatttaaca agattgagcc tccgctgatt ttcgagtgta accaggcgtg ctcatgctgg
3061 agaaactgca agaaccgggt cgtacagagt ggcatcaagg tgcggctaca gctctaccga
3121 acagccaaga tgggctgggg ggtccgcgcc ctgcagacca tcccacaggg gaccttcatc
3181 tgcgagtatg tcggggagct gatctctgat gctgaggctg atgtgagaga ggatgattct
3241 tacctcttcg acttagacaa caaggatgga gaggtgtact gcatagatgc ccgttactat
3301 ggcaacatca gccgcttcat caaccacctg tgtgacccca acatcattcc cgtccgggtc
3361 ttcatgctgc accaagacct gcgatttcca cgcatcgcct tcttcagttc ccgagacatc
3421 cggactgggg aggagctagg gtttgactat ggcgaccgct tctgggacat caaaagcaaa
3481 tatttcacct gccaatgtgg ctctgagaag tgcaagcact cagccgaagc cattgccctg
3541 gagcagagcc gtctggcccg cctggaccca caccctgagc tgctgcccga gctcggctcc
3601 ctgccccctg tcaacacatg agaacggacc acaccctctc tccccagcat ggatggccac
3661 agctcagccg cctcctctgc caccagctgc tcgcagccca tgcctggggg tgctgccatc
3721 ttctctcccc accacccttt cacacattcc tgaccagaga tcccagccag gccctggagg
3781 tctgacagcc cctccctccc agagctggtt cctccctggg agggcaactt cagggctggc
3841 caccccccgt gttccccatc ctcagttgaa gtttgatgaa ttgaagtcgg gcctctatgc
3901 caactggttc cttttgttct caataaatgt tgggtttggt aaaaaaaaaa aaaaaaaaaa
3961 aaa
//