LOCUS BC049367 3845 bp mRNA linear HUM 19-MAR-2009
DEFINITION Homo sapiens SET and MYND domain containing 2, mRNA (cDNA clone
MGC:57151 IMAGE:4826617), complete cds.
ACCESSION BC049367
VERSION BC049367.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 3845)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 3845)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (28-MAR-2003) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: Miklos Palkovits, M.D., Ph.D.
cDNA Library Preparation: Michael J. Brownstein (NHGRI) & Shiraki
Toshiyuki and Piero Carninci (RIKEN)
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Institute for Systems Biology
http://www.systemsbiology.org
contact: amadan@systemsbiology.org
Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 106 Row: e Column: 7
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 9910273
The stop codon of the CDS annotated on this record is located > 55
bases upstream of a splice junction, and therefore the mRNA is
predicted to be subject to nonsense-mediated mRNA decay (NMD).
FEATURES Location/Qualifiers
source 1..3845
/db_xref="H-InvDB:HIT000099072"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:57151 IMAGE:4826617"
/tissue_type="Testis"
/clone_lib="NIH_MGC_97"
/lab_host="DH10B"
/note="Vector: pBluescriptR"
gene 1..3845
/gene="SMYD2"
/gene_synonym="HSKM-B"
/gene_synonym="KMT3C"
/gene_synonym="ZMYND14"
/db_xref="GeneID:56950"
/db_xref="HGNC:HGNC:20982"
/db_xref="MIM:610663"
CDS 154..1269
/gene="SMYD2"
/gene_synonym="HSKM-B"
/gene_synonym="KMT3C"
/gene_synonym="ZMYND14"
/codon_start=1
/product="SMYD2 protein"
/protein_id="AAH49367.2"
/db_xref="GeneID:56950"
/db_xref="HGNC:HGNC:20982"
/db_xref="MIM:610663"
/translation="MRAEGLGGLERFCSPGKGRGLRALQPFQVGDLLFSCPAYAYVLT
VNERGNHCEYCFTRKEGLSKCGRCKQAFYCNVECQKEDWPMHKLECSPMVVFGENWNP
SETVRLTARILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFY
SKHLEFPDSDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYK
GTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTKDKDKAK
VEIRKLSDPPKAEAIRDMVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSSVFED
SNVYMLHMMYQAMGVCLYMQDWEGALQYGQKIIKPYR"
BASE COUNT 1004 a 919 c 958 g 964 t
ORIGIN
1 gggcgccctc ccttccgggg agcggggagc cgccgccgcg tccgccgggc ggctcccacc
61 ccgccccccg cagctctagg tgacgcgtct ccaataacag ctcgccggga gccgcagctc
121 gggcacagcc ggcggccgcg ccccgccgcc accatgaggg ccgagggcct cggcggcctg
181 gagcgcttct gcagcccggg caaaggccgg gggctgcggg ctctgcagcc cttccaggtg
241 ggggacttgc tgttctcctg cccggcctat gcctacgtgc tcacggtcaa cgagcggggc
301 aaccactgcg agtactgctt caccaggaaa gaaggattgt ccaaatgtgg aagatgcaag
361 caggcatttt actgcaatgt ggagtgtcag aaagaagatt ggcccatgca caagctggaa
421 tgttctccca tggttgtttt tggggaaaac tggaatccct cggagactgt aagactaaca
481 gcaaggattc tggccaaaca gaaaatccac ccagagagaa caccttcgga aaaattgtta
541 gctgtgaagg agtttgaatc acatctggat aagttagaca atgagaagaa ggatttgatt
601 cagagtgaca tagctgctct ccatcacttt tactccaagc atctcgaatt ccctgacagt
661 gatagcctcg tagtactctt tgcacaggtt aactgtaatg gcttcacaat tgaagatgaa
721 gaactttctc atttgggatc agcgatattt cctgatgttg cattgatgaa tcatagctgt
781 tgccccaatg tcattgtgac ctacaaaggg accctggcag aagtcagagc tgtacaggaa
841 atcaagccgg gagaggaggt ttttaccagc tatattgatc tcctgtaccc aacggaagat
901 agaaatgacc ggttaagaga ttcttatttc tttacctgtg agtgccagga gtgtaccacc
961 aaggacaagg ataaggccaa ggtggaaatc cggaagctca gcgatccccc aaaggcagaa
1021 gccatccgag acatggtcag atatgcacgc aacgtcattg aagagttccg gagggccaag
1081 cactataaat cccctagtga gctgctggag atctgcgagc tcagccagga gaagatgagc
1141 tctgtgtttg aggacagtaa cgtgtacatg ttgcacatga tgtaccaggc catgggtgtc
1201 tgcttgtaca tgcaggactg ggaaggagcc ctgcaatatg gacagaaaat cattaagccc
1261 tacaggtgat tgcagaggct gttctaatca tccacggagt ggaatttggt tgtttagaat
1321 gtatacgata gtgtcatctt cctgcccatt gctgagactt gcctagcgca gctgcctctt
1381 ctctgcaccc acatctggag gtgctggtgc agggcactgc tcccgagtca ggcctgttct
1441 caccaccatc ccaaggagac aggttgaggt tatgcagcgg atactggggc aagatccctc
1501 ccaggtccat ccagggccat atccatagta gttaactcag gctagttttg agaagccact
1561 agtttgcttc ttttcccacc cactgcttcc aagtttaatt actaagtggt ggtctcagaa
1621 accctggttt tcatcattac tcacaccatc cccttctttt gctttttcaa attacagccc
1681 cagtttgcaa atggggaagc tctgggacca gtgggttgac cgtggttatc taatctcccc
1741 accccaggtt tggaaaaaag aaacaaaaat tgctttttaa tttgtagtaa taatgaatat
1801 tagccctgga gagatttcct gcctgagagg gttactaaac ctaaaagtgg gtgacagacg
1861 gaagttgtgg ctttgtcctc tgactcggac tagacacttc agggctgagg tggcctagat
1921 gggatcactg gagctctttt taaaggcact gatattatgc atttacaggg tacgtccaac
1981 accaagtggg ccaagtgaat cagggtcgct gggcattaga cccagaaatc tgtgttctgg
2041 aagcatccca ggtgtttctc aggagtttga gaagccctca gtctccttcc acttcggaga
2101 ttagggtctg tgaaggttgt gtctggtctg aatgtccact gcccactcca ccctctactt
2161 accccactgt gccctgatta gccagcaggg ggagcccaaa ggaagccagc tcttaaaatc
2221 tgctcttgga aacagaattt gttaaattgg ggataaactc aaaccttcct tttctcctag
2281 agtgtgtgag gaagtgcact ggcctcatgc gcagttctct gtgtgagatc tgtagggaga
2341 gaggagcaga cagacagcat cccagccctg tggttctccc agcagggcct tgctctgcta
2401 gacccatctt tatctgcgtg agcccctccc ggaagcccag ctctggtctt ctgtggggcc
2461 gtgggtccca gggctgcctc ctacccagga atacagctag aacatgtctt tttggactga
2521 ctttcaggag acacagggtg gaacagccta gcctttaaat aaacacctct tacacacatg
2581 taaacacaca cactggctca cccaccagct gcctctgggc cctgttctta gggaaatggc
2641 tgtttcccag gcagacccca tcagtcagtc ttcaggtaca gccggagaga acacttgctg
2701 tatctaaagc ccctgctgga caatggggaa agaacatcca gaagacacag gaagtgttct
2761 gacttcagag cacttaactg ttttgctaaa ccaacccata tcatagtgtg acagaaagct
2821 ggggtcttaa gacacattgg ggtgggtggg ggttgggtgg gtggctgtat gtctgagtac
2881 ctaaggtgat aggcctaatt gctgccttca taaatgggac atttacttca caagctgttt
2941 tcccagggtc ttcctctggg tatgtctgaa atataaaaat ctggactggg attgaagatt
3001 gtgtttacaa atgcttttga ataggatttt ctcctgcagt tgttacgtag cttttcagaa
3061 acacacaaac tacaaataat gaacaacatc tgcaatgatt cggcagggtg gcagcatcca
3121 cgctctccac ccaaaccctg gtgggatttg gagaggccgc tggtgggcag aggttggccc
3181 taagcatggc agcctccggc ttactgcacc cagcctgtgg ggcggctcag tagccgctga
3241 catggtggcc tgttgtctct tctcttgttc tagtaagcac tatcctttgt actccctcaa
3301 cgtggcctcc atgtggttga agctagggag actctacatg ggcctggaac acaaagccgc
3361 aggggagaaa gccctgaaga aggccattgc aatcatggaa gtagctcacg gcaaagatca
3421 tccatatatt tctgagatca aacaggaaat tgaaagccac tgaaactatg cagcatttca
3481 gttttcattt aaacacttag ttcagaaacc ttaaaggatt tgaatatttc aaattgcaca
3541 cgtcactcca gcatctctgt aaaataattg gaatgaaaat acttcttgca cttaaacact
3601 gcacatgccg tactttgagg ttagtctgaa tcttgaactt taataccaaa ttaattttga
3661 atgcttttgt ttcctaagag ataatggcat ggtttcatat gttatacttt ggacagacag
3721 agttttaaaa atggaattat tttttctttc atgcctcttg taatgttctg aacaaacttg
3781 aatgatgaaa gtattaaaga gatatcagta aaaaaaaaaa aaaaaaaaag aaaaaaaaaa
3841 aaaaa
//