LOCUS BC049367 3845 bp mRNA linear HUM 19-MAR-2009 DEFINITION Homo sapiens SET and MYND domain containing 2, mRNA (cDNA clone MGC:57151 IMAGE:4826617), complete cds. ACCESSION BC049367 VERSION BC049367.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3845) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 3845) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (28-MAR-2003) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Miklos Palkovits, M.D., Ph.D. cDNA Library Preparation: Michael J. Brownstein (NHGRI) & Shiraki Toshiyuki and Piero Carninci (RIKEN) cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Institute for Systems Biology http://www.systemsbiology.org contact: amadan@systemsbiology.org Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 106 Row: e Column: 7 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 9910273 The stop codon of the CDS annotated on this record is located > 55 bases upstream of a splice junction, and therefore the mRNA is predicted to be subject to nonsense-mediated mRNA decay (NMD). FEATURES Location/Qualifiers source 1..3845 /db_xref="H-InvDB:HIT000099072" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:57151 IMAGE:4826617" /tissue_type="Testis" /clone_lib="NIH_MGC_97" /lab_host="DH10B" /note="Vector: pBluescriptR" gene 1..3845 /gene="SMYD2" /gene_synonym="HSKM-B" /gene_synonym="KMT3C" /gene_synonym="ZMYND14" /db_xref="GeneID:56950" /db_xref="HGNC:HGNC:20982" /db_xref="MIM:610663" CDS 154..1269 /gene="SMYD2" /gene_synonym="HSKM-B" /gene_synonym="KMT3C" /gene_synonym="ZMYND14" /codon_start=1 /product="SMYD2 protein" /protein_id="AAH49367.2" /db_xref="GeneID:56950" /db_xref="HGNC:HGNC:20982" /db_xref="MIM:610663" /translation="MRAEGLGGLERFCSPGKGRGLRALQPFQVGDLLFSCPAYAYVLT VNERGNHCEYCFTRKEGLSKCGRCKQAFYCNVECQKEDWPMHKLECSPMVVFGENWNP SETVRLTARILAKQKIHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFY SKHLEFPDSDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYK GTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTKDKDKAK VEIRKLSDPPKAEAIRDMVRYARNVIEEFRRAKHYKSPSELLEICELSQEKMSSVFED SNVYMLHMMYQAMGVCLYMQDWEGALQYGQKIIKPYR" BASE COUNT 1004 a 919 c 958 g 964 t ORIGIN 1 gggcgccctc ccttccgggg agcggggagc cgccgccgcg tccgccgggc ggctcccacc 61 ccgccccccg cagctctagg tgacgcgtct ccaataacag ctcgccggga gccgcagctc 121 gggcacagcc ggcggccgcg ccccgccgcc accatgaggg ccgagggcct cggcggcctg 181 gagcgcttct gcagcccggg caaaggccgg gggctgcggg ctctgcagcc cttccaggtg 241 ggggacttgc tgttctcctg cccggcctat gcctacgtgc tcacggtcaa cgagcggggc 301 aaccactgcg agtactgctt caccaggaaa gaaggattgt ccaaatgtgg aagatgcaag 361 caggcatttt actgcaatgt ggagtgtcag aaagaagatt ggcccatgca caagctggaa 421 tgttctccca tggttgtttt tggggaaaac tggaatccct cggagactgt aagactaaca 481 gcaaggattc tggccaaaca gaaaatccac ccagagagaa caccttcgga aaaattgtta 541 gctgtgaagg agtttgaatc acatctggat aagttagaca atgagaagaa ggatttgatt 601 cagagtgaca tagctgctct ccatcacttt tactccaagc atctcgaatt ccctgacagt 661 gatagcctcg tagtactctt tgcacaggtt aactgtaatg gcttcacaat tgaagatgaa 721 gaactttctc atttgggatc agcgatattt cctgatgttg cattgatgaa tcatagctgt 781 tgccccaatg tcattgtgac ctacaaaggg accctggcag aagtcagagc tgtacaggaa 841 atcaagccgg gagaggaggt ttttaccagc tatattgatc tcctgtaccc aacggaagat 901 agaaatgacc ggttaagaga ttcttatttc tttacctgtg agtgccagga gtgtaccacc 961 aaggacaagg ataaggccaa ggtggaaatc cggaagctca gcgatccccc aaaggcagaa 1021 gccatccgag acatggtcag atatgcacgc aacgtcattg aagagttccg gagggccaag 1081 cactataaat cccctagtga gctgctggag atctgcgagc tcagccagga gaagatgagc 1141 tctgtgtttg aggacagtaa cgtgtacatg ttgcacatga tgtaccaggc catgggtgtc 1201 tgcttgtaca tgcaggactg ggaaggagcc ctgcaatatg gacagaaaat cattaagccc 1261 tacaggtgat tgcagaggct gttctaatca tccacggagt ggaatttggt tgtttagaat 1321 gtatacgata gtgtcatctt cctgcccatt gctgagactt gcctagcgca gctgcctctt 1381 ctctgcaccc acatctggag gtgctggtgc agggcactgc tcccgagtca ggcctgttct 1441 caccaccatc ccaaggagac aggttgaggt tatgcagcgg atactggggc aagatccctc 1501 ccaggtccat ccagggccat atccatagta gttaactcag gctagttttg agaagccact 1561 agtttgcttc ttttcccacc cactgcttcc aagtttaatt actaagtggt ggtctcagaa 1621 accctggttt tcatcattac tcacaccatc cccttctttt gctttttcaa attacagccc 1681 cagtttgcaa atggggaagc tctgggacca gtgggttgac cgtggttatc taatctcccc 1741 accccaggtt tggaaaaaag aaacaaaaat tgctttttaa tttgtagtaa taatgaatat 1801 tagccctgga gagatttcct gcctgagagg gttactaaac ctaaaagtgg gtgacagacg 1861 gaagttgtgg ctttgtcctc tgactcggac tagacacttc agggctgagg tggcctagat 1921 gggatcactg gagctctttt taaaggcact gatattatgc atttacaggg tacgtccaac 1981 accaagtggg ccaagtgaat cagggtcgct gggcattaga cccagaaatc tgtgttctgg 2041 aagcatccca ggtgtttctc aggagtttga gaagccctca gtctccttcc acttcggaga 2101 ttagggtctg tgaaggttgt gtctggtctg aatgtccact gcccactcca ccctctactt 2161 accccactgt gccctgatta gccagcaggg ggagcccaaa ggaagccagc tcttaaaatc 2221 tgctcttgga aacagaattt gttaaattgg ggataaactc aaaccttcct tttctcctag 2281 agtgtgtgag gaagtgcact ggcctcatgc gcagttctct gtgtgagatc tgtagggaga 2341 gaggagcaga cagacagcat cccagccctg tggttctccc agcagggcct tgctctgcta 2401 gacccatctt tatctgcgtg agcccctccc ggaagcccag ctctggtctt ctgtggggcc 2461 gtgggtccca gggctgcctc ctacccagga atacagctag aacatgtctt tttggactga 2521 ctttcaggag acacagggtg gaacagccta gcctttaaat aaacacctct tacacacatg 2581 taaacacaca cactggctca cccaccagct gcctctgggc cctgttctta gggaaatggc 2641 tgtttcccag gcagacccca tcagtcagtc ttcaggtaca gccggagaga acacttgctg 2701 tatctaaagc ccctgctgga caatggggaa agaacatcca gaagacacag gaagtgttct 2761 gacttcagag cacttaactg ttttgctaaa ccaacccata tcatagtgtg acagaaagct 2821 ggggtcttaa gacacattgg ggtgggtggg ggttgggtgg gtggctgtat gtctgagtac 2881 ctaaggtgat aggcctaatt gctgccttca taaatgggac atttacttca caagctgttt 2941 tcccagggtc ttcctctggg tatgtctgaa atataaaaat ctggactggg attgaagatt 3001 gtgtttacaa atgcttttga ataggatttt ctcctgcagt tgttacgtag cttttcagaa 3061 acacacaaac tacaaataat gaacaacatc tgcaatgatt cggcagggtg gcagcatcca 3121 cgctctccac ccaaaccctg gtgggatttg gagaggccgc tggtgggcag aggttggccc 3181 taagcatggc agcctccggc ttactgcacc cagcctgtgg ggcggctcag tagccgctga 3241 catggtggcc tgttgtctct tctcttgttc tagtaagcac tatcctttgt actccctcaa 3301 cgtggcctcc atgtggttga agctagggag actctacatg ggcctggaac acaaagccgc 3361 aggggagaaa gccctgaaga aggccattgc aatcatggaa gtagctcacg gcaaagatca 3421 tccatatatt tctgagatca aacaggaaat tgaaagccac tgaaactatg cagcatttca 3481 gttttcattt aaacacttag ttcagaaacc ttaaaggatt tgaatatttc aaattgcaca 3541 cgtcactcca gcatctctgt aaaataattg gaatgaaaat acttcttgca cttaaacact 3601 gcacatgccg tactttgagg ttagtctgaa tcttgaactt taataccaaa ttaattttga 3661 atgcttttgt ttcctaagag ataatggcat ggtttcatat gttatacttt ggacagacag 3721 agttttaaaa atggaattat tttttctttc atgcctcttg taatgttctg aacaaacttg 3781 aatgatgaaa gtattaaaga gatatcagta aaaaaaaaaa aaaaaaaaag aaaaaaaaaa 3841 aaaaa //