LOCUS BC021252 3268 bp mRNA linear HUM 06-OCT-2003 DEFINITION Homo sapiens sex comb on midleg homolog 1 (Drosophila), mRNA (cDNA clone MGC:29533 IMAGE:3355251), complete cds. ACCESSION BC021252 VERSION BC021252.2 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3268) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 3268) AUTHORS Strausberg,R. TITLE Direct Submission JOURNAL Submitted (14-JAN-2002) National Institutes of Health, Mammalian Gene Collection (MGC), Cancer Genomics Office, National Cancer Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Aug 19, 2003 this sequence version replaced BC021252.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 39 Row: a Column: 13 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 6912641. FEATURES Location/Qualifiers source 1..3268 /db_xref="H-InvDB:HIT000039171" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:29533 IMAGE:3355251" /tissue_type="Eye, retinoblastoma" /clone_lib="NIH_MGC_16" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..3268 /gene="SCMH1" /gene_synonym="Scml3" /db_xref="GeneID:22955" CDS 453..2252 /gene="SCMH1" /gene_synonym="Scml3" /codon_start=1 /product="SCMH1 protein" /protein_id="AAH21252.1" /db_xref="GeneID:22955" /translation="MKLEAQDPRNTTSTCIATVVGLTGARLRLRLDGSDNKNDFWRLV DSAEIQPIGNCEKNGGMLQPPLGFRLNASSWPMFLLKTLNGAEMAPIRIFHKEPPSPS HNFFKMGMKLEAVDRKNPHFICPATIGEVRGSEVLVTFDGWRGAFDYWCRFDSRDIFP VGWCSLTGDNLQPPGTKVVIPKNPYPASDVNTEKPSIHSSTKTVLEHQPGQRGRKPGK KRGRTPKTLISHPISAPSKTAEPLKFPKKRGPKPGSKRKPRTLLNPPPASPTTSTPEP DTSTVPQDAATIPSSAMQAPTVCIYLNKNGSTGPHLDKKKVQQLPDHFGPARASVVLQ QAVQACIDCAYHQKTVFSFLKQGHGGEVISAVFDREQHTLNLPAVNSITYVLRFLEKL CHNLRSDNLFGNQPFTQTHLSLTAIEYSHSHDRYLPGETFVLGNSLARSLEPHSDSMD SASNPTNLVSTSQRHRPLLSSCGLPPSTASAVRRLCSRGVLKGSNERRDMESFWKLNR SPGSDRYLESRDASRLSGRDPSSWTVEDVMQFVREADPQLGPHADLFRKHEIDGKALL LLRSDMMMKYMGLKLGPALKLSYHIDRLKQGKF" misc_feature 669..974 /gene="SCMH1" /gene_synonym="Scml3" /note="MBT; Region: Present in Drosophila Scm, l(3)mbt, and vertebrate SCML2" /db_xref="CDD:smart00561" misc_feature 2040..2237 /gene="SCMH1" /gene_synonym="Scml3" /note="SAM; Region: SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as a protein interaction module through its ability to homo- and heterooligomerise with other SAM domains" /db_xref="CDD:pfam00536" BASE COUNT 820 a 937 c 802 g 709 t ORIGIN 1 atgcagccta atgcaagcag ggggcatatt tcagcccttt tggtgttaca gcaactcttt 61 tagccacgcc gttcagcaga tcccgagttc ttgtcctgca accaggaaga atgaggtacg 121 cagacaagtg gagggtgagc aagatgaaga ggtgctgtat tgagcaatgg aacagctcag 181 agaagactca cagtgccagg gcgtcccagt gagtgttcag cccctagcag gaaggatagc 241 tcctctctgc agtgatagac tggagtgatg ttagaaaaca caaatatggt cacctatcag 301 agtctgcatc ccaatatcaa gaagctgctg acatcctgga tctaggtcat tttacctggg 361 acaaatacct aaaagaaaca tgttcagtcc cagcgcctgt ccattgcttc aagcagtcct 421 acacacctcc aagcaacgag ttcaagatca gtatgaaatt ggaagcacag gaccccagga 481 acaccacatc cacctgtatt gccacagtag ttggactgac aggtgcccgc cttcgcctgc 541 gccttgatgg gagcgacaac aaaaatgact tctggcggct ggttgactca gctgaaatcc 601 agcctattgg gaactgtgaa aagaatgggg gtatgctaca gccacctctt ggatttcggc 661 tgaatgcgtc ttcttggccc atgttccttt tgaagacgct aaatggagca gagatggctc 721 ccatcaggat tttccacaag gagccaccat cgccttccca caacttcttc aaaatgggaa 781 tgaagctaga agctgtggac aggaagaacc ctcatttcat ttgcccagcc actattgggg 841 aggttcgggg ctcagaggtg cttgtcactt ttgatgggtg gcgaggggcc tttgactact 901 ggtgccgctt cgactcccga gacatcttcc ctgtgggctg gtgttccttg actggagaca 961 acctgcagcc tcctggcacc aaagttgtga ttccaaagaa tccctatcct gcctccgatg 1021 tgaatactga gaagcccagc atccacagca gcaccaaaac tgtcttggaa catcaaccag 1081 ggcagagggg gcgtaaacca ggaaagaagc ggggccggac acccaagacc ctaatttccc 1141 atcccatctc tgccccatcc aagacagctg aacctttgaa attcccaaag aagagaggtc 1201 ccaaacctgg cagcaagagg aaacctcgga ctttgctgaa cccaccacct gcctcaccaa 1261 caaccagcac tcctgaaccg gataccagca ctgtacccca ggatgctgcc accatcccca 1321 gctcagccat gcaggcccca acagtttgta tctacttgaa caagaatggc agcacaggcc 1381 cccacttaga taagaagaag gtccagcaac tccctgacca ttttggacca gcccgtgcct 1441 ctgtggtgtt gcagcaggct gtccaggcct gtatcgactg tgcttatcac cagaaaaccg 1501 tcttcagctt cctcaagcaa ggccatggtg gtgaggttat ctcagccgtg tttgaccggg 1561 aacagcatac cctcaacctc ccagcagtca acagcatcac ctacgtcctc cgcttcctgg 1621 agaaactctg ccacaacctt cgtagtgaca atctgtttgg caaccagccc tttacacaga 1681 ctcacttgtc actcactgcc atagagtaca gccacagcca cgacaggtac ctaccaggtg 1741 aaacctttgt cctggggaat agtctggccc gctccttgga accacactca gactcaatgg 1801 actctgcctc aaatcccacc aaccttgtca gcacctccca aaggcaccgg cccttgcttt 1861 catcctgtgg cctcccacca agcactgcct cagctgtgcg caggctatgc tccaggggag 1921 tgttaaaagg atcaaatgaa agaagggata tggaatcatt ttggaaacta aatcgttccc 1981 cagggtcgga ccgatacctg gagagccgcg atgcctctcg actgagtggc cgggacccct 2041 cctcatggac agtcgaggat gtgatgcagt ttgtccggga agctgatcct cagcttggac 2101 cccacgctga cctgtttcgc aaacacgaga tcgatggcaa ggccctgctg ctgctgcgca 2161 gtgacatgat gatgaagtac atgggcctga agctggggcc tgcactcaag ctctcctacc 2221 acattgaccg gctgaagcag ggcaagttct gaaccaggag aggcagccta gacaaccaag 2281 tggcagcagg tgggggcatt cttctaggaa tgaggggcat cagcccaccc caggcacctc 2341 agtggggttc cgggccacct caggactcca agaggctgtg tggagccacc actcctagcc 2401 acagctgcca tgataagtcc ttccatgaag gactgaggag ggagagtggg ggtccagggc 2461 tggtgctgct cttccctcag ctctgccggg gctctaaggt ccctctattt atttctcaac 2521 cctggctggc ctctcaccag gagtttaggc tgaatgcctt ccacgtgatg gaggaaaagg 2581 ccaactctgt cctggtcttg ctgtggcacc ccatcgcccc acagctcgta ccttctcacc 2641 agattcccct gaatccaaac tcgtggtgca aacctctacc ttttttacaa aaagatctta 2701 ttgttaattt attgtttctg gcacttgggc aaaccctgta gttaatactc ctcccacact 2761 agacactggg tttcaggagg agggagactg ccctgctttg gtcccagaga ggccctctgc 2821 agataggcgt ggcccctctt cagaggacac taccctaggg cactttctct ttgaggtgga 2881 gagacccata aagccttgac cacatcactc catatgggga ggagaaggat ccctgtcacc 2941 ttctcctctc ttcacggggc ccttttgcag ccctaggcct catctgtggg aagggagtcc 3001 ctggcttata ctgcccccac cacagctcct tgccctggcc agaactgctg tcgaagaaaa 3061 tcaggccgga aggccaagaa ggcgctaagg gggatgggag ggcaggtttt ccaggctgga 3121 gtcggttcca cccactcgcc tgtccacagg cttccttgta agcaagtcag cagcacagct 3181 actcacgctg ccatctggac ttattttatg tcaatctgtt tataaataaa aaccaatata 3241 gataaaaaaa aaaaaaaaaa aaaaaaaa //