LOCUS BC024033 2132 bp mRNA linear HUM 11-SEP-2007 DEFINITION Homo sapiens SIX homeobox 2, mRNA (cDNA clone MGC:4249 IMAGE:3027992), complete cds. ACCESSION BC024033 VERSION BC024033.2 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2132) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2132) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (07-FEB-2002) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Aug 20, 2003 this sequence version replaced BC024033.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Institute for Systems Biology http://www.systemsbiology.org contact: amadan@systemsbiology.org Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 10 Row: a Column: 24 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 34147624. FEATURES Location/Qualifiers source 1..2132 /db_xref="H-InvDB:HIT000039733" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:4249 IMAGE:3027992" /tissue_type="Muscle, rhabdomyosarcoma" /clone_lib="NIH_MGC_17" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..2132 /gene="SIX2" /db_xref="GeneID:10736" /db_xref="HGNC:HGNC:10888" /db_xref="MIM:604994" CDS 274..1149 /gene="SIX2" /codon_start=1 /product="SIX homeobox 2" /protein_id="AAH24033.1" /db_xref="GeneID:10736" /db_xref="HGNC:HGNC:10888" /db_xref="MIM:604994" /translation="MSMLPTFGFTQEQVACVCEVLQQGGNIERLGRFLWSLPACEHLH KNESVLKAKAVVAFHRGNFRELYKILESHQFSPHNHAKLQQLWLKAHYIEAEKLRGRP LGAVGKYRVRRKFPLPRSIWDGEETSYCFKEKSRSVLREWYAHNPYPSPREKRELTEA TGLTTTQVSNWFKNRRQRDRAAEAKERENNENSNSNSHNPLNGSGKSVLGSSEDEKTP SGTPDHSSSSPALLLSPPPPGLPSLHSLGHPPGPSAVPVPVPGGGGADPLQHHHGLQD SILNPMSANLVDLGS" BASE COUNT 379 a 703 c 604 g 446 t ORIGIN 1 cgcggagcat gcggagcggc gccccgggcg gcccccgggc ttgggcgagg gcttgggcca 61 ggcgcgcggg ccgttggggt tcggagcttc gtgggacccg cggccggcgc ggggacgtac 121 ggcagtgact cggggctcac cgggggccag tgccgggcca gggggccagc cccgcccgcg 181 tctcggcccg gacggcccgg cgaggaagct cccatgcggg accgcgcggc ccggtgaggg 241 cgcgcgcggg cgggcgggga cgcagccggc accatgtcca tgctgcccac cttcggcttc 301 acgcaggagc aagtggcgtg cgtgtgcgag gtgctgcagc agggcggcaa catcgagcgg 361 ctgggccgct tcctgtggtc gctgcccgcc tgcgagcacc ttcacaagaa tgaaagcgtg 421 ctcaaggcca aggccgtggt ggccttccac cgcggcaact tccgcgagct ctacaagatc 481 ctggagagcc accagttctc gccgcacaac cacgccaagc tgcagcagct gtggctcaag 541 gcacactaca tcgaggcgga gaagctgcgc ggccgacccc tgggcgccgt gggcaaatac 601 cgcgtgcgcc gcaaattccc gctgccgcgc tccatctggg acggcgagga gaccagctac 661 tgcttcaagg aaaagagtcg cagcgtgctg cgcgagtggt acgcgcacaa cccctaccct 721 tcaccccgcg agaagcgtga gctgacggag gccacgggcc tcaccaccac acaggtcagc 781 aactggttca agaaccggcg gcagcgcgac cgggcggccg aggccaagga aagggagaac 841 aacgagaact ccaattctaa cagccacaac ccgctgaatg gcagcggcaa gtcggtgtta 901 ggcagctcgg aggatgagaa gactccatcg gggacgccag accactcatc atccagcccc 961 gcactgctcc tcagcccgcc gccccctggg ctgccgtccc tgcacagcct gggccaccct 1021 ccgggcccca gcgcagtgcc agtgccggtg ccaggcggag gtggagcgga cccactgcaa 1081 caccaccatg gcctgcagga ctccatcctc aaccccatgt cagccaacct cgtggacctg 1141 ggctcctaga acccatttgc cttgatgagc ttgccttttg tgacttgaca ctggggacgt 1201 ggagtggcgg tgtccagggg cgccccgccc ctgcggcccc accaggtact gaaagacccg 1261 caggctgagc gggtagaaca gccgggtagg gcagatagct gtctatgttg gttcttgttt 1321 gggatttatt ttcaacaagt tacttttagg atccttttgg ggctggagac tgagtcttga 1381 accacagaag ggaataaatt atacaccact gtcattctct ctctccctct gtctcttcct 1441 tttaccctct cttgtcttgc cttttccccc tttcctcttc ctttcccttc cttctctttt 1501 cttttttctg ctttctgtct ttctccctct ccttgtattg ctttccttct agatttctag 1561 cttgccaccg ttcattctct ccttctgtct ctccctttct ctctccttct ctgtttctcc 1621 tctcttctct cctgccagtc tcttgtactc tgtgtcctgg tccctccgta tgtacccctg 1681 tctttctcct cctgactggt ggtctatctg cccctacctc tggccctcgc tttaccggag 1741 tagggggtgg gagagggaag aggagagaaa agacagggac tttgaaccta ggccatctcc 1801 tgaggccttt tccctcgccc atgtgggtca gtgggagctg caggtgtcag cttttcgtct 1861 agtaacttaa gtgagagaga aagggcagcg ccacagaagc ccctaaacgc cgcctcgtca 1921 tacgcccctc ctccttctct cttggcgagg ccccgccaca ccgcgctctt cctcccggga 1981 ctgtgactac agcgctcccg gctgagcgcg ccccccgagc cgccgacttg ccgtctcccc 2041 gtaatgccct catgtgaatg ttcttcggga aatatttctg cttttatttt ataataaaat 2101 tagaaatcat aaataaaaaa aaaaaaaaaa aa //