LOCUS BC024033 2132 bp mRNA linear HUM 11-SEP-2007
DEFINITION Homo sapiens SIX homeobox 2, mRNA (cDNA clone MGC:4249
IMAGE:3027992), complete cds.
ACCESSION BC024033
VERSION BC024033.2
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 2132)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 2132)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (07-FEB-2002) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT On Aug 20, 2003 this sequence version replaced BC024033.1.
Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Institute for Systems Biology
http://www.systemsbiology.org
contact: amadan@systemsbiology.org
Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 10 Row: a Column: 24
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 34147624.
FEATURES Location/Qualifiers
source 1..2132
/db_xref="H-InvDB:HIT000039733"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:4249 IMAGE:3027992"
/tissue_type="Muscle, rhabdomyosarcoma"
/clone_lib="NIH_MGC_17"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene 1..2132
/gene="SIX2"
/db_xref="GeneID:10736"
/db_xref="HGNC:HGNC:10888"
/db_xref="MIM:604994"
CDS 274..1149
/gene="SIX2"
/codon_start=1
/product="SIX homeobox 2"
/protein_id="AAH24033.1"
/db_xref="GeneID:10736"
/db_xref="HGNC:HGNC:10888"
/db_xref="MIM:604994"
/translation="MSMLPTFGFTQEQVACVCEVLQQGGNIERLGRFLWSLPACEHLH
KNESVLKAKAVVAFHRGNFRELYKILESHQFSPHNHAKLQQLWLKAHYIEAEKLRGRP
LGAVGKYRVRRKFPLPRSIWDGEETSYCFKEKSRSVLREWYAHNPYPSPREKRELTEA
TGLTTTQVSNWFKNRRQRDRAAEAKERENNENSNSNSHNPLNGSGKSVLGSSEDEKTP
SGTPDHSSSSPALLLSPPPPGLPSLHSLGHPPGPSAVPVPVPGGGGADPLQHHHGLQD
SILNPMSANLVDLGS"
BASE COUNT 379 a 703 c 604 g 446 t
ORIGIN
1 cgcggagcat gcggagcggc gccccgggcg gcccccgggc ttgggcgagg gcttgggcca
61 ggcgcgcggg ccgttggggt tcggagcttc gtgggacccg cggccggcgc ggggacgtac
121 ggcagtgact cggggctcac cgggggccag tgccgggcca gggggccagc cccgcccgcg
181 tctcggcccg gacggcccgg cgaggaagct cccatgcggg accgcgcggc ccggtgaggg
241 cgcgcgcggg cgggcgggga cgcagccggc accatgtcca tgctgcccac cttcggcttc
301 acgcaggagc aagtggcgtg cgtgtgcgag gtgctgcagc agggcggcaa catcgagcgg
361 ctgggccgct tcctgtggtc gctgcccgcc tgcgagcacc ttcacaagaa tgaaagcgtg
421 ctcaaggcca aggccgtggt ggccttccac cgcggcaact tccgcgagct ctacaagatc
481 ctggagagcc accagttctc gccgcacaac cacgccaagc tgcagcagct gtggctcaag
541 gcacactaca tcgaggcgga gaagctgcgc ggccgacccc tgggcgccgt gggcaaatac
601 cgcgtgcgcc gcaaattccc gctgccgcgc tccatctggg acggcgagga gaccagctac
661 tgcttcaagg aaaagagtcg cagcgtgctg cgcgagtggt acgcgcacaa cccctaccct
721 tcaccccgcg agaagcgtga gctgacggag gccacgggcc tcaccaccac acaggtcagc
781 aactggttca agaaccggcg gcagcgcgac cgggcggccg aggccaagga aagggagaac
841 aacgagaact ccaattctaa cagccacaac ccgctgaatg gcagcggcaa gtcggtgtta
901 ggcagctcgg aggatgagaa gactccatcg gggacgccag accactcatc atccagcccc
961 gcactgctcc tcagcccgcc gccccctggg ctgccgtccc tgcacagcct gggccaccct
1021 ccgggcccca gcgcagtgcc agtgccggtg ccaggcggag gtggagcgga cccactgcaa
1081 caccaccatg gcctgcagga ctccatcctc aaccccatgt cagccaacct cgtggacctg
1141 ggctcctaga acccatttgc cttgatgagc ttgccttttg tgacttgaca ctggggacgt
1201 ggagtggcgg tgtccagggg cgccccgccc ctgcggcccc accaggtact gaaagacccg
1261 caggctgagc gggtagaaca gccgggtagg gcagatagct gtctatgttg gttcttgttt
1321 gggatttatt ttcaacaagt tacttttagg atccttttgg ggctggagac tgagtcttga
1381 accacagaag ggaataaatt atacaccact gtcattctct ctctccctct gtctcttcct
1441 tttaccctct cttgtcttgc cttttccccc tttcctcttc ctttcccttc cttctctttt
1501 cttttttctg ctttctgtct ttctccctct ccttgtattg ctttccttct agatttctag
1561 cttgccaccg ttcattctct ccttctgtct ctccctttct ctctccttct ctgtttctcc
1621 tctcttctct cctgccagtc tcttgtactc tgtgtcctgg tccctccgta tgtacccctg
1681 tctttctcct cctgactggt ggtctatctg cccctacctc tggccctcgc tttaccggag
1741 tagggggtgg gagagggaag aggagagaaa agacagggac tttgaaccta ggccatctcc
1801 tgaggccttt tccctcgccc atgtgggtca gtgggagctg caggtgtcag cttttcgtct
1861 agtaacttaa gtgagagaga aagggcagcg ccacagaagc ccctaaacgc cgcctcgtca
1921 tacgcccctc ctccttctct cttggcgagg ccccgccaca ccgcgctctt cctcccggga
1981 ctgtgactac agcgctcccg gctgagcgcg ccccccgagc cgccgacttg ccgtctcccc
2041 gtaatgccct catgtgaatg ttcttcggga aatatttctg cttttatttt ataataaaat
2101 tagaaatcat aaataaaaaa aaaaaaaaaa aa
//