LOCUS       BC031797                3069 bp    mRNA    linear   HUM 15-JUL-2006
DEFINITION  Homo sapiens SRY (sex determining region Y)-box 8, mRNA (cDNA clone
            MGC:24837 IMAGE:4937883), complete cds.
ACCESSION   BC031797
VERSION     BC031797.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3069)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 3069)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (06-JUN-2002) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: David N. Louis, M.D.
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Baylor College of Medicine Human Genome
            Sequencing Center
            Center code: BCM-HGSC
            Web site: http://www.hgsc.bcm.tmc.edu/cdna/
            Contact: amg@bcm.tmc.edu
            Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H.,
            Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati,
            A.N., Gibbs, R.A.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 29 Row: o Column: 6
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 30179902.
FEATURES             Location/Qualifiers
     source          1..3069
                     /db_xref="H-InvDB:HIT000041372"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:24837 IMAGE:4937883"
                     /tissue_type="Brain, anaplastic oligodendroglioma with
                     1p/19q loss"
                     /clone_lib="NCI_CGAP_Brn67"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     gene            1..3069
                     /gene="SOX8"
                     /gene_synonym="MGC24837"
                     /db_xref="GeneID:30812"
                     /db_xref="HGNC:HGNC:11203"
                     /db_xref="MIM:605923"
     CDS             116..1456
                     /gene="SOX8"
                     /gene_synonym="MGC24837"
                     /codon_start=1
                     /product="SRY (sex determining region Y)-box 8"
                     /protein_id="AAH31797.1"
                     /db_xref="GeneID:30812"
                     /db_xref="HGNC:HGNC:11203"
                     /db_xref="MIM:605923"
                     /translation="MLDMSEARSQPPCSPSGTASSMSHVEDSDSDAPPSPAGSEGLGR
                     AGVAVGGARGDPAEAADERFPACIRDAVSQVLKGYDWSLVPMPVRGGGGGALKAKPHV
                     KRPMNAFMVWAQAARRKLADQYPHLHNAELSKTLGKLWRLLSESEKRPFVEEAERLRV
                     QHKKDHPDYKYQPRRRKSAKAGHSDSDSGAELGPHPGGGAVYKAEAGLGDGHHHGDHT
                     GQTHGPPTPPTTPKTELQQAGAKPELKLEGRRPVDSGRQNIDFSNVDISELSSEVMGT
                     MDAFDVHEFDQYLPLGGPAPPEPGQAYGGAYFHAGASPVWAHKSAPSASASPTETGPP
                     RPHIKTEQPSPGHYGDQPRGSPDYGSCSGQSSATPAAPAGPFAGSQGDYGDLQASSYY
                     GAYPGYAPGLYQYPCFHSPRRPYASPLLNGLALPPAHSPTSHWDQPVYTTLTRP"
BASE COUNT          574 a          997 c          948 g          550 t
ORIGIN      
        1 ggcgagggtc ggggccaccg cgcggcgacc tcgggtcccg gagcgaccgc agggcagccc
       61 cgggcgccgg ccccggtgcg cgtctcctgt gcgcgcccct ccgcgcgcgg ccccgatgct
      121 ggacatgagc gaggcccgct cccagccgcc ctgcagcccg tccggcaccg ccagctccat
      181 gtcgcacgtg gaggactcgg actcggacgc gccgccgtct cccgccggct ccgagggcct
      241 gggccgcgcg ggggtcgcgg tggggggcgc ccggggcgac ccggcggagg cggcggacga
      301 gcgcttcccg gcctgcatcc gcgacgccgt gtcgcaggtg ctcaagggct acgactggag
      361 tctggtgccc atgccggtgc gcggcggcgg cggcggcgcg ctcaaagcca agccgcatgt
      421 gaagcggccc atgaacgcat tcatggtgtg ggcgcaggcg gcgcgccgca agctggccga
      481 ccagtacccg cacctgcaca acgccgagct cagcaagacg ctgggcaagc tgtggcgctt
      541 gctgagcgag agcgagaagc ggcccttcgt ggaggaggca gagcgccttc gcgtgcagca
      601 caagaaggac caccccgact acaagtacca gccacggcgc aggaagagcg ccaaagccgg
      661 ccacagcgac tccgactcgg gcgcggagct gggaccccac cctggcggcg gtgccgtgta
      721 caaggctgaa gcagggcttg gagatgggca ccaccatggc gaccacacag ggcagaccca
      781 cgggccgccc accccgccca ccacccccaa gacggagctg cagcaggcgg gcgccaagcc
      841 ggagctgaag ctggagggac gccggccggt ggacagcggg cgccagaaca tcgacttcag
      901 caacgtggac atctcggagc tcagcagcga ggtcatgggc accatggacg ccttcgacgt
      961 ccacgagttc gaccagtacc tgcccctggg cggccccgcc ccacccgagc cgggccaggc
     1021 ctacgggggc gcctacttcc acgccggggc gtcccccgtg tgggcccaca agagtgcccc
     1081 gtcggcctcc gcgtcgccca ccgagacggg tcccccacgg ccgcacatca agacggagca
     1141 gccgagcccc ggccactacg gcgaccagcc ccgaggctcg cccgactacg gttcctgcag
     1201 cggccagtcc agcgccaccc cggccgcccc cgccggcccc ttcgccggct cacagggcga
     1261 ctatggcgac ctgcaggcct ccagctacta tggtgcctac cctggctacg cacccggcct
     1321 ctaccagtac ccctgcttcc actcgccgcg ccggccctac gcctcacccc tgctcaacgg
     1381 cctggccctg ccgcccgccc acagccccac cagtcactgg gaccagccgg tgtacaccac
     1441 cctgaccagg ccctgagggc ccagccgcgg ggagggactc gcaggcgtca gggggcagcc
     1501 ttgtcccggc ccagtgtgtg tgaccggggc gggaggggcc ccagtggctg agctccaagt
     1561 gcctgctgaa gtctgcaggg aaacacgctt gctgcccgtg gccctcggcc tccagatggc
     1621 cacacctctg ccgacgacgg accagctccc tctcccttct atctttcttt ttgaggtggt
     1681 gggattattc cacaaagaag ggctgccgtt tggtccctct tctgtgagga ctggcggcac
     1741 cagcaccttc gctttgcatc tcggtagagg agaaacggca gcacagccca aggaccaaag
     1801 gagggggtgg caggggcctt gcagggcgct gtgaggtcca ggccggtctt ggcgccgaga
     1861 gcccctgcac tcaaggccac attccctcga caacggctgc acgggctgtc cgggatccgg
     1921 ggtgtctgtc cgcagactgg gatgagtcta ctcgagcatc tccgggacct gcctgtcaga
     1981 tctgaggtgt ctccttgctg gcagagtgcg ctcacgcgag ggctggctgt gatgaacaca
     2041 tctctctttt atttttatgt ttttgataat ttttattttt gaagcttaaa tgtgtttctt
     2101 ctgaaagctg ttaaagatgt atttatgttc tgtgttattt tatctttaat taatgaggta
     2161 attcgggcaa agagtagaat ttaagacaaa acggaagctg ggaagcttcc cttgagggca
     2221 ggcaggaggt ggagttgcag ctgttggccg gcatcacgtt gctcgttgct cggcttatgg
     2281 gaggccgccc tggagggccc ggaggtccca aggtccctgg gaggactggg cccctcatgc
     2341 ctcgagcttg gcaaccgaaa acccgaggga ggagaaggga cctgccttgt gacatctctg
     2401 atcaggttgg ggtgccccag cacccagtac tagtttgggg tttgggaagc aggactccgt
     2461 ccctgtcccc gactgtgcca cgtggtagga cacataggac acaggaattc ctgggtcctt
     2521 gcccatgact gtgccatgtg gtaggacaca ggacacagga attcctggaa agtggtggct
     2581 tcagaagtga tcttggctcg caggcaccag tgccacctac caagctgtga aactaaacct
     2641 tctccactaa acgtcgttag ggcctcagtt ctagacgagt catacctgat tcacctgcac
     2701 tgcttccccc gtgtgctgag catagagcat acaatagcgc ctacttcacg gaaacttgtg
     2761 cctttaaact ttgtaaactt aaacacagcc gagaagttgc ttctttgtac tttttctact
     2821 tttcctactt ttttgtagaa aaaaaagata atgcctctgc ttctatttct ctgggggtgg
     2881 gggtgggggc cgggagccgt cgcagacccg tttcatgcag cgtctccctc ggcaccgcgt
     2941 tcggaggacg caccctcact cccctgctgc cttcactcct ttctgaccaa gcaacgctaa
     3001 cttttgtaca gatcgatttg ataaaattaa acaaagtgct ttttatggaa aaaaaaaaaa
     3061 aaaaaaaaa
//