LOCUS       BC047064                2671 bp    mRNA    linear   HUM 21-JUL-2005
DEFINITION  Homo sapiens SRY (sex determining region Y)-box 6, mRNA (cDNA clone
            MGC:48285 IMAGE:5269193), complete cds.
ACCESSION   BC047064
VERSION     BC047064.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2671)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 2671)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (19-FEB-2003) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: Miklos Palkovits, M.D., Ph.D.
            cDNA Library Preparation: Michael J. Brownstein (NHGRI) &  Shiraki
            Toshiyuki and Piero Carninci (RIKEN)
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Baylor College of Medicine Human Genome
            Sequencing Center
            Center code: BCM-HGSC
            Web site: http://www.hgsc.bcm.tmc.edu/cdna/
            Contact: amg@bcm.tmc.edu
            Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H.,
            Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati,
            A.N., Gibbs, R.A.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 91 Row: b Column: 8
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 15277412.
FEATURES             Location/Qualifiers
     source          1..2671
                     /db_xref="H-InvDB:HIT000053079"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:48285 IMAGE:5269193"
                     /tissue_type="Testis"
                     /clone_lib="NIH_MGC_97"
                     /lab_host="DH10B"
                     /note="Vector: pBluescriptR"
     gene            1..2671
                     /gene="SOX6"
                     /gene_synonym="HSSOX6"
                     /db_xref="GeneID:55553"
                     /db_xref="MIM:607257"
     CDS             69..2555
                     /gene="SOX6"
                     /gene_synonym="HSSOX6"
                     /codon_start=1
                     /product="SOX6 protein"
                     /protein_id="AAH47064.2"
                     /db_xref="GeneID:55553"
                     /db_xref="MIM:607257"
                     /translation="MSSKQATSPFACAADGEDAMTQDLTSREKEEGSDQHVASHLPLH
                     PIMHNKPHSEELPTLVSTIQQDADWDSVLSSQQRMESENNKLCSLYSFRNTSTSPHKP
                     DEGSRDREIMTSVTFGTPERRKGSLADVVDTLKQKKLEEMTRTEQEDSSCMEKLLSKD
                     WKEKMERLNTSELLGEIKGTPESLAEKERQLSTMITQLISLREQLLAAHDEQKKLAAS
                     QIEKQRQQMDLARQQQEQIARQQQQLLQQQHKINLLQQQIQVQGHMPPLMIPIFPHDQ
                     RTLAAAAAAQQGFLFPPGITYKPGDNYPVQFIPSTMAAAAASGLSPLQLQKGHVSHPQ
                     INQRLKGLSDRFGRNLDTFEHGGGHSYNHKQIEQLYAAQLASMQVSPGAKMPSTPQPP
                     NTAGTVSPTGIKNEKRGTSPVTQVKDEAAAQPLNLSSRPKTAEPVKSPTSPTQNLFPA
                     SKTSPVNLPNKSSIPSPIGGSLGRGSSLDILSSLNSPALFGDQDTVMKAIQEARKMRE
                     QIQREQQQQQPHGVDGKLSSINNMGLNSCRNEKERTRFENLGPQLTGKSNEDGKLGPG
                     VIDLTRPEDAEGSKAMNGSAAKLQQYYCWPTGGATVAEARVYRDARGRASSEPHIKRP
                     MNAFMVWAKDERRKILQAFPDMHNSNISKILGSRWKSMSNQEKQPYYEEQARLSKIHL
                     EKYPNYKYKPRPKRTCIVDGKKLRIGEYKQLMRSRRQEMRQFFTVGQQPQIPITTGTG
                     VVYPGAITMATTTPSPQMTSDCSSTSASPEPSLPVIQSTYGMKTDGGSLAGNEMINGE
                     DEMEMYDDYEDDPKSDYSSENEAPEAVSAN"
BASE COUNT          841 a          676 c          634 g          520 t
ORIGIN      
        1 gttaggagag accattgggg ataggaaaga tgaaaggtca tggtgagctt caaggacatg
       61 aaagaagaat gtcttccaag caagccacct ctccatttgc ctgtgcagct gatggagagg
      121 atgcaatgac ccaggattta acctcaaggg aaaaggaaga gggcagtgat caacatgtgg
      181 cctcccatct gcctctgcac cccataatgc acaacaaacc tcactctgag gagctaccaa
      241 cacttgtcag taccattcaa caagatgctg actgggacag cgttctgtca tctcagcaaa
      301 gaatggaatc agagaataat aagttatgtt ccctatattc cttccgaaat acctctacct
      361 caccacataa gcctgacgaa gggagtcggg accgtgagat aatgaccagt gttacttttg
      421 gaaccccaga gcgccgcaaa gggagtcttg ccgatgtggt ggacacactg aaacagaaga
      481 agcttgagga aatgactcgg actgaacaag aggattcctc ctgcatggaa aaactacttt
      541 caaaagattg gaaggaaaaa atggaaagac taaataccag tgaacttctt ggagaaatta
      601 aaggtacacc tgagagcctg gcagaaaaag aacggcagct ctccaccatg attacccagc
      661 tgatcagttt acgggagcag ctactggcag cgcatgatga acagaaaaaa ctggcagcgt
      721 cacaaattga gaaacaacgg cagcaaatgg accttgctcg ccaacagcaa gaacagattg
      781 cgagacaaca gcagcaactt ctgcaacagc agcacaaaat taatctcctg cagcaacaga
      841 tccaggttca gggtcacatg cctccgctca tgatcccaat ttttccacat gaccagcgga
      901 ccctggcagc agctgctgct gcccaacagg gattcctctt cccccctgga ataacataca
      961 aaccaggtga taactacccc gtacagttca ttccatcaac aatggcagct gctgctgctt
     1021 ctggactcag ccctttacag ctccagaagg gtcatgtctc ccacccacaa attaaccaaa
     1081 ggctaaaggg cctaagtgac cgttttggca ggaatttgga cacctttgaa catggtggtg
     1141 gccactctta caaccacaaa cagattgagc agctctatgc cgctcagctg gccagcatgc
     1201 aggtgtcacc tggagcaaag atgccatcaa ctccacagcc accaaacaca gcagggacgg
     1261 tctcacctac tgggataaaa aatgaaaaga gagggaccag ccctgtaact caagttaagg
     1321 atgaagcagc agcacagcct ctgaatctct catcccgacc caagacagca gagcctgtaa
     1381 agtccccaac gtctcccacc cagaacctct tcccagccag caaaaccagc cctgtcaatc
     1441 tgccaaacaa aagcagcatc cctagcccca ttggaggaag cctgggaaga ggatcctctt
     1501 tagatatcct atctagtctc aactcccctg ccctttttgg ggatcaggat acagtgatga
     1561 aagccattca ggaggcgcgg aagatgcgag agcagatcca gcgggagcaa cagcagcaac
     1621 agccacatgg tgttgacggg aaactgtcct ccataaataa tatggggctg aacagctgca
     1681 ggaatgaaaa ggaaagaacg cgctttgaga atttggggcc ccagttaacg ggaaagtcaa
     1741 atgaagatgg aaaactgggc ccaggtgtca tcgaccttac tcggccagaa gatgcagagg
     1801 gaagtaaagc aatgaatggc tctgcagcta aactacagca gtattattgt tggccaacag
     1861 gaggtgccac tgtggctgaa gcacgagtct acagggacgc ccgcggccgt gccagcagcg
     1921 agccacacat taagcgacca atgaatgcat tcatggtttg ggcaaaggat gagaggagaa
     1981 aaatccttca ggccttcccc gacatgcata actccaacat tagcaaaatc ttaggatctc
     2041 gctggaaatc aatgtccaac caggagaagc aaccttatta tgaagagcag gcccggctaa
     2101 gcaagatcca cttagagaag tacccaaact ataaatacaa accccgaccg aaacgcacct
     2161 gcattgttga tggcaaaaag cttcggattg gggagtataa gcaactgatg aggtctcgga
     2221 gacaggagat gaggcagttc tttactgtgg ggcaacagcc tcagattcca atcaccacag
     2281 gaacaggtgt tgtgtatcct ggtgctatca ctatggcaac taccacacca tcgcctcaga
     2341 tgacatctga ctgctctagc acctcggcca gcccggagcc cagcctcccg gtcatccaga
     2401 gcacttatgg tatgaagaca gatggcggaa gcctagctgg aaatgaaatg atcaatggag
     2461 aggatgaaat ggaaatgtat gatgactatg aagatgaccc caaatcagac tatagcagtg
     2521 aaaatgaagc cccggaggct gtcagtgcca actgaggagt ttttgtttgc tgaattaaag
     2581 tactctgaca tttcaccccc ctccccaaca aagagttatt aaagagcccg catgcatttg
     2641 tggctccaca attacaaaaa aaaaaaaaaa a
//