LOCUS       BC060773                2814 bp    mRNA    linear   HUM 28-JUL-2005
DEFINITION  Homo sapiens SRY (sex determining region Y)-box 5, mRNA (cDNA clone
            MGC:71528 IMAGE:30343519), complete cds.
ACCESSION   BC060773
VERSION     BC060773.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2814)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 2814)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (03-NOV-2003) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: Dr. Stefan Hansson
            cDNA Library Preparation: Michael Brownstein /  Ted Usdin
            Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Sequencing Group at the Stanford Human Genome
            Center, Stanford University School of Medicine, Stanford, CA  94305
            Web site:       http://www-shgc.stanford.edu
            Contact:  (Dickson, Mark) mcd@paxil.stanford.edu
            Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
            R. M.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 133 Row: f Column: 2
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 30061560.
FEATURES             Location/Qualifiers
     source          1..2814
                     /db_xref="H-InvDB:HIT000260075"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:71528 IMAGE:30343519"
                     /tissue_type="Placenta, normal"
                     /clone_lib="NIH_MGC_147"
                     /lab_host="DH10B"
                     /note="Vector: pBluescriptR"
     gene            1..2814
                     /gene="SOX5"
                     /gene_synonym="L-SOX5"
                     /gene_synonym="MGC35153"
                     /db_xref="GeneID:6660"
                     /db_xref="MIM:604975"
     CDS             451..2379
                     /gene="SOX5"
                     /gene_synonym="L-SOX5"
                     /gene_synonym="MGC35153"
                     /codon_start=1
                     /product="SOX5 protein"
                     /protein_id="AAH60773.1"
                     /db_xref="GeneID:6660"
                     /db_xref="MIM:604975"
                     /translation="MSSKRPASPYGEADGEVAMVTSRQKVEEEESDGLPAFHLPLHVS
                     FPNKPHSEEFQPVSLLTQETCGHRTPTSQHNTMEVDGNKVMSSFAPHNSSTSPQKAEE
                     GGRQSGESLSSTALGTPERRKGSLADVVDTLKQRKMEELIKNEPEETPSIEKLLSKDW
                     KDKLLAMGSGNFGEIKGTPESLAEKERQLMGMINQLTSLREQLLAAHDEQKKLAASQI
                     EKQRQQMELAKQQQEQIARQQQQLLQQQHKINLLQQQIQVQGQLPPLMIPVFPPDQRT
                     LAAAAQQGFLLPPGFSYKAGCSDPYPVQLIPTTMAAAAAATPGLGPLQLQQLYAAQLA
                     AMQVSPGGKLPGIPQGNLGAAVSPTSIHTDKSTNSPPPKSKEKTTLESLTQQLAVKQN
                     EEGKFSHAMMDFNLSGDSDGSAGVSESRIYRESRGRGSNEPHIKRPMNAFMVWAKDER
                     RKILQAFPDMHNSNISKILGSRWKAMTNLEKQPYYEEQARLSKQHLEKYPDYKYKPRP
                     KRTCLVDGKKLRIGEYKAIMRNRRQEMRQYFNVGQQAQIPIATAGVVYPGAIAMAGMP
                     SPHLPSEHSSVSSSPEPGMPVIQSTYGVKGEEPHIKEEIQAEDINGEIYDEYDEEEDD
                     PDVDYGSDSENHIAGQAN"
BASE COUNT          861 a          677 c          680 g          596 t
ORIGIN      
        1 gagagtgaaa aaggcgagcc accaaaaccc atctccagtc tcctcccggg ggcccccagc
       61 ccgcctctgt gccactttgc atcccacgcc ggaggaggca ttaacgagac cgggtaaggc
      121 tttttaaacg gtccaaggtg tagagccata cttcaggagg atcctcagaa gttttggaca
      181 agcctcccca aatgtggcag gtgctgtgct ggccattggt gacccaaaga tgatgaaaaa
      241 tatgttcctg cccacaagga gttagcgacc tactgggctt tcctcttgct gatgacatga
      301 ttcctgtttg aatctgttga caagattctg aaagctgaac agagaattct ggcactgcac
      361 tgggtaggaa aaagcatttc aagaaataga taatatcaag gacatcagga caccgggagt
      421 gggagagatt ggactgggag actcagcagg atgtcttcca agcgaccagc ctctccgtat
      481 ggggaagcag atggagaggt agccatggtg acaagcagac agaaagtgga agaagaggag
      541 agtgacgggc tcccagcctt tcaccttccc ttgcatgtga gttttcccaa caagcctcac
      601 tctgaggaat ttcagccagt ttctctgctg acgcaagaga cttgtggcca taggactccc
      661 acttctcagc acaatacaat ggaagttgat ggcaataaag ttatgtcttc atttgcccca
      721 cacaactcat ctacctcacc tcagaaggca gaagaaggtg ggcgacagag tggcgagtcc
      781 ttgtctagta cagccctggg aactcctgaa cggcgcaagg gcagtttagc tgatgttgtt
      841 gacaccttga agcagaggaa aatggaagag ctcatcaaaa acgagccgga agaaaccccc
      901 agtattgaaa aactactctc aaaggactgg aaagacaagc ttcttgcaat gggatcgggg
      961 aactttggcg aaataaaagg gactcccgag agcttagctg agaaagaaag gcaactcatg
     1021 ggtatgatca accagctgac cagcctccga gagcagctgt tggctgccca cgatgagcag
     1081 aagaaactag ctgcctctca gattgagaaa cagcgtcagc aaatggagct ggccaagcag
     1141 caacaagaac aaattgcaag acagcagcag cagcttctac agcaacaaca caaaatcaat
     1201 ttgctccagc aacagatcca ggttcaaggt cagctgccgc cattaatgat tcccgtattc
     1261 cctcctgatc aacggacact ggctgcagct gcccagcaag gattcctcct ccctccaggc
     1321 ttcagctata aggctggatg tagtgaccct taccctgttc agctgatccc aactaccatg
     1381 gcagctgctg ccgcagcaac accaggctta ggcccactcc aactgcagca gttatatgct
     1441 gcccagctag ctgcaatgca ggtatctcca ggagggaagc tgccaggcat accccaaggc
     1501 aaccttggtg ctgctgtatc tcctaccagc attcacacag acaagagcac aaacagccca
     1561 ccacccaaaa gcaaggaaaa aacaacactg gagagtctga ctcagcaact ggcagttaaa
     1621 cagaatgaag aaggaaaatt tagccatgca atgatggatt tcaatctgag tggagattct
     1681 gatggaagtg ctggagtctc agagtcaaga atttataggg aatcccgagg gcgtggtagc
     1741 aatgaacccc acataaagcg tccaatgaat gccttcatgg tgtgggctaa agatgaacgg
     1801 agaaagatcc ttcaagcctt tcctgacatg cacaactcca acatcagcaa gatattggga
     1861 tctcgctgga aagctatgac aaacctagag aaacagccat attatgagga gcaagcccgt
     1921 ctcagcaagc agcacctgga gaagtaccct gactataagt acaagcccag gccaaagcgc
     1981 acctgcctgg tggatggcaa aaagctgcgc attggtgaat acaaggcaat catgcgcaac
     2041 aggcggcagg aaatgcggca gtacttcaat gttgggcaac aagcacagat ccccattgcc
     2101 actgctggtg ttgtgtaccc tggagccatc gccatggctg ggatgccctc ccctcacctg
     2161 ccctcggagc actcaagcgt gtctagcagc ccagagcctg ggatgcctgt tatccagagc
     2221 acttacggtg tgaaaggaga ggagccacat atcaaagaag agatacaggc cgaggacatc
     2281 aatggagaaa tttatgatga gtacgacgag gaagaggatg atccagatgt agattatggg
     2341 agtgacagtg aaaaccatat tgcaggacaa gccaactgat aagggtcaaa agattgttgt
     2401 gaccttagga cttaaagaag ccctaactgg ttcatcctta ccagtggcca agcacattaa
     2461 ctttctcata cactgactgt tactttaact gttagtctta aatagttggg acatcagctg
     2521 actaatagac ctcagcctca aaaggcttgg aaagaaaaaa caaatacaac aagcaaacaa
     2581 caatatcaac aacaagagat tgaaataagc tatgggtaaa ataatgccag taattcagct
     2641 gctacatcca agcactgaag tcttacccgt caactttttt tttttttaaa taaactttat
     2701 ggctgtttgt tctacaatgt tctagaaatt ctcactcagg tacacagtgc caacaagtgg
     2761 cttgtgaatg tgttttgttg ttttgtgcta caatttttaa aaaaaaaaaa aaaa
//