LOCUS       BC071947                2255 bp    mRNA    linear   HUM 15-JUL-2006
DEFINITION  Homo sapiens SRY (sex determining region Y)-box 7, mRNA (cDNA clone
            MGC:88634 IMAGE:6459570), complete cds.
ACCESSION   BC071947
VERSION     BC071947.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2255)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 2255)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (01-JUN-2004) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: DCTD/DTP/Gazdar
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Sequencing Group at the Stanford Human Genome
            Center, Stanford University School of Medicine, Stanford, CA  94305
            Web site:       http://www-shgc.stanford.edu
            Contact:  (Dickson, Mark) mcd@paxil.stanford.edu
            Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
            R. M.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 58 Row: n Column: 22
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 30581119.
FEATURES             Location/Qualifiers
     source          1..2255
                     /db_xref="H-InvDB:HIT000264754"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:88634 IMAGE:6459570"
                     /tissue_type="Lung, large cell carcinoma"
                     /clone_lib="NIH_MGC_18"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     gene            1..2255
                     /gene="SOX7"
                     /gene_synonym="MGC10895"
                     /db_xref="GeneID:83595"
                     /db_xref="HGNC:HGNC:18196"
     CDS             68..1234
                     /gene="SOX7"
                     /gene_synonym="MGC10895"
                     /codon_start=1
                     /product="SRY (sex determining region Y)-box 7"
                     /protein_id="AAH71947.1"
                     /db_xref="GeneID:83595"
                     /db_xref="HGNC:HGNC:18196"
                     /translation="MASLLGAYPWPEGLECPALDAELSDGQSPPAVPRPPGDKGSESR
                     IRRPMNAFMVWAKDERKRLAVQNPDLHNAELSKMLGKSWKALTLSQKRPYVDEAERLR
                     LQHMQDYPNYKYRPRRKKQAKRLCKRVDPGFLLSSLSRDQNALPEKRSGSRGALGEKE
                     DRGEYSPGTALPSLRGCYHEGPAGGGGGGTPSSVDTYPYGLPTPPEMSPLDVLEPEQT
                     FFSSPCQEEHGHPRRIPHLPGHPYSPEYAPSPLHCSHPLGSLALGQSPGVSMMSPVPG
                     CPPSPAYYSPATYHPLHSNLQAHLGQLSPPPEHPGFDALDQLSQVELLGDMDRNEFDQ
                     YLNTPGHPDSATGAMALSGHVPVSQVTPTGPTETSLISVLADATATYYNSYSVS"
BASE COUNT          491 a          710 c          606 g          448 t
ORIGIN      
        1 ccgacccgtg cgagggccag gtccgcgcct gccccgccag gcgaagcgag gcgacccgcg
       61 tgcggccatg gcttcgctgc tgggagccta cccttggccc gagggtctcg agtgcccggc
      121 cctggacgcc gagctgtcgg atggacaatc gccgccggcc gtcccccggc ccccggggga
      181 caagggctcc gagagccgta tccggcggcc catgaacgcc ttcatggttt gggccaagga
      241 cgagaggaaa cggctggcag tgcagaaccc ggacctgcac aacgccgagc tcagcaagat
      301 gctgggaaag tcgtggaagg cgctgacgct gtcccagaag aggccgtacg tggacgaggc
      361 ggagcggctg cgcctgcagc acatgcagga ctaccccaac tacaagtacc ggccgcgcag
      421 gaagaagcag gccaagcggc tgtgcaagcg cgtggacccg ggcttccttc tgagctccct
      481 ctcccgggac cagaacgccc tgccggagaa gagaagcggc agccgggggg cgctggggga
      541 gaaggaggac aggggtgagt actcccccgg cactgccctg cccagcctcc ggggctgcta
      601 ccacgagggg ccggctggtg gtggcggcgg cggcaccccg agcagtgtgg acacgtaccc
      661 gtacgggctg cccacacctc ctgaaatgtc tcccctggac gtgctggagc cggagcagac
      721 cttcttctcc tccccctgcc aggaggagca tggccatccc cgccgcatcc cccacctgcc
      781 agggcacccg tactcaccgg agtacgcccc aagccctctc cactgtagcc accccctggg
      841 ctccctggcc cttggccagt cccccggcgt ctccatgatg tcccctgtac ccggctgtcc
      901 cccatctcct gcctattact ccccggccac ctaccaccca ctccactcca acctccaagc
      961 ccacctgggc cagctctccc cgcctcctga gcaccctggc ttcgacgccc tggatcaact
     1021 gagccaggtg gaactcctgg gggacatgga tcgcaatgaa ttcgaccagt atttgaacac
     1081 tcctggccac ccagactccg ccacaggggc catggccctc agtgggcatg ttccggtctc
     1141 ccaggtgaca ccaacgggtc ccacagagac cagcctcatc tccgtcctgg ctgatgccac
     1201 ggccacgtac tacaacagct acagtgtgtc atagagctgg aggcgccccg tccggtcagc
     1261 cctcgcgccc tctccttcct gtgccttgag tggcagagga gccgtccagc cacaccagct
     1321 ttcctcccac cgctcagggc agggaggtct gaactgcggc cccagagcct ttggcctaag
     1381 ctggactctc cttatccgag tgccgcctct atccccttcc ccacgttcca gcccctgcag
     1441 cccacatttt aagtatattc cttcaagtga gttttcctcc agcccctgag agttgctgtc
     1501 tcccagtgga atgttcactg acgtcttttc ttggtagcca tcatcgaaac taatgggggg
     1561 acagacttga tagccaaggt cccttctggt ccagttttct gatttagggt tctctcaaga
     1621 ttaataaagg aagatgggga aatttgactc attaatgagc tcgctaacct acgatctggt
     1681 gataattttg tgtgcacagc ccaaggacca cgaggctttc tgcactttct gcaccccctt
     1741 ccaaagtgac cacaaaattt caaagggact catacaattt gagaaaaaac agtcaacctg
     1801 atttgagaaa ttaaccagta tggctaacta tatcacagaa aatgggattg agttaaaact
     1861 attttatttt aaatatacat tttaaagcag ttcttttttt ttgttaattt gtttattata
     1921 cacacacttc aagagaatat gcacagtcta ggccgggcac ggtggctcac gcctgtaatc
     1981 ccagcacttt gggaggccga ggcatgtgga tcacctgagg tcaggagttt gagaccagcc
     2041 tagacaacat ggtgaaacct tgtctctatg aaaaatacaa aatttgctgg gagtggtggt
     2101 gcatgcctgt aatcccagct acttggaagg ctgaggcagg agaatgtctt gaacctagga
     2161 ggtggaggtt gcagtgagct gagattgcac cattgcactc cagcctgtgc aacaagagtg
     2221 aaactccatt tcaagaaaaa aaaaaaaaaa aaaaa
//