LOCUS BC071947 2255 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens SRY (sex determining region Y)-box 7, mRNA (cDNA clone MGC:88634 IMAGE:6459570), complete cds. ACCESSION BC071947 VERSION BC071947.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2255) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2255) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (01-JUN-2004) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: DCTD/DTP/Gazdar cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Sequencing Group at the Stanford Human Genome Center, Stanford University School of Medicine, Stanford, CA 94305 Web site: http://www-shgc.stanford.edu Contact: (Dickson, Mark) mcd@paxil.stanford.edu Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers, R. M. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 58 Row: n Column: 22 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 30581119. FEATURES Location/Qualifiers source 1..2255 /db_xref="H-InvDB:HIT000264754" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:88634 IMAGE:6459570" /tissue_type="Lung, large cell carcinoma" /clone_lib="NIH_MGC_18" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..2255 /gene="SOX7" /gene_synonym="MGC10895" /db_xref="GeneID:83595" /db_xref="HGNC:HGNC:18196" CDS 68..1234 /gene="SOX7" /gene_synonym="MGC10895" /codon_start=1 /product="SRY (sex determining region Y)-box 7" /protein_id="AAH71947.1" /db_xref="GeneID:83595" /db_xref="HGNC:HGNC:18196" /translation="MASLLGAYPWPEGLECPALDAELSDGQSPPAVPRPPGDKGSESR IRRPMNAFMVWAKDERKRLAVQNPDLHNAELSKMLGKSWKALTLSQKRPYVDEAERLR LQHMQDYPNYKYRPRRKKQAKRLCKRVDPGFLLSSLSRDQNALPEKRSGSRGALGEKE DRGEYSPGTALPSLRGCYHEGPAGGGGGGTPSSVDTYPYGLPTPPEMSPLDVLEPEQT FFSSPCQEEHGHPRRIPHLPGHPYSPEYAPSPLHCSHPLGSLALGQSPGVSMMSPVPG CPPSPAYYSPATYHPLHSNLQAHLGQLSPPPEHPGFDALDQLSQVELLGDMDRNEFDQ YLNTPGHPDSATGAMALSGHVPVSQVTPTGPTETSLISVLADATATYYNSYSVS" BASE COUNT 491 a 710 c 606 g 448 t ORIGIN 1 ccgacccgtg cgagggccag gtccgcgcct gccccgccag gcgaagcgag gcgacccgcg 61 tgcggccatg gcttcgctgc tgggagccta cccttggccc gagggtctcg agtgcccggc 121 cctggacgcc gagctgtcgg atggacaatc gccgccggcc gtcccccggc ccccggggga 181 caagggctcc gagagccgta tccggcggcc catgaacgcc ttcatggttt gggccaagga 241 cgagaggaaa cggctggcag tgcagaaccc ggacctgcac aacgccgagc tcagcaagat 301 gctgggaaag tcgtggaagg cgctgacgct gtcccagaag aggccgtacg tggacgaggc 361 ggagcggctg cgcctgcagc acatgcagga ctaccccaac tacaagtacc ggccgcgcag 421 gaagaagcag gccaagcggc tgtgcaagcg cgtggacccg ggcttccttc tgagctccct 481 ctcccgggac cagaacgccc tgccggagaa gagaagcggc agccgggggg cgctggggga 541 gaaggaggac aggggtgagt actcccccgg cactgccctg cccagcctcc ggggctgcta 601 ccacgagggg ccggctggtg gtggcggcgg cggcaccccg agcagtgtgg acacgtaccc 661 gtacgggctg cccacacctc ctgaaatgtc tcccctggac gtgctggagc cggagcagac 721 cttcttctcc tccccctgcc aggaggagca tggccatccc cgccgcatcc cccacctgcc 781 agggcacccg tactcaccgg agtacgcccc aagccctctc cactgtagcc accccctggg 841 ctccctggcc cttggccagt cccccggcgt ctccatgatg tcccctgtac ccggctgtcc 901 cccatctcct gcctattact ccccggccac ctaccaccca ctccactcca acctccaagc 961 ccacctgggc cagctctccc cgcctcctga gcaccctggc ttcgacgccc tggatcaact 1021 gagccaggtg gaactcctgg gggacatgga tcgcaatgaa ttcgaccagt atttgaacac 1081 tcctggccac ccagactccg ccacaggggc catggccctc agtgggcatg ttccggtctc 1141 ccaggtgaca ccaacgggtc ccacagagac cagcctcatc tccgtcctgg ctgatgccac 1201 ggccacgtac tacaacagct acagtgtgtc atagagctgg aggcgccccg tccggtcagc 1261 cctcgcgccc tctccttcct gtgccttgag tggcagagga gccgtccagc cacaccagct 1321 ttcctcccac cgctcagggc agggaggtct gaactgcggc cccagagcct ttggcctaag 1381 ctggactctc cttatccgag tgccgcctct atccccttcc ccacgttcca gcccctgcag 1441 cccacatttt aagtatattc cttcaagtga gttttcctcc agcccctgag agttgctgtc 1501 tcccagtgga atgttcactg acgtcttttc ttggtagcca tcatcgaaac taatgggggg 1561 acagacttga tagccaaggt cccttctggt ccagttttct gatttagggt tctctcaaga 1621 ttaataaagg aagatgggga aatttgactc attaatgagc tcgctaacct acgatctggt 1681 gataattttg tgtgcacagc ccaaggacca cgaggctttc tgcactttct gcaccccctt 1741 ccaaagtgac cacaaaattt caaagggact catacaattt gagaaaaaac agtcaacctg 1801 atttgagaaa ttaaccagta tggctaacta tatcacagaa aatgggattg agttaaaact 1861 attttatttt aaatatacat tttaaagcag ttcttttttt ttgttaattt gtttattata 1921 cacacacttc aagagaatat gcacagtcta ggccgggcac ggtggctcac gcctgtaatc 1981 ccagcacttt gggaggccga ggcatgtgga tcacctgagg tcaggagttt gagaccagcc 2041 tagacaacat ggtgaaacct tgtctctatg aaaaatacaa aatttgctgg gagtggtggt 2101 gcatgcctgt aatcccagct acttggaagg ctgaggcagg agaatgtctt gaacctagga 2161 ggtggaggtt gcagtgagct gagattgcac cattgcactc cagcctgtgc aacaagagtg 2221 aaactccatt tcaagaaaaa aaaaaaaaaa aaaaa //