LOCUS BC031797 3069 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens SRY (sex determining region Y)-box 8, mRNA (cDNA clone MGC:24837 IMAGE:4937883), complete cds. ACCESSION BC031797 VERSION BC031797.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3069) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 3069) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (06-JUN-2002) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: David N. Louis, M.D. cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Baylor College of Medicine Human Genome Sequencing Center Center code: BCM-HGSC Web site: http://www.hgsc.bcm.tmc.edu/cdna/ Contact: amg@bcm.tmc.edu Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H., Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati, A.N., Gibbs, R.A. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 29 Row: o Column: 6 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 30179902. FEATURES Location/Qualifiers source 1..3069 /db_xref="H-InvDB:HIT000041372" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:24837 IMAGE:4937883" /tissue_type="Brain, anaplastic oligodendroglioma with 1p/19q loss" /clone_lib="NCI_CGAP_Brn67" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..3069 /gene="SOX8" /gene_synonym="MGC24837" /db_xref="GeneID:30812" /db_xref="HGNC:HGNC:11203" /db_xref="MIM:605923" CDS 116..1456 /gene="SOX8" /gene_synonym="MGC24837" /codon_start=1 /product="SRY (sex determining region Y)-box 8" /protein_id="AAH31797.1" /db_xref="GeneID:30812" /db_xref="HGNC:HGNC:11203" /db_xref="MIM:605923" /translation="MLDMSEARSQPPCSPSGTASSMSHVEDSDSDAPPSPAGSEGLGR AGVAVGGARGDPAEAADERFPACIRDAVSQVLKGYDWSLVPMPVRGGGGGALKAKPHV KRPMNAFMVWAQAARRKLADQYPHLHNAELSKTLGKLWRLLSESEKRPFVEEAERLRV QHKKDHPDYKYQPRRRKSAKAGHSDSDSGAELGPHPGGGAVYKAEAGLGDGHHHGDHT GQTHGPPTPPTTPKTELQQAGAKPELKLEGRRPVDSGRQNIDFSNVDISELSSEVMGT MDAFDVHEFDQYLPLGGPAPPEPGQAYGGAYFHAGASPVWAHKSAPSASASPTETGPP RPHIKTEQPSPGHYGDQPRGSPDYGSCSGQSSATPAAPAGPFAGSQGDYGDLQASSYY GAYPGYAPGLYQYPCFHSPRRPYASPLLNGLALPPAHSPTSHWDQPVYTTLTRP" BASE COUNT 574 a 997 c 948 g 550 t ORIGIN 1 ggcgagggtc ggggccaccg cgcggcgacc tcgggtcccg gagcgaccgc agggcagccc 61 cgggcgccgg ccccggtgcg cgtctcctgt gcgcgcccct ccgcgcgcgg ccccgatgct 121 ggacatgagc gaggcccgct cccagccgcc ctgcagcccg tccggcaccg ccagctccat 181 gtcgcacgtg gaggactcgg actcggacgc gccgccgtct cccgccggct ccgagggcct 241 gggccgcgcg ggggtcgcgg tggggggcgc ccggggcgac ccggcggagg cggcggacga 301 gcgcttcccg gcctgcatcc gcgacgccgt gtcgcaggtg ctcaagggct acgactggag 361 tctggtgccc atgccggtgc gcggcggcgg cggcggcgcg ctcaaagcca agccgcatgt 421 gaagcggccc atgaacgcat tcatggtgtg ggcgcaggcg gcgcgccgca agctggccga 481 ccagtacccg cacctgcaca acgccgagct cagcaagacg ctgggcaagc tgtggcgctt 541 gctgagcgag agcgagaagc ggcccttcgt ggaggaggca gagcgccttc gcgtgcagca 601 caagaaggac caccccgact acaagtacca gccacggcgc aggaagagcg ccaaagccgg 661 ccacagcgac tccgactcgg gcgcggagct gggaccccac cctggcggcg gtgccgtgta 721 caaggctgaa gcagggcttg gagatgggca ccaccatggc gaccacacag ggcagaccca 781 cgggccgccc accccgccca ccacccccaa gacggagctg cagcaggcgg gcgccaagcc 841 ggagctgaag ctggagggac gccggccggt ggacagcggg cgccagaaca tcgacttcag 901 caacgtggac atctcggagc tcagcagcga ggtcatgggc accatggacg ccttcgacgt 961 ccacgagttc gaccagtacc tgcccctggg cggccccgcc ccacccgagc cgggccaggc 1021 ctacgggggc gcctacttcc acgccggggc gtcccccgtg tgggcccaca agagtgcccc 1081 gtcggcctcc gcgtcgccca ccgagacggg tcccccacgg ccgcacatca agacggagca 1141 gccgagcccc ggccactacg gcgaccagcc ccgaggctcg cccgactacg gttcctgcag 1201 cggccagtcc agcgccaccc cggccgcccc cgccggcccc ttcgccggct cacagggcga 1261 ctatggcgac ctgcaggcct ccagctacta tggtgcctac cctggctacg cacccggcct 1321 ctaccagtac ccctgcttcc actcgccgcg ccggccctac gcctcacccc tgctcaacgg 1381 cctggccctg ccgcccgccc acagccccac cagtcactgg gaccagccgg tgtacaccac 1441 cctgaccagg ccctgagggc ccagccgcgg ggagggactc gcaggcgtca gggggcagcc 1501 ttgtcccggc ccagtgtgtg tgaccggggc gggaggggcc ccagtggctg agctccaagt 1561 gcctgctgaa gtctgcaggg aaacacgctt gctgcccgtg gccctcggcc tccagatggc 1621 cacacctctg ccgacgacgg accagctccc tctcccttct atctttcttt ttgaggtggt 1681 gggattattc cacaaagaag ggctgccgtt tggtccctct tctgtgagga ctggcggcac 1741 cagcaccttc gctttgcatc tcggtagagg agaaacggca gcacagccca aggaccaaag 1801 gagggggtgg caggggcctt gcagggcgct gtgaggtcca ggccggtctt ggcgccgaga 1861 gcccctgcac tcaaggccac attccctcga caacggctgc acgggctgtc cgggatccgg 1921 ggtgtctgtc cgcagactgg gatgagtcta ctcgagcatc tccgggacct gcctgtcaga 1981 tctgaggtgt ctccttgctg gcagagtgcg ctcacgcgag ggctggctgt gatgaacaca 2041 tctctctttt atttttatgt ttttgataat ttttattttt gaagcttaaa tgtgtttctt 2101 ctgaaagctg ttaaagatgt atttatgttc tgtgttattt tatctttaat taatgaggta 2161 attcgggcaa agagtagaat ttaagacaaa acggaagctg ggaagcttcc cttgagggca 2221 ggcaggaggt ggagttgcag ctgttggccg gcatcacgtt gctcgttgct cggcttatgg 2281 gaggccgccc tggagggccc ggaggtccca aggtccctgg gaggactggg cccctcatgc 2341 ctcgagcttg gcaaccgaaa acccgaggga ggagaaggga cctgccttgt gacatctctg 2401 atcaggttgg ggtgccccag cacccagtac tagtttgggg tttgggaagc aggactccgt 2461 ccctgtcccc gactgtgcca cgtggtagga cacataggac acaggaattc ctgggtcctt 2521 gcccatgact gtgccatgtg gtaggacaca ggacacagga attcctggaa agtggtggct 2581 tcagaagtga tcttggctcg caggcaccag tgccacctac caagctgtga aactaaacct 2641 tctccactaa acgtcgttag ggcctcagtt ctagacgagt catacctgat tcacctgcac 2701 tgcttccccc gtgtgctgag catagagcat acaatagcgc ctacttcacg gaaacttgtg 2761 cctttaaact ttgtaaactt aaacacagcc gagaagttgc ttctttgtac tttttctact 2821 tttcctactt ttttgtagaa aaaaaagata atgcctctgc ttctatttct ctgggggtgg 2881 gggtgggggc cgggagccgt cgcagacccg tttcatgcag cgtctccctc ggcaccgcgt 2941 tcggaggacg caccctcact cccctgctgc cttcactcct ttctgaccaa gcaacgctaa 3001 cttttgtaca gatcgatttg ataaaattaa acaaagtgct ttttatggaa aaaaaaaaaa 3061 aaaaaaaaa //