LOCUS BC047665 1494 bp mRNA linear HUM 30-SEP-2003 DEFINITION Homo sapiens SRY (sex determining region Y)-box 5, mRNA (cDNA clone IMAGE:5743782), partial cds. ACCESSION BC047665 VERSION BC047665.2 KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1494) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 1494) AUTHORS Strausberg,R. TITLE Direct Submission JOURNAL Submitted (03-MAR-2003) National Institutes of Health, Mammalian Gene Collection (MGC), Cancer Genomics Office, National Cancer Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Sep 16, 2003 this sequence version replaced BC047665.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Life Technologies, Inc. cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Sequencing Group at the Stanford Human Genome Center, Stanford University School of Medicine, Stanford, CA 94305 Web site: http://www-shgc.stanford.edu Contact: (Dickson, Mark) mcd@paxil.stanford.edu Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers, R. M. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 98 Row: d Column: 18 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 23308712. FEATURES Location/Qualifiers source 1..1494 /db_xref="H-InvDB:HIT000053189" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="IMAGE:5743782" /tissue_type="Brain, adult medulla" /clone_lib="NIH_MGC_119" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene <1..1494 /gene="SOX5" /gene_synonym="L-SOX5" /gene_synonym="MGC35153" /db_xref="GeneID:6660" /db_xref="MIM:604975" CDS <1..1132 /gene="SOX5" /gene_synonym="L-SOX5" /gene_synonym="MGC35153" /codon_start=2 /product="SOX5 protein" /protein_id="AAH47665.2" /db_xref="GeneID:6660" /db_xref="MIM:604975" /translation="HDEVAQPLNLSAKPKTSDGKSPTSPTSPHMPALRINSGAGPLKA SVPAALASPSARVSTIGYLNDHDAVTKAIQEARQMKEQLRREQQVLDGKVAVVNSLGL NNCRTEKEKTTLESLTQQLAVKQNEEGKFSHAMMDFNLSGDSDGSAGVSESRIYRESR GRGSNEPHIKRPMNAFMVWAKDERRKILQAFPDMHNSNISKILGSRWKAMTNLEKQPY YEEQARLSKQHLEKYPDYKYKPRPKRTCLVDGKKLRIGEYKAIMRNRRQEMRQYFNVG QQAQIPIATAGVVYPGAIAMAGMPSPHLPSEHSSVSSSPEPGMPVIQSTYGVKGEEPH IKEEIQAEDINGEIYDEYDEEEDDPDVDYGSDSENHIAGQAN" misc_feature 506..694 /gene="SOX5" /gene_synonym="L-SOX5" /gene_synonym="MGC35153" /note="HMG_box; Region: HMG (high mobility group) box" /db_xref="CDD:pfam00505" BASE COUNT 496 a 338 c 346 g 314 t ORIGIN 1 gcatgatgaa gtggcacagc cactgaacct atcagctaaa cccaagacct ctgatggcaa 61 atcacccaca tcacccacct ctccccatat gccagctctg agaataaaca gtggggcagg 121 ccccctcaaa gcctctgtcc cagcagcgtt agctagtcct tcagccagag ttagcacaat 181 aggttactta aatgaccatg atgctgtcac caaggcaatc caagaagctc ggcaaatgaa 241 ggagcaactc cgacgggaac aacaggtgct tgatgggaag gtggctgttg tgaatagtct 301 gggtctcaat aactgccgaa cagaaaagga aaaaacaaca ctggagagtc tgactcagca 361 actggcagtt aaacagaatg aagaaggaaa atttagccat gcaatgatgg atttcaatct 421 gagtggagat tctgatggaa gtgctggagt ctcagagtca agaatttata gggaatcccg 481 agggcgtggt agcaatgaac cccacataaa gcgtccaatg aatgccttca tggtgtgggc 541 taaagatgaa cggagaaaga tccttcaagc ctttcctgac atgcacaact ccaacatcag 601 caagatattg ggatctcgct ggaaagctat gacaaaccta gagaaacagc catattatga 661 ggagcaagcc cgtctcagca agcagcacct ggagaagtac cctgactata agtacaagcc 721 caggccaaag cgcacctgcc tggtggatgg caaaaagctg cgcattggtg aatacaaggc 781 aatcatgcgc aacaggcggc aggaaatgcg gcagtacttc aatgttgggc aacaagcaca 841 gatccccatt gccactgctg gtgttgtgta ccctggagcc atcgccatgg ctgggatgcc 901 ctcccctcac ctgccctcgg agcactcaag cgtgtctagc agcccagagc ctgggatgcc 961 tgttatccag agcacttacg gtgtgaaagg agaggagcca catatcaaag aagagataca 1021 ggccgaggac atcaatggag aaatttatga tgagtacgac gaggaagagg atgatccaga 1081 tgtagattat gggagtgaca gtgaaaacca tattgcagga caagccaact gataagggtc 1141 aaaagattgt tgtgacctta ggacttaaag aagccctaac tggttcatcc ttaccagtgg 1201 ccaagcacat taactttctc atacactgac tgttacttta actgttagtc ttaaatagtt 1261 gggacatcag ctgactaata gacctcagcc tcaaaaggct tggaaagaaa aaacaaatac 1321 aacaagcaaa caacaatatc aacaacaaga gattgaaata agctatgggt aaaataatgc 1381 cagtaattca gctgctacat ccaagcactg aagtcttacc cgtcaacttt tttttttttt 1441 taaataaact ttatggctgt ttgttctaca aaaaaaaaaa aaaaaaaaaa aaaa //