LOCUS BC060773 2814 bp mRNA linear HUM 28-JUL-2005 DEFINITION Homo sapiens SRY (sex determining region Y)-box 5, mRNA (cDNA clone MGC:71528 IMAGE:30343519), complete cds. ACCESSION BC060773 VERSION BC060773.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2814) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2814) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (03-NOV-2003) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Dr. Stefan Hansson cDNA Library Preparation: Michael Brownstein / Ted Usdin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Sequencing Group at the Stanford Human Genome Center, Stanford University School of Medicine, Stanford, CA 94305 Web site: http://www-shgc.stanford.edu Contact: (Dickson, Mark) mcd@paxil.stanford.edu Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers, R. M. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 133 Row: f Column: 2 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 30061560. FEATURES Location/Qualifiers source 1..2814 /db_xref="H-InvDB:HIT000260075" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:71528 IMAGE:30343519" /tissue_type="Placenta, normal" /clone_lib="NIH_MGC_147" /lab_host="DH10B" /note="Vector: pBluescriptR" gene 1..2814 /gene="SOX5" /gene_synonym="L-SOX5" /gene_synonym="MGC35153" /db_xref="GeneID:6660" /db_xref="MIM:604975" CDS 451..2379 /gene="SOX5" /gene_synonym="L-SOX5" /gene_synonym="MGC35153" /codon_start=1 /product="SOX5 protein" /protein_id="AAH60773.1" /db_xref="GeneID:6660" /db_xref="MIM:604975" /translation="MSSKRPASPYGEADGEVAMVTSRQKVEEEESDGLPAFHLPLHVS FPNKPHSEEFQPVSLLTQETCGHRTPTSQHNTMEVDGNKVMSSFAPHNSSTSPQKAEE GGRQSGESLSSTALGTPERRKGSLADVVDTLKQRKMEELIKNEPEETPSIEKLLSKDW KDKLLAMGSGNFGEIKGTPESLAEKERQLMGMINQLTSLREQLLAAHDEQKKLAASQI EKQRQQMELAKQQQEQIARQQQQLLQQQHKINLLQQQIQVQGQLPPLMIPVFPPDQRT LAAAAQQGFLLPPGFSYKAGCSDPYPVQLIPTTMAAAAAATPGLGPLQLQQLYAAQLA AMQVSPGGKLPGIPQGNLGAAVSPTSIHTDKSTNSPPPKSKEKTTLESLTQQLAVKQN EEGKFSHAMMDFNLSGDSDGSAGVSESRIYRESRGRGSNEPHIKRPMNAFMVWAKDER RKILQAFPDMHNSNISKILGSRWKAMTNLEKQPYYEEQARLSKQHLEKYPDYKYKPRP KRTCLVDGKKLRIGEYKAIMRNRRQEMRQYFNVGQQAQIPIATAGVVYPGAIAMAGMP SPHLPSEHSSVSSSPEPGMPVIQSTYGVKGEEPHIKEEIQAEDINGEIYDEYDEEEDD PDVDYGSDSENHIAGQAN" BASE COUNT 861 a 677 c 680 g 596 t ORIGIN 1 gagagtgaaa aaggcgagcc accaaaaccc atctccagtc tcctcccggg ggcccccagc 61 ccgcctctgt gccactttgc atcccacgcc ggaggaggca ttaacgagac cgggtaaggc 121 tttttaaacg gtccaaggtg tagagccata cttcaggagg atcctcagaa gttttggaca 181 agcctcccca aatgtggcag gtgctgtgct ggccattggt gacccaaaga tgatgaaaaa 241 tatgttcctg cccacaagga gttagcgacc tactgggctt tcctcttgct gatgacatga 301 ttcctgtttg aatctgttga caagattctg aaagctgaac agagaattct ggcactgcac 361 tgggtaggaa aaagcatttc aagaaataga taatatcaag gacatcagga caccgggagt 421 gggagagatt ggactgggag actcagcagg atgtcttcca agcgaccagc ctctccgtat 481 ggggaagcag atggagaggt agccatggtg acaagcagac agaaagtgga agaagaggag 541 agtgacgggc tcccagcctt tcaccttccc ttgcatgtga gttttcccaa caagcctcac 601 tctgaggaat ttcagccagt ttctctgctg acgcaagaga cttgtggcca taggactccc 661 acttctcagc acaatacaat ggaagttgat ggcaataaag ttatgtcttc atttgcccca 721 cacaactcat ctacctcacc tcagaaggca gaagaaggtg ggcgacagag tggcgagtcc 781 ttgtctagta cagccctggg aactcctgaa cggcgcaagg gcagtttagc tgatgttgtt 841 gacaccttga agcagaggaa aatggaagag ctcatcaaaa acgagccgga agaaaccccc 901 agtattgaaa aactactctc aaaggactgg aaagacaagc ttcttgcaat gggatcgggg 961 aactttggcg aaataaaagg gactcccgag agcttagctg agaaagaaag gcaactcatg 1021 ggtatgatca accagctgac cagcctccga gagcagctgt tggctgccca cgatgagcag 1081 aagaaactag ctgcctctca gattgagaaa cagcgtcagc aaatggagct ggccaagcag 1141 caacaagaac aaattgcaag acagcagcag cagcttctac agcaacaaca caaaatcaat 1201 ttgctccagc aacagatcca ggttcaaggt cagctgccgc cattaatgat tcccgtattc 1261 cctcctgatc aacggacact ggctgcagct gcccagcaag gattcctcct ccctccaggc 1321 ttcagctata aggctggatg tagtgaccct taccctgttc agctgatccc aactaccatg 1381 gcagctgctg ccgcagcaac accaggctta ggcccactcc aactgcagca gttatatgct 1441 gcccagctag ctgcaatgca ggtatctcca ggagggaagc tgccaggcat accccaaggc 1501 aaccttggtg ctgctgtatc tcctaccagc attcacacag acaagagcac aaacagccca 1561 ccacccaaaa gcaaggaaaa aacaacactg gagagtctga ctcagcaact ggcagttaaa 1621 cagaatgaag aaggaaaatt tagccatgca atgatggatt tcaatctgag tggagattct 1681 gatggaagtg ctggagtctc agagtcaaga atttataggg aatcccgagg gcgtggtagc 1741 aatgaacccc acataaagcg tccaatgaat gccttcatgg tgtgggctaa agatgaacgg 1801 agaaagatcc ttcaagcctt tcctgacatg cacaactcca acatcagcaa gatattggga 1861 tctcgctgga aagctatgac aaacctagag aaacagccat attatgagga gcaagcccgt 1921 ctcagcaagc agcacctgga gaagtaccct gactataagt acaagcccag gccaaagcgc 1981 acctgcctgg tggatggcaa aaagctgcgc attggtgaat acaaggcaat catgcgcaac 2041 aggcggcagg aaatgcggca gtacttcaat gttgggcaac aagcacagat ccccattgcc 2101 actgctggtg ttgtgtaccc tggagccatc gccatggctg ggatgccctc ccctcacctg 2161 ccctcggagc actcaagcgt gtctagcagc ccagagcctg ggatgcctgt tatccagagc 2221 acttacggtg tgaaaggaga ggagccacat atcaaagaag agatacaggc cgaggacatc 2281 aatggagaaa tttatgatga gtacgacgag gaagaggatg atccagatgt agattatggg 2341 agtgacagtg aaaaccatat tgcaggacaa gccaactgat aagggtcaaa agattgttgt 2401 gaccttagga cttaaagaag ccctaactgg ttcatcctta ccagtggcca agcacattaa 2461 ctttctcata cactgactgt tactttaact gttagtctta aatagttggg acatcagctg 2521 actaatagac ctcagcctca aaaggcttgg aaagaaaaaa caaatacaac aagcaaacaa 2581 caatatcaac aacaagagat tgaaataagc tatgggtaaa ataatgccag taattcagct 2641 gctacatcca agcactgaag tcttacccgt caactttttt tttttttaaa taaactttat 2701 ggctgtttgt tctacaatgt tctagaaatt ctcactcagg tacacagtgc caacaagtgg 2761 cttgtgaatg tgttttgttg ttttgtgcta caatttttaa aaaaaaaaaa aaaa //