LOCUS BC029220 1496 bp mRNA linear HUM 30-SEP-2003
DEFINITION Homo sapiens SRY (sex determining region Y)-box 5, mRNA (cDNA clone
IMAGE:5169548), partial cds.
ACCESSION BC029220
VERSION BC029220.2
KEYWORDS .
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 1496)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 1496)
AUTHORS Strausberg,R.
TITLE Direct Submission
JOURNAL Submitted (01-MAY-2002) National Institutes of Health, Mammalian
Gene Collection (MGC), Cancer Genomics Office, National Cancer
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT On Sep 16, 2003 this sequence version replaced BC029220.1.
Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: Life Technologies, Inc.
cDNA Library Preparation: Life Technologies, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Baylor College of Medicine Human Genome
Sequencing Center
Center code: BCM-HGSC
Web site: http://www.hgsc.bcm.tmc.edu/cdna/
Contact: amg@bcm.tmc.edu
Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H.,
Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati,
A.N., Gibbs, R.A.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 51 Row: c Column: 10.
FEATURES Location/Qualifiers
source 1..1496
/db_xref="H-InvDB:HIT000040707"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="IMAGE:5169548"
/tissue_type="Brain, adult medulla"
/clone_lib="NIH_MGC_119"
/lab_host="DH10B"
/note="Vector: pCMV-SPORT6"
gene <1..1496
/gene="SOX5"
/gene_synonym="L-SOX5"
/gene_synonym="MGC35153"
/db_xref="GeneID:6660"
/db_xref="MIM:604975"
CDS <1..1132
/gene="SOX5"
/gene_synonym="L-SOX5"
/gene_synonym="MGC35153"
/codon_start=2
/product="SOX5 protein"
/protein_id="AAH29220.2"
/db_xref="GeneID:6660"
/db_xref="MIM:604975"
/translation="HDEVAQPLNLSAKPKTSDGKSPTSPTSPHMPALRINSGAGPLKA
SVPAALASPSARVSTIGYLNDHDAVTKAIQEARQMKEQLRREQQVLDGKVAVVNSLGL
NNCRTEKEKTTLESLTQQLAVKQNEEGKFSHAMMDFNLSGDSDGSAGVSESRIYRESR
GRGSNEPHIKRPMNAFMVWAKDERRKILQAFPDMHNSNISKILGSRWKAMTNLEKQPY
YEEQARLSKQHLEKYPDYKYKPRPKRTCLVDGKKLRIGEYKAIMRNRRQEMRQYFNVG
QQAQIPIATAGVVYPGAIAMAGMPSPHLPSEHSSVSSSPEPGMPVIQSTYGVKGEEPH
IKEEIQAEDINGEIYDEYDEEEDDPDVDYGSDSENHIAGQAN"
misc_feature 506..694
/gene="SOX5"
/gene_synonym="L-SOX5"
/gene_synonym="MGC35153"
/note="HMG_box; Region: HMG (high mobility group) box"
/db_xref="CDD:pfam00505"
BASE COUNT 496 a 337 c 348 g 315 t
ORIGIN
1 gcatgatgaa gtggcacagc cactgaacct atcagctaaa cccaagacct ctgatggcaa
61 atcacccaca tcacccacct ctccccatat gccagctctg agaataaaca gtggggcagg
121 ccccctcaaa gcctctgtcc cagcagcgtt agctagtcct tcagccagag ttagcacaat
181 aggttactta aatgaccatg atgctgtcac caaggcaatc caagaagctc ggcaaatgaa
241 ggagcaactc cgacgggaac aacaggtgct tgatgggaag gtggctgttg tgaatagtct
301 gggtctcaat aactgccgaa cagaaaagga aaaaacaaca ctggagagtc tgactcagca
361 actggcagtt aaacagaatg aagaaggaaa atttagccat gcaatgatgg atttcaatct
421 gagtggagat tctgatggaa gtgctggagt ctcagagtca agaatttata gggaatcccg
481 agggcgtggt agcaatgaac cccacataaa gcgtccaatg aatgccttca tggtgtgggc
541 taaagatgaa cggagaaaga tccttcaagc ctttcctgac atgcacaact ccaacatcag
601 caagatattg ggatctcgct ggaaagctat gacaaaccta gagaaacagc catattatga
661 ggagcaagcc cgtctcagca agcagcacct ggagaagtac cctgactata agtacaagcc
721 caggccaaag cgcacctgcc tggtggatgg caaaaagctg cgcattggtg aatacaaggc
781 aatcatgcgc aacaggcggc aggaaatgcg gcagtacttc aatgttgggc aacaagcaca
841 gatccccatt gccactgctg gtgttgtgta ccctggagcc atcgccatgg ctgggatgcc
901 ctcccctcac ctgccctcgg agcactcaag cgtgtctagc agcccagagc ctgggatgcc
961 tgttatccag agcacttacg gtgtgaaagg agaggagcca catatcaaag aagagataca
1021 ggccgaggac atcaatggag aaatttatga tgagtacgac gaggaagagg atgatccaga
1081 tgtagattat gggagtgaca gtgaaaacca tattgcagga caagccaact gataagggtc
1141 aaaagattgt tgtgacctta ggacttaaag aagccctaac tggttcatcc ttaccagtgg
1201 ccaagcacat taactttctc atacactgac tgttacttta actgttagtc ttaaatagtt
1261 gggacatcag ctgactaata gacctcagcc tcaaaaggct tggaaagaaa aaacaaatac
1321 aacaagcaaa caacaatatc aacaacaaga gattgaaata agctatgggt aaaataatgc
1381 cagtaattca gctgctacat ccaagcactg aagtcttacc cgtcaacttt tttttttttt
1441 ttaaataaac tttatggctg tttgttctaa aaaaaaaaaa aaaaaaaaaa aaaagg
//