LOCUS BC047665 1494 bp mRNA linear HUM 30-SEP-2003
DEFINITION Homo sapiens SRY (sex determining region Y)-box 5, mRNA (cDNA clone
IMAGE:5743782), partial cds.
ACCESSION BC047665
VERSION BC047665.2
KEYWORDS .
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 1494)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 1494)
AUTHORS Strausberg,R.
TITLE Direct Submission
JOURNAL Submitted (03-MAR-2003) National Institutes of Health, Mammalian
Gene Collection (MGC), Cancer Genomics Office, National Cancer
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT On Sep 16, 2003 this sequence version replaced BC047665.1.
Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: Life Technologies, Inc.
cDNA Library Preparation: Life Technologies, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Sequencing Group at the Stanford Human Genome
Center, Stanford University School of Medicine, Stanford, CA 94305
Web site: http://www-shgc.stanford.edu
Contact: (Dickson, Mark) mcd@paxil.stanford.edu
Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
R. M.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 98 Row: d Column: 18
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 23308712.
FEATURES Location/Qualifiers
source 1..1494
/db_xref="H-InvDB:HIT000053189"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="IMAGE:5743782"
/tissue_type="Brain, adult medulla"
/clone_lib="NIH_MGC_119"
/lab_host="DH10B"
/note="Vector: pCMV-SPORT6"
gene <1..1494
/gene="SOX5"
/gene_synonym="L-SOX5"
/gene_synonym="MGC35153"
/db_xref="GeneID:6660"
/db_xref="MIM:604975"
CDS <1..1132
/gene="SOX5"
/gene_synonym="L-SOX5"
/gene_synonym="MGC35153"
/codon_start=2
/product="SOX5 protein"
/protein_id="AAH47665.2"
/db_xref="GeneID:6660"
/db_xref="MIM:604975"
/translation="HDEVAQPLNLSAKPKTSDGKSPTSPTSPHMPALRINSGAGPLKA
SVPAALASPSARVSTIGYLNDHDAVTKAIQEARQMKEQLRREQQVLDGKVAVVNSLGL
NNCRTEKEKTTLESLTQQLAVKQNEEGKFSHAMMDFNLSGDSDGSAGVSESRIYRESR
GRGSNEPHIKRPMNAFMVWAKDERRKILQAFPDMHNSNISKILGSRWKAMTNLEKQPY
YEEQARLSKQHLEKYPDYKYKPRPKRTCLVDGKKLRIGEYKAIMRNRRQEMRQYFNVG
QQAQIPIATAGVVYPGAIAMAGMPSPHLPSEHSSVSSSPEPGMPVIQSTYGVKGEEPH
IKEEIQAEDINGEIYDEYDEEEDDPDVDYGSDSENHIAGQAN"
misc_feature 506..694
/gene="SOX5"
/gene_synonym="L-SOX5"
/gene_synonym="MGC35153"
/note="HMG_box; Region: HMG (high mobility group) box"
/db_xref="CDD:pfam00505"
BASE COUNT 496 a 338 c 346 g 314 t
ORIGIN
1 gcatgatgaa gtggcacagc cactgaacct atcagctaaa cccaagacct ctgatggcaa
61 atcacccaca tcacccacct ctccccatat gccagctctg agaataaaca gtggggcagg
121 ccccctcaaa gcctctgtcc cagcagcgtt agctagtcct tcagccagag ttagcacaat
181 aggttactta aatgaccatg atgctgtcac caaggcaatc caagaagctc ggcaaatgaa
241 ggagcaactc cgacgggaac aacaggtgct tgatgggaag gtggctgttg tgaatagtct
301 gggtctcaat aactgccgaa cagaaaagga aaaaacaaca ctggagagtc tgactcagca
361 actggcagtt aaacagaatg aagaaggaaa atttagccat gcaatgatgg atttcaatct
421 gagtggagat tctgatggaa gtgctggagt ctcagagtca agaatttata gggaatcccg
481 agggcgtggt agcaatgaac cccacataaa gcgtccaatg aatgccttca tggtgtgggc
541 taaagatgaa cggagaaaga tccttcaagc ctttcctgac atgcacaact ccaacatcag
601 caagatattg ggatctcgct ggaaagctat gacaaaccta gagaaacagc catattatga
661 ggagcaagcc cgtctcagca agcagcacct ggagaagtac cctgactata agtacaagcc
721 caggccaaag cgcacctgcc tggtggatgg caaaaagctg cgcattggtg aatacaaggc
781 aatcatgcgc aacaggcggc aggaaatgcg gcagtacttc aatgttgggc aacaagcaca
841 gatccccatt gccactgctg gtgttgtgta ccctggagcc atcgccatgg ctgggatgcc
901 ctcccctcac ctgccctcgg agcactcaag cgtgtctagc agcccagagc ctgggatgcc
961 tgttatccag agcacttacg gtgtgaaagg agaggagcca catatcaaag aagagataca
1021 ggccgaggac atcaatggag aaatttatga tgagtacgac gaggaagagg atgatccaga
1081 tgtagattat gggagtgaca gtgaaaacca tattgcagga caagccaact gataagggtc
1141 aaaagattgt tgtgacctta ggacttaaag aagccctaac tggttcatcc ttaccagtgg
1201 ccaagcacat taactttctc atacactgac tgttacttta actgttagtc ttaaatagtt
1261 gggacatcag ctgactaata gacctcagcc tcaaaaggct tggaaagaaa aaacaaatac
1321 aacaagcaaa caacaatatc aacaacaaga gattgaaata agctatgggt aaaataatgc
1381 cagtaattca gctgctacat ccaagcactg aagtcttacc cgtcaacttt tttttttttt
1441 taaataaact ttatggctgt ttgttctaca aaaaaaaaaa aaaaaaaaaa aaaa
//