LOCUS BC013923 1172 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens SRY (sex determining region Y)-box 2, mRNA (cDNA clone
MGC:2413 IMAGE:2823424), complete cds.
ACCESSION BC013923
VERSION BC013923.2
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 1172)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 1172)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (07-SEP-2001) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT On Aug 19, 2003 this sequence version replaced BC013923.1.
Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: DCTD/DTP
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Institute for Systems Biology
http://www.systemsbiology.org
contact: amadan@systemsbiology.org
Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 2 Row: b Column: 15
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 29826338.
FEATURES Location/Qualifiers
source 1..1172
/db_xref="H-InvDB:HIT000036409"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:2413 IMAGE:2823424"
/tissue_type="Lung, small cell carcinoma"
/clone_lib="NIH_MGC_7"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene 1..1172
/gene="SOX2"
/gene_synonym="ANOP3"
/gene_synonym="MGC2413"
/db_xref="GeneID:6657"
/db_xref="HGNC:HGNC:11195"
/db_xref="MIM:184429"
CDS 76..1029
/gene="SOX2"
/gene_synonym="ANOP3"
/gene_synonym="MGC2413"
/codon_start=1
/product="SRY (sex determining region Y)-box 2"
/protein_id="AAH13923.1"
/db_xref="GeneID:6657"
/db_xref="HGNC:HGNC:11195"
/db_xref="MIM:184429"
/translation="MYNMMETELKPPGPQQTSGGGGGNSTAAAAGGNQKNSPDRVKRP
MNAFMVWSRGQRRKMAQENPKMHNSEISKRLGAEWKLLSETEKRPFIDEAKRLRALHM
KEHPDYKYRPRRKTKTLMKKDKYTLPGGLLAPGGNSMASGVGVGAGLGAGVNQRMDSY
AHMNGWSNGSYSMMQDQLGYPQHPGLNAHGAAQMQPMHRYDVSALQYNSMTSSQTYMN
GSPTYSMSYSQQGTPGMALGSMGSVVKSEASSSPPVVTSSSHSRAPCQAGDLRDMISM
YLPGAEVPEPAAPSRLHMSQHYQSGPVPGTAINGTLPLSHM"
BASE COUNT 281 a 377 c 376 g 138 t
ORIGIN
1 cccgggcccc ccaaagtccc ggccgggccg agggtcggcg gccgccggcg ggccgggccc
61 gcgcacagcg cccgcatgta caacatgatg gagacggagc tgaagccgcc gggcccgcag
121 caaacttcgg ggggcggcgg cggcaactcc accgcggcgg cggccggcgg caaccagaaa
181 aacagcccgg accgcgtcaa gcggcccatg aatgccttca tggtgtggtc ccgcgggcag
241 cggcgcaaga tggcccagga gaaccccaag atgcacaact cggagatcag caagcgcctg
301 ggcgccgagt ggaaactttt gtcggagacg gagaagcggc cgttcatcga cgaggctaag
361 cggctgcgag cgctgcacat gaaggagcac ccggattata aataccggcc ccggcggaaa
421 accaagacgc tcatgaagaa ggataagtac acgctgcccg gcgggctgct ggcccccggc
481 ggcaatagca tggcgagcgg ggtcggggtg ggcgccggcc tgggcgcggg cgtgaaccag
541 cgcatggaca gttacgcgca catgaacggc tggagcaacg gcagctacag catgatgcag
601 gaccagctgg gctacccgca gcacccgggc ctcaatgcgc acggcgcagc gcagatgcag
661 cccatgcacc gctacgacgt gagcgccctg cagtacaact ccatgaccag ctcgcagacc
721 tacatgaacg gctcgcccac ctacagcatg tcctactcgc agcagggcac ccctggcatg
781 gctcttggct ccatgggttc ggtggtcaag tccgaggcca gctccagccc ccctgtggtt
841 acctcttcct cccactccag ggcgccctgc caggccgggg acctccggga catgatcagc
901 atgtatctcc ccggcgccga ggtgccggaa cccgccgccc ccagcagact tcacatgtcc
961 cagcactacc agagcggccc ggtgcccggc acggccatta acggcacact gcccctctca
1021 cacatgtgag ggccggacag cgaactggag gggggagaaa ttttcaaaga aaaacgaggg
1081 aaatgggagg ggtgcaaaag aggagagtaa gaaacagcat ggagaaaacc cggtacgctc
1141 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa
//