LOCUS BC013923 1172 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens SRY (sex determining region Y)-box 2, mRNA (cDNA clone MGC:2413 IMAGE:2823424), complete cds. ACCESSION BC013923 VERSION BC013923.2 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1172) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 1172) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (07-SEP-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Aug 19, 2003 this sequence version replaced BC013923.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: DCTD/DTP cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Institute for Systems Biology http://www.systemsbiology.org contact: amadan@systemsbiology.org Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 2 Row: b Column: 15 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 29826338. FEATURES Location/Qualifiers source 1..1172 /db_xref="H-InvDB:HIT000036409" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:2413 IMAGE:2823424" /tissue_type="Lung, small cell carcinoma" /clone_lib="NIH_MGC_7" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..1172 /gene="SOX2" /gene_synonym="ANOP3" /gene_synonym="MGC2413" /db_xref="GeneID:6657" /db_xref="HGNC:HGNC:11195" /db_xref="MIM:184429" CDS 76..1029 /gene="SOX2" /gene_synonym="ANOP3" /gene_synonym="MGC2413" /codon_start=1 /product="SRY (sex determining region Y)-box 2" /protein_id="AAH13923.1" /db_xref="GeneID:6657" /db_xref="HGNC:HGNC:11195" /db_xref="MIM:184429" /translation="MYNMMETELKPPGPQQTSGGGGGNSTAAAAGGNQKNSPDRVKRP MNAFMVWSRGQRRKMAQENPKMHNSEISKRLGAEWKLLSETEKRPFIDEAKRLRALHM KEHPDYKYRPRRKTKTLMKKDKYTLPGGLLAPGGNSMASGVGVGAGLGAGVNQRMDSY AHMNGWSNGSYSMMQDQLGYPQHPGLNAHGAAQMQPMHRYDVSALQYNSMTSSQTYMN GSPTYSMSYSQQGTPGMALGSMGSVVKSEASSSPPVVTSSSHSRAPCQAGDLRDMISM YLPGAEVPEPAAPSRLHMSQHYQSGPVPGTAINGTLPLSHM" BASE COUNT 281 a 377 c 376 g 138 t ORIGIN 1 cccgggcccc ccaaagtccc ggccgggccg agggtcggcg gccgccggcg ggccgggccc 61 gcgcacagcg cccgcatgta caacatgatg gagacggagc tgaagccgcc gggcccgcag 121 caaacttcgg ggggcggcgg cggcaactcc accgcggcgg cggccggcgg caaccagaaa 181 aacagcccgg accgcgtcaa gcggcccatg aatgccttca tggtgtggtc ccgcgggcag 241 cggcgcaaga tggcccagga gaaccccaag atgcacaact cggagatcag caagcgcctg 301 ggcgccgagt ggaaactttt gtcggagacg gagaagcggc cgttcatcga cgaggctaag 361 cggctgcgag cgctgcacat gaaggagcac ccggattata aataccggcc ccggcggaaa 421 accaagacgc tcatgaagaa ggataagtac acgctgcccg gcgggctgct ggcccccggc 481 ggcaatagca tggcgagcgg ggtcggggtg ggcgccggcc tgggcgcggg cgtgaaccag 541 cgcatggaca gttacgcgca catgaacggc tggagcaacg gcagctacag catgatgcag 601 gaccagctgg gctacccgca gcacccgggc ctcaatgcgc acggcgcagc gcagatgcag 661 cccatgcacc gctacgacgt gagcgccctg cagtacaact ccatgaccag ctcgcagacc 721 tacatgaacg gctcgcccac ctacagcatg tcctactcgc agcagggcac ccctggcatg 781 gctcttggct ccatgggttc ggtggtcaag tccgaggcca gctccagccc ccctgtggtt 841 acctcttcct cccactccag ggcgccctgc caggccgggg acctccggga catgatcagc 901 atgtatctcc ccggcgccga ggtgccggaa cccgccgccc ccagcagact tcacatgtcc 961 cagcactacc agagcggccc ggtgcccggc acggccatta acggcacact gcccctctca 1021 cacatgtgag ggccggacag cgaactggag gggggagaaa ttttcaaaga aaaacgaggg 1081 aaatgggagg ggtgcaaaag aggagagtaa gaaacagcat ggagaaaacc cggtacgctc 1141 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa //