LOCUS BC052274 2689 bp mRNA linear HUM 17-JUL-2007 DEFINITION Homo sapiens POU class 2 homeobox 1, mRNA (cDNA clone MGC:59844 IMAGE:4622256), complete cds. ACCESSION BC052274 VERSION BC052274.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2689) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2689) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (02-MAY-2003) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: DCTD/DTP/Gazdar cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 47 Row: b Column: 2 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 42476163. FEATURES Location/Qualifiers source 1..2689 /db_xref="H-InvDB:HIT000053762" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:59844 IMAGE:4622256" /tissue_type="Lung, large cell carcinoma" /clone_lib="NIH_MGC_18" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..2689 /gene="POU2F1" /gene_synonym="OCT1" /db_xref="GeneID:5451" /db_xref="HGNC:HGNC:9212" /db_xref="MIM:164175" CDS 163..2394 /gene="POU2F1" /gene_synonym="OCT1" /codon_start=1 /product="POU class 2 homeobox 1" /protein_id="AAH52274.1" /db_xref="GeneID:5451" /db_xref="HGNC:HGNC:9212" /db_xref="MIM:164175" /translation="MNNPSETSKPSMESGDGNTGTQTNGLDFQKQPVPVGGAISTAQA QAFLGHLHQVQLAGTSLQAAAQSLNVQSKSNEESGDSQQPSQPSQQPSVQAAIPQTQL MLAGGQITGLTLTPAQQQLLLQQAQAQAQLLAAAVQQHSASQQHSAAGATISASAATP MTQIPLSQPIQIAQDLQQLQQLQQQNLNLQQFVLVHPTTNLQPAQFIISQTPQGQQGL LQAQNLLTQLPQQSQANLLQSQPSITLTSQPATPTRTIAATPIQTLPQSQSTPKRIDT PSLEEPSDLEELEQFAKTFKQRRIKLGFTQGDVGLAMGKLYGNDFSQTTISRFEALNL SFKNMCKLKPLLEKWLNDAENLSSDSSLSSPSALNSPGIEGLSRRRKKRTSIETNIRV ALEKSFLENQKPTSEEITMIADQLNMEKEVIRVWFCNRRQKEKRINPPSSGGTSSSPI KAIFPSPTSLVATTPSLVTSSAATTLTVSPVLPLTSAAVTNLSVTGTSDTTSNNTATV ISTAPPASSAVTSPSLSPSPSASASTSEASSASETSTTQTTSTPLSSPLGTSQVMVTA SGLQTAAAAALQGAAQLPANASLAAMAAAAGLNPSLMAPSQFAAGGALLSLNPGTLSG ALSPALMSNSTLATIQALASGGSLPITSLDATGNLVFANAGGAPNIVTAPLFLNPQNL SLLTSNPVSLVSAAAASAGNSAPVASLHATSTSAESIQNSLFTVASASGAASTTTTAS KAQ" BASE COUNT 743 a 804 c 589 g 553 t ORIGIN 1 cggaggagca gcgagtcaag atgagagttc agccgcggcg gcagcagcag cagactggaa 61 aagtaagaag agctttcctg cctttttaat taccaaacta ctctcagttt tcaatgaatc 121 agttcaaaga aagaatgcag tctttctata cctgactcaa gaatgaacaa tccgtcagaa 181 accagtaaac catctatgga gagtggagat ggcaacacag gcacacaaac caatggtctg 241 gactttcaga agcagcctgt gcctgtagga ggagcaatct caacagccca ggcgcaggct 301 ttccttggac atctccatca ggtccaactc gctggaacaa gtttacaggc tgctgctcag 361 tctttaaatg tacagtctaa atctaatgaa gaatcggggg attcgcagca gccaagccag 421 ccttcccagc agccttcagt gcaggcagcc attccccaga cccagcttat gctagctgga 481 ggacagataa ctgggcttac tttgacgcct gcccagcaac agttactact ccagcaggca 541 caggcacagg cacagctgct ggctgctgca gtgcagcagc actccgccag ccagcagcac 601 agtgctgctg gagccaccat ctccgcctct gctgccacgc ccatgacgca gatccccctg 661 tctcagccca tacagatcgc acaggatctt caacaactgc aacagcttca acagcagaat 721 ctcaacctgc aacagtttgt gttggtgcat ccaaccacca atttgcagcc agcgcagttt 781 atcatctcac agacgcccca gggccagcag ggtctcctgc aagcgcaaaa tcttctaacg 841 caactacctc agcaaagcca agccaacctc ctacagtcgc agccaagcat caccctcacc 901 tcccagccag caaccccaac acgcacaata gcagcaaccc caattcagac acttccacag 961 agccagtcaa caccaaagcg aattgatact cccagcttgg aggagcccag tgaccttgag 1021 gagcttgagc agtttgccaa gaccttcaaa caaagacgaa tcaaacttgg attcactcag 1081 ggtgatgttg ggctcgctat ggggaaacta tatggaaatg acttcagcca aactaccatc 1141 tctcgatttg aagccttgaa cctcagcttt aagaacatgt gcaagttgaa gccactttta 1201 gagaagtggc taaatgatgc agagaacctc tcatctgatt cgtccctctc cagcccaagt 1261 gccctgaatt ctccaggaat tgagggcttg agccgtagga ggaagaaacg caccagcata 1321 gagaccaaca tccgtgtggc cttagagaag agtttcttgg agaatcaaaa gcctacctcg 1381 gaagagatca ctatgattgc tgatcagctc aatatggaaa aagaggtgat tcgtgtttgg 1441 ttctgtaacc gccgccagaa agaaaaaaga atcaacccac caagcagtgg tgggaccagc 1501 agctcaccta ttaaagcaat tttccccagc ccaacttcac tggtggcgac cacaccaagc 1561 cttgtgacta gcagtgcagc aactaccctc acagtcagcc ctgtcctccc tctgaccagt 1621 gctgctgtga cgaatctttc agttacaggc acttcagaca ccacctccaa caacacagca 1681 accgtgattt ccacagcgcc tccagcttcc tcagcagtca cgtccccctc tctgagtccc 1741 tccccttctg cctcagcctc cacctccgag gcatccagtg ccagtgagac cagcacaaca 1801 cagaccacct ccactccttt gtcctcccct cttgggacca gccaggtgat ggtgacagca 1861 tcaggtttgc aaacagcagc agctgctgcc cttcaaggag ctgcacagtt gccagcaaat 1921 gccagtcttg ctgccatggc agctgctgca ggactaaacc caagcctgat ggcaccctca 1981 cagtttgcgg ctggaggtgc cttactcagt ctgaatccag ggaccctgag cggtgctctc 2041 agcccagctc taatgagcaa cagtacactg gcaactattc aagctcttgc ttctggtggc 2101 tctcttccaa taacatcact tgatgcaact gggaacctgg tatttgccaa tgcgggagga 2161 gcccccaaca tcgtgactgc ccctctgttc ctgaaccctc agaacctctc tctgctcacc 2221 agcaaccctg ttagcttggt ctctgccgcc gcagcatctg cagggaactc tgcacctgta 2281 gccagccttc acgccacctc cacctctgct gagtccatcc agaactctct cttcacagtg 2341 gcctctgcca gcggggctgc gtccaccacc accaccgcct ccaaggcaca gtgagctggg 2401 cagagctggg ctgccagaag cctttttcac tctgcagtgt gattggactg ccagccaggt 2461 taataaactg aaaaatgtga ttggcttcct ctcgccgtgt tgtgagggca aaggagagaa 2521 gggagaaaaa aaaaaaaaaa ccacacacac ccatacacac ataccagaaa aagaaagaaa 2581 ggatggagac ggaacatttg cctaattttg taataaaaca ctgtcttttc aggattgctt 2641 catggattgg agaactttct aaccaaaaat taaaaaaaaa aaaaaaaaa //