LOCUS BC003571 2582 bp mRNA linear HUM 11-SEP-2007 DEFINITION Homo sapiens POU class 2 homeobox 1, mRNA (cDNA clone MGC:2114 IMAGE:3544378), complete cds. ACCESSION BC003571 VERSION BC003571.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2582) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2582) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (20-FEB-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Louis M. Staudt, M.D., Ph.D. cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Genome Sequence Centre, BC Cancer Agency, Vancouver, BC, Canada info@bcgsc.bc.ca Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao, Kim MacDonald, Mike R. Mayo, Josh Moran, Diana Palmquist, JR Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang, Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 8 Row: j Column: 4 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 4502890. FEATURES Location/Qualifiers source 1..2582 /db_xref="H-InvDB:HIT000031609" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:2114 IMAGE:3544378" /tissue_type="Lymph, Burkitt lymphoma" /clone_lib="NIH_MGC_8" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..2582 /gene="POU2F1" /gene_synonym="OCT1" /db_xref="GeneID:5451" /db_xref="HGNC:HGNC:9212" /db_xref="MIM:164175" CDS 55..2286 /gene="POU2F1" /gene_synonym="OCT1" /codon_start=1 /product="POU2F1 protein" /protein_id="AAH03571.1" /db_xref="GeneID:5451" /db_xref="HGNC:HGNC:9212" /db_xref="MIM:164175" /translation="MNNPSETSKPSMESGDGNTGTQTNGLDFQKQPVPVGGAISTAQA QAFLGHLHQVQLAGTSLQAAAQSLNVQSKSNEESGDSQQPSQPSQQPSVQAAIPQTQL MLAGGQITGLTLTPAQQQLLLQQAQAQAQLLAAAVQQHSASQQHSAAGATISASAATP MTQIPLSQPIQIAQDLQQLQQLQQQNLNLQQFVLVHPTTNLQPAQFIISQTPQGQQGL LQAQNLLTQLPQQSQANLLQSQPSITLTSQPATPTRTIAATPIQTLPQSQSTPKRIDT PSLEEPSDLEELEQFAKTFKQRRIKLGFTQGDVGLAMGKLYGNDFSQTTISRFEALNL SFKNMCKLKPLLEKWLNDAENLSSDSSLSSPSALNSPGIEGLSRRRKKRTSIETNIRV ALEKSFLENQKPTSEEITMIADQLNMEKEVIRVWFCNRRQKEKRINPPSSGGTSSSPI KAIFPSPTSLVATTPSLVTSSAATTLTVSPVLPLTSAAVTNLSVTGTSDTTSNNTATV ISTAPPASSAVTSPSLSPSPSASASTSEASSASETSTTQTTSTPLSSPLGTSQVMVTA SGLQTAAAAALQGAAQLPANASLAAMAAAAGLNPSLMAPSQFAAGGALLSLNPGTLSG ALSPALMSNSTLATIQALASGGSLPITSLDATGNLVFANAGGAPNIVTAPLFLNPQNL SLLTSNPVSLVSAAAASAGNSAPVASLHATSTSAESIQNSLFTVASASGAASTTTTAS KAQ" BASE COUNT 709 a 780 c 563 g 530 t ORIGIN 1 ctagaagatt tcacagcaat gctggactgc agtgactatg ttctagactc aagaatgaac 61 aatccgtcag aaaccagtaa accatctatg gagagtggag atggcaacac aggcacacaa 121 accaatggtc tggactttca gaagcagcct gtgcctgtag gaggagcaat ctcaacagcc 181 caggcgcagg ctttccttgg acatctccat caggtccaac tcgctggaac aagtttacag 241 gctgctgctc agtctttaaa tgtacagtct aaatctaatg aagaatcggg ggattcgcag 301 cagccaagcc agccttccca gcagccttca gtgcaggcag ccattcccca gacccagctt 361 atgctagctg gaggacagat aactgggctt actttgacgc ctgcccagca acagttacta 421 ctccagcagg cacaggcaca ggcacagctg ctggctgctg cagtgcagca gcactccgcc 481 agccagcagc acagtgctgc tggagccacc atctccgcct ctgctgccac gcccatgacg 541 cagatccccc tgtctcagcc catacagatc gcacaggatc ttcaacaact gcaacagctt 601 caacagcaga atctcaacct gcaacagttt gtgttggtgc atccaaccac caatttgcag 661 ccagcgcagt ttatcatctc acagacgccc cagggccagc agggtctcct gcaagcgcaa 721 aatcttctaa cgcaactacc tcagcaaagc caagccaacc tcctacagtc gcagccaagc 781 atcaccctca cctcccagcc agcaacccca acacgcacaa tagcagcaac cccaattcag 841 acacttccac agagccagtc aacaccaaag cgaattgata ctcccagctt ggaggagccc 901 agtgaccttg aggagcttga gcagtttgcc aagaccttca aacaaagacg aatcaaactt 961 ggattcactc agggtgatgt tgggctcgct atggggaaac tatatggaaa tgacttcagc 1021 caaactacca tctctcgatt tgaagccttg aacctcagct ttaagaacat gtgcaagttg 1081 aagccacttt tagagaagtg gctaaatgat gcagagaacc tctcatctga ttcgtccctc 1141 tccagcccaa gtgccctgaa ttctccagga attgagggct tgagccgtag gaggaagaaa 1201 cgcaccagca tagagaccaa catccgtgtg gccttagaga agagtttctt ggagaatcaa 1261 aagcctacct cggaagagat cactatgatt gctgatcagc tcaatatgga aaaagaggtg 1321 attcgtgttt ggttctgtaa ccgccgccag aaagaaaaaa gaatcaaccc accaagcagt 1381 ggtgggacca gcagctcacc tattaaagca attttcccca gcccaacttc actggtggcg 1441 accacaccaa gccttgtgac tagcagtgca gcaactaccc tcacagtcag ccctgtcctc 1501 cctctgacca gtgctgctgt gacgaatctt tcagttacag gcacttcaga caccacctcc 1561 aacaacacag caaccgtgat ttccacagcg cctccagctt cctcagcagt cacgtccccc 1621 tctctgagtc cctccccttc tgcctcagcc tccacctccg aggcatccag tgccagtgag 1681 accagcacaa cacagaccac ctccactcct ttgtcctccc ctcttgggac cagccaggtg 1741 atggtgacag catcaggttt gcaaacagca gcagctgctg cccttcaagg agctgcacag 1801 ttgccagcaa atgccagtct tgctgccatg gcagctgctg caggactaaa cccaagcctg 1861 atggcaccct cacagtttgc ggctggaggt gccttactca gtctgaatcc agggaccctg 1921 agcggtgctc tcagcccagc tctaatgagc aacagtacac tggcaactat tcaagctctt 1981 gcttctggtg gctctcttcc aataacatca cttgatgcaa ctgggaacct ggtatttgcc 2041 aatgcgggag gagcccccaa catcgtgact gcccctctgt tcctgaaccc tcagaacctc 2101 tctctgctca ccagcaaccc tgttagcttg gtctctgccg ccgcagcatc tgcagggaac 2161 tctgcacctg tagccagcct tcacgccacc tccacctctg ctgagtccat ccagaactct 2221 ctcttcacag tggcctctgc cagcggggct gcgtccacca ccaccaccgc ctccaaggca 2281 cagtgagctg ggcagagctg ggctgccaga agcctttttc actctgcagt gtgattggac 2341 tgccagccag gttaataaac tgaaaaatgt gattggcttc ctctcgccgt gttgtgaggg 2401 caaaggagag aagggagaaa aaaaaaaaaa aaccacacac acccatacac acataccaga 2461 aaaagaaaga aaggatggag acggaacatt tgcctaattt tgtaataaaa cactgtcttt 2521 tcaggattgc ttcatggatt ggagaacttt ctaaccaaaa attaaaaaaa aaaaaaaaaa 2581 aa //