LOCUS       BC003571                2582 bp    mRNA    linear   HUM 11-SEP-2007
DEFINITION  Homo sapiens POU class 2 homeobox 1, mRNA (cDNA clone MGC:2114
            IMAGE:3544378), complete cds.
ACCESSION   BC003571
VERSION     BC003571.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2582)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 2582)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (20-FEB-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: Louis M. Staudt, M.D., Ph.D.
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Genome Sequence Centre,
            BC Cancer Agency, Vancouver, BC, Canada
            info@bcgsc.bc.ca
            Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson
            Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen
            Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel
            Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave
            Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth
            Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao,
            Kim MacDonald,  Mike R. Mayo, Josh Moran, Diana Palmquist, JR
            Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang,
            Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 8 Row: j Column: 4
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 4502890.
FEATURES             Location/Qualifiers
     source          1..2582
                     /db_xref="H-InvDB:HIT000031609"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:2114 IMAGE:3544378"
                     /tissue_type="Lymph, Burkitt lymphoma"
                     /clone_lib="NIH_MGC_8"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     gene            1..2582
                     /gene="POU2F1"
                     /gene_synonym="OCT1"
                     /db_xref="GeneID:5451"
                     /db_xref="HGNC:HGNC:9212"
                     /db_xref="MIM:164175"
     CDS             55..2286
                     /gene="POU2F1"
                     /gene_synonym="OCT1"
                     /codon_start=1
                     /product="POU2F1 protein"
                     /protein_id="AAH03571.1"
                     /db_xref="GeneID:5451"
                     /db_xref="HGNC:HGNC:9212"
                     /db_xref="MIM:164175"
                     /translation="MNNPSETSKPSMESGDGNTGTQTNGLDFQKQPVPVGGAISTAQA
                     QAFLGHLHQVQLAGTSLQAAAQSLNVQSKSNEESGDSQQPSQPSQQPSVQAAIPQTQL
                     MLAGGQITGLTLTPAQQQLLLQQAQAQAQLLAAAVQQHSASQQHSAAGATISASAATP
                     MTQIPLSQPIQIAQDLQQLQQLQQQNLNLQQFVLVHPTTNLQPAQFIISQTPQGQQGL
                     LQAQNLLTQLPQQSQANLLQSQPSITLTSQPATPTRTIAATPIQTLPQSQSTPKRIDT
                     PSLEEPSDLEELEQFAKTFKQRRIKLGFTQGDVGLAMGKLYGNDFSQTTISRFEALNL
                     SFKNMCKLKPLLEKWLNDAENLSSDSSLSSPSALNSPGIEGLSRRRKKRTSIETNIRV
                     ALEKSFLENQKPTSEEITMIADQLNMEKEVIRVWFCNRRQKEKRINPPSSGGTSSSPI
                     KAIFPSPTSLVATTPSLVTSSAATTLTVSPVLPLTSAAVTNLSVTGTSDTTSNNTATV
                     ISTAPPASSAVTSPSLSPSPSASASTSEASSASETSTTQTTSTPLSSPLGTSQVMVTA
                     SGLQTAAAAALQGAAQLPANASLAAMAAAAGLNPSLMAPSQFAAGGALLSLNPGTLSG
                     ALSPALMSNSTLATIQALASGGSLPITSLDATGNLVFANAGGAPNIVTAPLFLNPQNL
                     SLLTSNPVSLVSAAAASAGNSAPVASLHATSTSAESIQNSLFTVASASGAASTTTTAS
                     KAQ"
BASE COUNT          709 a          780 c          563 g          530 t
ORIGIN      
        1 ctagaagatt tcacagcaat gctggactgc agtgactatg ttctagactc aagaatgaac
       61 aatccgtcag aaaccagtaa accatctatg gagagtggag atggcaacac aggcacacaa
      121 accaatggtc tggactttca gaagcagcct gtgcctgtag gaggagcaat ctcaacagcc
      181 caggcgcagg ctttccttgg acatctccat caggtccaac tcgctggaac aagtttacag
      241 gctgctgctc agtctttaaa tgtacagtct aaatctaatg aagaatcggg ggattcgcag
      301 cagccaagcc agccttccca gcagccttca gtgcaggcag ccattcccca gacccagctt
      361 atgctagctg gaggacagat aactgggctt actttgacgc ctgcccagca acagttacta
      421 ctccagcagg cacaggcaca ggcacagctg ctggctgctg cagtgcagca gcactccgcc
      481 agccagcagc acagtgctgc tggagccacc atctccgcct ctgctgccac gcccatgacg
      541 cagatccccc tgtctcagcc catacagatc gcacaggatc ttcaacaact gcaacagctt
      601 caacagcaga atctcaacct gcaacagttt gtgttggtgc atccaaccac caatttgcag
      661 ccagcgcagt ttatcatctc acagacgccc cagggccagc agggtctcct gcaagcgcaa
      721 aatcttctaa cgcaactacc tcagcaaagc caagccaacc tcctacagtc gcagccaagc
      781 atcaccctca cctcccagcc agcaacccca acacgcacaa tagcagcaac cccaattcag
      841 acacttccac agagccagtc aacaccaaag cgaattgata ctcccagctt ggaggagccc
      901 agtgaccttg aggagcttga gcagtttgcc aagaccttca aacaaagacg aatcaaactt
      961 ggattcactc agggtgatgt tgggctcgct atggggaaac tatatggaaa tgacttcagc
     1021 caaactacca tctctcgatt tgaagccttg aacctcagct ttaagaacat gtgcaagttg
     1081 aagccacttt tagagaagtg gctaaatgat gcagagaacc tctcatctga ttcgtccctc
     1141 tccagcccaa gtgccctgaa ttctccagga attgagggct tgagccgtag gaggaagaaa
     1201 cgcaccagca tagagaccaa catccgtgtg gccttagaga agagtttctt ggagaatcaa
     1261 aagcctacct cggaagagat cactatgatt gctgatcagc tcaatatgga aaaagaggtg
     1321 attcgtgttt ggttctgtaa ccgccgccag aaagaaaaaa gaatcaaccc accaagcagt
     1381 ggtgggacca gcagctcacc tattaaagca attttcccca gcccaacttc actggtggcg
     1441 accacaccaa gccttgtgac tagcagtgca gcaactaccc tcacagtcag ccctgtcctc
     1501 cctctgacca gtgctgctgt gacgaatctt tcagttacag gcacttcaga caccacctcc
     1561 aacaacacag caaccgtgat ttccacagcg cctccagctt cctcagcagt cacgtccccc
     1621 tctctgagtc cctccccttc tgcctcagcc tccacctccg aggcatccag tgccagtgag
     1681 accagcacaa cacagaccac ctccactcct ttgtcctccc ctcttgggac cagccaggtg
     1741 atggtgacag catcaggttt gcaaacagca gcagctgctg cccttcaagg agctgcacag
     1801 ttgccagcaa atgccagtct tgctgccatg gcagctgctg caggactaaa cccaagcctg
     1861 atggcaccct cacagtttgc ggctggaggt gccttactca gtctgaatcc agggaccctg
     1921 agcggtgctc tcagcccagc tctaatgagc aacagtacac tggcaactat tcaagctctt
     1981 gcttctggtg gctctcttcc aataacatca cttgatgcaa ctgggaacct ggtatttgcc
     2041 aatgcgggag gagcccccaa catcgtgact gcccctctgt tcctgaaccc tcagaacctc
     2101 tctctgctca ccagcaaccc tgttagcttg gtctctgccg ccgcagcatc tgcagggaac
     2161 tctgcacctg tagccagcct tcacgccacc tccacctctg ctgagtccat ccagaactct
     2221 ctcttcacag tggcctctgc cagcggggct gcgtccacca ccaccaccgc ctccaaggca
     2281 cagtgagctg ggcagagctg ggctgccaga agcctttttc actctgcagt gtgattggac
     2341 tgccagccag gttaataaac tgaaaaatgt gattggcttc ctctcgccgt gttgtgaggg
     2401 caaaggagag aagggagaaa aaaaaaaaaa aaccacacac acccatacac acataccaga
     2461 aaaagaaaga aaggatggag acggaacatt tgcctaattt tgtaataaaa cactgtcttt
     2521 tcaggattgc ttcatggatt ggagaacttt ctaaccaaaa attaaaaaaa aaaaaaaaaa
     2581 aa
//