LOCUS       BC052274                2689 bp    mRNA    linear   HUM 17-JUL-2007
DEFINITION  Homo sapiens POU class 2 homeobox 1, mRNA (cDNA clone MGC:59844
            IMAGE:4622256), complete cds.
ACCESSION   BC052274
VERSION     BC052274.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2689)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 2689)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (02-MAY-2003) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: DCTD/DTP/Gazdar
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: National Institutes of Health Intramural
            Sequencing Center (NISC),
            Gaithersburg, Maryland;
            Web site: http://www.nisc.nih.gov/
            Contact: nisc_mgc@nhgri.nih.gov
            Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
            Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
            Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
            Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
            Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
            McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
            Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
            Young,A., Zhang,L.-H. and Green,E.D.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 47 Row: b Column: 2
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 42476163.
FEATURES             Location/Qualifiers
     source          1..2689
                     /db_xref="H-InvDB:HIT000053762"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:59844 IMAGE:4622256"
                     /tissue_type="Lung, large cell carcinoma"
                     /clone_lib="NIH_MGC_18"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     gene            1..2689
                     /gene="POU2F1"
                     /gene_synonym="OCT1"
                     /db_xref="GeneID:5451"
                     /db_xref="HGNC:HGNC:9212"
                     /db_xref="MIM:164175"
     CDS             163..2394
                     /gene="POU2F1"
                     /gene_synonym="OCT1"
                     /codon_start=1
                     /product="POU class 2 homeobox 1"
                     /protein_id="AAH52274.1"
                     /db_xref="GeneID:5451"
                     /db_xref="HGNC:HGNC:9212"
                     /db_xref="MIM:164175"
                     /translation="MNNPSETSKPSMESGDGNTGTQTNGLDFQKQPVPVGGAISTAQA
                     QAFLGHLHQVQLAGTSLQAAAQSLNVQSKSNEESGDSQQPSQPSQQPSVQAAIPQTQL
                     MLAGGQITGLTLTPAQQQLLLQQAQAQAQLLAAAVQQHSASQQHSAAGATISASAATP
                     MTQIPLSQPIQIAQDLQQLQQLQQQNLNLQQFVLVHPTTNLQPAQFIISQTPQGQQGL
                     LQAQNLLTQLPQQSQANLLQSQPSITLTSQPATPTRTIAATPIQTLPQSQSTPKRIDT
                     PSLEEPSDLEELEQFAKTFKQRRIKLGFTQGDVGLAMGKLYGNDFSQTTISRFEALNL
                     SFKNMCKLKPLLEKWLNDAENLSSDSSLSSPSALNSPGIEGLSRRRKKRTSIETNIRV
                     ALEKSFLENQKPTSEEITMIADQLNMEKEVIRVWFCNRRQKEKRINPPSSGGTSSSPI
                     KAIFPSPTSLVATTPSLVTSSAATTLTVSPVLPLTSAAVTNLSVTGTSDTTSNNTATV
                     ISTAPPASSAVTSPSLSPSPSASASTSEASSASETSTTQTTSTPLSSPLGTSQVMVTA
                     SGLQTAAAAALQGAAQLPANASLAAMAAAAGLNPSLMAPSQFAAGGALLSLNPGTLSG
                     ALSPALMSNSTLATIQALASGGSLPITSLDATGNLVFANAGGAPNIVTAPLFLNPQNL
                     SLLTSNPVSLVSAAAASAGNSAPVASLHATSTSAESIQNSLFTVASASGAASTTTTAS
                     KAQ"
BASE COUNT          743 a          804 c          589 g          553 t
ORIGIN      
        1 cggaggagca gcgagtcaag atgagagttc agccgcggcg gcagcagcag cagactggaa
       61 aagtaagaag agctttcctg cctttttaat taccaaacta ctctcagttt tcaatgaatc
      121 agttcaaaga aagaatgcag tctttctata cctgactcaa gaatgaacaa tccgtcagaa
      181 accagtaaac catctatgga gagtggagat ggcaacacag gcacacaaac caatggtctg
      241 gactttcaga agcagcctgt gcctgtagga ggagcaatct caacagccca ggcgcaggct
      301 ttccttggac atctccatca ggtccaactc gctggaacaa gtttacaggc tgctgctcag
      361 tctttaaatg tacagtctaa atctaatgaa gaatcggggg attcgcagca gccaagccag
      421 ccttcccagc agccttcagt gcaggcagcc attccccaga cccagcttat gctagctgga
      481 ggacagataa ctgggcttac tttgacgcct gcccagcaac agttactact ccagcaggca
      541 caggcacagg cacagctgct ggctgctgca gtgcagcagc actccgccag ccagcagcac
      601 agtgctgctg gagccaccat ctccgcctct gctgccacgc ccatgacgca gatccccctg
      661 tctcagccca tacagatcgc acaggatctt caacaactgc aacagcttca acagcagaat
      721 ctcaacctgc aacagtttgt gttggtgcat ccaaccacca atttgcagcc agcgcagttt
      781 atcatctcac agacgcccca gggccagcag ggtctcctgc aagcgcaaaa tcttctaacg
      841 caactacctc agcaaagcca agccaacctc ctacagtcgc agccaagcat caccctcacc
      901 tcccagccag caaccccaac acgcacaata gcagcaaccc caattcagac acttccacag
      961 agccagtcaa caccaaagcg aattgatact cccagcttgg aggagcccag tgaccttgag
     1021 gagcttgagc agtttgccaa gaccttcaaa caaagacgaa tcaaacttgg attcactcag
     1081 ggtgatgttg ggctcgctat ggggaaacta tatggaaatg acttcagcca aactaccatc
     1141 tctcgatttg aagccttgaa cctcagcttt aagaacatgt gcaagttgaa gccactttta
     1201 gagaagtggc taaatgatgc agagaacctc tcatctgatt cgtccctctc cagcccaagt
     1261 gccctgaatt ctccaggaat tgagggcttg agccgtagga ggaagaaacg caccagcata
     1321 gagaccaaca tccgtgtggc cttagagaag agtttcttgg agaatcaaaa gcctacctcg
     1381 gaagagatca ctatgattgc tgatcagctc aatatggaaa aagaggtgat tcgtgtttgg
     1441 ttctgtaacc gccgccagaa agaaaaaaga atcaacccac caagcagtgg tgggaccagc
     1501 agctcaccta ttaaagcaat tttccccagc ccaacttcac tggtggcgac cacaccaagc
     1561 cttgtgacta gcagtgcagc aactaccctc acagtcagcc ctgtcctccc tctgaccagt
     1621 gctgctgtga cgaatctttc agttacaggc acttcagaca ccacctccaa caacacagca
     1681 accgtgattt ccacagcgcc tccagcttcc tcagcagtca cgtccccctc tctgagtccc
     1741 tccccttctg cctcagcctc cacctccgag gcatccagtg ccagtgagac cagcacaaca
     1801 cagaccacct ccactccttt gtcctcccct cttgggacca gccaggtgat ggtgacagca
     1861 tcaggtttgc aaacagcagc agctgctgcc cttcaaggag ctgcacagtt gccagcaaat
     1921 gccagtcttg ctgccatggc agctgctgca ggactaaacc caagcctgat ggcaccctca
     1981 cagtttgcgg ctggaggtgc cttactcagt ctgaatccag ggaccctgag cggtgctctc
     2041 agcccagctc taatgagcaa cagtacactg gcaactattc aagctcttgc ttctggtggc
     2101 tctcttccaa taacatcact tgatgcaact gggaacctgg tatttgccaa tgcgggagga
     2161 gcccccaaca tcgtgactgc ccctctgttc ctgaaccctc agaacctctc tctgctcacc
     2221 agcaaccctg ttagcttggt ctctgccgcc gcagcatctg cagggaactc tgcacctgta
     2281 gccagccttc acgccacctc cacctctgct gagtccatcc agaactctct cttcacagtg
     2341 gcctctgcca gcggggctgc gtccaccacc accaccgcct ccaaggcaca gtgagctggg
     2401 cagagctggg ctgccagaag cctttttcac tctgcagtgt gattggactg ccagccaggt
     2461 taataaactg aaaaatgtga ttggcttcct ctcgccgtgt tgtgagggca aaggagagaa
     2521 gggagaaaaa aaaaaaaaaa ccacacacac ccatacacac ataccagaaa aagaaagaaa
     2581 ggatggagac ggaacatttg cctaattttg taataaaaca ctgtcttttc aggattgctt
     2641 catggattgg agaactttct aaccaaaaat taaaaaaaaa aaaaaaaaa
//