LOCUS       BC053847                2852 bp    mRNA    linear   HUM 15-JUL-2006
DEFINITION  Homo sapiens centromere protein B, 80kDa, mRNA (cDNA clone
            MGC:61764 IMAGE:6470289), complete cds.
ACCESSION   BC053847
VERSION     BC053847.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2852)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 2852)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (13-JUN-2003) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: National Institutes of Health Intramural
            Sequencing Center (NISC),
            Gaithersburg, Maryland;
            Web site: http://www.nisc.nih.gov/
            Contact: nisc_mgc@nhgri.nih.gov
            Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
            Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
            Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
            Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
            Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
            McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
            Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
            Young,A., Zhang,L.-H. and Green,E.D.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 115 Row: o Column: 17.
FEATURES             Location/Qualifiers
     source          1..2852
                     /db_xref="H-InvDB:HIT000053988"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:61764 IMAGE:6470289"
                     /tissue_type="Uterus, leiomyosarcoma"
                     /clone_lib="NIH_MGC_71"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     gene            1..2852
                     /gene="CENPB"
                     /db_xref="GeneID:1059"
                     /db_xref="HGNC:HGNC:1852"
                     /db_xref="MIM:117140"
     CDS             208..2007
                     /gene="CENPB"
                     /codon_start=1
                     /product="centromere protein B, 80kDa"
                     /protein_id="AAH53847.1"
                     /db_xref="GeneID:1059"
                     /db_xref="HGNC:HGNC:1852"
                     /db_xref="MIM:117140"
                     /translation="MGPKRRQLTFREKSRIIQEVEENPDLRKGEIARRFNIPPSTLST
                     ILKNKRAILASERKYGVASTCRKTNKLSPYDKLEGLLIAWFQQIRAAGLPVKGIILKE
                     KALRIAEELGMDDFTASNGWLDRFRRRHGVVSCSGVARARARNAAPRTPAAPASPAAV
                     PSEGSGGSTTGWRAREEQPPSVAEGYASQDVFSATETSLWYDFLPDQAAGLCGGDGRP
                     RQATQRLSVLLCANADGSEKLPPLVAGKSAKPRAGQAGLPCDYTANSKGGVTTQALAK
                     YLKALDTRMAAESRRVLLLAGRLAAQSLDTSGLRHVQLAFFPPGTVHPLERGVVQQVK
                     GHYRQAMLLKAMAALEGQDPSGLQLGLTEALHFVAAAWQAVEPSDIAACFREAGFGGG
                     PNATITTSLKSEGEEEEEEEEEEEEEEGEGEEEEEEGEEEEEEGGEGEELGEEEEVEE
                     EGDVDSDEEEEEDEESSSEGLEAEDWAQGVVEAGGSFGAYGAQEEAQCPTLHFLEGGE
                     DSDSDSEEEDDEEEDDEDEDDDDDEEDGDEVPVPSFGEAMAYFAMVKRYLTSFPIDDR
                     VQSHILHLEHDLVHVTRKNHARQAGVRGLGHQS"
BASE COUNT          512 a          902 c          977 g          461 t
ORIGIN      
        1 gcccgccgcc cggcgacgcg cccgccggcg tccccggagg tgcccccggc ccggccgggt
       61 cgtcgccccg ccgccgcgcc cgcgagccgc tttgtctcgg gcggggcgcg cgggagaggc
      121 cgccaggtgc cccccgccac gggcccgggc cccccgcccc ggggcggcgg cggcggcgcc
      181 gggccccggg gcggggggcg cgccgggatg ggccccaaga ggcgacagct gacgttccgg
      241 gagaagtcac ggatcatcca ggaggtggag gagaatccgg acctgcgcaa gggcgagatc
      301 gcgcggcgct tcaacatccc gccgtccacg ctgagcacga tcctgaagaa caagcgcgcc
      361 atcctggcgt cggagcgcaa gtacggggtg gcctccacct gccgcaagac caacaagctg
      421 tctccctacg acaagctcga gggcttgctc atcgcctggt tccagcagat ccgcgccgcc
      481 ggcctgccgg tcaagggcat catcctcaag gagaaggcgc tgcgcatagc cgaggagctg
      541 ggcatggacg acttcaccgc ctccaacggc tggctggacc gcttccgccg gcgccacggc
      601 gtggtgtcct gcagcggcgt ggcccgcgcc cgcgcgcgaa acgctgcccc ccgcaccccg
      661 gcggcgcctg ccagtccggc cgcggtgccc tcggagggca gtggcgggag cactactggt
      721 tggcgcgctc gggaggagca gccgccgtcg gtggccgagg gctacgcctc gcaggacgtg
      781 ttcagcgcca ccgagaccag tctatggtac gacttcctgc ccgaccaggc cgcggggctg
      841 tgcggaggcg acggacggcc gcgtcaagcc acccagcgcc tgagcgtcct gctatgcgcc
      901 aatgccgacg gcagcgagaa gctgcccccg ctggtggccg gcaagtcggc caagccccgc
      961 gcaggccaag ccggcctgcc ctgcgactac accgccaact ccaagggtgg tgtcaccacc
     1021 caggccctgg ccaagtactt gaaggccttg gacacccgaa tggctgcaga gtctcgccgg
     1081 gtcctgctgt tggccggccg cttggctgcc cagtccttgg acacctcggg cctgcggcat
     1141 gtgcagctgg ccttcttccc tcccggcacc gtgcatccgc tggagagggg agtggtccag
     1201 caggtgaagg gccactaccg ccaggccatg ctgctcaagg ccatggccgc gctagagggc
     1261 caggatccct caggcctgca gctgggtctc acggaggccc tgcactttgt ggctgccgcc
     1321 tggcaggcag tggagccttc ggacatagcc gcctgctttc gtgaggctgg ctttgggggt
     1381 ggccctaatg ccaccatcac cacttccctc aagagtgagg gagaggaaga ggaggaggag
     1441 gaggaagaag aggaggagga agagggtgaa ggagaggaag aggaggagga aggggaggag
     1501 gaggaggagg aaggggggga aggagaggaa ttgggggagg aagaggaggt ggaggaggag
     1561 ggtgatgttg atagtgatga agaagaggag gaagatgagg agagctcctc ggagggcttg
     1621 gaggctgagg actgggccca gggagtagtg gaggccggtg gcagcttcgg ggcttatggt
     1681 gcccaggagg aagcccagtg ccctactctg catttcctgg aaggtgggga ggactctgat
     1741 tcagacagtg aggaagagga cgatgaggaa gaggatgatg aagatgaaga cgacgatgat
     1801 gatgaggagg atggtgatga ggtgcctgta cccagctttg gggaggccat ggcttacttt
     1861 gccatggtca agaggtacct gacctccttc cccattgatg accgcgtgca gagccacatc
     1921 ctccacttgg aacacgatct ggttcatgtg accaggaaga accacgccag gcaggcggga
     1981 gttcgaggtc ttggacatca aagctgagtc actggaccta gctgtgcccc caacctagat
     2041 tggcagcacc accccagggc agaggactct ctgggcaccc gctgtgcatg gagccagagt
     2101 gcagagcccc agatccttta gtaatgcttc ccctggtcct gcaacaggcc cggtcacctc
     2161 ggccgggccc ggggctgagg tcagcctcac tgcctgctta ttgcctcttt ctcagaatcc
     2221 tctttcctcc ccatttggcc ctgggctcag gggaccaggt ggggcgggtg gggagctgtc
     2281 cggtgctacc acaccgtgcc ctcagtggac taaccacagc agcagccagg gatgggccct
     2341 ggaggttccc ggccggagag tgcctctccc ctctgccatc cacgtcaggt ctttggtggg
     2401 gggaccccaa agccattctg ggaagggctc cagaagaagg tccagcctag gccccctgca
     2461 aggctggcag cccccacccc caccccccag gccgccttga gaagcacagt ttaactcact
     2521 gcgggctcct gagcctgctt ctgcctgctt tccacctccc cagtcccttt ctctggccct
     2581 gtccatgtga ctttggccct tggttttctt tccagattgg aggtttccaa gaggcccccc
     2641 accgtggaag taaccaaggg cgcttccttg tgggcagctg caggccccat gcctctcctc
     2701 cctctctggc agggccccat cctgggcaga ggggcctggg gctgggccca gagtccagcc
     2761 gtccagctgc tcctttccca gtttgatttc aataaatctg tccactcccc ttttgtgggg
     2821 gtgaacgttt taacagcaaa aaaaaaaaaa aa
//