LOCUS BC053847 2852 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens centromere protein B, 80kDa, mRNA (cDNA clone MGC:61764 IMAGE:6470289), complete cds. ACCESSION BC053847 VERSION BC053847.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2852) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2852) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (13-JUN-2003) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 115 Row: o Column: 17. FEATURES Location/Qualifiers source 1..2852 /db_xref="H-InvDB:HIT000053988" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:61764 IMAGE:6470289" /tissue_type="Uterus, leiomyosarcoma" /clone_lib="NIH_MGC_71" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..2852 /gene="CENPB" /db_xref="GeneID:1059" /db_xref="HGNC:HGNC:1852" /db_xref="MIM:117140" CDS 208..2007 /gene="CENPB" /codon_start=1 /product="centromere protein B, 80kDa" /protein_id="AAH53847.1" /db_xref="GeneID:1059" /db_xref="HGNC:HGNC:1852" /db_xref="MIM:117140" /translation="MGPKRRQLTFREKSRIIQEVEENPDLRKGEIARRFNIPPSTLST ILKNKRAILASERKYGVASTCRKTNKLSPYDKLEGLLIAWFQQIRAAGLPVKGIILKE KALRIAEELGMDDFTASNGWLDRFRRRHGVVSCSGVARARARNAAPRTPAAPASPAAV PSEGSGGSTTGWRAREEQPPSVAEGYASQDVFSATETSLWYDFLPDQAAGLCGGDGRP RQATQRLSVLLCANADGSEKLPPLVAGKSAKPRAGQAGLPCDYTANSKGGVTTQALAK YLKALDTRMAAESRRVLLLAGRLAAQSLDTSGLRHVQLAFFPPGTVHPLERGVVQQVK GHYRQAMLLKAMAALEGQDPSGLQLGLTEALHFVAAAWQAVEPSDIAACFREAGFGGG PNATITTSLKSEGEEEEEEEEEEEEEEGEGEEEEEEGEEEEEEGGEGEELGEEEEVEE EGDVDSDEEEEEDEESSSEGLEAEDWAQGVVEAGGSFGAYGAQEEAQCPTLHFLEGGE DSDSDSEEEDDEEEDDEDEDDDDDEEDGDEVPVPSFGEAMAYFAMVKRYLTSFPIDDR VQSHILHLEHDLVHVTRKNHARQAGVRGLGHQS" BASE COUNT 512 a 902 c 977 g 461 t ORIGIN 1 gcccgccgcc cggcgacgcg cccgccggcg tccccggagg tgcccccggc ccggccgggt 61 cgtcgccccg ccgccgcgcc cgcgagccgc tttgtctcgg gcggggcgcg cgggagaggc 121 cgccaggtgc cccccgccac gggcccgggc cccccgcccc ggggcggcgg cggcggcgcc 181 gggccccggg gcggggggcg cgccgggatg ggccccaaga ggcgacagct gacgttccgg 241 gagaagtcac ggatcatcca ggaggtggag gagaatccgg acctgcgcaa gggcgagatc 301 gcgcggcgct tcaacatccc gccgtccacg ctgagcacga tcctgaagaa caagcgcgcc 361 atcctggcgt cggagcgcaa gtacggggtg gcctccacct gccgcaagac caacaagctg 421 tctccctacg acaagctcga gggcttgctc atcgcctggt tccagcagat ccgcgccgcc 481 ggcctgccgg tcaagggcat catcctcaag gagaaggcgc tgcgcatagc cgaggagctg 541 ggcatggacg acttcaccgc ctccaacggc tggctggacc gcttccgccg gcgccacggc 601 gtggtgtcct gcagcggcgt ggcccgcgcc cgcgcgcgaa acgctgcccc ccgcaccccg 661 gcggcgcctg ccagtccggc cgcggtgccc tcggagggca gtggcgggag cactactggt 721 tggcgcgctc gggaggagca gccgccgtcg gtggccgagg gctacgcctc gcaggacgtg 781 ttcagcgcca ccgagaccag tctatggtac gacttcctgc ccgaccaggc cgcggggctg 841 tgcggaggcg acggacggcc gcgtcaagcc acccagcgcc tgagcgtcct gctatgcgcc 901 aatgccgacg gcagcgagaa gctgcccccg ctggtggccg gcaagtcggc caagccccgc 961 gcaggccaag ccggcctgcc ctgcgactac accgccaact ccaagggtgg tgtcaccacc 1021 caggccctgg ccaagtactt gaaggccttg gacacccgaa tggctgcaga gtctcgccgg 1081 gtcctgctgt tggccggccg cttggctgcc cagtccttgg acacctcggg cctgcggcat 1141 gtgcagctgg ccttcttccc tcccggcacc gtgcatccgc tggagagggg agtggtccag 1201 caggtgaagg gccactaccg ccaggccatg ctgctcaagg ccatggccgc gctagagggc 1261 caggatccct caggcctgca gctgggtctc acggaggccc tgcactttgt ggctgccgcc 1321 tggcaggcag tggagccttc ggacatagcc gcctgctttc gtgaggctgg ctttgggggt 1381 ggccctaatg ccaccatcac cacttccctc aagagtgagg gagaggaaga ggaggaggag 1441 gaggaagaag aggaggagga agagggtgaa ggagaggaag aggaggagga aggggaggag 1501 gaggaggagg aaggggggga aggagaggaa ttgggggagg aagaggaggt ggaggaggag 1561 ggtgatgttg atagtgatga agaagaggag gaagatgagg agagctcctc ggagggcttg 1621 gaggctgagg actgggccca gggagtagtg gaggccggtg gcagcttcgg ggcttatggt 1681 gcccaggagg aagcccagtg ccctactctg catttcctgg aaggtgggga ggactctgat 1741 tcagacagtg aggaagagga cgatgaggaa gaggatgatg aagatgaaga cgacgatgat 1801 gatgaggagg atggtgatga ggtgcctgta cccagctttg gggaggccat ggcttacttt 1861 gccatggtca agaggtacct gacctccttc cccattgatg accgcgtgca gagccacatc 1921 ctccacttgg aacacgatct ggttcatgtg accaggaaga accacgccag gcaggcggga 1981 gttcgaggtc ttggacatca aagctgagtc actggaccta gctgtgcccc caacctagat 2041 tggcagcacc accccagggc agaggactct ctgggcaccc gctgtgcatg gagccagagt 2101 gcagagcccc agatccttta gtaatgcttc ccctggtcct gcaacaggcc cggtcacctc 2161 ggccgggccc ggggctgagg tcagcctcac tgcctgctta ttgcctcttt ctcagaatcc 2221 tctttcctcc ccatttggcc ctgggctcag gggaccaggt ggggcgggtg gggagctgtc 2281 cggtgctacc acaccgtgcc ctcagtggac taaccacagc agcagccagg gatgggccct 2341 ggaggttccc ggccggagag tgcctctccc ctctgccatc cacgtcaggt ctttggtggg 2401 gggaccccaa agccattctg ggaagggctc cagaagaagg tccagcctag gccccctgca 2461 aggctggcag cccccacccc caccccccag gccgccttga gaagcacagt ttaactcact 2521 gcgggctcct gagcctgctt ctgcctgctt tccacctccc cagtcccttt ctctggccct 2581 gtccatgtga ctttggccct tggttttctt tccagattgg aggtttccaa gaggcccccc 2641 accgtggaag taaccaaggg cgcttccttg tgggcagctg caggccccat gcctctcctc 2701 cctctctggc agggccccat cctgggcaga ggggcctggg gctgggccca gagtccagcc 2761 gtccagctgc tcctttccca gtttgatttc aataaatctg tccactcccc ttttgtgggg 2821 gtgaacgttt taacagcaaa aaaaaaaaaa aa //