LOCUS BC053847 2852 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens centromere protein B, 80kDa, mRNA (cDNA clone
MGC:61764 IMAGE:6470289), complete cds.
ACCESSION BC053847
VERSION BC053847.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 2852)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 2852)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (13-JUN-2003) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Life Technologies, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: National Institutes of Health Intramural
Sequencing Center (NISC),
Gaithersburg, Maryland;
Web site: http://www.nisc.nih.gov/
Contact: nisc_mgc@nhgri.nih.gov
Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
Young,A., Zhang,L.-H. and Green,E.D.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 115 Row: o Column: 17.
FEATURES Location/Qualifiers
source 1..2852
/db_xref="H-InvDB:HIT000053988"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:61764 IMAGE:6470289"
/tissue_type="Uterus, leiomyosarcoma"
/clone_lib="NIH_MGC_71"
/lab_host="DH10B"
/note="Vector: pCMV-SPORT6"
gene 1..2852
/gene="CENPB"
/db_xref="GeneID:1059"
/db_xref="HGNC:HGNC:1852"
/db_xref="MIM:117140"
CDS 208..2007
/gene="CENPB"
/codon_start=1
/product="centromere protein B, 80kDa"
/protein_id="AAH53847.1"
/db_xref="GeneID:1059"
/db_xref="HGNC:HGNC:1852"
/db_xref="MIM:117140"
/translation="MGPKRRQLTFREKSRIIQEVEENPDLRKGEIARRFNIPPSTLST
ILKNKRAILASERKYGVASTCRKTNKLSPYDKLEGLLIAWFQQIRAAGLPVKGIILKE
KALRIAEELGMDDFTASNGWLDRFRRRHGVVSCSGVARARARNAAPRTPAAPASPAAV
PSEGSGGSTTGWRAREEQPPSVAEGYASQDVFSATETSLWYDFLPDQAAGLCGGDGRP
RQATQRLSVLLCANADGSEKLPPLVAGKSAKPRAGQAGLPCDYTANSKGGVTTQALAK
YLKALDTRMAAESRRVLLLAGRLAAQSLDTSGLRHVQLAFFPPGTVHPLERGVVQQVK
GHYRQAMLLKAMAALEGQDPSGLQLGLTEALHFVAAAWQAVEPSDIAACFREAGFGGG
PNATITTSLKSEGEEEEEEEEEEEEEEGEGEEEEEEGEEEEEEGGEGEELGEEEEVEE
EGDVDSDEEEEEDEESSSEGLEAEDWAQGVVEAGGSFGAYGAQEEAQCPTLHFLEGGE
DSDSDSEEEDDEEEDDEDEDDDDDEEDGDEVPVPSFGEAMAYFAMVKRYLTSFPIDDR
VQSHILHLEHDLVHVTRKNHARQAGVRGLGHQS"
BASE COUNT 512 a 902 c 977 g 461 t
ORIGIN
1 gcccgccgcc cggcgacgcg cccgccggcg tccccggagg tgcccccggc ccggccgggt
61 cgtcgccccg ccgccgcgcc cgcgagccgc tttgtctcgg gcggggcgcg cgggagaggc
121 cgccaggtgc cccccgccac gggcccgggc cccccgcccc ggggcggcgg cggcggcgcc
181 gggccccggg gcggggggcg cgccgggatg ggccccaaga ggcgacagct gacgttccgg
241 gagaagtcac ggatcatcca ggaggtggag gagaatccgg acctgcgcaa gggcgagatc
301 gcgcggcgct tcaacatccc gccgtccacg ctgagcacga tcctgaagaa caagcgcgcc
361 atcctggcgt cggagcgcaa gtacggggtg gcctccacct gccgcaagac caacaagctg
421 tctccctacg acaagctcga gggcttgctc atcgcctggt tccagcagat ccgcgccgcc
481 ggcctgccgg tcaagggcat catcctcaag gagaaggcgc tgcgcatagc cgaggagctg
541 ggcatggacg acttcaccgc ctccaacggc tggctggacc gcttccgccg gcgccacggc
601 gtggtgtcct gcagcggcgt ggcccgcgcc cgcgcgcgaa acgctgcccc ccgcaccccg
661 gcggcgcctg ccagtccggc cgcggtgccc tcggagggca gtggcgggag cactactggt
721 tggcgcgctc gggaggagca gccgccgtcg gtggccgagg gctacgcctc gcaggacgtg
781 ttcagcgcca ccgagaccag tctatggtac gacttcctgc ccgaccaggc cgcggggctg
841 tgcggaggcg acggacggcc gcgtcaagcc acccagcgcc tgagcgtcct gctatgcgcc
901 aatgccgacg gcagcgagaa gctgcccccg ctggtggccg gcaagtcggc caagccccgc
961 gcaggccaag ccggcctgcc ctgcgactac accgccaact ccaagggtgg tgtcaccacc
1021 caggccctgg ccaagtactt gaaggccttg gacacccgaa tggctgcaga gtctcgccgg
1081 gtcctgctgt tggccggccg cttggctgcc cagtccttgg acacctcggg cctgcggcat
1141 gtgcagctgg ccttcttccc tcccggcacc gtgcatccgc tggagagggg agtggtccag
1201 caggtgaagg gccactaccg ccaggccatg ctgctcaagg ccatggccgc gctagagggc
1261 caggatccct caggcctgca gctgggtctc acggaggccc tgcactttgt ggctgccgcc
1321 tggcaggcag tggagccttc ggacatagcc gcctgctttc gtgaggctgg ctttgggggt
1381 ggccctaatg ccaccatcac cacttccctc aagagtgagg gagaggaaga ggaggaggag
1441 gaggaagaag aggaggagga agagggtgaa ggagaggaag aggaggagga aggggaggag
1501 gaggaggagg aaggggggga aggagaggaa ttgggggagg aagaggaggt ggaggaggag
1561 ggtgatgttg atagtgatga agaagaggag gaagatgagg agagctcctc ggagggcttg
1621 gaggctgagg actgggccca gggagtagtg gaggccggtg gcagcttcgg ggcttatggt
1681 gcccaggagg aagcccagtg ccctactctg catttcctgg aaggtgggga ggactctgat
1741 tcagacagtg aggaagagga cgatgaggaa gaggatgatg aagatgaaga cgacgatgat
1801 gatgaggagg atggtgatga ggtgcctgta cccagctttg gggaggccat ggcttacttt
1861 gccatggtca agaggtacct gacctccttc cccattgatg accgcgtgca gagccacatc
1921 ctccacttgg aacacgatct ggttcatgtg accaggaaga accacgccag gcaggcggga
1981 gttcgaggtc ttggacatca aagctgagtc actggaccta gctgtgcccc caacctagat
2041 tggcagcacc accccagggc agaggactct ctgggcaccc gctgtgcatg gagccagagt
2101 gcagagcccc agatccttta gtaatgcttc ccctggtcct gcaacaggcc cggtcacctc
2161 ggccgggccc ggggctgagg tcagcctcac tgcctgctta ttgcctcttt ctcagaatcc
2221 tctttcctcc ccatttggcc ctgggctcag gggaccaggt ggggcgggtg gggagctgtc
2281 cggtgctacc acaccgtgcc ctcagtggac taaccacagc agcagccagg gatgggccct
2341 ggaggttccc ggccggagag tgcctctccc ctctgccatc cacgtcaggt ctttggtggg
2401 gggaccccaa agccattctg ggaagggctc cagaagaagg tccagcctag gccccctgca
2461 aggctggcag cccccacccc caccccccag gccgccttga gaagcacagt ttaactcact
2521 gcgggctcct gagcctgctt ctgcctgctt tccacctccc cagtcccttt ctctggccct
2581 gtccatgtga ctttggccct tggttttctt tccagattgg aggtttccaa gaggcccccc
2641 accgtggaag taaccaaggg cgcttccttg tgggcagctg caggccccat gcctctcctc
2701 cctctctggc agggccccat cctgggcaga ggggcctggg gctgggccca gagtccagcc
2761 gtccagctgc tcctttccca gtttgatttc aataaatctg tccactcccc ttttgtgggg
2821 gtgaacgttt taacagcaaa aaaaaaaaaa aa
//