LOCUS       BC000881                1310 bp    mRNA    linear   HUM 30-JUN-2006
DEFINITION  Homo sapiens centromere protein A, mRNA (cDNA clone MGC:5165
            IMAGE:3461992), complete cds.
ACCESSION   BC000881
VERSION     BC000881.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1310)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 1310)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (15-NOV-2000) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Sequencing Group at the Stanford Human Genome
            Center, Stanford University School of Medicine, Stanford, CA  94305
            Web site:       http://www-shgc.stanford.edu
            Contact:  (Dickson, Mark) mcd@paxil.stanford.edu
            Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
            R. M.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 4 Row: f Column: 8
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 4585861.
FEATURES             Location/Qualifiers
     source          1..1310
                     /db_xref="H-InvDB:HIT000029974"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:5165 IMAGE:3461992"
                     /tissue_type="Cervix, carcinoma"
                     /clone_lib="NIH_MGC_12"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     gene            1..1310
                     /gene="CENPA"
                     /db_xref="GeneID:1058"
                     /db_xref="HGNC:HGNC:1851"
                     /db_xref="MIM:117139"
     CDS             130..474
                     /gene="CENPA"
                     /codon_start=1
                     /product="centromere protein A"
                     /protein_id="AAH00881.1"
                     /db_xref="GeneID:1058"
                     /db_xref="HGNC:HGNC:1851"
                     /db_xref="MIM:117139"
                     /translation="MGPRRRSRKPEAPRRRSPSPTPTPGPSRRGPSLGASSHQHSRRR
                     QGWLKEIRKLQKSTHLLIRKLPFSRLAAEAFLVHLFEDAYLLTLHAGRVTLFPKDVQL
                     ARRIRGLEEGLG"
BASE COUNT          339 a          304 c          307 g          360 t
ORIGIN      
        1 agcaccggct catgtgaggc tcgcggcaca gcgttctctg ggctccccag aagccagcct
       61 ttcgctcccg gacccggcag cccgagcagg agccgtggga ccgggcgcca gcaccctctg
      121 cggcgtgtca tgggcccgcg ccgccggagc cgaaagcccg aggccccgag gaggcgcagc
      181 ccgagcccga ccccgacccc cggcccctcc cggcggggcc cctccttagg cgcttcctcc
      241 catcaacaca gtcggcggag acaaggttgg ctaaaggaga tccgaaagct tcagaagagc
      301 acacacctct tgataaggaa gctgcccttc agccgcctgg cagcagaagc atttctagtt
      361 catctctttg aggacgccta tctcctcacc ttacatgcag gccgagttac tctcttccca
      421 aaggatgtgc aactggcccg gaggatccgg ggccttgagg agggactcgg ctgagctcct
      481 gcacccagtg tttctgtcag tctttcctgc tcagccaggg gggatgatac cggggactct
      541 ccagagccat gactagatcc aatggattct gcgatgctgt ctggactttg ctgtctctga
      601 acagtatgtg tgtgttgctt taaatatttt tctttttttt gagaaggaga agactgcatg
      661 actttcctct gtaacagagg taatatatga gacaatcaac accgttccaa aggcctgaaa
      721 ataattttca gataaagaga ctccaaggtt gactttagtt tgtgagttac tcatgtgact
      781 atttgaggat tttgaaaaca tcagatttgc tgtggtatgg gagaaaaggt tatgtactta
      841 ttattttagc tctttctgta atatttacat tttttaccat atgtacattt gtacttttat
      901 tttacacata agggaaaaaa taagaccact ttgagcagtt gcctggaagg ctgggcattt
      961 ccatcatata gacctctgcc cttcagagta gcctcaccat tagtggcagc atcatgtaac
     1021 tgagtggact gtgcttgtca acggatgtgt agcttttcag aaacttaatt ggggatgaat
     1081 agaaaacctg taagctttga tgttctggtt acttctagta aattcctgtc aaaatcaatt
     1141 cagaaattct aacttggaga atttaacatt ttactcttgt aaatcataga agatgtatca
     1201 taacagttca gaattttaaa gtacattttc gatgctttta tgggtatttt tgtagtttct
     1261 ttgtagagag ataataaaaa tcaaaatatt taatgaaaaa aaaaaaaaaa
//