LOCUS       BC002703                 759 bp    mRNA    linear   HUM 23-JUN-2006
DEFINITION  Homo sapiens centromere protein A, mRNA (cDNA clone MGC:3892
            IMAGE:3626578), complete cds.
ACCESSION   BC002703
VERSION     BC002703.2
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 759)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 759)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (05-FEB-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     On Aug 19, 2003 this sequence version replaced BC002703.1.
            Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: National Institutes of Health Intramural
            Sequencing Center (NISC),
            Gaithersburg, Maryland;
            Web site: http://www.nisc.nih.gov/
            Contact: nisc_mgc@nhgri.nih.gov
            Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
            Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
            Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
            Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
            Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
            McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
            Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
            Young,A., Zhang,L.-H. and Green,E.D.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 12 Row: a Column: 2
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 4585861.
FEATURES             Location/Qualifiers
     source          1..759
                     /db_xref="H-InvDB:HIT000031112"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:3892 IMAGE:3626578"
                     /tissue_type="Uterus, endometrium adenocarcinoma"
                     /clone_lib="NIH_MGC_44"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     gene            1..759
                     /gene="CENPA"
                     /db_xref="GeneID:1058"
                     /db_xref="HGNC:HGNC:1851"
                     /db_xref="MIM:117139"
     CDS             55..477
                     /gene="CENPA"
                     /codon_start=1
                     /product="CENPA protein"
                     /protein_id="AAH02703.1"
                     /db_xref="GeneID:1058"
                     /db_xref="HGNC:HGNC:1851"
                     /db_xref="MIM:117139"
                     /translation="MGPRRRSRKPEAPRRRSPSPTPTPGPSRRGPSLGASSHQHSRRR
                     QGWLKEIRKLQKSTHLLIRKLPFSRLAREICVKFTRGVDFNWQAQALLALQEAAEAFL
                     VHLFEDAYLLTLHAGRVTLFPKDVQLARRIRGLEEGLG"
BASE COUNT          174 a          231 c          198 g          156 t
ORIGIN      
        1 ggcagcccga gcaggagccg tgggaccggg cgccggcacc ctctgcggcg tgtcatgggc
       61 ccgcgccgcc ggagccgaaa gcccgaggcc ccgaggaggc gcagcccgag cccgaccccg
      121 acccccggcc cctcccggcg gggcccctcc ttaggcgctt cctcccatca acacagtcgg
      181 cggagacaag gttggctaaa ggagatccga aagcttcaga agagcacaca cctcttgata
      241 aggaagctgc ccttcagccg cctggcaaga gaaatatgtg ttaaattcac tcgtggtgtg
      301 gacttcaatt ggcaagccca ggccctattg gccctacaag aggcagcaga agcatttcta
      361 gttcatctct ttgaggacgc ctatctcctc accttacatg caggccgagt tactctcttc
      421 ccaaaggatg tgcaactggc ccggaggatc cggggccttg aggagggact cggctgagct
      481 cctgcaccca gtgtttctgt cagtctttcc tgctcagcca ggggggatga taccggggac
      541 tctccagagc catgactaga tccaatggat tctgcgatgc tgtctggact ttgctgtctc
      601 tgaacagctg gagtgcagca gcacgatctc ggctcactgc aacctccaac tcccaagttc
      661 aagcgattct cctgcctcag cctctcggac cctgtctcta ttttttttta aataaagaaa
      721 taaaaattct gtgtaacttc ataaaaaaaa aaaaaaaaa
//