LOCUS       BC012462                2132 bp    mRNA    linear   HUM 23-JUN-2006
DEFINITION  Homo sapiens centromere protein I, mRNA (cDNA clone MGC:21750
            IMAGE:4537558), complete cds.
ACCESSION   BC012462
VERSION     BC012462.2
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2132)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 2132)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (15-AUG-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     On Sep 16, 2003 this sequence version replaced BC012462.1.
            Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Baylor College of Medicine Human Genome
            Sequencing Center
            Center code: BCM-HGSC
            Web site: http://www.hgsc.bcm.tmc.edu/cdna/
            Contact: amg@bcm.tmc.edu
            Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H.,
            Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati,
            A.N., Gibbs, R.A.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 28 Row: k Column: 3
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 41352696.
FEATURES             Location/Qualifiers
     source          1..2132
                     /db_xref="H-InvDB:HIT000035826"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:21750 IMAGE:4537558"
                     /tissue_type="Testis, embryonal carcinoma"
                     /clone_lib="NIH_MGC_92"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     gene            1..2132
                     /gene="CENPI"
                     /gene_synonym="CENP-I"
                     /gene_synonym="LRPR1"
                     /gene_synonym="Mis6"
                     /db_xref="GeneID:2491"
                     /db_xref="HGNC:HGNC:3968"
                     /db_xref="MIM:300065"
     CDS             15..1583
                     /gene="CENPI"
                     /gene_synonym="CENP-I"
                     /gene_synonym="LRPR1"
                     /gene_synonym="Mis6"
                     /codon_start=1
                     /product="CENPI protein"
                     /protein_id="AAH12462.1"
                     /db_xref="GeneID:2491"
                     /db_xref="HGNC:HGNC:3968"
                     /db_xref="MIM:300065"
                     /translation="MSPQKRVKNVQAQNRTSQGSSSFQTTLSAWKVKQDPSNSKNISK
                     HGQNNPVGDYEHADDQAEEDALQMAVGYFEKGPIKASQNKDKTLEKHLKTVENVAWKN
                     GLASEEIDILLNIALSGKFGNAVNTRILKCMIPATVISEDSVVKAVSWLCVGKCSGST
                     KVLFYRWLVAMFDFIDRKEQINLLYGFFFASLQDDALCPYVCHLLYLLTKKENVKPFR
                     VRKLLDLQAKMGMQPHLQALLSLYKFFAPALISVSLPVRKKIYFKNSENLWKTALLAV
                     KQRNRGPSPEPLKLMLGPANVRPLKRKWNSLSVIPVLNSSSYTKECGKKEMSLSDCLN
                     RSGSFPLEQLQSFPQLLQNIHCLELPSQMGSVLNNSLLLHYINCVRDEPVLLRFYYWL
                     SQTLQEECIWYKVNNYEHGKEFTNFLDTIIRAECFLQEGFYSCEAFLYKSLPLWDGLC
                     CRSQFLQLVSWIPFSSFSEVKPLLFDHLAQLFFTSTIYFKCSVLQSLKELLQNWLLWL
                     SMDIHMKPVTNSPL"
BASE COUNT          656 a          387 c          417 g          672 t
ORIGIN      
        1 gaacatatgc agtaatgtca cctcaaaaga gagttaagaa cgtccaggca caaaacagga
       61 cttcacaagg tagtagtagt tttcagacca cgctttcagc ctggaaagta aaacaggatc
      121 caagcaactc gaagaacatc tcaaaacatg gacaaaacaa tccagtggga gattatgaac
      181 atgctgatga tcaagctgaa gaagatgctt tgcaaatggc agtgggatat tttgagaaag
      241 gtcccattaa agcttcacag aataaagata aaaccttgga aaaacacttg aaaactgtgg
      301 aaaatgtggc ttggaagaat gggttagctt cagaagaaat tgatattcta ttaaatattg
      361 cactcagtgg caaatttgga aatgctgtaa acacacggat attgaagtgc atgatcccag
      421 caacagtaat atcagaagat tctgtggtta aggcagtctc ctggctttgt gttggcaagt
      481 gttctggtag caccaaggta cttttttatc gttggctggt tgcaatgttt gacttcattg
      541 atcgtaagga gcaaattaac ttgctctatg gcttcttttt tgcttcattg caagatgatg
      601 cactgtgccc ttatgtttgc catttgttat atttacttac caaaaaagag aatgtcaaac
      661 catttcgtgt gagaaaactg cttgatcttc aggccaaaat gggaatgcag cctcatctcc
      721 aggctttgtt gtcactgtat aagttctttg ctcctgctct gatttcagta tctttgcctg
      781 taaggaagaa gatatatttt aagaattcag agaatctatg gaagacggct ctgcttgccg
      841 tgaagcaaag aaaccgggga ccttctccag aacctctgaa gttgatgtta ggtccagcta
      901 atgttcgtcc tctaaaaaga aagtggaatt ctctctcagt tataccagtg ctcaattcca
      961 gtagctacac taaagaatgt ggaaaaaaag agatgagtct ttctgattgt ctgaatagaa
     1021 gtggatcatt tccactagaa caacttcaaa gcttccccca acttttacag aacatccatt
     1081 gcttagagct gccttctcag atgggctcag tgctaaacaa ctctctgctg cttcactaca
     1141 ttaactgtgt cagagatgag ccagtcttgc tgaggtttta ttactggttg agtcaaacat
     1201 tacaagaaga atgtatttgg tacaaggtga ataattatga acatggaaaa gaatttacca
     1261 acttcctgga taccatcatc agggcagagt gcttcttaca agaggggttt tattcctgtg
     1321 aagcattcct gtataagagc cttcctctct gggatggcct ttgttgtcgg tcacagttcc
     1381 ttcagcttgt gagctggatt ccttttagta gcttctctga ggtgaaacca cttctttttg
     1441 accatctagc gcagctcttc tttacatcaa ccatttattt caagtgtagt gtgcttcaga
     1501 gtctgaaaga gctattgcag aattggctgt tgtggctttc tatggacatt cacatgaaac
     1561 ctgttacaaa cagtcctctg tgagttatct aacggtcttc cacaagagac gatctgtata
     1621 caaatgtttg catatgttta atagtgtggc gtgaaacaat ttttttcatg taaatgctta
     1681 gtagtacttt tttggaagac ttttttgtga cttttcactc ttcttccccc acaatcctga
     1741 aaggaaggct aattctctaa gttcctattg tgtaaatcag aaagcaaccc ttatctgcta
     1801 aatgacttct ttcaagagtt aatataacaa aacactagag agatgagact agaatttgca
     1861 actttgactt ccattcagtt taactcattt tataaattat attcttatat ttaaagccaa
     1921 ttttcatgat ggctagaaaa atctctaacg taattttgcg tacaactttg tttttatgtt
     1981 tctgtgctgt attttggtta tatgtgttaa tttatcttaa ctttggagtt taaaataaga
     2041 gccctaaatt gttaaatgtg tgctttacct tcaaagtaaa agtaaaagac taaatttcaa
     2101 ggaactgcaa aaaaaaaaaa aaaaaaaaaa aa
//