LOCUS BC012462 2132 bp mRNA linear HUM 23-JUN-2006 DEFINITION Homo sapiens centromere protein I, mRNA (cDNA clone MGC:21750 IMAGE:4537558), complete cds. ACCESSION BC012462 VERSION BC012462.2 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2132) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2132) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (15-AUG-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Sep 16, 2003 this sequence version replaced BC012462.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Baylor College of Medicine Human Genome Sequencing Center Center code: BCM-HGSC Web site: http://www.hgsc.bcm.tmc.edu/cdna/ Contact: amg@bcm.tmc.edu Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H., Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati, A.N., Gibbs, R.A. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 28 Row: k Column: 3 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 41352696. FEATURES Location/Qualifiers source 1..2132 /db_xref="H-InvDB:HIT000035826" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:21750 IMAGE:4537558" /tissue_type="Testis, embryonal carcinoma" /clone_lib="NIH_MGC_92" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..2132 /gene="CENPI" /gene_synonym="CENP-I" /gene_synonym="LRPR1" /gene_synonym="Mis6" /db_xref="GeneID:2491" /db_xref="HGNC:HGNC:3968" /db_xref="MIM:300065" CDS 15..1583 /gene="CENPI" /gene_synonym="CENP-I" /gene_synonym="LRPR1" /gene_synonym="Mis6" /codon_start=1 /product="CENPI protein" /protein_id="AAH12462.1" /db_xref="GeneID:2491" /db_xref="HGNC:HGNC:3968" /db_xref="MIM:300065" /translation="MSPQKRVKNVQAQNRTSQGSSSFQTTLSAWKVKQDPSNSKNISK HGQNNPVGDYEHADDQAEEDALQMAVGYFEKGPIKASQNKDKTLEKHLKTVENVAWKN GLASEEIDILLNIALSGKFGNAVNTRILKCMIPATVISEDSVVKAVSWLCVGKCSGST KVLFYRWLVAMFDFIDRKEQINLLYGFFFASLQDDALCPYVCHLLYLLTKKENVKPFR VRKLLDLQAKMGMQPHLQALLSLYKFFAPALISVSLPVRKKIYFKNSENLWKTALLAV KQRNRGPSPEPLKLMLGPANVRPLKRKWNSLSVIPVLNSSSYTKECGKKEMSLSDCLN RSGSFPLEQLQSFPQLLQNIHCLELPSQMGSVLNNSLLLHYINCVRDEPVLLRFYYWL SQTLQEECIWYKVNNYEHGKEFTNFLDTIIRAECFLQEGFYSCEAFLYKSLPLWDGLC CRSQFLQLVSWIPFSSFSEVKPLLFDHLAQLFFTSTIYFKCSVLQSLKELLQNWLLWL SMDIHMKPVTNSPL" BASE COUNT 656 a 387 c 417 g 672 t ORIGIN 1 gaacatatgc agtaatgtca cctcaaaaga gagttaagaa cgtccaggca caaaacagga 61 cttcacaagg tagtagtagt tttcagacca cgctttcagc ctggaaagta aaacaggatc 121 caagcaactc gaagaacatc tcaaaacatg gacaaaacaa tccagtggga gattatgaac 181 atgctgatga tcaagctgaa gaagatgctt tgcaaatggc agtgggatat tttgagaaag 241 gtcccattaa agcttcacag aataaagata aaaccttgga aaaacacttg aaaactgtgg 301 aaaatgtggc ttggaagaat gggttagctt cagaagaaat tgatattcta ttaaatattg 361 cactcagtgg caaatttgga aatgctgtaa acacacggat attgaagtgc atgatcccag 421 caacagtaat atcagaagat tctgtggtta aggcagtctc ctggctttgt gttggcaagt 481 gttctggtag caccaaggta cttttttatc gttggctggt tgcaatgttt gacttcattg 541 atcgtaagga gcaaattaac ttgctctatg gcttcttttt tgcttcattg caagatgatg 601 cactgtgccc ttatgtttgc catttgttat atttacttac caaaaaagag aatgtcaaac 661 catttcgtgt gagaaaactg cttgatcttc aggccaaaat gggaatgcag cctcatctcc 721 aggctttgtt gtcactgtat aagttctttg ctcctgctct gatttcagta tctttgcctg 781 taaggaagaa gatatatttt aagaattcag agaatctatg gaagacggct ctgcttgccg 841 tgaagcaaag aaaccgggga ccttctccag aacctctgaa gttgatgtta ggtccagcta 901 atgttcgtcc tctaaaaaga aagtggaatt ctctctcagt tataccagtg ctcaattcca 961 gtagctacac taaagaatgt ggaaaaaaag agatgagtct ttctgattgt ctgaatagaa 1021 gtggatcatt tccactagaa caacttcaaa gcttccccca acttttacag aacatccatt 1081 gcttagagct gccttctcag atgggctcag tgctaaacaa ctctctgctg cttcactaca 1141 ttaactgtgt cagagatgag ccagtcttgc tgaggtttta ttactggttg agtcaaacat 1201 tacaagaaga atgtatttgg tacaaggtga ataattatga acatggaaaa gaatttacca 1261 acttcctgga taccatcatc agggcagagt gcttcttaca agaggggttt tattcctgtg 1321 aagcattcct gtataagagc cttcctctct gggatggcct ttgttgtcgg tcacagttcc 1381 ttcagcttgt gagctggatt ccttttagta gcttctctga ggtgaaacca cttctttttg 1441 accatctagc gcagctcttc tttacatcaa ccatttattt caagtgtagt gtgcttcaga 1501 gtctgaaaga gctattgcag aattggctgt tgtggctttc tatggacatt cacatgaaac 1561 ctgttacaaa cagtcctctg tgagttatct aacggtcttc cacaagagac gatctgtata 1621 caaatgtttg catatgttta atagtgtggc gtgaaacaat ttttttcatg taaatgctta 1681 gtagtacttt tttggaagac ttttttgtga cttttcactc ttcttccccc acaatcctga 1741 aaggaaggct aattctctaa gttcctattg tgtaaatcag aaagcaaccc ttatctgcta 1801 aatgacttct ttcaagagtt aatataacaa aacactagag agatgagact agaatttgca 1861 actttgactt ccattcagtt taactcattt tataaattat attcttatat ttaaagccaa 1921 ttttcatgat ggctagaaaa atctctaacg taattttgcg tacaactttg tttttatgtt 1981 tctgtgctgt attttggtta tatgtgttaa tttatcttaa ctttggagtt taaaataaga 2041 gccctaaatt gttaaatgtg tgctttacct tcaaagtaaa agtaaaagac taaatttcaa 2101 ggaactgcaa aaaaaaaaaa aaaaaaaaaa aa //