LOCUS BC012462 2132 bp mRNA linear HUM 23-JUN-2006
DEFINITION Homo sapiens centromere protein I, mRNA (cDNA clone MGC:21750
IMAGE:4537558), complete cds.
ACCESSION BC012462
VERSION BC012462.2
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 2132)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 2132)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (15-AUG-2001) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT On Sep 16, 2003 this sequence version replaced BC012462.1.
Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Life Technologies, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Baylor College of Medicine Human Genome
Sequencing Center
Center code: BCM-HGSC
Web site: http://www.hgsc.bcm.tmc.edu/cdna/
Contact: amg@bcm.tmc.edu
Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H.,
Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati,
A.N., Gibbs, R.A.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 28 Row: k Column: 3
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 41352696.
FEATURES Location/Qualifiers
source 1..2132
/db_xref="H-InvDB:HIT000035826"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:21750 IMAGE:4537558"
/tissue_type="Testis, embryonal carcinoma"
/clone_lib="NIH_MGC_92"
/lab_host="DH10B"
/note="Vector: pCMV-SPORT6"
gene 1..2132
/gene="CENPI"
/gene_synonym="CENP-I"
/gene_synonym="LRPR1"
/gene_synonym="Mis6"
/db_xref="GeneID:2491"
/db_xref="HGNC:HGNC:3968"
/db_xref="MIM:300065"
CDS 15..1583
/gene="CENPI"
/gene_synonym="CENP-I"
/gene_synonym="LRPR1"
/gene_synonym="Mis6"
/codon_start=1
/product="CENPI protein"
/protein_id="AAH12462.1"
/db_xref="GeneID:2491"
/db_xref="HGNC:HGNC:3968"
/db_xref="MIM:300065"
/translation="MSPQKRVKNVQAQNRTSQGSSSFQTTLSAWKVKQDPSNSKNISK
HGQNNPVGDYEHADDQAEEDALQMAVGYFEKGPIKASQNKDKTLEKHLKTVENVAWKN
GLASEEIDILLNIALSGKFGNAVNTRILKCMIPATVISEDSVVKAVSWLCVGKCSGST
KVLFYRWLVAMFDFIDRKEQINLLYGFFFASLQDDALCPYVCHLLYLLTKKENVKPFR
VRKLLDLQAKMGMQPHLQALLSLYKFFAPALISVSLPVRKKIYFKNSENLWKTALLAV
KQRNRGPSPEPLKLMLGPANVRPLKRKWNSLSVIPVLNSSSYTKECGKKEMSLSDCLN
RSGSFPLEQLQSFPQLLQNIHCLELPSQMGSVLNNSLLLHYINCVRDEPVLLRFYYWL
SQTLQEECIWYKVNNYEHGKEFTNFLDTIIRAECFLQEGFYSCEAFLYKSLPLWDGLC
CRSQFLQLVSWIPFSSFSEVKPLLFDHLAQLFFTSTIYFKCSVLQSLKELLQNWLLWL
SMDIHMKPVTNSPL"
BASE COUNT 656 a 387 c 417 g 672 t
ORIGIN
1 gaacatatgc agtaatgtca cctcaaaaga gagttaagaa cgtccaggca caaaacagga
61 cttcacaagg tagtagtagt tttcagacca cgctttcagc ctggaaagta aaacaggatc
121 caagcaactc gaagaacatc tcaaaacatg gacaaaacaa tccagtggga gattatgaac
181 atgctgatga tcaagctgaa gaagatgctt tgcaaatggc agtgggatat tttgagaaag
241 gtcccattaa agcttcacag aataaagata aaaccttgga aaaacacttg aaaactgtgg
301 aaaatgtggc ttggaagaat gggttagctt cagaagaaat tgatattcta ttaaatattg
361 cactcagtgg caaatttgga aatgctgtaa acacacggat attgaagtgc atgatcccag
421 caacagtaat atcagaagat tctgtggtta aggcagtctc ctggctttgt gttggcaagt
481 gttctggtag caccaaggta cttttttatc gttggctggt tgcaatgttt gacttcattg
541 atcgtaagga gcaaattaac ttgctctatg gcttcttttt tgcttcattg caagatgatg
601 cactgtgccc ttatgtttgc catttgttat atttacttac caaaaaagag aatgtcaaac
661 catttcgtgt gagaaaactg cttgatcttc aggccaaaat gggaatgcag cctcatctcc
721 aggctttgtt gtcactgtat aagttctttg ctcctgctct gatttcagta tctttgcctg
781 taaggaagaa gatatatttt aagaattcag agaatctatg gaagacggct ctgcttgccg
841 tgaagcaaag aaaccgggga ccttctccag aacctctgaa gttgatgtta ggtccagcta
901 atgttcgtcc tctaaaaaga aagtggaatt ctctctcagt tataccagtg ctcaattcca
961 gtagctacac taaagaatgt ggaaaaaaag agatgagtct ttctgattgt ctgaatagaa
1021 gtggatcatt tccactagaa caacttcaaa gcttccccca acttttacag aacatccatt
1081 gcttagagct gccttctcag atgggctcag tgctaaacaa ctctctgctg cttcactaca
1141 ttaactgtgt cagagatgag ccagtcttgc tgaggtttta ttactggttg agtcaaacat
1201 tacaagaaga atgtatttgg tacaaggtga ataattatga acatggaaaa gaatttacca
1261 acttcctgga taccatcatc agggcagagt gcttcttaca agaggggttt tattcctgtg
1321 aagcattcct gtataagagc cttcctctct gggatggcct ttgttgtcgg tcacagttcc
1381 ttcagcttgt gagctggatt ccttttagta gcttctctga ggtgaaacca cttctttttg
1441 accatctagc gcagctcttc tttacatcaa ccatttattt caagtgtagt gtgcttcaga
1501 gtctgaaaga gctattgcag aattggctgt tgtggctttc tatggacatt cacatgaaac
1561 ctgttacaaa cagtcctctg tgagttatct aacggtcttc cacaagagac gatctgtata
1621 caaatgtttg catatgttta atagtgtggc gtgaaacaat ttttttcatg taaatgctta
1681 gtagtacttt tttggaagac ttttttgtga cttttcactc ttcttccccc acaatcctga
1741 aaggaaggct aattctctaa gttcctattg tgtaaatcag aaagcaaccc ttatctgcta
1801 aatgacttct ttcaagagtt aatataacaa aacactagag agatgagact agaatttgca
1861 actttgactt ccattcagtt taactcattt tataaattat attcttatat ttaaagccaa
1921 ttttcatgat ggctagaaaa atctctaacg taattttgcg tacaactttg tttttatgtt
1981 tctgtgctgt attttggtta tatgtgttaa tttatcttaa ctttggagtt taaaataaga
2041 gccctaaatt gttaaatgtg tgctttacct tcaaagtaaa agtaaaagac taaatttcaa
2101 ggaactgcaa aaaaaaaaaa aaaaaaaaaa aa
//