LOCUS BC000881 1310 bp mRNA linear HUM 30-JUN-2006 DEFINITION Homo sapiens centromere protein A, mRNA (cDNA clone MGC:5165 IMAGE:3461992), complete cds. ACCESSION BC000881 VERSION BC000881.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1310) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 1310) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (15-NOV-2000) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Sequencing Group at the Stanford Human Genome Center, Stanford University School of Medicine, Stanford, CA 94305 Web site: http://www-shgc.stanford.edu Contact: (Dickson, Mark) mcd@paxil.stanford.edu Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers, R. M. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 4 Row: f Column: 8 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 4585861. FEATURES Location/Qualifiers source 1..1310 /db_xref="H-InvDB:HIT000029974" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:5165 IMAGE:3461992" /tissue_type="Cervix, carcinoma" /clone_lib="NIH_MGC_12" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..1310 /gene="CENPA" /db_xref="GeneID:1058" /db_xref="HGNC:HGNC:1851" /db_xref="MIM:117139" CDS 130..474 /gene="CENPA" /codon_start=1 /product="centromere protein A" /protein_id="AAH00881.1" /db_xref="GeneID:1058" /db_xref="HGNC:HGNC:1851" /db_xref="MIM:117139" /translation="MGPRRRSRKPEAPRRRSPSPTPTPGPSRRGPSLGASSHQHSRRR QGWLKEIRKLQKSTHLLIRKLPFSRLAAEAFLVHLFEDAYLLTLHAGRVTLFPKDVQL ARRIRGLEEGLG" BASE COUNT 339 a 304 c 307 g 360 t ORIGIN 1 agcaccggct catgtgaggc tcgcggcaca gcgttctctg ggctccccag aagccagcct 61 ttcgctcccg gacccggcag cccgagcagg agccgtggga ccgggcgcca gcaccctctg 121 cggcgtgtca tgggcccgcg ccgccggagc cgaaagcccg aggccccgag gaggcgcagc 181 ccgagcccga ccccgacccc cggcccctcc cggcggggcc cctccttagg cgcttcctcc 241 catcaacaca gtcggcggag acaaggttgg ctaaaggaga tccgaaagct tcagaagagc 301 acacacctct tgataaggaa gctgcccttc agccgcctgg cagcagaagc atttctagtt 361 catctctttg aggacgccta tctcctcacc ttacatgcag gccgagttac tctcttccca 421 aaggatgtgc aactggcccg gaggatccgg ggccttgagg agggactcgg ctgagctcct 481 gcacccagtg tttctgtcag tctttcctgc tcagccaggg gggatgatac cggggactct 541 ccagagccat gactagatcc aatggattct gcgatgctgt ctggactttg ctgtctctga 601 acagtatgtg tgtgttgctt taaatatttt tctttttttt gagaaggaga agactgcatg 661 actttcctct gtaacagagg taatatatga gacaatcaac accgttccaa aggcctgaaa 721 ataattttca gataaagaga ctccaaggtt gactttagtt tgtgagttac tcatgtgact 781 atttgaggat tttgaaaaca tcagatttgc tgtggtatgg gagaaaaggt tatgtactta 841 ttattttagc tctttctgta atatttacat tttttaccat atgtacattt gtacttttat 901 tttacacata agggaaaaaa taagaccact ttgagcagtt gcctggaagg ctgggcattt 961 ccatcatata gacctctgcc cttcagagta gcctcaccat tagtggcagc atcatgtaac 1021 tgagtggact gtgcttgtca acggatgtgt agcttttcag aaacttaatt ggggatgaat 1081 agaaaacctg taagctttga tgttctggtt acttctagta aattcctgtc aaaatcaatt 1141 cagaaattct aacttggaga atttaacatt ttactcttgt aaatcataga agatgtatca 1201 taacagttca gaattttaaa gtacattttc gatgctttta tgggtatttt tgtagtttct 1261 ttgtagagag ataataaaaa tcaaaatatt taatgaaaaa aaaaaaaaaa //