LOCUS BC000881 1310 bp mRNA linear HUM 30-JUN-2006
DEFINITION Homo sapiens centromere protein A, mRNA (cDNA clone MGC:5165
IMAGE:3461992), complete cds.
ACCESSION BC000881
VERSION BC000881.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 1310)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 1310)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (15-NOV-2000) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Life Technologies, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Sequencing Group at the Stanford Human Genome
Center, Stanford University School of Medicine, Stanford, CA 94305
Web site: http://www-shgc.stanford.edu
Contact: (Dickson, Mark) mcd@paxil.stanford.edu
Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
R. M.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 4 Row: f Column: 8
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 4585861.
FEATURES Location/Qualifiers
source 1..1310
/db_xref="H-InvDB:HIT000029974"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:5165 IMAGE:3461992"
/tissue_type="Cervix, carcinoma"
/clone_lib="NIH_MGC_12"
/lab_host="DH10B"
/note="Vector: pCMV-SPORT6"
gene 1..1310
/gene="CENPA"
/db_xref="GeneID:1058"
/db_xref="HGNC:HGNC:1851"
/db_xref="MIM:117139"
CDS 130..474
/gene="CENPA"
/codon_start=1
/product="centromere protein A"
/protein_id="AAH00881.1"
/db_xref="GeneID:1058"
/db_xref="HGNC:HGNC:1851"
/db_xref="MIM:117139"
/translation="MGPRRRSRKPEAPRRRSPSPTPTPGPSRRGPSLGASSHQHSRRR
QGWLKEIRKLQKSTHLLIRKLPFSRLAAEAFLVHLFEDAYLLTLHAGRVTLFPKDVQL
ARRIRGLEEGLG"
BASE COUNT 339 a 304 c 307 g 360 t
ORIGIN
1 agcaccggct catgtgaggc tcgcggcaca gcgttctctg ggctccccag aagccagcct
61 ttcgctcccg gacccggcag cccgagcagg agccgtggga ccgggcgcca gcaccctctg
121 cggcgtgtca tgggcccgcg ccgccggagc cgaaagcccg aggccccgag gaggcgcagc
181 ccgagcccga ccccgacccc cggcccctcc cggcggggcc cctccttagg cgcttcctcc
241 catcaacaca gtcggcggag acaaggttgg ctaaaggaga tccgaaagct tcagaagagc
301 acacacctct tgataaggaa gctgcccttc agccgcctgg cagcagaagc atttctagtt
361 catctctttg aggacgccta tctcctcacc ttacatgcag gccgagttac tctcttccca
421 aaggatgtgc aactggcccg gaggatccgg ggccttgagg agggactcgg ctgagctcct
481 gcacccagtg tttctgtcag tctttcctgc tcagccaggg gggatgatac cggggactct
541 ccagagccat gactagatcc aatggattct gcgatgctgt ctggactttg ctgtctctga
601 acagtatgtg tgtgttgctt taaatatttt tctttttttt gagaaggaga agactgcatg
661 actttcctct gtaacagagg taatatatga gacaatcaac accgttccaa aggcctgaaa
721 ataattttca gataaagaga ctccaaggtt gactttagtt tgtgagttac tcatgtgact
781 atttgaggat tttgaaaaca tcagatttgc tgtggtatgg gagaaaaggt tatgtactta
841 ttattttagc tctttctgta atatttacat tttttaccat atgtacattt gtacttttat
901 tttacacata agggaaaaaa taagaccact ttgagcagtt gcctggaagg ctgggcattt
961 ccatcatata gacctctgcc cttcagagta gcctcaccat tagtggcagc atcatgtaac
1021 tgagtggact gtgcttgtca acggatgtgt agcttttcag aaacttaatt ggggatgaat
1081 agaaaacctg taagctttga tgttctggtt acttctagta aattcctgtc aaaatcaatt
1141 cagaaattct aacttggaga atttaacatt ttactcttgt aaatcataga agatgtatca
1201 taacagttca gaattttaaa gtacattttc gatgctttta tgggtatttt tgtagtttct
1261 ttgtagagag ataataaaaa tcaaaatatt taatgaaaaa aaaaaaaaaa
//