LOCUS BC002703 759 bp mRNA linear HUM 23-JUN-2006
DEFINITION Homo sapiens centromere protein A, mRNA (cDNA clone MGC:3892
IMAGE:3626578), complete cds.
ACCESSION BC002703
VERSION BC002703.2
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 759)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 759)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (05-FEB-2001) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT On Aug 19, 2003 this sequence version replaced BC002703.1.
Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: National Institutes of Health Intramural
Sequencing Center (NISC),
Gaithersburg, Maryland;
Web site: http://www.nisc.nih.gov/
Contact: nisc_mgc@nhgri.nih.gov
Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
Young,A., Zhang,L.-H. and Green,E.D.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 12 Row: a Column: 2
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 4585861.
FEATURES Location/Qualifiers
source 1..759
/db_xref="H-InvDB:HIT000031112"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:3892 IMAGE:3626578"
/tissue_type="Uterus, endometrium adenocarcinoma"
/clone_lib="NIH_MGC_44"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene 1..759
/gene="CENPA"
/db_xref="GeneID:1058"
/db_xref="HGNC:HGNC:1851"
/db_xref="MIM:117139"
CDS 55..477
/gene="CENPA"
/codon_start=1
/product="CENPA protein"
/protein_id="AAH02703.1"
/db_xref="GeneID:1058"
/db_xref="HGNC:HGNC:1851"
/db_xref="MIM:117139"
/translation="MGPRRRSRKPEAPRRRSPSPTPTPGPSRRGPSLGASSHQHSRRR
QGWLKEIRKLQKSTHLLIRKLPFSRLAREICVKFTRGVDFNWQAQALLALQEAAEAFL
VHLFEDAYLLTLHAGRVTLFPKDVQLARRIRGLEEGLG"
BASE COUNT 174 a 231 c 198 g 156 t
ORIGIN
1 ggcagcccga gcaggagccg tgggaccggg cgccggcacc ctctgcggcg tgtcatgggc
61 ccgcgccgcc ggagccgaaa gcccgaggcc ccgaggaggc gcagcccgag cccgaccccg
121 acccccggcc cctcccggcg gggcccctcc ttaggcgctt cctcccatca acacagtcgg
181 cggagacaag gttggctaaa ggagatccga aagcttcaga agagcacaca cctcttgata
241 aggaagctgc ccttcagccg cctggcaaga gaaatatgtg ttaaattcac tcgtggtgtg
301 gacttcaatt ggcaagccca ggccctattg gccctacaag aggcagcaga agcatttcta
361 gttcatctct ttgaggacgc ctatctcctc accttacatg caggccgagt tactctcttc
421 ccaaaggatg tgcaactggc ccggaggatc cggggccttg aggagggact cggctgagct
481 cctgcaccca gtgtttctgt cagtctttcc tgctcagcca ggggggatga taccggggac
541 tctccagagc catgactaga tccaatggat tctgcgatgc tgtctggact ttgctgtctc
601 tgaacagctg gagtgcagca gcacgatctc ggctcactgc aacctccaac tcccaagttc
661 aagcgattct cctgcctcag cctctcggac cctgtctcta ttttttttta aataaagaaa
721 taaaaattct gtgtaacttc ataaaaaaaa aaaaaaaaa
//