LOCUS BC002703 759 bp mRNA linear HUM 23-JUN-2006 DEFINITION Homo sapiens centromere protein A, mRNA (cDNA clone MGC:3892 IMAGE:3626578), complete cds. ACCESSION BC002703 VERSION BC002703.2 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 759) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 759) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (05-FEB-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Aug 19, 2003 this sequence version replaced BC002703.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 12 Row: a Column: 2 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 4585861. FEATURES Location/Qualifiers source 1..759 /db_xref="H-InvDB:HIT000031112" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:3892 IMAGE:3626578" /tissue_type="Uterus, endometrium adenocarcinoma" /clone_lib="NIH_MGC_44" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..759 /gene="CENPA" /db_xref="GeneID:1058" /db_xref="HGNC:HGNC:1851" /db_xref="MIM:117139" CDS 55..477 /gene="CENPA" /codon_start=1 /product="CENPA protein" /protein_id="AAH02703.1" /db_xref="GeneID:1058" /db_xref="HGNC:HGNC:1851" /db_xref="MIM:117139" /translation="MGPRRRSRKPEAPRRRSPSPTPTPGPSRRGPSLGASSHQHSRRR QGWLKEIRKLQKSTHLLIRKLPFSRLAREICVKFTRGVDFNWQAQALLALQEAAEAFL VHLFEDAYLLTLHAGRVTLFPKDVQLARRIRGLEEGLG" BASE COUNT 174 a 231 c 198 g 156 t ORIGIN 1 ggcagcccga gcaggagccg tgggaccggg cgccggcacc ctctgcggcg tgtcatgggc 61 ccgcgccgcc ggagccgaaa gcccgaggcc ccgaggaggc gcagcccgag cccgaccccg 121 acccccggcc cctcccggcg gggcccctcc ttaggcgctt cctcccatca acacagtcgg 181 cggagacaag gttggctaaa ggagatccga aagcttcaga agagcacaca cctcttgata 241 aggaagctgc ccttcagccg cctggcaaga gaaatatgtg ttaaattcac tcgtggtgtg 301 gacttcaatt ggcaagccca ggccctattg gccctacaag aggcagcaga agcatttcta 361 gttcatctct ttgaggacgc ctatctcctc accttacatg caggccgagt tactctcttc 421 ccaaaggatg tgcaactggc ccggaggatc cggggccttg aggagggact cggctgagct 481 cctgcaccca gtgtttctgt cagtctttcc tgctcagcca ggggggatga taccggggac 541 tctccagagc catgactaga tccaatggat tctgcgatgc tgtctggact ttgctgtctc 601 tgaacagctg gagtgcagca gcacgatctc ggctcactgc aacctccaac tcccaagttc 661 aagcgattct cctgcctcag cctctcggac cctgtctcta ttttttttta aataaagaaa 721 taaaaattct gtgtaacttc ataaaaaaaa aaaaaaaaa //