LOCUS BC015355 1387 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens centromere protein H, mRNA (cDNA clone MGC:21458 IMAGE:3451767), complete cds. ACCESSION BC015355 VERSION BC015355.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1387) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 1387) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (01-OCT-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Sequencing Group at the Stanford Human Genome Center, Stanford University School of Medicine, Stanford, CA 94305 Web site: http://www-shgc.stanford.edu Contact: (Dickson, Mark) mcd@paxil.stanford.edu Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers, R. M. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 20 Row: a Column: 21 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 21264590. FEATURES Location/Qualifiers source 1..1387 /db_xref="H-InvDB:HIT000037093" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:21458 IMAGE:3451767" /tissue_type="Placenta, choriocarcinoma" /clone_lib="NIH_MGC_10" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..1387 /gene="CENPH" /db_xref="GeneID:64946" /db_xref="HGNC:HGNC:17268" /db_xref="MIM:605607" CDS 70..813 /gene="CENPH" /codon_start=1 /product="centromere protein H" /protein_id="AAH15355.1" /db_xref="GeneID:64946" /db_xref="HGNC:HGNC:17268" /db_xref="MIM:605607" /translation="MEEQPQMQDADEPADSGGEGRAGGPPQVAGAQAACSEDRMTLLL RLRAQTKQQLLEYKSMVDASEEKTPEQIMQEKQIEAKIEDLENEIEEVKVAFEIKKLA LDRMRLSTALKKNLEKISRQSSVLMDNMKHLLELNKLIMKSQQESWDLEEKLLDIRKK RLQLKQASESKLLEIQTEKNKQKIDLDSMENSERIKIIRQNLQMEIKITTVIQHVFQN LILGSKVNWAEDPALKEIVLQLEKNVDMM" BASE COUNT 436 a 264 c 318 g 369 t ORIGIN 1 gtggcgggaa aagcgacctt ttctgagcgc gtttgcctgt tgagtggtag cctttcccct 61 caaccagcaa tggaggagca gccccagatg caagacgccg acgagcccgc ggactccgga 121 ggggaaggcc gggcaggcgg gccaccgcag gtcgccggcg cccaggcggc gtgcagcgag 181 gaccgcatga ccctgctcct caggctgaga gcacagacaa aacaacaact cttagaatat 241 aaatcaatgg ttgatgcaag tgaagaaaaa actccagaac aaattatgca agaaaagcaa 301 atcgaagcta aaattgaaga cctggaaaat gaaattgaag aggtaaaagt tgcttttgag 361 ataaaaaagc ttgcattaga caggatgaga ctttcaactg cacttaaaaa aaacctggag 421 aaaattagca gacagtctag tgtgctcatg gataacatga aacacctatt agagctaaat 481 aaattaataa tgaaatcaca gcaggaatct tgggatttag aggaaaaact gcttgatatt 541 agaaagaaga gattgcaatt aaaacaagct tcagaaagta agcttttaga aatacagact 601 gaaaagaaca aacagaagat tgatttggac agtatggaaa actcagagag gataaagatc 661 atacgacaaa acctacagat ggagataaaa attactactg ttattcaaca tgtgttccag 721 aaccttattt tggggagtaa agtcaattgg gcagaggatc ctgcccttaa ggaaattgtt 781 ctgcagcttg agaagaatgt tgacatgatg taataagaat tcatttctga catattttac 841 atttctggca atctcaactc ttatttggaa tacttctgtg catttgtctg tccaccgtaa 901 ttttagaaaa gcatatccat aacgtttaca gttgtagtac agttgtggtt agttatttgt 961 agtgggattg aaagtaattt ttttcttttt atatttctat atttagtttg tttttttgtt 1021 gttgttgttt tttgagatgg agtctcgctt tgttgcccag actggagggc agtggcgcga 1081 tctcggctca ctgcaacctc tgcctcccgg gttcaagcag ttctgcctca gcctcccaag 1141 tagctgtgac taaaggtgca cgccgccatg cccagctaat tttttgtatt ttagtagaga 1201 cggggtttca ccgtgttgcc caggctgctc tcagaactcc tgagctcagg cagtccaccg 1261 cctcggccta ccgaagtgct aggattacag acgtaagcca ccgagcctgg tctagtttgc 1321 attttttttc tatcagtttt ataagttaag aaataaaagg aattaatgtt aaaaaaaaaa 1381 aaaaaaa //