LOCUS BC012024 1379 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens centromere protein H, mRNA (cDNA clone MGC:21431 IMAGE:4510607), complete cds. ACCESSION BC012024 VERSION BC012024.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1379) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 1379) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (30-JUL-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Baylor College of Medicine Human Genome Sequencing Center Center code: BCM-HGSC Web site: http://www.hgsc.bcm.tmc.edu/cdna/ Contact: amg@bcm.tmc.edu Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H., Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati, A.N., Gibbs, R.A. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 28 Row: h Column: 15 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 21264590. FEATURES Location/Qualifiers source 1..1379 /db_xref="H-InvDB:HIT000035606" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:21431 IMAGE:4510607" /tissue_type="Testis, embryonal carcinoma" /clone_lib="NIH_MGC_92" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..1379 /gene="CENPH" /db_xref="GeneID:64946" /db_xref="HGNC:HGNC:17268" /db_xref="MIM:605607" CDS 64..807 /gene="CENPH" /codon_start=1 /product="centromere protein H" /protein_id="AAH12024.1" /db_xref="GeneID:64946" /db_xref="HGNC:HGNC:17268" /db_xref="MIM:605607" /translation="MEEQPQMQDADEPADSGGEGRAGGPPQVAGAQAACSEDRMTLLL RLRAQTKQQLLEYKSMVDASEEKTPEQIMQEKQIEAKIEDLENEIEEVKVAFEIKKLA LDRMRLSTALKKNLEKISRQSSVLMDNMKHLLELNKLIMKSQQESWDLEEKLLDIRKK RLQLKQASESKLLEIQTEKNKQKIDLDSMENSERIKIIRQNLQMEIKITTVIQHVFQN LILGSKVNWAEDPALKEIVLQLEKNVDMM" BASE COUNT 434 a 263 c 314 g 368 t ORIGIN 1 ggaaaagcga ccttttctga gcgcgtttgc ctgttgagtg gtagcctttc ccctcaacca 61 gcaatggagg agcagcccca gatgcaagac gccgacgagc ccgcggactc cggaggggaa 121 ggccgggcag gcgggccacc gcaggtcgcc ggcgcccagg cggcgtgcag cgaggaccgc 181 atgaccctgc tcctcaggct gagagcacag acaaaacaac aactcttaga atataaatca 241 atggttgatg caagtgaaga aaaaactcca gaacaaatta tgcaagaaaa gcaaatcgaa 301 gctaaaattg aagacctgga aaatgaaatt gaagaggtaa aagttgcttt tgagataaaa 361 aagcttgcat tagacaggat gagactttca actgcactta aaaaaaacct ggagaaaatt 421 agcagacagt ctagtgtgct catggataac atgaaacacc tattagagct aaataaatta 481 ataatgaaat cacagcagga atcttgggat ttagaggaaa aactgcttga tattagaaag 541 aagagattgc aattaaaaca agcttcagaa agtaagcttt tagaaataca gactgaaaag 601 aacaaacaga agattgattt ggacagtatg gaaaactcag agaggataaa gatcatacga 661 caaaacctac agatggagat aaaaattact actgttattc aacatgtgtt ccagaacctt 721 attttgggga gtaaagtcaa ttgggcagag gatcctgccc ttaaggaaat tgttctgcag 781 cttgagaaga atgttgacat gatgtaataa gaattcattt ctgacatatt ttacatttct 841 ggcaatctca actcttattt ggaatacttc tgtgcatttg tctgtccacc gtaattttag 901 aaaagcatat ccataacgtt tacagttgta gtacagttgt ggttagttat ttgtagtggg 961 attgaaagta atttttttct ttttatattt ctatatttag tttgtttttt tgttgttgtt 1021 gttttttgag atggagtctc gctttgttgc ccagactgga gggcagtggc gcgatctcgg 1081 ctcactgcaa cctctgcctc ccgggttcaa gcagttctgc ctcagcctcc caagtagctg 1141 tgactaaagg tgcacgccgc catgcccagc taattttttg tattttagta gagacggggt 1201 ttcaccgtgt tgcccaggct gctctcagaa ctcctgagct caggcagtcc accgcctcgg 1261 cctaccgaag tgctaggatt acagacgtaa gccaccgagc ctggtctagt ttgcattttt 1321 tttctatcag ttttataagt taagaaataa aaggaattaa tgttaaaaaa aaaaaaaaa //