LOCUS BC012024 1379 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens centromere protein H, mRNA (cDNA clone MGC:21431
IMAGE:4510607), complete cds.
ACCESSION BC012024
VERSION BC012024.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 1379)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 1379)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (30-JUL-2001) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Life Technologies, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Baylor College of Medicine Human Genome
Sequencing Center
Center code: BCM-HGSC
Web site: http://www.hgsc.bcm.tmc.edu/cdna/
Contact: amg@bcm.tmc.edu
Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H.,
Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati,
A.N., Gibbs, R.A.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 28 Row: h Column: 15
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 21264590.
FEATURES Location/Qualifiers
source 1..1379
/db_xref="H-InvDB:HIT000035606"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:21431 IMAGE:4510607"
/tissue_type="Testis, embryonal carcinoma"
/clone_lib="NIH_MGC_92"
/lab_host="DH10B"
/note="Vector: pCMV-SPORT6"
gene 1..1379
/gene="CENPH"
/db_xref="GeneID:64946"
/db_xref="HGNC:HGNC:17268"
/db_xref="MIM:605607"
CDS 64..807
/gene="CENPH"
/codon_start=1
/product="centromere protein H"
/protein_id="AAH12024.1"
/db_xref="GeneID:64946"
/db_xref="HGNC:HGNC:17268"
/db_xref="MIM:605607"
/translation="MEEQPQMQDADEPADSGGEGRAGGPPQVAGAQAACSEDRMTLLL
RLRAQTKQQLLEYKSMVDASEEKTPEQIMQEKQIEAKIEDLENEIEEVKVAFEIKKLA
LDRMRLSTALKKNLEKISRQSSVLMDNMKHLLELNKLIMKSQQESWDLEEKLLDIRKK
RLQLKQASESKLLEIQTEKNKQKIDLDSMENSERIKIIRQNLQMEIKITTVIQHVFQN
LILGSKVNWAEDPALKEIVLQLEKNVDMM"
BASE COUNT 434 a 263 c 314 g 368 t
ORIGIN
1 ggaaaagcga ccttttctga gcgcgtttgc ctgttgagtg gtagcctttc ccctcaacca
61 gcaatggagg agcagcccca gatgcaagac gccgacgagc ccgcggactc cggaggggaa
121 ggccgggcag gcgggccacc gcaggtcgcc ggcgcccagg cggcgtgcag cgaggaccgc
181 atgaccctgc tcctcaggct gagagcacag acaaaacaac aactcttaga atataaatca
241 atggttgatg caagtgaaga aaaaactcca gaacaaatta tgcaagaaaa gcaaatcgaa
301 gctaaaattg aagacctgga aaatgaaatt gaagaggtaa aagttgcttt tgagataaaa
361 aagcttgcat tagacaggat gagactttca actgcactta aaaaaaacct ggagaaaatt
421 agcagacagt ctagtgtgct catggataac atgaaacacc tattagagct aaataaatta
481 ataatgaaat cacagcagga atcttgggat ttagaggaaa aactgcttga tattagaaag
541 aagagattgc aattaaaaca agcttcagaa agtaagcttt tagaaataca gactgaaaag
601 aacaaacaga agattgattt ggacagtatg gaaaactcag agaggataaa gatcatacga
661 caaaacctac agatggagat aaaaattact actgttattc aacatgtgtt ccagaacctt
721 attttgggga gtaaagtcaa ttgggcagag gatcctgccc ttaaggaaat tgttctgcag
781 cttgagaaga atgttgacat gatgtaataa gaattcattt ctgacatatt ttacatttct
841 ggcaatctca actcttattt ggaatacttc tgtgcatttg tctgtccacc gtaattttag
901 aaaagcatat ccataacgtt tacagttgta gtacagttgt ggttagttat ttgtagtggg
961 attgaaagta atttttttct ttttatattt ctatatttag tttgtttttt tgttgttgtt
1021 gttttttgag atggagtctc gctttgttgc ccagactgga gggcagtggc gcgatctcgg
1081 ctcactgcaa cctctgcctc ccgggttcaa gcagttctgc ctcagcctcc caagtagctg
1141 tgactaaagg tgcacgccgc catgcccagc taattttttg tattttagta gagacggggt
1201 ttcaccgtgt tgcccaggct gctctcagaa ctcctgagct caggcagtcc accgcctcgg
1261 cctaccgaag tgctaggatt acagacgtaa gccaccgagc ctggtctagt ttgcattttt
1321 tttctatcag ttttataagt taagaaataa aaggaattaa tgttaaaaaa aaaaaaaaa
//