LOCUS       BC012024                1379 bp    mRNA    linear   HUM 15-JUL-2006
DEFINITION  Homo sapiens centromere protein H, mRNA (cDNA clone MGC:21431
            IMAGE:4510607), complete cds.
ACCESSION   BC012024
VERSION     BC012024.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1379)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 1379)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (30-JUL-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Baylor College of Medicine Human Genome
            Sequencing Center
            Center code: BCM-HGSC
            Web site: http://www.hgsc.bcm.tmc.edu/cdna/
            Contact: amg@bcm.tmc.edu
            Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H.,
            Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati,
            A.N., Gibbs, R.A.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 28 Row: h Column: 15
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 21264590.
FEATURES             Location/Qualifiers
     source          1..1379
                     /db_xref="H-InvDB:HIT000035606"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:21431 IMAGE:4510607"
                     /tissue_type="Testis, embryonal carcinoma"
                     /clone_lib="NIH_MGC_92"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     gene            1..1379
                     /gene="CENPH"
                     /db_xref="GeneID:64946"
                     /db_xref="HGNC:HGNC:17268"
                     /db_xref="MIM:605607"
     CDS             64..807
                     /gene="CENPH"
                     /codon_start=1
                     /product="centromere protein H"
                     /protein_id="AAH12024.1"
                     /db_xref="GeneID:64946"
                     /db_xref="HGNC:HGNC:17268"
                     /db_xref="MIM:605607"
                     /translation="MEEQPQMQDADEPADSGGEGRAGGPPQVAGAQAACSEDRMTLLL
                     RLRAQTKQQLLEYKSMVDASEEKTPEQIMQEKQIEAKIEDLENEIEEVKVAFEIKKLA
                     LDRMRLSTALKKNLEKISRQSSVLMDNMKHLLELNKLIMKSQQESWDLEEKLLDIRKK
                     RLQLKQASESKLLEIQTEKNKQKIDLDSMENSERIKIIRQNLQMEIKITTVIQHVFQN
                     LILGSKVNWAEDPALKEIVLQLEKNVDMM"
BASE COUNT          434 a          263 c          314 g          368 t
ORIGIN      
        1 ggaaaagcga ccttttctga gcgcgtttgc ctgttgagtg gtagcctttc ccctcaacca
       61 gcaatggagg agcagcccca gatgcaagac gccgacgagc ccgcggactc cggaggggaa
      121 ggccgggcag gcgggccacc gcaggtcgcc ggcgcccagg cggcgtgcag cgaggaccgc
      181 atgaccctgc tcctcaggct gagagcacag acaaaacaac aactcttaga atataaatca
      241 atggttgatg caagtgaaga aaaaactcca gaacaaatta tgcaagaaaa gcaaatcgaa
      301 gctaaaattg aagacctgga aaatgaaatt gaagaggtaa aagttgcttt tgagataaaa
      361 aagcttgcat tagacaggat gagactttca actgcactta aaaaaaacct ggagaaaatt
      421 agcagacagt ctagtgtgct catggataac atgaaacacc tattagagct aaataaatta
      481 ataatgaaat cacagcagga atcttgggat ttagaggaaa aactgcttga tattagaaag
      541 aagagattgc aattaaaaca agcttcagaa agtaagcttt tagaaataca gactgaaaag
      601 aacaaacaga agattgattt ggacagtatg gaaaactcag agaggataaa gatcatacga
      661 caaaacctac agatggagat aaaaattact actgttattc aacatgtgtt ccagaacctt
      721 attttgggga gtaaagtcaa ttgggcagag gatcctgccc ttaaggaaat tgttctgcag
      781 cttgagaaga atgttgacat gatgtaataa gaattcattt ctgacatatt ttacatttct
      841 ggcaatctca actcttattt ggaatacttc tgtgcatttg tctgtccacc gtaattttag
      901 aaaagcatat ccataacgtt tacagttgta gtacagttgt ggttagttat ttgtagtggg
      961 attgaaagta atttttttct ttttatattt ctatatttag tttgtttttt tgttgttgtt
     1021 gttttttgag atggagtctc gctttgttgc ccagactgga gggcagtggc gcgatctcgg
     1081 ctcactgcaa cctctgcctc ccgggttcaa gcagttctgc ctcagcctcc caagtagctg
     1141 tgactaaagg tgcacgccgc catgcccagc taattttttg tattttagta gagacggggt
     1201 ttcaccgtgt tgcccaggct gctctcagaa ctcctgagct caggcagtcc accgcctcgg
     1261 cctaccgaag tgctaggatt acagacgtaa gccaccgagc ctggtctagt ttgcattttt
     1321 tttctatcag ttttataagt taagaaataa aaggaattaa tgttaaaaaa aaaaaaaaa
//