LOCUS       BC031559                1817 bp    mRNA    linear   HUM 15-JUL-2006
DEFINITION  Homo sapiens keratin 20, mRNA (cDNA clone MGC:35423 IMAGE:5189289),
            complete cds.
ACCESSION   BC031559
VERSION     BC031559.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1817)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 1817)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (06-JUN-2002) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: Life Technologies, Inc.
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Baylor College of Medicine Human Genome
            Sequencing Center
            Center code: BCM-HGSC
            Web site: http://www.hgsc.bcm.tmc.edu/cdna/
            Contact: amg@bcm.tmc.edu
            Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H.,
            Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati,
            A.N., Gibbs, R.A.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 50 Row: i Column: 14
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 27894336.
FEATURES             Location/Qualifiers
     source          1..1817
                     /db_xref="H-InvDB:HIT000041311"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:35423 IMAGE:5189289"
                     /tissue_type="Colon, Kidney, Stomach, adult, whole pooled"
                     /clone_lib="NIH_MGC_116"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     gene            1..1817
                     /gene="KRT20"
                     /gene_synonym="CK20"
                     /gene_synonym="K20"
                     /gene_synonym="KRT21"
                     /gene_synonym="MGC35423"
                     /db_xref="GeneID:54474"
                     /db_xref="HGNC:HGNC:20412"
                     /db_xref="MIM:608218"
     CDS             43..1317
                     /gene="KRT20"
                     /gene_synonym="CK20"
                     /gene_synonym="K20"
                     /gene_synonym="KRT21"
                     /gene_synonym="MGC35423"
                     /codon_start=1
                     /product="keratin 20"
                     /protein_id="AAH31559.1"
                     /db_xref="GeneID:54474"
                     /db_xref="HGNC:HGNC:20412"
                     /db_xref="MIM:608218"
                     /translation="MDFSRRSFHRSLSSSLQAPVVSTVGMQRLGTTPSVYGGAGGRGI
                     RISNSRHTVNYGSDLTGGGDLFVGNEKMAMQNLNDRLASYLEKVRTLEQSNSKLEVQI
                     KQWYETNAPRAGRDYSAYYRQIEELRSQIKDAQLQNARCVLQIDNAKLAAEDFRLKYE
                     TERGIRLTVEADLQGLNKVFDDLTLHKTDLEIQIEELNKDLALLKKEHQEEVDGLHKH
                     LGNTVNVEVDAAPGLNLGVIMNEMRQKYEVMAQKNLQEAKEQFERQTAVLQQQVTVNT
                     EELKGTEVQLTELRRTSQSLEIELQSHLSMKESLEHTLEETKARYSSQLANLQSLLSS
                     LEAQLMQIRSNMERQNNEYHILLDIKTRLEQEIATYRRLLEGEDVKTTEYQLSTLEER
                     DIKKTRKIKTVVQEVVDGKVVSSEVKEVEENI"
BASE COUNT          638 a          381 c          412 g          386 t
ORIGIN      
        1 caaccatcct gaagctacag gtgctccctc ctggaatctc caatggattt cagtcgcaga
       61 agcttccaca gaagcctgag ctcctccttg caggcccctg tagtcagtac agtgggcatg
      121 cagcgcctcg ggacgacacc cagcgtttat gggggtgctg gaggccgggg catccgcatc
      181 tccaactcca gacacacggt gaactatggg agcgatctca caggcggcgg ggacctgttt
      241 gttggcaatg agaaaatggc catgcagaac ctaaatgacc gtctagcgag ctacctagaa
      301 aaggtgcgga ccctggagca gtccaactcc aaacttgaag tgcaaatcaa gcagtggtac
      361 gaaaccaacg ccccgagggc tggtcgcgac tacagtgcat attacagaca aattgaagag
      421 ctgcgaagtc agattaagga tgctcaactg caaaatgctc ggtgtgtcct gcaaattgat
      481 aatgctaaac tggctgctga ggacttcaga ctgaagtatg agactgagag aggaatacgt
      541 ctaacagtgg aagctgatct ccaaggcctg aataaggtct ttgatgacct aaccctacat
      601 aaaacagatt tggagattca aattgaagaa ctgaataaag acctagctct cctcaaaaag
      661 gagcatcagg aggaagtcga tggcctacac aagcatctgg gcaacactgt caatgtggag
      721 gttgatgctg ctccaggcct gaaccttggc gtcatcatga atgaaatgag gcagaagtat
      781 gaagtcatgg cccagaagaa ccttcaagag gccaaagaac agtttgagag acagactgca
      841 gttctgcagc aacaggtcac agtgaatact gaagaattaa aaggaactga ggttcaacta
      901 acggagctga gacgcacctc ccagagcctt gagatagaac tccagtccca tctcagcatg
      961 aaagagtctt tggagcacac tctagaggag accaaggccc gttacagcag ccagttagcc
     1021 aacctccagt cgctgttgag ctctctggag gcccaactga tgcagattcg gagtaacatg
     1081 gaacgccaga acaacgaata ccatatcctt cttgacataa agactcgact tgaacaggaa
     1141 attgctactt accgccgcct tctggaagga gaagacgtaa aaactacaga atatcagtta
     1201 agcaccctgg aagagagaga tataaagaaa accaggaaga ttaagacagt cgtgcaagaa
     1261 gtagtggatg gcaaggtcgt gtcatctgaa gtcaaagagg tggaagaaaa tatctaaata
     1321 gctaccagaa ggagatgctg ctgaggtttt gaaagaaatt tggctataat cttatctttg
     1381 ctccctgcaa gaaatcagcc ataagaaagc actattaata ctctgcagtg attagaaggg
     1441 gtggggtggc gggaatccta tttatcagac tctgtaattg aatataaatg ttttactcag
     1501 aggagctgca aattgcctgc aaaaatgaaa tccagtgagc actagaatat ttaaaacatc
     1561 attactgcca tctttatcat gaagcacatc aattacaagc tgtagaccac ctaatatcaa
     1621 tttgtaggta atgttcctga aaattgcaat acatttcaat tatactaaac ctcacaaagt
     1681 agaggaatcc atgtaaattg caaataaacc actttctaat tttttcctgt ttctgaaaaa
     1741 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
     1801 aaaaaaaaaa aaaaaaa
//