LOCUS BC031559 1817 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens keratin 20, mRNA (cDNA clone MGC:35423 IMAGE:5189289), complete cds. ACCESSION BC031559 VERSION BC031559.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1817) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 1817) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (06-JUN-2002) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Life Technologies, Inc. cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Baylor College of Medicine Human Genome Sequencing Center Center code: BCM-HGSC Web site: http://www.hgsc.bcm.tmc.edu/cdna/ Contact: amg@bcm.tmc.edu Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H., Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati, A.N., Gibbs, R.A. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 50 Row: i Column: 14 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 27894336. FEATURES Location/Qualifiers source 1..1817 /db_xref="H-InvDB:HIT000041311" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:35423 IMAGE:5189289" /tissue_type="Colon, Kidney, Stomach, adult, whole pooled" /clone_lib="NIH_MGC_116" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..1817 /gene="KRT20" /gene_synonym="CK20" /gene_synonym="K20" /gene_synonym="KRT21" /gene_synonym="MGC35423" /db_xref="GeneID:54474" /db_xref="HGNC:HGNC:20412" /db_xref="MIM:608218" CDS 43..1317 /gene="KRT20" /gene_synonym="CK20" /gene_synonym="K20" /gene_synonym="KRT21" /gene_synonym="MGC35423" /codon_start=1 /product="keratin 20" /protein_id="AAH31559.1" /db_xref="GeneID:54474" /db_xref="HGNC:HGNC:20412" /db_xref="MIM:608218" /translation="MDFSRRSFHRSLSSSLQAPVVSTVGMQRLGTTPSVYGGAGGRGI RISNSRHTVNYGSDLTGGGDLFVGNEKMAMQNLNDRLASYLEKVRTLEQSNSKLEVQI KQWYETNAPRAGRDYSAYYRQIEELRSQIKDAQLQNARCVLQIDNAKLAAEDFRLKYE TERGIRLTVEADLQGLNKVFDDLTLHKTDLEIQIEELNKDLALLKKEHQEEVDGLHKH LGNTVNVEVDAAPGLNLGVIMNEMRQKYEVMAQKNLQEAKEQFERQTAVLQQQVTVNT EELKGTEVQLTELRRTSQSLEIELQSHLSMKESLEHTLEETKARYSSQLANLQSLLSS LEAQLMQIRSNMERQNNEYHILLDIKTRLEQEIATYRRLLEGEDVKTTEYQLSTLEER DIKKTRKIKTVVQEVVDGKVVSSEVKEVEENI" BASE COUNT 638 a 381 c 412 g 386 t ORIGIN 1 caaccatcct gaagctacag gtgctccctc ctggaatctc caatggattt cagtcgcaga 61 agcttccaca gaagcctgag ctcctccttg caggcccctg tagtcagtac agtgggcatg 121 cagcgcctcg ggacgacacc cagcgtttat gggggtgctg gaggccgggg catccgcatc 181 tccaactcca gacacacggt gaactatggg agcgatctca caggcggcgg ggacctgttt 241 gttggcaatg agaaaatggc catgcagaac ctaaatgacc gtctagcgag ctacctagaa 301 aaggtgcgga ccctggagca gtccaactcc aaacttgaag tgcaaatcaa gcagtggtac 361 gaaaccaacg ccccgagggc tggtcgcgac tacagtgcat attacagaca aattgaagag 421 ctgcgaagtc agattaagga tgctcaactg caaaatgctc ggtgtgtcct gcaaattgat 481 aatgctaaac tggctgctga ggacttcaga ctgaagtatg agactgagag aggaatacgt 541 ctaacagtgg aagctgatct ccaaggcctg aataaggtct ttgatgacct aaccctacat 601 aaaacagatt tggagattca aattgaagaa ctgaataaag acctagctct cctcaaaaag 661 gagcatcagg aggaagtcga tggcctacac aagcatctgg gcaacactgt caatgtggag 721 gttgatgctg ctccaggcct gaaccttggc gtcatcatga atgaaatgag gcagaagtat 781 gaagtcatgg cccagaagaa ccttcaagag gccaaagaac agtttgagag acagactgca 841 gttctgcagc aacaggtcac agtgaatact gaagaattaa aaggaactga ggttcaacta 901 acggagctga gacgcacctc ccagagcctt gagatagaac tccagtccca tctcagcatg 961 aaagagtctt tggagcacac tctagaggag accaaggccc gttacagcag ccagttagcc 1021 aacctccagt cgctgttgag ctctctggag gcccaactga tgcagattcg gagtaacatg 1081 gaacgccaga acaacgaata ccatatcctt cttgacataa agactcgact tgaacaggaa 1141 attgctactt accgccgcct tctggaagga gaagacgtaa aaactacaga atatcagtta 1201 agcaccctgg aagagagaga tataaagaaa accaggaaga ttaagacagt cgtgcaagaa 1261 gtagtggatg gcaaggtcgt gtcatctgaa gtcaaagagg tggaagaaaa tatctaaata 1321 gctaccagaa ggagatgctg ctgaggtttt gaaagaaatt tggctataat cttatctttg 1381 ctccctgcaa gaaatcagcc ataagaaagc actattaata ctctgcagtg attagaaggg 1441 gtggggtggc gggaatccta tttatcagac tctgtaattg aatataaatg ttttactcag 1501 aggagctgca aattgcctgc aaaaatgaaa tccagtgagc actagaatat ttaaaacatc 1561 attactgcca tctttatcat gaagcacatc aattacaagc tgtagaccac ctaatatcaa 1621 tttgtaggta atgttcctga aaattgcaat acatttcaat tatactaaac ctcacaaagt 1681 agaggaatcc atgtaaattg caaataaacc actttctaat tttttcctgt ttctgaaaaa 1741 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1801 aaaaaaaaaa aaaaaaa //