LOCUS BC020982 1416 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens keratin 18, mRNA (cDNA clone MGC:9716 IMAGE:3850724), complete cds. ACCESSION BC020982 VERSION BC020982.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1416) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 1416) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (03-JAN-2002) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Sequencing Group at the Stanford Human Genome Center, Stanford University School of Medicine, Stanford, CA 94305 Web site: http://www-shgc.stanford.edu Contact: (Dickson, Mark) mcd@paxil.stanford.edu Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers, R. M. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 20 Row: m Column: 12 This clone was selected for full length sequencing because it passed the following selection criteria: Hexamer frequency ORF analysis. FEATURES Location/Qualifiers source 1..1416 /db_xref="H-InvDB:HIT000039066" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:9716 IMAGE:3850724" /tissue_type="Colon, adenocarcinoma" /clone_lib="NIH_MGC_65" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..1416 /gene="KRT18" /gene_synonym="CYK18" /gene_synonym="K18" /db_xref="GeneID:3875" /db_xref="HGNC:HGNC:6430" /db_xref="MIM:148070" CDS 50..1342 /gene="KRT18" /gene_synonym="CYK18" /gene_synonym="K18" /codon_start=1 /product="keratin 18" /protein_id="AAH20982.1" /db_xref="GeneID:3875" /db_xref="HGNC:HGNC:6430" /db_xref="MIM:148070" /translation="MSFTTRSTFSTNYRSLGSVQAPSYGARPVSSAASVYAGAGGSGS RISVSRSTSFRGGMGSGGLATGIAGGLAGMGGIQNEKETMQSLNDRLASYLDRVRSLE TENRRLESKIREHLEKKGPQVRDWSHYFKIIEDLRAQIFANTVDNARIVLQIDNARLA ADDFRVKYETELAMRQSVENDIHGLRKVIDDTNITRLQLETEIEALKEELLFMKKNHE EEVKGLQAQIASSGLTVEVDAPKSQDLAKIMADIRAQYDELARKNREELDKYWSQQIE ESTTVVTTQSAEVGAAETTLTELRRTVQSLEIDLDSMRNLKASLENSLREVEARYALQ MEQLNGILLHLESELAQTRAEGQRQAQEYEALLNIKVKLEAEIATYRRLLEDGEDFNL GDALDSSNSMQTIQKTTTRRIVDGKVVSETNDTKVLRH" BASE COUNT 364 a 392 c 423 g 237 t ORIGIN 1 ccgtcgtccg caaagcctga gtcctgtcct ttctctctcc ccggacagca tgagcttcac 61 cactcgctcc accttctcca ccaactaccg gtccctgggc tctgtccagg cgcccagcta 121 cggcgcccgg ccggtcagca gcgcggccag cgtctatgca ggcgctgggg gctctggttc 181 ccggatctcc gtgtcccgct ccaccagctt caggggcggc atggggtccg ggggcctggc 241 caccgggata gccgggggtc tggcaggaat gggaggcatc cagaacgaga aggagaccat 301 gcaaagcctg aacgaccgcc tggcctctta cctggacaga gtgaggagcc tggagaccga 361 gaaccggagg ctggagagca aaatccggga gcacttggag aagaagggac cccaggtcag 421 agactggagc cattacttca agatcatcga ggacctgagg gctcagatct tcgcaaatac 481 tgtggacaat gcccgcatcg ttctgcagat tgacaatgcc cgtcttgctg ctgatgactt 541 tagagtcaag tatgagacag agctggccat gcgccagtct gtggagaacg acatccatgg 601 gctccgcaag gtcattgatg acaccaatat cacacgactg cagctggaga cagagatcga 661 ggctctcaag gaggagctgc tcttcatgaa gaagaaccac gaagaggaag taaaaggcct 721 acaagcccag attgccagct ctgggttgac cgtggaggta gatgccccca aatctcagga 781 cctcgccaag atcatggcag acatccgggc ccaatatgac gagctggctc ggaagaaccg 841 agaggagcta gacaagtact ggtctcagca gattgaggag agcaccacag tggtcaccac 901 acagtctgct gaggttggag ctgctgagac gacgctcaca gagctgagac gtacagtcca 961 gtccttggag atcgacctgg actccatgag aaatctgaag gccagcttgg agaacagcct 1021 gagggaggtg gaggcccgct acgccctaca gatggagcag ctcaacggga tcctgctgca 1081 ccttgagtca gagctggcac agacccgggc agagggacag cgccaggccc aggagtatga 1141 ggccctgctg aacatcaagg tcaagctgga ggctgagatc gccacctacc gccgcctgct 1201 ggaagatggc gaggacttta atcttggtga tgccttggac agcagcaact ccatgcaaac 1261 catccaaaag accaccaccc gccggatagt ggatggcaaa gtggtgtctg agaccaatga 1321 caccaaagtt ctgaggcatt aagccagcag aagcagggta ccctttgggg agcaggaggc 1381 caataaaaag ttcagagttc aaaaaaaaaa aaaaaa //