LOCUS BC031559 1817 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens keratin 20, mRNA (cDNA clone MGC:35423 IMAGE:5189289),
complete cds.
ACCESSION BC031559
VERSION BC031559.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 1817)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 1817)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (06-JUN-2002) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: Life Technologies, Inc.
cDNA Library Preparation: Life Technologies, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Baylor College of Medicine Human Genome
Sequencing Center
Center code: BCM-HGSC
Web site: http://www.hgsc.bcm.tmc.edu/cdna/
Contact: amg@bcm.tmc.edu
Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H.,
Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati,
A.N., Gibbs, R.A.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 50 Row: i Column: 14
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 27894336.
FEATURES Location/Qualifiers
source 1..1817
/db_xref="H-InvDB:HIT000041311"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:35423 IMAGE:5189289"
/tissue_type="Colon, Kidney, Stomach, adult, whole pooled"
/clone_lib="NIH_MGC_116"
/lab_host="DH10B"
/note="Vector: pCMV-SPORT6"
gene 1..1817
/gene="KRT20"
/gene_synonym="CK20"
/gene_synonym="K20"
/gene_synonym="KRT21"
/gene_synonym="MGC35423"
/db_xref="GeneID:54474"
/db_xref="HGNC:HGNC:20412"
/db_xref="MIM:608218"
CDS 43..1317
/gene="KRT20"
/gene_synonym="CK20"
/gene_synonym="K20"
/gene_synonym="KRT21"
/gene_synonym="MGC35423"
/codon_start=1
/product="keratin 20"
/protein_id="AAH31559.1"
/db_xref="GeneID:54474"
/db_xref="HGNC:HGNC:20412"
/db_xref="MIM:608218"
/translation="MDFSRRSFHRSLSSSLQAPVVSTVGMQRLGTTPSVYGGAGGRGI
RISNSRHTVNYGSDLTGGGDLFVGNEKMAMQNLNDRLASYLEKVRTLEQSNSKLEVQI
KQWYETNAPRAGRDYSAYYRQIEELRSQIKDAQLQNARCVLQIDNAKLAAEDFRLKYE
TERGIRLTVEADLQGLNKVFDDLTLHKTDLEIQIEELNKDLALLKKEHQEEVDGLHKH
LGNTVNVEVDAAPGLNLGVIMNEMRQKYEVMAQKNLQEAKEQFERQTAVLQQQVTVNT
EELKGTEVQLTELRRTSQSLEIELQSHLSMKESLEHTLEETKARYSSQLANLQSLLSS
LEAQLMQIRSNMERQNNEYHILLDIKTRLEQEIATYRRLLEGEDVKTTEYQLSTLEER
DIKKTRKIKTVVQEVVDGKVVSSEVKEVEENI"
BASE COUNT 638 a 381 c 412 g 386 t
ORIGIN
1 caaccatcct gaagctacag gtgctccctc ctggaatctc caatggattt cagtcgcaga
61 agcttccaca gaagcctgag ctcctccttg caggcccctg tagtcagtac agtgggcatg
121 cagcgcctcg ggacgacacc cagcgtttat gggggtgctg gaggccgggg catccgcatc
181 tccaactcca gacacacggt gaactatggg agcgatctca caggcggcgg ggacctgttt
241 gttggcaatg agaaaatggc catgcagaac ctaaatgacc gtctagcgag ctacctagaa
301 aaggtgcgga ccctggagca gtccaactcc aaacttgaag tgcaaatcaa gcagtggtac
361 gaaaccaacg ccccgagggc tggtcgcgac tacagtgcat attacagaca aattgaagag
421 ctgcgaagtc agattaagga tgctcaactg caaaatgctc ggtgtgtcct gcaaattgat
481 aatgctaaac tggctgctga ggacttcaga ctgaagtatg agactgagag aggaatacgt
541 ctaacagtgg aagctgatct ccaaggcctg aataaggtct ttgatgacct aaccctacat
601 aaaacagatt tggagattca aattgaagaa ctgaataaag acctagctct cctcaaaaag
661 gagcatcagg aggaagtcga tggcctacac aagcatctgg gcaacactgt caatgtggag
721 gttgatgctg ctccaggcct gaaccttggc gtcatcatga atgaaatgag gcagaagtat
781 gaagtcatgg cccagaagaa ccttcaagag gccaaagaac agtttgagag acagactgca
841 gttctgcagc aacaggtcac agtgaatact gaagaattaa aaggaactga ggttcaacta
901 acggagctga gacgcacctc ccagagcctt gagatagaac tccagtccca tctcagcatg
961 aaagagtctt tggagcacac tctagaggag accaaggccc gttacagcag ccagttagcc
1021 aacctccagt cgctgttgag ctctctggag gcccaactga tgcagattcg gagtaacatg
1081 gaacgccaga acaacgaata ccatatcctt cttgacataa agactcgact tgaacaggaa
1141 attgctactt accgccgcct tctggaagga gaagacgtaa aaactacaga atatcagtta
1201 agcaccctgg aagagagaga tataaagaaa accaggaaga ttaagacagt cgtgcaagaa
1261 gtagtggatg gcaaggtcgt gtcatctgaa gtcaaagagg tggaagaaaa tatctaaata
1321 gctaccagaa ggagatgctg ctgaggtttt gaaagaaatt tggctataat cttatctttg
1381 ctccctgcaa gaaatcagcc ataagaaagc actattaata ctctgcagtg attagaaggg
1441 gtggggtggc gggaatccta tttatcagac tctgtaattg aatataaatg ttttactcag
1501 aggagctgca aattgcctgc aaaaatgaaa tccagtgagc actagaatat ttaaaacatc
1561 attactgcca tctttatcat gaagcacatc aattacaagc tgtagaccac ctaatatcaa
1621 tttgtaggta atgttcctga aaattgcaat acatttcaat tatactaaac ctcacaaagt
1681 agaggaatcc atgtaaattg caaataaacc actttctaat tttttcctgt ttctgaaaaa
1741 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
1801 aaaaaaaaaa aaaaaaa
//