LOCUS BC071906 2239 bp mRNA linear HUM 07-AUG-2008
DEFINITION Homo sapiens keratin 5, mRNA (cDNA clone MGC:88585 IMAGE:6106996),
complete cds.
ACCESSION BC071906
VERSION BC071906.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 2239)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 2239)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (01-JUN-2004) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Sequencing Group at the Stanford Human Genome
Center, Stanford University School of Medicine, Stanford, CA 94305
Web site: http://www-shgc.stanford.edu
Contact: (Dickson, Mark) mcd@paxil.stanford.edu
Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
R. M.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 58 Row: i Column: 20
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 17318577.
FEATURES Location/Qualifiers
source 1..2239
/db_xref="H-InvDB:HIT000264717"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:88585 IMAGE:6106996"
/tissue_type="Pancreas, adenocarcinoma"
/clone_lib="NIH_MGC_39"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene 1..2239
/gene="KRT5"
/gene_synonym="CK5"
/gene_synonym="DDD"
/gene_synonym="K5"
/gene_synonym="KRT5A"
/db_xref="GeneID:3852"
/db_xref="HGNC:HGNC:6442"
/db_xref="MIM:148040"
CDS 83..1855
/gene="KRT5"
/gene_synonym="CK5"
/gene_synonym="DDD"
/gene_synonym="K5"
/gene_synonym="KRT5A"
/codon_start=1
/product="keratin 5"
/protein_id="AAH71906.1"
/db_xref="GeneID:3852"
/db_xref="HGNC:HGNC:6442"
/db_xref="MIM:148040"
/translation="MSRQSSVSFRSGGSRSFSTASAITPSVSRTSFTSVSRSGGGGGG
GFGRVSLAGACGVGGYGSRSLYNLGGSKRISISTSGGSFRNRFGAGAGGGYGFGGGAG
SGFGFGGGAGGGFGLGGGAGFGGGFGGPGFPVCPPGGIQEVTVNQSLLTPLNLQIDPS
IQRVRTEEREQIKTLNNKFASFIDKVRFLEQQNKVLDTKWTLLQEQGTKTVRQNLEPL
FEQYINNLRRQLDSIVGERGRLDSELRNMQDLVEDFKNKYEDEINKRTTAENEFVMLK
KDVDAAYMNKVELEAKVDALMDEINFMKMFFDAELSQMQTHVSDTSVVLSMDNNRNLD
LDSIIAEVKAQYEEIANRSRTEAESWYQTKYEELQQTAGRHGDDLRNTKHEISEMNRM
IQRLRAEIDNVKKQCANLQNAIADAEQRGELALKDARNKLAELEEALQKAKQDMARLL
REYQELMNTKLALDVEIATYRKLLEGEECRLSGEGVGPVNISVVTSSVSSGYGSGSGY
GGGLGGGLGGGLGGGLAGGGSGSYYSSSSGGVGLSGGLSVGGSGFSASSGRGLGVGFG
SGGGSSSSVKFVSTTSSSRKSFKS"
BASE COUNT 507 a 593 c 672 g 467 t
ORIGIN
1 gcgtcctgct gagctctgtt ctctccagca cctcccaacc cactagtgcc tggttctctt
61 gctccaccag gaacaagcca ccatgtctcg ccagtcaagt gtgtccttcc ggagcggggg
121 cagtcgtagc ttcagcaccg cctctgccat caccccgtct gtctcccgca ccagcttcac
181 ctccgtgtcc cggtccgggg gtggcggtgg tggtggcttc ggcagggtca gccttgcggg
241 tgcttgtgga gtgggtggct atggcagccg gagcctctac aacctggggg gctccaagag
301 gatatccatc agcactagtg gtggcagctt caggaaccgg tttggtgctg gtgctggagg
361 cggctatggc tttggaggtg gtgccggtag tggatttggt ttcggcggtg gagctggtgg
421 tggctttggg ctcggtggcg gagctggctt tggaggtggc ttcggtggcc ctggctttcc
481 tgtctgccct cctggaggta tccaagaggt cactgtcaac cagagtctcc tgactcccct
541 caacctgcaa atcgacccca gcatccagag ggtgaggacc gaggagcgcg agcagatcaa
601 gaccctcaac aataagtttg cctccttcat cgacaaggtg cggttcctgg agcagcagaa
661 caaggttctg gacaccaagt ggaccctgct gcaggagcag ggcaccaaga ccgtgaggca
721 gaacctggag ccgttgttcg agcagtacat caacaacctc aggaggcagc tggacagcat
781 cgtgggggaa cggggccgcc tggactcaga gctaagaaac atgcaggacc tggtggaaga
841 cttcaagaac aagtatgagg atgaaatcaa caagcgtacc actgctgaga atgagtttgt
901 gatgctgaag aaggatgtag atgctgccta catgaacaag gtggagctgg aggccaaggt
961 tgatgcactg atggatgaga ttaacttcat gaagatgttc tttgatgcgg agctgtccca
1021 gatgcagacg catgtctctg acacctcagt ggtcctctcc atggacaaca accgcaacct
1081 ggacctggat agcatcatcg ctgaggtcaa ggcccagtat gaggagattg ccaaccgcag
1141 ccggaccgaa gccgagtcct ggtatcagac caagtatgag gagctgcagc agacagctgg
1201 ccggcatggc gatgacctcc gcaacaccaa gcatgagatc tctgagatga accggatgat
1261 ccagaggctg agagccgaga ttgacaatgt caagaaacag tgcgccaatc tgcagaacgc
1321 cattgcggat gccgagcagc gtggggagct ggccctcaag gatgccagga acaagctggc
1381 cgagctggag gaggccctgc agaaggccaa gcaggacatg gcccggctgc tgcgtgagta
1441 ccaggagctc atgaacacca agctggccct ggacgtggag atcgccactt accgcaagct
1501 gctggagggc gaggaatgca gactcagtgg agaaggagtt ggaccagtca acatctctgt
1561 tgtcacaagc agtgtttcct ctggatatgg cagtggcagt ggctatggcg gtggcctcgg
1621 tggaggtctt ggcggcggcc tcggtggagg tcttgccgga ggtggcagtg gaagctacta
1681 ctccagcagc agtgggggtg tcggcctaag tggtgggctc agtgtggggg gctctggctt
1741 cagtgcaagc agtggccgag ggctgggggt gggctttggc agtggcgggg gtagcagctc
1801 cagcgtcaaa tttgtctcca ccacctcctc ctcccggaag agcttcaaga gctaagaacc
1861 tgctgcaagt cactgccttc caagtgcagc aacccagccc atggagattg cctcttctag
1921 gcagttgctc aagccatgtt ttatcctttt ctggagagta gtctagacca agccaattgc
1981 agaaccacat tctttggttc ccaggagagc cccattccca gcccctggtc tcccgtgccg
2041 cagttctata ttctgcttca aatcagcctt caggtttccc acagcatggc ccctgctgac
2101 acgagaatcc aaagttttcc caaatctaaa tcatcaaaac agaatcccca ccccaatccc
2161 aaattttgtt ttggttctaa ctacctccag aatgtgttca ataaaatgct tttataatat
2221 aaaaaaaaaa aaaaaaaaa
//