LOCUS BC075839 1757 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens keratin 8, mRNA (cDNA clone MGC:88746 IMAGE:6285632),
complete cds.
ACCESSION BC075839
VERSION BC075839.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 1757)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 1757)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (06-JUL-2004) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: CGAP (Stanford)
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Sequencing Group at the Stanford Human Genome
Center, Stanford University School of Medicine, Stanford, CA 94305
Web site: http://www-shgc.stanford.edu
Contact: (Dickson, Mark) mcd@paxil.stanford.edu
Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
R. M.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 58 Row: k Column: 16
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 68226427.
FEATURES Location/Qualifiers
source 1..1757
/db_xref="H-InvDB:HIT000265653"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:88746 IMAGE:6285632"
/tissue_type="Liver, hepatocellular carcinoma"
/clone_lib="NIH_MGC_100"
/lab_host="DH10B"
/note="Vector: pOTB7"
gene 1..1757
/gene="KRT8"
/gene_synonym="CARD2"
/gene_synonym="CK8"
/gene_synonym="CYK8"
/gene_synonym="K2C8"
/gene_synonym="K8"
/gene_synonym="KO"
/db_xref="GeneID:3856"
/db_xref="HGNC:HGNC:6446"
/db_xref="MIM:148060"
CDS 49..1500
/gene="KRT8"
/gene_synonym="CARD2"
/gene_synonym="CK8"
/gene_synonym="CYK8"
/gene_synonym="K2C8"
/gene_synonym="K8"
/gene_synonym="KO"
/codon_start=1
/product="keratin 8"
/protein_id="AAH75839.1"
/db_xref="GeneID:3856"
/db_xref="HGNC:HGNC:6446"
/db_xref="MIM:148060"
/translation="MSIRVTQKSYKVSTSGPRAFSSRSYTSGPGSRISSSSFSRVGSS
NFRGGLGGGYGGASGMGGITAVTVNQSLLSPLVLEVDPNIQAVRTQEKEQIKTLNNKF
ASFIDKVRFLEQQNKMLETKWSLLQQQKTARSNMDNMFESYINNLRRQLETLGQEKLK
LEAELGNMQGLVEDFKNKYEDEINKRTEMENEFVLIKKDVDEAYMNKVELESRLEGLT
DEINFLRQLYEEEIRELQSQISDTSVVLSMDNSRSLDMDSIIAEVKAQYEDIANRSRA
EAESMYQIKYEELQSLAGKHGDDLRRTKTEISEMNRNISRLQAEIEGLKGQRASLEAA
IADAEQRGELAIKDANAKLSELEAALQRAKQDMARQLREYQELMNVKLALDIEIATYR
KLLEGEESRLESGMQNMSIHTKTTSGYAGGLSSAYGGLTSPGLSYSLGSSFGSGAGSS
SFSRTSSSRAVVVKKIETRDGKLVSESSDVLPK"
BASE COUNT 423 a 503 c 530 g 301 t
ORIGIN
1 aggatctccg cctggttcgg cccgcctgcc tccactcctg cctctaccat gtccatcagg
61 gtgacccaga agtcctacaa ggtgtccacc tctggccccc gggccttcag cagccgctcc
121 tacacgagtg ggcccggttc ccgcatcagc tcctcgagct tctcccgagt gggcagcagc
181 aactttcgcg gtggcctggg cggcggctat ggtggggcca gcggcatggg aggcatcacc
241 gcagttacgg tcaaccagag cctgctgagc ccccttgtcc tggaggtgga ccccaacatc
301 caggccgtgc gcacccagga gaaggagcag atcaagaccc tcaacaacaa gtttgcctcc
361 ttcatagaca aggtacggtt cctggagcag cagaacaaga tgctggagac caagtggagc
421 ctcctgcagc agcagaagac ggctcgaagc aacatggaca acatgttcga gagctacatc
481 aacaacctta ggcggcagct ggagactctg ggccaggaga agctgaagct ggaggcggag
541 cttggcaaca tgcaggggct ggtggaggac ttcaagaaca agtatgagga tgagatcaat
601 aagcgtacag agatggagaa cgaatttgtc ctcatcaaga aggatgtgga tgaagcttac
661 atgaacaagg tagagctgga gtctcgcctg gaagggctga ccgacgagat caacttcctc
721 aggcagctat atgaagagga gatccgggag ctgcagtccc agatctcgga cacatctgtg
781 gtgctgtcca tggacaacag ccgctccctg gacatggaca gcatcattgc tgaggtcaag
841 gcacagtacg aggatattgc caaccgcagc cgggctgagg ctgagagcat gtaccagatc
901 aagtatgagg agctgcagag cctggctggg aagcacgggg atgacctgcg gcgcacaaag
961 actgagatct ctgagatgaa ccggaacatc agccggctcc aggctgagat tgagggcctc
1021 aaaggccaga gggcttccct ggaggccgcc attgcagatg ccgagcagcg tggagagctg
1081 gccattaagg atgccaacgc caagttgtcc gagctggagg ccgccctgca gcgggccaag
1141 caggacatgg cgcggcagct gcgtgagtac caggagctga tgaacgtcaa gctggccctg
1201 gacatcgaga tcgccaccta caggaagctg ctggagggcg aggagagccg gctggagtct
1261 gggatgcaga acatgagtat tcatacgaag accaccagcg gctatgcagg tggtctgagc
1321 tcggcctatg ggggcctcac aagccccggc ctcagctaca gcctgggctc cagctttggc
1381 tctggcgcgg gctccagctc cttcagccgc accagctcct ccagggccgt ggttgtgaag
1441 aagatcgaga cacgtgatgg gaagctggtg tctgagtcct ctgacgtcct gcccaagtga
1501 acagctgcgg cagcccctcc cagcctaccc ctcctgcgct gccccagagc ctgggaagga
1561 ggccgctatg cagggtagca ctgggaacag gagacccacc tgaggctcag ccctagccct
1621 cagcccacct ggggagttta ctacctgggg accccccttg cccatgcctc cagctacaaa
1681 acaattcaat tgcttttttt tttggtccaa aataaaacct cagctagctc tgccaaaaaa
1741 aaaaaaaaaa aaaaaaa
//