LOCUS BC042174 2163 bp mRNA linear HUM 23-JUN-2006
DEFINITION Homo sapiens keratin 4, mRNA (cDNA clone MGC:52326 IMAGE:5453644),
complete cds.
ACCESSION BC042174
VERSION BC042174.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 2163)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 2163)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (02-JAN-2003) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: National Institutes of Health Intramural
Sequencing Center (NISC),
Gaithersburg, Maryland;
Web site: http://www.nisc.nih.gov/
Contact: nisc_mgc@nhgri.nih.gov
Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
Young,A., Zhang,L.-H. and Green,E.D.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 44 Row: j Column: 12
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 17318573.
FEATURES Location/Qualifiers
source 1..2163
/db_xref="H-InvDB:HIT000052671"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:52326 IMAGE:5453644"
/tissue_type="Brain, astrocytoma, grade IV"
/clone_lib="NIH_MGC_98"
/lab_host="DH10B"
/note="Vector: pOTB7"
gene 1..2163
/gene="KRT4"
/gene_synonym="CK4"
/gene_synonym="FLJ31692"
/gene_synonym="K4"
/db_xref="GeneID:3851"
/db_xref="HGNC:HGNC:6441"
/db_xref="MIM:123940"
CDS 22..1626
/gene="KRT4"
/gene_synonym="CK4"
/gene_synonym="FLJ31692"
/gene_synonym="K4"
/codon_start=1
/product="KRT4 protein"
/protein_id="AAH42174.1"
/db_xref="GeneID:3851"
/db_xref="HGNC:HGNC:6441"
/db_xref="MIM:123940"
/translation="MIARQQCVRGGPRGFSCGSAIVGGGKRGAFSSVSMSGGAGRCSS
GGFGSRSLYNLRGNKSISMSVAGSRQGACFGGAGGFGTGGFGAGGFGAGFGTGGFGGG
FGGSFSGKGGPGFPVCPAGGIQEVTINQSLLTPLHVEIDPEIQKVRTEEREQIKLLNN
KFASFIDKVQFLEQQNKVLETKWNLLQQQTTTTSSKNLEPLFETYLSVLRKQLDTLGN
DKGRLQSELKTMQDSVEDFKTKYEEEINKRTAAENDFVVLKKDVDAAYLNKVELEAKV
DSLNDEINFLKVLYDAELSQMQTHVSDTSVVLSMDNNRNLDLDSIIAEVRAQYEEIAQ
RSKAEAEALYQTKVQQLQISVDQHGDNLKNTKSEIAELNRMIQRLRAEIENIKKQCQT
LQVSVADAEQRGENALKDAHSKRVELEAALQQAKEELARMLREYQELMSVKLALDIEI
ATYRKLLEGEEYRMSGECQSAVSISVVSGSTSTGGISGGLGSGSGFGLSSGFGSGSGS
GFGFGGSVSGSSSSKIISTTTLNKRR"
BASE COUNT 511 a 580 c 610 g 462 t
ORIGIN
1 cccagctcgc tctctgcagc catgattgcc agacagcagt gtgtccgagg cgggccccgg
61 ggcttcagct gtggctcggc cattgtaggc ggtggcaaga gaggtgcctt cagctcagtc
121 tccatgtctg gaggtgctgg ccgatgctct tctgggggat ttggcagcag aagcctctac
181 aacctcaggg ggaacaaaag catctccatg agtgtggctg ggtcacgaca aggtgcctgc
241 tttgggggtg ctggaggctt tggcactggt ggctttggtg ccggcggctt cggagctggt
301 ttcggcactg gtggctttgg tggtggattt gggggctcct tcagtggtaa gggtggccct
361 ggcttccccg tctgccccgc tgggggaatt caggaggtca ccatcaacca gagcttgctc
421 acccccctcc acgtggagat tgaccctgag atccagaaag tccggacgga agagcgcgaa
481 cagatcaagc tcctcaacaa caagtttgcc tccttcatcg acaaggtgca gttcttagag
541 caacagaata aggtcctgga gaccaaatgg aacctgctcc agcagcagac gaccaccacc
601 tccagcaaaa accttgagcc cctctttgag acctacctca gtgtcctgag gaagcagcta
661 gataccttgg gcaatgacaa agggcgcctg cagtctgagc tgaagaccat gcaggacagc
721 gtggaggact tcaagactaa gtatgaagag gagatcaaca aacgcacagc agccgagaat
781 gactttgtgg tcctaaagaa ggacgtggat gctgcctacc tgaacaaggt ggagttggag
841 gccaaggtgg acagtcttaa tgacgagatc aacttcctga aggtcctcta tgatgcggag
901 ctgtcccaga tgcagaccca tgtcagcgac acgtccgtgg tcctttccat ggacaacaac
961 cgcaacctgg acctggacag cattattgcc gaggtccgtg cccagtacga ggagattgcc
1021 cagaggagca aggctgaggc tgaagccctg taccagacca aggtccagca gctccagatc
1081 tcggttgacc aacatggtga caacctgaag aacaccaaga gtgaaattgc agagctcaac
1141 aggatgatcc agaggctgcg ggcagagatc gagaacatca agaagcagtg ccagactctt
1201 caggtatccg tggctgatgc agagcagcga ggtgagaatg cccttaaaga tgcccacagc
1261 aagcgcgtag agctggaggc tgccctgcag caggccaagg aggagctggc acgaatgctg
1321 cgtgagtacc aggagctcat gagtgtgaag ctggccttgg acatcgagat cgccacctac
1381 cgcaaactgc tggagggcga ggagtacaga atgtctggag aatgccagag tgccgtgagc
1441 atctctgtgg tcagcggtag caccagcact ggaggcatca gcggaggatt aggaagtggc
1501 tccgggtttg gcctgagtag tggctttggc tccggctctg gaagtggctt tgggtttggt
1561 ggcagtgtct ctggcagttc cagcagcaag atcatctcta ccaccaccct gaacaagaga
1621 cgatagagga gacgaggtcc ctgcagctca ctgtgtccag ctgggcccag cactggtgtc
1681 tctgtgcttc cttcacttca cctccatcct ctgtctctgg ggctcatctt actagtatcc
1741 cctccactat cccatgggct ctctctgccc caggatgatc ttctgtgctg ggacagggac
1801 tctgcctctt ggagtttggt agctacttct tgatttgggc ctggtgaccc acctggaatg
1861 ggaaggatgt cagctgacct ctcacctccc atgggcagag aagaaaatga ccaggagtgt
1921 catctccaga attattgggg tcacatatgt cccttcccag tccaatgcca tctcccacta
1981 gatcctgtat tatccatcta catcagaacc aaactacttc tccaacaccc ggcagcactt
2041 ggccctgcaa gcttaggatg agaaccactt agtgtcccat tctactcctc tcattccctc
2101 ttatccatct gcaggtgaat cttcaataaa atgcttttgt cattcaaaaa aaaaaaaaaa
2161 aaa
//