LOCUS BC042174 2163 bp mRNA linear HUM 23-JUN-2006 DEFINITION Homo sapiens keratin 4, mRNA (cDNA clone MGC:52326 IMAGE:5453644), complete cds. ACCESSION BC042174 VERSION BC042174.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2163) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2163) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (02-JAN-2003) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 44 Row: j Column: 12 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 17318573. FEATURES Location/Qualifiers source 1..2163 /db_xref="H-InvDB:HIT000052671" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:52326 IMAGE:5453644" /tissue_type="Brain, astrocytoma, grade IV" /clone_lib="NIH_MGC_98" /lab_host="DH10B" /note="Vector: pOTB7" gene 1..2163 /gene="KRT4" /gene_synonym="CK4" /gene_synonym="FLJ31692" /gene_synonym="K4" /db_xref="GeneID:3851" /db_xref="HGNC:HGNC:6441" /db_xref="MIM:123940" CDS 22..1626 /gene="KRT4" /gene_synonym="CK4" /gene_synonym="FLJ31692" /gene_synonym="K4" /codon_start=1 /product="KRT4 protein" /protein_id="AAH42174.1" /db_xref="GeneID:3851" /db_xref="HGNC:HGNC:6441" /db_xref="MIM:123940" /translation="MIARQQCVRGGPRGFSCGSAIVGGGKRGAFSSVSMSGGAGRCSS GGFGSRSLYNLRGNKSISMSVAGSRQGACFGGAGGFGTGGFGAGGFGAGFGTGGFGGG FGGSFSGKGGPGFPVCPAGGIQEVTINQSLLTPLHVEIDPEIQKVRTEEREQIKLLNN KFASFIDKVQFLEQQNKVLETKWNLLQQQTTTTSSKNLEPLFETYLSVLRKQLDTLGN DKGRLQSELKTMQDSVEDFKTKYEEEINKRTAAENDFVVLKKDVDAAYLNKVELEAKV DSLNDEINFLKVLYDAELSQMQTHVSDTSVVLSMDNNRNLDLDSIIAEVRAQYEEIAQ RSKAEAEALYQTKVQQLQISVDQHGDNLKNTKSEIAELNRMIQRLRAEIENIKKQCQT LQVSVADAEQRGENALKDAHSKRVELEAALQQAKEELARMLREYQELMSVKLALDIEI ATYRKLLEGEEYRMSGECQSAVSISVVSGSTSTGGISGGLGSGSGFGLSSGFGSGSGS GFGFGGSVSGSSSSKIISTTTLNKRR" BASE COUNT 511 a 580 c 610 g 462 t ORIGIN 1 cccagctcgc tctctgcagc catgattgcc agacagcagt gtgtccgagg cgggccccgg 61 ggcttcagct gtggctcggc cattgtaggc ggtggcaaga gaggtgcctt cagctcagtc 121 tccatgtctg gaggtgctgg ccgatgctct tctgggggat ttggcagcag aagcctctac 181 aacctcaggg ggaacaaaag catctccatg agtgtggctg ggtcacgaca aggtgcctgc 241 tttgggggtg ctggaggctt tggcactggt ggctttggtg ccggcggctt cggagctggt 301 ttcggcactg gtggctttgg tggtggattt gggggctcct tcagtggtaa gggtggccct 361 ggcttccccg tctgccccgc tgggggaatt caggaggtca ccatcaacca gagcttgctc 421 acccccctcc acgtggagat tgaccctgag atccagaaag tccggacgga agagcgcgaa 481 cagatcaagc tcctcaacaa caagtttgcc tccttcatcg acaaggtgca gttcttagag 541 caacagaata aggtcctgga gaccaaatgg aacctgctcc agcagcagac gaccaccacc 601 tccagcaaaa accttgagcc cctctttgag acctacctca gtgtcctgag gaagcagcta 661 gataccttgg gcaatgacaa agggcgcctg cagtctgagc tgaagaccat gcaggacagc 721 gtggaggact tcaagactaa gtatgaagag gagatcaaca aacgcacagc agccgagaat 781 gactttgtgg tcctaaagaa ggacgtggat gctgcctacc tgaacaaggt ggagttggag 841 gccaaggtgg acagtcttaa tgacgagatc aacttcctga aggtcctcta tgatgcggag 901 ctgtcccaga tgcagaccca tgtcagcgac acgtccgtgg tcctttccat ggacaacaac 961 cgcaacctgg acctggacag cattattgcc gaggtccgtg cccagtacga ggagattgcc 1021 cagaggagca aggctgaggc tgaagccctg taccagacca aggtccagca gctccagatc 1081 tcggttgacc aacatggtga caacctgaag aacaccaaga gtgaaattgc agagctcaac 1141 aggatgatcc agaggctgcg ggcagagatc gagaacatca agaagcagtg ccagactctt 1201 caggtatccg tggctgatgc agagcagcga ggtgagaatg cccttaaaga tgcccacagc 1261 aagcgcgtag agctggaggc tgccctgcag caggccaagg aggagctggc acgaatgctg 1321 cgtgagtacc aggagctcat gagtgtgaag ctggccttgg acatcgagat cgccacctac 1381 cgcaaactgc tggagggcga ggagtacaga atgtctggag aatgccagag tgccgtgagc 1441 atctctgtgg tcagcggtag caccagcact ggaggcatca gcggaggatt aggaagtggc 1501 tccgggtttg gcctgagtag tggctttggc tccggctctg gaagtggctt tgggtttggt 1561 ggcagtgtct ctggcagttc cagcagcaag atcatctcta ccaccaccct gaacaagaga 1621 cgatagagga gacgaggtcc ctgcagctca ctgtgtccag ctgggcccag cactggtgtc 1681 tctgtgcttc cttcacttca cctccatcct ctgtctctgg ggctcatctt actagtatcc 1741 cctccactat cccatgggct ctctctgccc caggatgatc ttctgtgctg ggacagggac 1801 tctgcctctt ggagtttggt agctacttct tgatttgggc ctggtgaccc acctggaatg 1861 ggaaggatgt cagctgacct ctcacctccc atgggcagag aagaaaatga ccaggagtgt 1921 catctccaga attattgggg tcacatatgt cccttcccag tccaatgcca tctcccacta 1981 gatcctgtat tatccatcta catcagaacc aaactacttc tccaacaccc ggcagcactt 2041 ggccctgcaa gcttaggatg agaaccactt agtgtcccat tctactcctc tcattccctc 2101 ttatccatct gcaggtgaat cttcaataaa atgcttttgt cattcaaaaa aaaaaaaaaa 2161 aaa //