LOCUS       BC042174                2163 bp    mRNA    linear   HUM 23-JUN-2006
DEFINITION  Homo sapiens keratin 4, mRNA (cDNA clone MGC:52326 IMAGE:5453644),
            complete cds.
ACCESSION   BC042174
VERSION     BC042174.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2163)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 2163)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (02-JAN-2003) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: National Institutes of Health Intramural
            Sequencing Center (NISC),
            Gaithersburg, Maryland;
            Web site: http://www.nisc.nih.gov/
            Contact: nisc_mgc@nhgri.nih.gov
            Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
            Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
            Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
            Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
            Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
            McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
            Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
            Young,A., Zhang,L.-H. and Green,E.D.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 44 Row: j Column: 12
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 17318573.
FEATURES             Location/Qualifiers
     source          1..2163
                     /db_xref="H-InvDB:HIT000052671"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:52326 IMAGE:5453644"
                     /tissue_type="Brain, astrocytoma, grade IV"
                     /clone_lib="NIH_MGC_98"
                     /lab_host="DH10B"
                     /note="Vector: pOTB7"
     gene            1..2163
                     /gene="KRT4"
                     /gene_synonym="CK4"
                     /gene_synonym="FLJ31692"
                     /gene_synonym="K4"
                     /db_xref="GeneID:3851"
                     /db_xref="HGNC:HGNC:6441"
                     /db_xref="MIM:123940"
     CDS             22..1626
                     /gene="KRT4"
                     /gene_synonym="CK4"
                     /gene_synonym="FLJ31692"
                     /gene_synonym="K4"
                     /codon_start=1
                     /product="KRT4 protein"
                     /protein_id="AAH42174.1"
                     /db_xref="GeneID:3851"
                     /db_xref="HGNC:HGNC:6441"
                     /db_xref="MIM:123940"
                     /translation="MIARQQCVRGGPRGFSCGSAIVGGGKRGAFSSVSMSGGAGRCSS
                     GGFGSRSLYNLRGNKSISMSVAGSRQGACFGGAGGFGTGGFGAGGFGAGFGTGGFGGG
                     FGGSFSGKGGPGFPVCPAGGIQEVTINQSLLTPLHVEIDPEIQKVRTEEREQIKLLNN
                     KFASFIDKVQFLEQQNKVLETKWNLLQQQTTTTSSKNLEPLFETYLSVLRKQLDTLGN
                     DKGRLQSELKTMQDSVEDFKTKYEEEINKRTAAENDFVVLKKDVDAAYLNKVELEAKV
                     DSLNDEINFLKVLYDAELSQMQTHVSDTSVVLSMDNNRNLDLDSIIAEVRAQYEEIAQ
                     RSKAEAEALYQTKVQQLQISVDQHGDNLKNTKSEIAELNRMIQRLRAEIENIKKQCQT
                     LQVSVADAEQRGENALKDAHSKRVELEAALQQAKEELARMLREYQELMSVKLALDIEI
                     ATYRKLLEGEEYRMSGECQSAVSISVVSGSTSTGGISGGLGSGSGFGLSSGFGSGSGS
                     GFGFGGSVSGSSSSKIISTTTLNKRR"
BASE COUNT          511 a          580 c          610 g          462 t
ORIGIN      
        1 cccagctcgc tctctgcagc catgattgcc agacagcagt gtgtccgagg cgggccccgg
       61 ggcttcagct gtggctcggc cattgtaggc ggtggcaaga gaggtgcctt cagctcagtc
      121 tccatgtctg gaggtgctgg ccgatgctct tctgggggat ttggcagcag aagcctctac
      181 aacctcaggg ggaacaaaag catctccatg agtgtggctg ggtcacgaca aggtgcctgc
      241 tttgggggtg ctggaggctt tggcactggt ggctttggtg ccggcggctt cggagctggt
      301 ttcggcactg gtggctttgg tggtggattt gggggctcct tcagtggtaa gggtggccct
      361 ggcttccccg tctgccccgc tgggggaatt caggaggtca ccatcaacca gagcttgctc
      421 acccccctcc acgtggagat tgaccctgag atccagaaag tccggacgga agagcgcgaa
      481 cagatcaagc tcctcaacaa caagtttgcc tccttcatcg acaaggtgca gttcttagag
      541 caacagaata aggtcctgga gaccaaatgg aacctgctcc agcagcagac gaccaccacc
      601 tccagcaaaa accttgagcc cctctttgag acctacctca gtgtcctgag gaagcagcta
      661 gataccttgg gcaatgacaa agggcgcctg cagtctgagc tgaagaccat gcaggacagc
      721 gtggaggact tcaagactaa gtatgaagag gagatcaaca aacgcacagc agccgagaat
      781 gactttgtgg tcctaaagaa ggacgtggat gctgcctacc tgaacaaggt ggagttggag
      841 gccaaggtgg acagtcttaa tgacgagatc aacttcctga aggtcctcta tgatgcggag
      901 ctgtcccaga tgcagaccca tgtcagcgac acgtccgtgg tcctttccat ggacaacaac
      961 cgcaacctgg acctggacag cattattgcc gaggtccgtg cccagtacga ggagattgcc
     1021 cagaggagca aggctgaggc tgaagccctg taccagacca aggtccagca gctccagatc
     1081 tcggttgacc aacatggtga caacctgaag aacaccaaga gtgaaattgc agagctcaac
     1141 aggatgatcc agaggctgcg ggcagagatc gagaacatca agaagcagtg ccagactctt
     1201 caggtatccg tggctgatgc agagcagcga ggtgagaatg cccttaaaga tgcccacagc
     1261 aagcgcgtag agctggaggc tgccctgcag caggccaagg aggagctggc acgaatgctg
     1321 cgtgagtacc aggagctcat gagtgtgaag ctggccttgg acatcgagat cgccacctac
     1381 cgcaaactgc tggagggcga ggagtacaga atgtctggag aatgccagag tgccgtgagc
     1441 atctctgtgg tcagcggtag caccagcact ggaggcatca gcggaggatt aggaagtggc
     1501 tccgggtttg gcctgagtag tggctttggc tccggctctg gaagtggctt tgggtttggt
     1561 ggcagtgtct ctggcagttc cagcagcaag atcatctcta ccaccaccct gaacaagaga
     1621 cgatagagga gacgaggtcc ctgcagctca ctgtgtccag ctgggcccag cactggtgtc
     1681 tctgtgcttc cttcacttca cctccatcct ctgtctctgg ggctcatctt actagtatcc
     1741 cctccactat cccatgggct ctctctgccc caggatgatc ttctgtgctg ggacagggac
     1801 tctgcctctt ggagtttggt agctacttct tgatttgggc ctggtgaccc acctggaatg
     1861 ggaaggatgt cagctgacct ctcacctccc atgggcagag aagaaaatga ccaggagtgt
     1921 catctccaga attattgggg tcacatatgt cccttcccag tccaatgcca tctcccacta
     1981 gatcctgtat tatccatcta catcagaacc aaactacttc tccaacaccc ggcagcactt
     2041 ggccctgcaa gcttaggatg agaaccactt agtgtcccat tctactcctc tcattccctc
     2101 ttatccatct gcaggtgaat cttcaataaa atgcttttgt cattcaaaaa aaaaaaaaaa
     2161 aaa
//