LOCUS       BC030811                2468 bp    mRNA    linear   HUM 07-OCT-2003
DEFINITION  Homo sapiens Kruppel-like factor 4 (gut), mRNA (cDNA clone
            MGC:22411 IMAGE:4703002), complete cds.
ACCESSION   BC030811
VERSION     BC030811.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2468)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 2468)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (03-JUN-2002) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: CLONTECH
            cDNA Library Preparation: CLONTECH Laboratories, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Sequencing Group at the Stanford Human Genome
            Center, Stanford University School of Medicine, Stanford, CA  94305
            Web site:       http://www-shgc.stanford.edu
            Contact:  (Dickson, Mark) mcd@paxil.stanford.edu
            Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
            R. M.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 37 Row: b Column: 17
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 4758321.
FEATURES             Location/Qualifiers
     source          1..2468
                     /db_xref="H-InvDB:HIT000041148"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:22411 IMAGE:4703002"
                     /tissue_type="Lung"
                     /clone_lib="NIH_MGC_77"
                     /lab_host="DH10B"
                     /note="Vector: pDNR-LIB"
     gene            1..2468
                     /gene="KLF4"
                     /gene_synonym="EZF"
                     /gene_synonym="GKLF"
                     /db_xref="GeneID:9314"
                     /db_xref="MIM:602253"
     CDS             30..1544
                     /gene="KLF4"
                     /gene_synonym="EZF"
                     /gene_synonym="GKLF"
                     /codon_start=1
                     /product="KLF4 protein"
                     /protein_id="AAH30811.1"
                     /db_xref="GeneID:9314"
                     /db_xref="MIM:602253"
                     /translation="MAVSDALLPSFSTFASGPAGREKTLRQAGAPNNRWREELSHMKR
                     LPPVLPGRPYDLAAATVATDLESGGAGAACGGSNLAPLPRRETEEFNDLLDLDFILSN
                     SLTHPPESVAATVSSSASASSSSSPSSSGPASAPSTCSFTYPIRAGNDPGVAPGGTGG
                     GLLYGRESAPPPTAPFNLADINDVSPSGGFVAELLRPELDPVYIPPQQPQPPGGGLMG
                     KFVLKASLSAPGSEYGSPSVISVSKGSPDGSHPVVVAPYNGGPPRTCPKIKQEAVSSC
                     THLGAGPPLSNGHRPAAHDFPLGRQLPSRTTPTLGLEEVLSSRDCHPALPLPPGFHPH
                     PGPNYPSFLPDQMQPQVPPLHYQGQSRGFVARAGEPCVCWPHFGTHGMMLTPPSSPLE
                     LMPPGSCMPEEPKPKRGRRSWPRKRTATHTCDYAGCGKTYTKSSHLKAHLRTHTGEKP
                     YHCDWDGCGWKFARSDELTRHYRKHTGHRPFQCQKCDRAFSRSDHLALHMKRHF"
     misc_feature    1275..1532
                     /gene="KLF4"
                     /gene_synonym="EZF"
                     /gene_synonym="GKLF"
                     /note="COG5048; Region: COG5048, FOG: Zn-finger [General
                     function prediction only]"
                     /db_xref="CDD:COG5048"
BASE COUNT          609 a          696 c          628 g          535 t
ORIGIN      
        1 taatgaggca gccacctggc gagtctgaca tggctgtcag cgacgcgctg ctcccatctt
       61 tctccacgtt cgcgtctggc ccggcgggaa gggagaagac actgcgtcaa gcaggtgccc
      121 cgaataaccg ctggcgggag gagctctccc acatgaagcg acttccccca gtgcttcccg
      181 gccgccccta tgacctggcg gcggcgaccg tggccacaga cctggagagc ggcggagccg
      241 gtgcggcttg cggcggtagc aacctggcgc ccctacctcg gagagagacc gaggagttca
      301 acgatctcct ggacctggac tttattctct ccaattcgct gacccatcct ccggagtcag
      361 tggccgccac cgtgtcctcg tcagcgtcag cctcctcttc gtcgtcgccg tcgagcagcg
      421 gccctgccag cgcgccctcc acctgcagct tcacctatcc gatccgggcc gggaacgacc
      481 cgggcgtggc gccgggcggc acgggcggag gcctcctcta tggcagggag tccgctcccc
      541 ctccgacggc tcccttcaac ctggcggaca tcaacgacgt gagcccctcg ggcggcttcg
      601 tggccgagct cctgcggcca gaattggacc cggtgtacat tccgccgcag cagccgcagc
      661 cgccaggtgg cgggctgatg ggcaagttcg tgctgaaggc gtcgctgagc gcccctggca
      721 gcgagtacgg cagcccgtcg gtcatcagcg tcagcaaagg cagccctgac ggcagccacc
      781 cggtggtggt ggcgccctac aacggcgggc cgccgcgcac gtgccccaag atcaagcagg
      841 aggcggtctc ttcgtgcacc cacttgggcg ctggaccccc tctcagcaat ggccaccggc
      901 cggctgcaca cgacttcccc ctggggcggc agctccccag caggactacc ccgaccctgg
      961 gtcttgagga agtgctgagc agcagggact gtcaccctgc cctgccgctt cctcccggct
     1021 tccatcccca cccggggccc aattacccat ccttcctgcc cgatcagatg cagccgcaag
     1081 tcccgccgct ccattaccaa ggtcagtccc ggggatttgt agctcgggct ggggagccct
     1141 gtgtgtgctg gccccacttc gggacacacg ggatgatgct caccccacct tcttcacccc
     1201 tagagctcat gccacccggt tcctgcatgc cagaggagcc caagccaaag aggggaagac
     1261 gatcgtggcc ccggaaaagg accgccaccc acacttgtga ttacgcgggc tgcggcaaaa
     1321 cctacacaaa gagttcccat ctcaaggcac acctgcgaac ccacacaggt gagaaacctt
     1381 accactgtga ctgggacggc tgtggatgga aattcgcccg ctcagatgaa ctgaccaggc
     1441 actaccgtaa acacacgggg caccgcccgt tccagtgcca aaaatgcgac cgagcatttt
     1501 ccaggtcgga ccacctcgcc ttacacatga agaggcattt ttaaatccca gacagtggat
     1561 atgacccaca ctgccagaag agaattcagt attttttact tttcacactg tcttcccgat
     1621 gagggaagga gcccagccag aaagcactac aatcatggtc aagttcccaa ctgagtcatc
     1681 ttgtgagtgg ataatcagga aaaatgagga atccaaaaga caaaaatcaa agaacagatg
     1741 gggtctgtga ctggatcttc tatcattcca attctaaatc cgacttgaat attcctggac
     1801 ttacaaaatg ccaagggggt gactggaagt tgtggatatc agggtataaa ttatatccgt
     1861 gagttggggg agggaagacc agaattccct tgaattgtgt attgatgcaa tataagcata
     1921 aaagatcacc ttgtattctc tttaccttct aaaagccatt attatgatgt tagaagaaga
     1981 ggaagaaatt caggtacaga aaacatgttt aaatagccta aatgatggtg cttggtgagt
     2041 cttggttcta aaggtaccaa acaaggaagc caaagttttc aaactgctgc atactttgac
     2101 aaggaaaatc tatatttgtc ttccgatcaa catttatgac ctaagtcagg taatatacct
     2161 ggtttacttc tttagcattt ttatgcagac agtctgttat gcactgtggt ttcagatgtg
     2221 caataatttg tacaatggtt tattcccaag tatgccttaa gcagaacaaa tgtgtttttc
     2281 tatatagttc cttgccttaa taaatatgta atataaattt aagcaaacgt ctattttgta
     2341 tatttgtaaa ctacaaagta aaatgaacat tttgtggagt ttgtattttg catactcaag
     2401 gtgagaatta agttttaaat aaacctataa tattttataa aaaaaaaaaa aaaaaaaaaa
     2461 aaaaaaaa
//