LOCUS       BAS33068.1              1342 aa    PRT              BCT 15-SEP-2015
DEFINITION  Klebsiella pneumoniae cellulose synthase operon C domain-
            containingprotein protein.
ACCESSION   AP014950-208
PROTEIN_ID  BAS33068.1
SOURCE      Klebsiella pneumoniae
  ORGANISM  Klebsiella pneumoniae
            Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales;
            Enterobacteriaceae; Klebsiella/Raoultella group; Klebsiella.
REFERENCE   1  (bases 1 to 5520319)
  AUTHORS   Iwase,T., Ogura,Y., Ishiwata,K., Hayashi,T., Yoneda,M. and
            Mizunoe,Y.
  TITLE     Direct Submission
  JOURNAL   Submitted (14-SEP-2015) to the DDBJ/EMBL/GenBank databases.
            Contact:Tadayuki Iwase
            Jikei University School of Medicine, Bacteriology; 3-25-8
            Nishi-shinbashi, Minato-ku, Tokyo 105-8461, Japan
REFERENCE   2
  AUTHORS   Iwase,T., Ogura,Y., Ishiwata,K., Hayashi,T., Yoneda,M. and
            Mizunoe,Y.
  TITLE     Complete genome sequence of Klebsiella pneumoniae YH43
  JOURNAL   Unpublished (2015)
COMMENT     The genome sequence of K. pneumoniae strain YH43 was determined
            using 454 GS FLX Titanium system. A total of 524,416 single-end
            reads (218 Mb) and 142,440 8-kb paired-end reads (46 Mb) were
            assembled with GS Assembler software version 2.6 into one scaffold
            containing 58 gaps. The finishing was first performed by in silico
            analysis using GenoFinisher software 
            (http://www.ige.tohoku.ac.jp/joho/gf_e/) and then remaining gaps
            were closed by sequencing of gap-spanning PCR products using an
            ABI3130xl DNA sequencer. To correct sequence error by resequencing
            of YH43 genome, a paired-end library of YH43 was constructed using
            the TruSeq DNA Sample Prep kit and sequenced by the Illumina MiSeq
            platform (2x150 bp). The mapping of obtained 2,085,021 reads (626
            Mb) to YH43 genome sequence and the SNPs-calling were performed
            using BWAand SAMtools, respectively. Total 38 in-dels were
            corrected by the resequencing. The gene identification and
            annotation were conducted by the Microbial Genome Annotation
            Pipeline (MiGAP).
            
            ##Genome-Assembly-Data-START##
            Assembly Method      :: GS Assembler software v. 2.6
            Genome Coverage      :: 40x
            Sequencing Technolog :: 454 GS FLX; Illumina MiSeq
            ##Genome-Assembly-Data-END##
FEATURES             Qualifiers
     source          /db_xref="taxon:573"
                     /mol_type="genomic DNA"
                     /organism="Klebsiella pneumoniae"
                     /strain="YH43"
     protein         /locus_tag="KPYH43_c0208"
                     /transl_table=11
BEGIN
        1 MKTSRRLTTF CLTGALALGA SGGVLAAGND AALQALFAQA NYWHEKSHDE LAMESLQKVL
       61 SVDANNTQAL YLMALWSQQG GDMQAAAQWR ARLAKAAPDS PGLQDLDNAK KMSQVPQGQL
      121 SLARQQARGG NIPGALATWR SMFNGNTPPA GLAAEYYLTM ASDKSLYPQA ISELRQYVAQ
      181 HPQENAPRVA LGKALTWREE TRREGIALLE PMASGNKEAD SGLRQALLWL GPQAGDEQYY
      241 DTWMQRHPQD SEVQNYFRER RSGQARGQGY ANLNSGNTTA AKQQFEEVLQ TNPQDADALA
      301 GMGYIAQRSG DYQAASQYLS RAADLGGDAS ATRRQQAADA LFYGQLAQAQ QAYKQGNISQ
      361 ALALSAPLAQ QSGARGASAK LFRADVLRHN KDLPQAEQTL RSLLSDDPQN AAARENLYYV
      421 LREQNKSAEA QAMLQTLPQS LQQKLQPRVV AGMPGDAVRR QAQAQVSSGN PGGAIATLRE
      481 GVARYPDDPW LRLDLARLLQ KSGNGSEASS LMSAAYRPGA SNNALYAAAL FASENGAWQQ
      541 AQTLLARIPG GSQTSDMRDL RQRVNYNLQL VTAENYLAQG NSTAASNTLR AMASTPPKAP
      601 ADAGKLARLL AESGDLTTAV SLVRNNISSG VSGNAGDYAD QIAVLNQAGL TGEAQNLLSN
      661 PQLQASSTPT QLASIRNGYV INEADRLREQ GNYAAAYDKL IRAMQSDPQN TDLMFAMARL
      721 YQSGKMNKEA GVVYDYLMTR DTPNQDARAG AIDVALSAGN NDRAEQLAGG LRQDNSPDRL
      781 LLLARVAEAQ GHHQQAMTYL RSARGKLLGM QSTNSSETPT VGGVLAADNP FIGVSQTSAP
      841 TRTASTYGQY MPWQVAQSAA APGSSLPGIQ RPDLPVDTAE TRMLRQVDTM MESLQEKTGS
      901 WLQGGMDVRG RDGESGTSKL TEIRTPLTWS SSPFGDSRFD FTVTPVSLNA GTASGDAWRR
      961 YGANPLANAV SNMVSTATSE QAAIASMTEA ERTAYFASNP GAEALSGLGT LNAADFNPTT
     1021 SSGMENLAKL GSYDSGQVAS YLASSSLKPN VDQTSGSTDS QKANGVELAL ALSGDDYRVD
     1081 IGSTPLGQDL NTVVGGVKWS PKLSNYLSLI LTGERRSLTD SLLSYVGLKD TYSGKTWGQV
     1141 TKNGGTLQLS YDDGDAGFYV GGGGYSYLGQ NVASNTSINA NAGVYLRPYH DEYRQLQTGL
     1201 SMSYMDYSKN LSYFTYGQGG YFSPQNYVSV SLPVSLTEKY DNWTMKLGGS VGYQSYSQDK
     1261 SAYFPTNSEW QQTLETAVSN GFAKEAYYSA TSKSGIGYTL RAGADYKVNK QMTLGGQIGY
     1321 DTFGDYNEST AGLYIRYMLG DH
//