LOCUS       ABU78442.1               475 aa    PRT              BCT 31-JAN-2014
DEFINITION  Cronobacter sakazakii ATCC BAA-894 hypothetical protein protein.
ACCESSION   CP000783-3133
PROTEIN_ID  ABU78442.1
SOURCE      Cronobacter sakazakii ATCC BAA-894
  ORGANISM  Cronobacter sakazakii ATCC BAA-894
            Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacterales;
            Enterobacteriaceae; Cronobacter.
REFERENCE   1  (bases 1 to 4368373)
  AUTHORS   Kucerova,E., Clifton,S.W., Xia,X.Q., Long,F., Porwollik,S.,
            Fulton,L., Fronick,C., Minx,P., Kyung,K., Warren,W., Fulton,R.,
            Feng,D., Wollam,A., Shah,N., Bhonagiri,V., Nash,W.E.,
            Hallsworth-Pepin,K., Wilson,R.K., McClelland,M. and Forsythe,S.J.
  TITLE     Genome sequence of Cronobacter sakazakii BAA-894 and comparative
            genomic hybridization analysis with other Cronobacter species
  JOURNAL   PLoS ONE 5 (3), E9556 (2010)
   PUBMED   20221447
  REMARK    Publication Status: Online-Only
REFERENCE   2  (bases 1 to 4368373)
  AUTHORS   McClelland,M., Sanderson,E.K., Porwollik,S., Spieth,J.,
            Clifton,W.S., Fulton,B., Wollam,A., Shah,N., Pepin,K.,
            Bhonagiri,V., Nash,W., Johnson,M., Thiruvilangam,P. and Wilson,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (25-JUL-2007) Genetics, Genome Sequencing Center, 4444
            Forest Park Parkway, St. Louis, MO 63108, USA
COMMENT     C. sakazakii--Cronobacter sakazakii is rarely encountered in
            clinical specimens, and is more prevalent in the environment and in
            food. However, Enterobacter sakazakii is strongly implicated in
            food borne diseases causing severe meningitis or enteritis,
            especially in neonates and infants (Nazarowec-White and Farber, Int
            J FoodMicrobiol. 1997 Feb;34(2):103-13).
            
            The strain of Enterobacter sakazakii being sequenced was isolated
            from powdered milk formula fed to a hospitalized neonate that
            developed an infection (Centers for Disease Control and
            Prevention). It is available from the American Type Culture
            Collection as ATCC BAA-894 or from the Salmonella Genetic Stock
            Centre as SGSC4695. The genome was sequenced to 8X coverage, using
            plasmid and fosmid libraries, and was finished to an error rate of
            less than 1 per 10,000 bases. Automated annotation was performed
            and manual annotation will continue in the labs of Michael
            McClelland and Kenneth Sanderson. The National Institute of Allergy
            and Infectious Diseases (NIAID), National Institutes of Health
            (NIH) has funded this project.
            
            Coding sequences below are predicted using GeneMark v3.3 and
            Glimmer2  v2.13. Intergenic regions not spanned by GeneMark and
            Glimmer2 were blasted against NCBI's non-redundant (NR) database
            and predictions generated based on protein alignments. RNA genes
            were determined  using tRNAscan-SE 1.23 or Rfam v8.0. This sequence
            was finished as follows unless otherwise noted: all regions were
            double stranded, sequenced with an alternate chemistries or covered
            by high quality data(i.e., phred quality >=30);an attempt was made
            to resolve all sequencing problems, such as compressions and
            repeats; all regionswere covered by sequence from more than one m13
            subclone.
FEATURES             Qualifiers
     source          /organism="Cronobacter sakazakii ATCC BAA-894"
                     /mol_type="genomic DNA"
                     /strain="ATCC BAA-894"
                     /culture_collection="ATCC:BAA-894"
                     /db_xref="taxon:290339"
     protein         /locus_tag="ESA_03220"
                     /inference="protein motif:BlastProDom:IPR001327"
                     /inference="protein motif:Gene3D:IPR004099"
                     /inference="protein motif:HMMPfam:IPR001327"
                     /inference="protein motif:HMMPfam:IPR004099"
                     /inference="protein motif:HMMPfam:IPR013027"
                     /inference="protein motif:HMMTigr:IPR006258"
                     /inference="protein motif:ScanRegExp:IPR012999"
                     /inference="similar to AA sequence:REFSEQ:YP_215140.1"
                     /note="KEGG: sec:SC0153 7.0e-250 lpdA; lipoamide
                     dehydrogenase (NADH); component of 2-oxodehydrogenase and
                     pyruvate complexes; L protein of glycine cleavage complex
                     second part K00382; COG: COG1249 Pyruvate/2-oxoglutarate
                     dehydrogenase complex, dihydrolipoamide dehydrogenase (E3)
                     component, and related enzymes; Psort location:
                     Cytoplasmic, score:9.97"
                     /transl_table=11
                     /db_xref="InterPro:IPR001327"
                     /db_xref="InterPro:IPR004099"
                     /db_xref="InterPro:IPR006258"
                     /db_xref="InterPro:IPR012999"
                     /db_xref="InterPro:IPR013027"
BEGIN
        1 MMSTEIKTQV VVLGAGPAGY SAAFRCADLG LETVIVERYS TLGGVCLNVG CIPSKALLHV
       61 AKVIEEAKAL AEHGIVFGEP KTDIDKIRTW KEKVINQLTG GLSGMAKGRK VKVVNGLGKF
      121 TGANTLEVEG ENGKTVINFD NAIIAAGSRP IQLPFIPHED PRVWDSTDAL ELKEVPKRML
      181 VMGGGIIGLE MGTVYHALGS EIDVVEMFDQ VIPAADKDVV KVFTKRISKK FNLMLETKVT
      241 AVEAKEDGIY VSMEGKKAPA EAQRYDAVLV AIGRVPNGKN LDAGKAGVEV DDRGFIRVDK
      301 QMRTNVPHIY AIGDIVGQPM LAHKGVHEGH VAAEVIAGMK HYFDPKVIPS IAYTEPEVAW
      361 VGLTEKEAKE KGISYETATF PWAASGRAIA SDCADGMTKL IFDKETHRVI GGAIVGTNGG
      421 ELLGEIGLAI EMGCDAEDIA LTIHAHPTLH ESVGLAAEVF EGSITDLPNP KAKKK
//