LOCUS       ABU76308.1              1176 aa    PRT              BCT 31-JAN-2014
DEFINITION  Cronobacter sakazakii ATCC BAA-894 hypothetical protein protein.
ACCESSION   CP000783-999
PROTEIN_ID  ABU76308.1
SOURCE      Cronobacter sakazakii ATCC BAA-894
  ORGANISM  Cronobacter sakazakii ATCC BAA-894
            Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacterales;
            Enterobacteriaceae; Cronobacter.
REFERENCE   1  (bases 1 to 4368373)
  AUTHORS   Kucerova,E., Clifton,S.W., Xia,X.Q., Long,F., Porwollik,S.,
            Fulton,L., Fronick,C., Minx,P., Kyung,K., Warren,W., Fulton,R.,
            Feng,D., Wollam,A., Shah,N., Bhonagiri,V., Nash,W.E.,
            Hallsworth-Pepin,K., Wilson,R.K., McClelland,M. and Forsythe,S.J.
  TITLE     Genome sequence of Cronobacter sakazakii BAA-894 and comparative
            genomic hybridization analysis with other Cronobacter species
  JOURNAL   PLoS ONE 5 (3), E9556 (2010)
   PUBMED   20221447
  REMARK    Publication Status: Online-Only
REFERENCE   2  (bases 1 to 4368373)
  AUTHORS   McClelland,M., Sanderson,E.K., Porwollik,S., Spieth,J.,
            Clifton,W.S., Fulton,B., Wollam,A., Shah,N., Pepin,K.,
            Bhonagiri,V., Nash,W., Johnson,M., Thiruvilangam,P. and Wilson,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (25-JUL-2007) Genetics, Genome Sequencing Center, 4444
            Forest Park Parkway, St. Louis, MO 63108, USA
COMMENT     C. sakazakii--Cronobacter sakazakii is rarely encountered in
            clinical specimens, and is more prevalent in the environment and in
            food. However, Enterobacter sakazakii is strongly implicated in
            food borne diseases causing severe meningitis or enteritis,
            especially in neonates and infants (Nazarowec-White and Farber, Int
            J FoodMicrobiol. 1997 Feb;34(2):103-13).
            
            The strain of Enterobacter sakazakii being sequenced was isolated
            from powdered milk formula fed to a hospitalized neonate that
            developed an infection (Centers for Disease Control and
            Prevention). It is available from the American Type Culture
            Collection as ATCC BAA-894 or from the Salmonella Genetic Stock
            Centre as SGSC4695. The genome was sequenced to 8X coverage, using
            plasmid and fosmid libraries, and was finished to an error rate of
            less than 1 per 10,000 bases. Automated annotation was performed
            and manual annotation will continue in the labs of Michael
            McClelland and Kenneth Sanderson. The National Institute of Allergy
            and Infectious Diseases (NIAID), National Institutes of Health
            (NIH) has funded this project.
            
            Coding sequences below are predicted using GeneMark v3.3 and
            Glimmer2  v2.13. Intergenic regions not spanned by GeneMark and
            Glimmer2 were blasted against NCBI's non-redundant (NR) database
            and predictions generated based on protein alignments. RNA genes
            were determined  using tRNAscan-SE 1.23 or Rfam v8.0. This sequence
            was finished as follows unless otherwise noted: all regions were
            double stranded, sequenced with an alternate chemistries or covered
            by high quality data(i.e., phred quality >=30);an attempt was made
            to resolve all sequencing problems, such as compressions and
            repeats; all regionswere covered by sequence from more than one m13
            subclone.
FEATURES             Qualifiers
     source          /organism="Cronobacter sakazakii ATCC BAA-894"
                     /mol_type="genomic DNA"
                     /strain="ATCC BAA-894"
                     /culture_collection="ATCC:BAA-894"
                     /db_xref="taxon:290339"
     protein         /locus_tag="ESA_01039"
                     /inference="protein motif:HMMPfam:IPR009302"
                     /inference="protein motif:HMMTigr:IPR006431"
                     /inference="protein motif:HMMTigr:IPR013491"
                     /inference="protein motif:ScanRegExp:IPR000223"
                     /inference="similar to AA sequence:REFSEQ:NP_288651.1"
                     /note="KEGG: eci:UTI89_C5128 1.3e-35 putative tail length
                     tape measure protein precursor K00924; COG: COG5281
                     Phage-related minor tail protein; Psort location:
                     OuterMembrane, score:9.49"
                     /transl_table=11
                     /db_xref="InterPro:IPR000223"
                     /db_xref="InterPro:IPR006431"
                     /db_xref="InterPro:IPR009302"
                     /db_xref="InterPro:IPR013491"
BEGIN
        1 MATLRELIIK ISANSQSFQT EIARASRMGQ DYYRTMQNGG RQAAAAARES EQALSELTNG
       61 FASAGRAAAA ATAAFATGKI VQIADEWTSV NARLKQASSS TDDFTNSQMQ LMQISQRTGT
      121 AFTDNANLFA RAAASMREFG YNSEDVLRVT EAVSTGLKIS GASAEESGSV ITQFSQALAQ
      181 GVLRGEEFNA VNESGDRVVR ALAAGMGVAR KDLKAMADQG QLTIDKVVPA MISQLDTLRG
      241 EFGSMPQTVS GSMQKVTNSF MAWVGGVNQA TGATDALSGG LNGLAGALDG ITRSGVSGAL
      301 DDVADNMSTI TTVAGGLIGV GLAKYLGGIV SSATGATTSL ISAAKAEVAL AVAQDKAAQS
      361 SVAASRADVY RAQQALQAAR GADVQAAQQE KIAAAEAKVT AAQTRLNAAL AGGTATEKVR
      421 ARAALERAQS GLAAAKNADV QTAAEKRLAA AQAALNRNLA GRVTTQNNLN SVTAVGTRLL
      481 GGAMGLIGGV PGLVMLGAGA WYALYQNQEQ ARQSARDYAG TIEEIRNKTR EMSLPEATDN
      541 ESKTREALKE QNRLISDQET KIRSIKDAIS GYQQMLANPG VTINGYMVNH LTSMDEATTE
      601 LANSTDALAV EQERLSQMQA KSNEIQSVLE GLEHRRVDLI RQQAAEQNKV YQSLLIMNGE
      661 HAEFNRLLGL GNNLLAARQG LANIPMRIPQ ADSTQQQKDA LEKSRRDLEL SRLKGEAKEI
      721 ARLGYVADDL GLTTDPQYQN NRQALINIGL ETWRNNEANK PQKKGPKTEA EKTEDIYKRL
      781 IKQQQEQIAL AGQSTELAKV KYQVSQGELA TLSQAQKAEV MRNASLIDQV KLREQLRAYE
      841 ANLADSNASA RAANQADLIG FGQGSRFRQR AQEMLNIRKD FEQKNVELQR QYQGGDITED
      901 LYQKELALNK QYLAERLRDQ QDFYVASDAQ RGDWVGGMRE GFADWADNAS DYASQSAELV
      961 SSSMDSALDN VTAMLVGNKA SWKDWGLSVL QTISKVALQM AVVNAMGGGS SGSGLFGSLL
     1021 GGIAGGIAGG AAGDTNAGTA IQNYGASFQF NAKGGVYSSA DLSSYSGSVV DTPTFFAFAK
     1081 GAGVMGEAGP EAIMPLTRDA TGRLGVKALG SGTQGGAGVS LSIGTINFSG GTGGAQGNTN
     1141 AAGAVANQLT GAIIDTINTQ LRKPGTPLWN ATQGKR
//