LOCUS       ABU78954.1              1860 aa    PRT              BCT 31-JAN-2014
DEFINITION  Cronobacter sakazakii ATCC BAA-894 hypothetical protein protein.
ACCESSION   CP000783-3645
PROTEIN_ID  ABU78954.1
SOURCE      Cronobacter sakazakii ATCC BAA-894
  ORGANISM  Cronobacter sakazakii ATCC BAA-894
            Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacterales;
            Enterobacteriaceae; Cronobacter.
REFERENCE   1  (bases 1 to 4368373)
  AUTHORS   Kucerova,E., Clifton,S.W., Xia,X.Q., Long,F., Porwollik,S.,
            Fulton,L., Fronick,C., Minx,P., Kyung,K., Warren,W., Fulton,R.,
            Feng,D., Wollam,A., Shah,N., Bhonagiri,V., Nash,W.E.,
            Hallsworth-Pepin,K., Wilson,R.K., McClelland,M. and Forsythe,S.J.
  TITLE     Genome sequence of Cronobacter sakazakii BAA-894 and comparative
            genomic hybridization analysis with other Cronobacter species
  JOURNAL   PLoS ONE 5 (3), E9556 (2010)
   PUBMED   20221447
  REMARK    Publication Status: Online-Only
REFERENCE   2  (bases 1 to 4368373)
  AUTHORS   McClelland,M., Sanderson,E.K., Porwollik,S., Spieth,J.,
            Clifton,W.S., Fulton,B., Wollam,A., Shah,N., Pepin,K.,
            Bhonagiri,V., Nash,W., Johnson,M., Thiruvilangam,P. and Wilson,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (25-JUL-2007) Genetics, Genome Sequencing Center, 4444
            Forest Park Parkway, St. Louis, MO 63108, USA
COMMENT     C. sakazakii--Cronobacter sakazakii is rarely encountered in
            clinical specimens, and is more prevalent in the environment and in
            food. However, Enterobacter sakazakii is strongly implicated in
            food borne diseases causing severe meningitis or enteritis,
            especially in neonates and infants (Nazarowec-White and Farber, Int
            J FoodMicrobiol. 1997 Feb;34(2):103-13).
            
            The strain of Enterobacter sakazakii being sequenced was isolated
            from powdered milk formula fed to a hospitalized neonate that
            developed an infection (Centers for Disease Control and
            Prevention). It is available from the American Type Culture
            Collection as ATCC BAA-894 or from the Salmonella Genetic Stock
            Centre as SGSC4695. The genome was sequenced to 8X coverage, using
            plasmid and fosmid libraries, and was finished to an error rate of
            less than 1 per 10,000 bases. Automated annotation was performed
            and manual annotation will continue in the labs of Michael
            McClelland and Kenneth Sanderson. The National Institute of Allergy
            and Infectious Diseases (NIAID), National Institutes of Health
            (NIH) has funded this project.
            
            Coding sequences below are predicted using GeneMark v3.3 and
            Glimmer2  v2.13. Intergenic regions not spanned by GeneMark and
            Glimmer2 were blasted against NCBI's non-redundant (NR) database
            and predictions generated based on protein alignments. RNA genes
            were determined  using tRNAscan-SE 1.23 or Rfam v8.0. This sequence
            was finished as follows unless otherwise noted: all regions were
            double stranded, sequenced with an alternate chemistries or covered
            by high quality data(i.e., phred quality >=30);an attempt was made
            to resolve all sequencing problems, such as compressions and
            repeats; all regionswere covered by sequence from more than one m13
            subclone.
FEATURES             Qualifiers
     source          /organism="Cronobacter sakazakii ATCC BAA-894"
                     /mol_type="genomic DNA"
                     /strain="ATCC BAA-894"
                     /culture_collection="ATCC:BAA-894"
                     /db_xref="taxon:290339"
     protein         /locus_tag="ESA_03758"
                     /inference="similar to AA sequence:REFSEQ:YP_049205.1"
                     /note="KEGG: ava:Ava_4160 4.1e-34 VCBS K01317; COG:
                     COG5295 Autotransporter adhesin"
                     /transl_table=11
BEGIN
        1 MATILDGSAT AQALTIQTLA RHGGELITTV QAQGTRTLIL SEPSVVRIQA TPDAALHYER
       61 QGDDLILHMQ NGSTVVCKNY FAEQDGYHSE LLFDDGKHPP VHAVFPSNET MAASAPGLTP
      121 EYQTVDSIEP LLIGDSHTMA ILGPILGAVA LGGVIAAAGS GGGGGDNDRG GDNGGNSGSN
      181 GTIKLNTLSG DGMLNAQEAQ NALVVSGQTV NVAPGTTVTV TINGKTYTTT VAADNTWSLT
      241 VPAADVQAFP DGPLPVRVTT TDTAGHTISA ESSLDVAAQF LPDPQIAPVF GDDMLVLTES
      301 QGNQTISGTT GITGAGQSAV VVINGVSYPV EVDGQGNWSL TLTPAQLSTL PQGELPISVI
      361 VTDAAGNTGS NTIIATVDTI PPPVTVLPLT GDNLVNAIES KLPIAINGTS EPGAQITVSY
      421 NNQQYTTTTG ADGKWSVQIP ANALDGMANG NYPLTVTAKD AAGNTGTTSE TVTMALTPPA
      481 PTLNTPFGDD LLNNNDTKVT QLLTGKTGAF GGSQGVIVNI GGLDVLSHAT PVRGSDGKWS
      541 LNVDPVAGGN NYLATVDENG NWQLALPPDV LQQFADGEIT ITVVAVDGAG NFGAAPQQSF
      601 DVDTTPPTLA VTPVTGDDII TALESGADVV VSGTSSGLES GQSVSVLING VTYVTAAGAD
      661 GSWSVTIPAA NVQALPQGDV PISVSASDAA GNRGVAGSTV TVDTEVALTV NPVAGDDIIN
      721 SLEIAGDVPV SGTASVEDAG RTVTVTLNGQ DYTTTVQPDG SWTVNLPAAD LQALADGQQP
      781 LTVSLSDAVG NSVTINHPVT IDASAATLPV LSINPLSGDG YLNAFEHTRP LLLSGTSTNV
      841 EAGQTVTLTL NGKTYTATTQ SDGSWSVEVP ATDVLLLTDQ QWTVNASVTD TSGNPASASG
      901 DLTVVAQTNP TISVDPIATD NIINATEKGI DLTITGDSTA IEAGQPVTVL FNGVTYNAQV
      961 GSDGSWSVTV PATALSGLND GPLSVVVGSQ DIAGNQVSAG QTVTVDSSVT LTIDTIAVDD
     1021 IISDAESQSD VTISGTASTA DAGRTVTVTL GGNTYNATVQ PDGTWSVDVP AADVQAQSDG
     1081 ELTVTASLTD AAGNAVSVDH LVTLDTTAPV VTINPIATDD IVSAAEAGGL VTLSGTAEAN
     1141 STVVITIGTL TFNATTNASG VWTTDVSLAA LADGDYTANV VATDAAGNSG TTNRPFEMLV
     1201 TPPAPTVEAL PFGNAFLSQV ESQTDQIVTG TTGTAHAANV EVVINGITYT ATFDASGTWQ
     1261 ITVPAADLQA MPADGSQIPI TVTVTDTAGN SGNGNNSFSA DFVPPSLTVN DVTGDNLINP
     1321 TEAAGVIAVS GTSDPGSVVT FTLNGETYGP VTTAGNGNWS INIPAGQLAS LVDDVYDMVV
     1381 TAEDDAGNTV TETVPLTIDF TAPVITINEP LAADGYINQA EAQSGVTITG TGEADLPLTL
     1441 TLNGVTYNTT VLANGTWSVQ LPASALGSVP NGDYTLSATQ TDAAGNSGSD TAPVSVLRTL
     1501 PAPTITLAYG DGFLSQAESQ TDQTITGTTG LNSSALVESA TVTIGGIAYT PTINSNGTWS
     1561 VTLTPTQLQA LPQGAVPIRV DVTDVAGNAG SRQTFSTVDT VIPTVDVLPV TGDDIIDQTE
     1621 IAGGGLLISG DSEPGAEVTV LFGSVTLTTT VLGDGSWSVA ISPDALAQPE GSYTLDITSR
     1681 DDAGNQVSTQ HTVQLQNGVA PPALMALFAE PLLISDVALT GTDGNDHFIL NSLDFSRIDG
     1741 GAGVDTLTLG DKLQTLNLAQ LGFKVTHIDV LDLGTSGNGS LTLNLDNALN LKDDPHEPLR
     1801 ITGDNGGEVT LLNTPEGIWS VSGVETIGDH VFDVYHNSAL GSTNTLGDLL IQENVHVKLM
//