LOCUS ABU78954.1 1860 aa PRT BCT 31-JAN-2014 DEFINITION Cronobacter sakazakii ATCC BAA-894 hypothetical protein protein. ACCESSION CP000783-3645 PROTEIN_ID ABU78954.1 SOURCE Cronobacter sakazakii ATCC BAA-894 ORGANISM Cronobacter sakazakii ATCC BAA-894 Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Cronobacter. REFERENCE 1 (bases 1 to 4368373) AUTHORS Kucerova,E., Clifton,S.W., Xia,X.Q., Long,F., Porwollik,S., Fulton,L., Fronick,C., Minx,P., Kyung,K., Warren,W., Fulton,R., Feng,D., Wollam,A., Shah,N., Bhonagiri,V., Nash,W.E., Hallsworth-Pepin,K., Wilson,R.K., McClelland,M. and Forsythe,S.J. TITLE Genome sequence of Cronobacter sakazakii BAA-894 and comparative genomic hybridization analysis with other Cronobacter species JOURNAL PLoS ONE 5 (3), E9556 (2010) PUBMED 20221447 REMARK Publication Status: Online-Only REFERENCE 2 (bases 1 to 4368373) AUTHORS McClelland,M., Sanderson,E.K., Porwollik,S., Spieth,J., Clifton,W.S., Fulton,B., Wollam,A., Shah,N., Pepin,K., Bhonagiri,V., Nash,W., Johnson,M., Thiruvilangam,P. and Wilson,R. TITLE Direct Submission JOURNAL Submitted (25-JUL-2007) Genetics, Genome Sequencing Center, 4444 Forest Park Parkway, St. Louis, MO 63108, USA COMMENT C. sakazakii--Cronobacter sakazakii is rarely encountered in clinical specimens, and is more prevalent in the environment and in food. However, Enterobacter sakazakii is strongly implicated in food borne diseases causing severe meningitis or enteritis, especially in neonates and infants (Nazarowec-White and Farber, Int J FoodMicrobiol. 1997 Feb;34(2):103-13). The strain of Enterobacter sakazakii being sequenced was isolated from powdered milk formula fed to a hospitalized neonate that developed an infection (Centers for Disease Control and Prevention). It is available from the American Type Culture Collection as ATCC BAA-894 or from the Salmonella Genetic Stock Centre as SGSC4695. The genome was sequenced to 8X coverage, using plasmid and fosmid libraries, and was finished to an error rate of less than 1 per 10,000 bases. Automated annotation was performed and manual annotation will continue in the labs of Michael McClelland and Kenneth Sanderson. The National Institute of Allergy and Infectious Diseases (NIAID), National Institutes of Health (NIH) has funded this project. Coding sequences below are predicted using GeneMark v3.3 and Glimmer2 v2.13. Intergenic regions not spanned by GeneMark and Glimmer2 were blasted against NCBI's non-redundant (NR) database and predictions generated based on protein alignments. RNA genes were determined using tRNAscan-SE 1.23 or Rfam v8.0. This sequence was finished as follows unless otherwise noted: all regions were double stranded, sequenced with an alternate chemistries or covered by high quality data(i.e., phred quality >=30);an attempt was made to resolve all sequencing problems, such as compressions and repeats; all regionswere covered by sequence from more than one m13 subclone. FEATURES Qualifiers source /organism="Cronobacter sakazakii ATCC BAA-894" /mol_type="genomic DNA" /strain="ATCC BAA-894" /culture_collection="ATCC:BAA-894" /db_xref="taxon:290339" protein /locus_tag="ESA_03758" /inference="similar to AA sequence:REFSEQ:YP_049205.1" /note="KEGG: ava:Ava_4160 4.1e-34 VCBS K01317; COG: COG5295 Autotransporter adhesin" /transl_table=11 BEGIN 1 MATILDGSAT AQALTIQTLA RHGGELITTV QAQGTRTLIL SEPSVVRIQA TPDAALHYER 61 QGDDLILHMQ NGSTVVCKNY FAEQDGYHSE LLFDDGKHPP VHAVFPSNET MAASAPGLTP 121 EYQTVDSIEP LLIGDSHTMA ILGPILGAVA LGGVIAAAGS GGGGGDNDRG GDNGGNSGSN 181 GTIKLNTLSG DGMLNAQEAQ NALVVSGQTV NVAPGTTVTV TINGKTYTTT VAADNTWSLT 241 VPAADVQAFP DGPLPVRVTT TDTAGHTISA ESSLDVAAQF LPDPQIAPVF GDDMLVLTES 301 QGNQTISGTT GITGAGQSAV VVINGVSYPV EVDGQGNWSL TLTPAQLSTL PQGELPISVI 361 VTDAAGNTGS NTIIATVDTI PPPVTVLPLT GDNLVNAIES KLPIAINGTS EPGAQITVSY 421 NNQQYTTTTG ADGKWSVQIP ANALDGMANG NYPLTVTAKD AAGNTGTTSE TVTMALTPPA 481 PTLNTPFGDD LLNNNDTKVT QLLTGKTGAF GGSQGVIVNI GGLDVLSHAT PVRGSDGKWS 541 LNVDPVAGGN NYLATVDENG NWQLALPPDV LQQFADGEIT ITVVAVDGAG NFGAAPQQSF 601 DVDTTPPTLA VTPVTGDDII TALESGADVV VSGTSSGLES GQSVSVLING VTYVTAAGAD 661 GSWSVTIPAA NVQALPQGDV PISVSASDAA GNRGVAGSTV TVDTEVALTV NPVAGDDIIN 721 SLEIAGDVPV SGTASVEDAG RTVTVTLNGQ DYTTTVQPDG SWTVNLPAAD LQALADGQQP 781 LTVSLSDAVG NSVTINHPVT IDASAATLPV LSINPLSGDG YLNAFEHTRP LLLSGTSTNV 841 EAGQTVTLTL NGKTYTATTQ SDGSWSVEVP ATDVLLLTDQ QWTVNASVTD TSGNPASASG 901 DLTVVAQTNP TISVDPIATD NIINATEKGI DLTITGDSTA IEAGQPVTVL FNGVTYNAQV 961 GSDGSWSVTV PATALSGLND GPLSVVVGSQ DIAGNQVSAG QTVTVDSSVT LTIDTIAVDD 1021 IISDAESQSD VTISGTASTA DAGRTVTVTL GGNTYNATVQ PDGTWSVDVP AADVQAQSDG 1081 ELTVTASLTD AAGNAVSVDH LVTLDTTAPV VTINPIATDD IVSAAEAGGL VTLSGTAEAN 1141 STVVITIGTL TFNATTNASG VWTTDVSLAA LADGDYTANV VATDAAGNSG TTNRPFEMLV 1201 TPPAPTVEAL PFGNAFLSQV ESQTDQIVTG TTGTAHAANV EVVINGITYT ATFDASGTWQ 1261 ITVPAADLQA MPADGSQIPI TVTVTDTAGN SGNGNNSFSA DFVPPSLTVN DVTGDNLINP 1321 TEAAGVIAVS GTSDPGSVVT FTLNGETYGP VTTAGNGNWS INIPAGQLAS LVDDVYDMVV 1381 TAEDDAGNTV TETVPLTIDF TAPVITINEP LAADGYINQA EAQSGVTITG TGEADLPLTL 1441 TLNGVTYNTT VLANGTWSVQ LPASALGSVP NGDYTLSATQ TDAAGNSGSD TAPVSVLRTL 1501 PAPTITLAYG DGFLSQAESQ TDQTITGTTG LNSSALVESA TVTIGGIAYT PTINSNGTWS 1561 VTLTPTQLQA LPQGAVPIRV DVTDVAGNAG SRQTFSTVDT VIPTVDVLPV TGDDIIDQTE 1621 IAGGGLLISG DSEPGAEVTV LFGSVTLTTT VLGDGSWSVA ISPDALAQPE GSYTLDITSR 1681 DDAGNQVSTQ HTVQLQNGVA PPALMALFAE PLLISDVALT GTDGNDHFIL NSLDFSRIDG 1741 GAGVDTLTLG DKLQTLNLAQ LGFKVTHIDV LDLGTSGNGS LTLNLDNALN LKDDPHEPLR 1801 ITGDNGGEVT LLNTPEGIWS VSGVETIGDH VFDVYHNSAL GSTNTLGDLL IQENVHVKLM //