LOCUS ABU78681.1 951 aa PRT BCT 31-JAN-2014 DEFINITION Cronobacter sakazakii ATCC BAA-894 hypothetical protein protein. ACCESSION CP000783-3372 PROTEIN_ID ABU78681.1 SOURCE Cronobacter sakazakii ATCC BAA-894 ORGANISM Cronobacter sakazakii ATCC BAA-894 Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Cronobacter. REFERENCE 1 (bases 1 to 4368373) AUTHORS Kucerova,E., Clifton,S.W., Xia,X.Q., Long,F., Porwollik,S., Fulton,L., Fronick,C., Minx,P., Kyung,K., Warren,W., Fulton,R., Feng,D., Wollam,A., Shah,N., Bhonagiri,V., Nash,W.E., Hallsworth-Pepin,K., Wilson,R.K., McClelland,M. and Forsythe,S.J. TITLE Genome sequence of Cronobacter sakazakii BAA-894 and comparative genomic hybridization analysis with other Cronobacter species JOURNAL PLoS ONE 5 (3), E9556 (2010) PUBMED 20221447 REMARK Publication Status: Online-Only REFERENCE 2 (bases 1 to 4368373) AUTHORS McClelland,M., Sanderson,E.K., Porwollik,S., Spieth,J., Clifton,W.S., Fulton,B., Wollam,A., Shah,N., Pepin,K., Bhonagiri,V., Nash,W., Johnson,M., Thiruvilangam,P. and Wilson,R. TITLE Direct Submission JOURNAL Submitted (25-JUL-2007) Genetics, Genome Sequencing Center, 4444 Forest Park Parkway, St. Louis, MO 63108, USA COMMENT C. sakazakii--Cronobacter sakazakii is rarely encountered in clinical specimens, and is more prevalent in the environment and in food. However, Enterobacter sakazakii is strongly implicated in food borne diseases causing severe meningitis or enteritis, especially in neonates and infants (Nazarowec-White and Farber, Int J FoodMicrobiol. 1997 Feb;34(2):103-13). The strain of Enterobacter sakazakii being sequenced was isolated from powdered milk formula fed to a hospitalized neonate that developed an infection (Centers for Disease Control and Prevention). It is available from the American Type Culture Collection as ATCC BAA-894 or from the Salmonella Genetic Stock Centre as SGSC4695. The genome was sequenced to 8X coverage, using plasmid and fosmid libraries, and was finished to an error rate of less than 1 per 10,000 bases. Automated annotation was performed and manual annotation will continue in the labs of Michael McClelland and Kenneth Sanderson. The National Institute of Allergy and Infectious Diseases (NIAID), National Institutes of Health (NIH) has funded this project. Coding sequences below are predicted using GeneMark v3.3 and Glimmer2 v2.13. Intergenic regions not spanned by GeneMark and Glimmer2 were blasted against NCBI's non-redundant (NR) database and predictions generated based on protein alignments. RNA genes were determined using tRNAscan-SE 1.23 or Rfam v8.0. This sequence was finished as follows unless otherwise noted: all regions were double stranded, sequenced with an alternate chemistries or covered by high quality data(i.e., phred quality >=30);an attempt was made to resolve all sequencing problems, such as compressions and repeats; all regionswere covered by sequence from more than one m13 subclone. FEATURES Qualifiers source /organism="Cronobacter sakazakii ATCC BAA-894" /mol_type="genomic DNA" /strain="ATCC BAA-894" /culture_collection="ATCC:BAA-894" /db_xref="taxon:290339" protein /locus_tag="ESA_03466" /inference="protein motif:HMMPfam:IPR002300" /inference="protein motif:HMMPfam:IPR013155" /inference="protein motif:HMMTigr:IPR002303" /inference="protein motif:ScanRegExp:IPR001412" /inference="protein motif:superfamily:IPR009008" /inference="protein motif:superfamily:IPR009080" /inference="protein motif:superfamily:IPR010978" /note="KEGG: ecs:ECs5235 0. valine tRNA synthetase K01873; COG: COG0525 Valyl-tRNA synthetase; Psort location: Cytoplasmic, score:9.26" /transl_table=11 /db_xref="InterPro:IPR001412" /db_xref="InterPro:IPR002300" /db_xref="InterPro:IPR002303" /db_xref="InterPro:IPR009008" /db_xref="InterPro:IPR009080" /db_xref="InterPro:IPR010978" /db_xref="InterPro:IPR013155" BEGIN 1 MEKTYNPQDI EQPLYEHWEQ QGYFKPNGDE SQESFCIMIP PPNVTGSLHM GHAFQQTIMD 61 TMIRYQRMQG KNTLWQVGTD HAGIATQMVV ERKIAAEEGK TRHDYGRDAF IDKIWEWKAE 121 SGGTITRQMR RLGNSVDWER ERFTMDEGLS NAVKEVFVRL YKEDLIYRGK RLVNWDPKLR 181 TAISDLEVEN RESKGSMWHI RYPLADGAKT ADGKDYLVVA TTRPETLLGD TGVAVNPEDP 241 RYKDLIGKYV VLPLVNRRIP IVGDEHADME KGTGCVKITP AHDFNDYEVG RRHQLPMINI 301 LTFDGDIRES AEVYDTKGNE SDVYSNEIPA QFQKMERFAA RKAIVAAVDE LGLLEEIKPH 361 DLTVPYGDRG GVVIEPMLTD QWYVRADVLA KPAVEAVENG DIQFVPKQYE NMYFSWMRDI 421 QDWCISRQLW WGHRIPAWYD AEGNVYVGRS EEEVRQENNL SADVALRQDD DVLDTWFSSA 481 LWTFSTLGWP ENTDALRQFH PTSVMVSGFD IIFFWIARMI MMTMHFIKDE NGKPQVPFKT 541 VYMTGLIRDD EGQKMSKSKG NVIDPLDMVD GITLEELLEK RTGNMMQPQL AEKIRKRTEK 601 QFPEGIEPHG TDALRFTLAA LASTGRDINW DMKRLEGYRN FCNKLWNASR FVLMNTEDQD 661 CGFNGGEKVL SLADRWILAE FNQTVKAYRD ALDNFRFDIA AGILYEFTWN QFCDWYLELT 721 KPVMNGGSEA ELRGTRNTLV TVLEGLLRLA HPIIPFITET IWQRVKVIAG IDADTIMLQP 781 FPAFDASRVD DAALADTEWL KQAIIAVRNI RAEMNIAPGK PLALLLRGCS QDAQRRVNDN 841 RGFLQTLARL ESITVLPADD KGPVSVTKII DGAELLIPMA GLIDKDAELA RLAKEVAKIE 901 GEIGRIESKL GNEGFVARAP EAVIAKEREK LAGYHEAKAK LIEQQGVISA L //