LOCUS ABX70073.1 895 aa PRT BCT 31-JAN-2014 DEFINITION Salmonella enterica subsp. enterica serovar Paratyphi B str. SPB7 hypothetical protein protein. ACCESSION CP000886-4630 PROTEIN_ID ABX70073.1 SOURCE Salmonella enterica subsp. enterica serovar Paratyphi B str. SPB7 ORGANISM Salmonella enterica subsp. enterica serovar Paratyphi B str. SPB7 Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Salmonella. REFERENCE 1 (bases 1 to 4858887) AUTHORS McClelland,M., Sanderson,E.K., Porwollik,S., Spieth,J., Clifton,W.S., Fulton,R., Cordes,M., Wollam,A., Shah,N., Pepin,K., Bhonagiri,V., Nash,W., Johnson,M., Thiruvilangam,P. and Wilson,R. CONSRTM The Salmonella enterica serovar Paratyphi B Genome Sequencing Project TITLE Direct Submission JOURNAL Submitted (13-NOV-2007) Genetics, Genome Sequencing Center, 4444 Forest Park Parkway, St. Louis, MO 63108, USA COMMENT The bacterium Salmonella Paratyphi B is host-specialized, for it grows well and causes disease only in humans, whereas most strains of Salmonella can grow in the gut of almost all animals, both domesticated and wild. Humans usually acquire Salmonella Paratyphi B by the ingestion of water or of food that has been contaminated through fecal contact with humans. Paratyphi B is quite diverse and human infection is sometimes not associated with human-to- human system infection but rather associated with food borne infection. The specific strain of S. Paratyphi that was sequenced was isolated from the stool of a female in Penang Malaysia, and was obtained from Kwai-Lin Thong. It can be obtained from the Salmonella Genetic Stock Center as SGSC4150 (www.ucalgary.ca/ kesander). The genome was sequenced to 8X coverage, using plasmid and fosmid libraries, and was manually finished. Automated annotation has been performed and manual annotation will continue in the labs of Michael McClelland and Kenneth Sanderson. The National Institute of Allergy and Infectious Diseases (NIAID), National Institutes of Health (NIH) has funded this project. Coding sequences below are predicted using GeneMark v3.3 and Glimmer2 v2.13.Intergenic regions not spanned by GeneMark and Glimmer2 were blasted against NCBI's non-redundant (NR) database and predictions generated based on protein alignments. RNA genes were determined using tRNAscan-SE 1.23 or Rfam v8.0. This sequence was finished as follows unless otherwise noted: all regions were double stranded, sequenced with an alternate chemistries or covered by high quality data(i.e., phred quality >=30);an attempt was made to resolve all sequencing problems, such as compressions and repeats; all regions were covered by sequence from more than one m13 subclone. FEATURES Qualifiers source /organism="Salmonella enterica subsp. enterica serovar Paratyphi B str. SPB7" /mol_type="genomic DNA" /strain="SGSC4150; SPB7" /serovar="Paratyphi B" /sub_species="enterica" /culture_collection="ATCC:BAA-1250" /db_xref="taxon:1016998" protein /locus_tag="SPAB_04762" /inference="protein motif:BlastProDom:IPR001789" /inference="protein motif:Gene3D:IPR003594" /inference="protein motif:HMMPfam:IPR001789" /inference="protein motif:HMMPfam:IPR003594" /inference="protein motif:HMMPfam:IPR003660" /inference="protein motif:HMMPfam:IPR003661" /inference="protein motif:HMMPfam:IPR008207" /inference="protein motif:HMMSmart:IPR001789" /inference="protein motif:HMMSmart:IPR003594" /inference="protein motif:HMMSmart:IPR003660" /inference="protein motif:HMMSmart:IPR003661" /inference="protein motif:HMMSmart:IPR008207" /inference="protein motif:superfamily:IPR003594" /inference="protein motif:superfamily:IPR008207" /inference="protein motif:superfamily:IPR009082" /inference="protein motif:superfamily:IPR011006" /note="KEGG: stt:t3692 0. torS; two-component sensor protein histidine protein kinase. K07647; COG: COG0784 FOG: CheY-like receiver; Psort location: CytoplasmicMembrane, score:9.82" /transl_table=11 /db_xref="InterPro:IPR001789" /db_xref="InterPro:IPR003594" /db_xref="InterPro:IPR003660" /db_xref="InterPro:IPR003661" /db_xref="InterPro:IPR008207" /db_xref="InterPro:IPR009082" /db_xref="InterPro:IPR011006" BEGIN 1 MAALTLLSTV IGWISLRVIS QVEQTNTQAL LPTMNMARQL SEASAYELFS AQNLTNADSE 61 GVWLAQGKML KAQSLKINHL LLALSEQGFN TSAIARQEKE IAQTLGQQGT LVGEILTLRA 121 QQQQLSRQIA EAAESIAAQA HGQANNAATS AGATQAGIYD LIESGKGDQA ERALDRLIDI 181 DLEYVNQMNE LRVNALRFKQ LIVTLKDAQG LSDAEETDEK LNQLVKILSR RQQRIEDPTV 241 RAQIADALEK INQYTTLVTL FRKENAIRDQ LQTLMANNLF QFTRFSTEVS QLVNAIEKRN 301 EAGLARLTHA SQRGQIGLVI LGILALCSLS FILWRVVYRS VSRPLAQQTQ ALQRLLEGDI 361 DSPFPEAAGV SELDTISRLM EAFRANVRKL NRHREDLAEQ VRSQTAELHA LVLEHRQARA 421 EAEKANEAKS TFLAAMSHEI RTPLYGILGT VQLLADKPLM ANYRDDLQAI NDSGESLLAI 481 LNDILDYSAI EVGGTNVSIS EEPFEPRQLL NSALHLMHSR VQVALIADFS EQLPSTLQGD 541 PRRIRQIVIN LLSNAAKFTD RGSIVLRTFC DDQSWFIEVE DTGCGIPEAK LTAIFKPFVQ 601 ATGRRGGTGL GLAISASLAE AMGGTLTVTS TLHVGSCFRL QLPVRHPKPA SKSAFRKPIN 661 LNGLRLLLIE DNMLTQRITA EMLTGKGVKV SVAESANDAL RCLAEGESFD VALVDFDLPD 721 YDGLTLAQQL MSQYPAMKRI GFSAHVIDDN LRQRTAGLFC GIIQKPVPRE ELYRMIAHYL 781 QGKSHNARAM LNEHQLAGDM ASVGPEKLRQ WIALFKDSAL PLVEEIEAAR AMNDDVNIKR 841 LAHKLKSGCA SLGMTQATEA CRELELQPLS DIDIKTIVTQ GVTALDAWIA DHPSP //