LOCUS       ABX68840.1               394 aa    PRT              BCT 31-JAN-2014
DEFINITION  Salmonella enterica subsp. enterica serovar Paratyphi
            B str. SPB7 hypothetical protein protein.
ACCESSION   CP000886-3396
PROTEIN_ID  ABX68840.1
SOURCE      Salmonella enterica subsp. enterica serovar Paratyphi B str. SPB7
  ORGANISM  Salmonella enterica subsp. enterica serovar Paratyphi B str. SPB7
            Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacterales;
            Enterobacteriaceae; Salmonella.
REFERENCE   1  (bases 1 to 4858887)
  AUTHORS   McClelland,M., Sanderson,E.K., Porwollik,S., Spieth,J.,
            Clifton,W.S., Fulton,R., Cordes,M., Wollam,A., Shah,N., Pepin,K.,
            Bhonagiri,V., Nash,W., Johnson,M., Thiruvilangam,P. and Wilson,R.
  CONSRTM   The Salmonella enterica serovar Paratyphi B Genome Sequencing
            Project
  TITLE     Direct Submission
  JOURNAL   Submitted (13-NOV-2007) Genetics, Genome Sequencing Center, 4444
            Forest Park Parkway, St. Louis, MO 63108, USA
COMMENT     The bacterium Salmonella Paratyphi B is host-specialized, for it
            grows well and causes disease only in humans, whereas most strains
            of Salmonella can grow in the gut of almost all animals, both
            domesticated and wild. Humans usually acquire Salmonella Paratyphi
            B by the ingestion of water or of food that has been contaminated
            through fecal contact with humans.  Paratyphi B is quite diverse
            and human infection is sometimes not associated with human-to-
            human system infection but rather associated with food borne
            infection.
            
            The specific strain of S. Paratyphi that was sequenced was isolated
            from the stool of a female in Penang Malaysia, and was obtained
            from Kwai-Lin Thong. It can be obtained from the Salmonella Genetic
            Stock Center as SGSC4150 (www.ucalgary.ca/
            kesander). The genome was sequenced to 8X coverage, using plasmid
            and fosmid libraries, and was manually finished.  Automated
            annotation has been performed and manual annotation will continue
            in the labs of Michael McClelland and Kenneth Sanderson.  The
            National Institute of Allergy and Infectious Diseases (NIAID),
            National Institutes of Health (NIH) has funded this project.
            
            Coding sequences below are predicted using GeneMark v3.3 and
            Glimmer2  v2.13.Intergenic regions not spanned by GeneMark and
            Glimmer2 were blasted against NCBI's non-redundant (NR) database
            and predictions generated based on protein alignments. RNA genes
            were determined  using tRNAscan-SE 1.23 or Rfam v8.0. This sequence
            was finished as follows unless otherwise noted: all regions were
            double stranded, sequenced with an alternate chemistries or covered
            by high quality data(i.e., phred quality >=30);an attempt was made
            to resolve all sequencing problems, such as compressions and
            repeats; all regions were covered by sequence from more than one
            m13 subclone.
FEATURES             Qualifiers
     source          /organism="Salmonella enterica subsp. enterica serovar
                     Paratyphi B str. SPB7"
                     /mol_type="genomic DNA"
                     /strain="SGSC4150; SPB7"
                     /serovar="Paratyphi B"
                     /sub_species="enterica"
                     /culture_collection="ATCC:BAA-1250"
                     /db_xref="taxon:1016998"
     protein         /locus_tag="SPAB_03493"
                     /inference="protein motif:HMMPfam:IPR011701"
                     /inference="similar to AA sequence:INSD:AAL21697.1"
                     /note="KEGG: shn:Shewana3_1692 3.7e-17 Xaa-His dipeptidase
                     K01270; COG: COG0477 Permeases of the major facilitator
                     superfamily; Psort location: CytoplasmicMembrane,
                     score:10.00"
                     /transl_table=11
                     /db_xref="InterPro:IPR011701"
BEGIN
        1 MTKPTHELSP ALIVLMSVAT GLAVASNYYA QPLLDTIAHH FSLSASSAGF IVTAAQLGYA
       61 AGLLFLVPLG DMFERRTLIV SMTLLAAGGM LITASSQSLS MMILGTALTG LFSVVAQILV
      121 PLAATLATPA TRGKVVGTIM SGLLLGILLA RTVAGLLANL GGWRTVFWVA SALMALMAVA
      181 LWRGLPKLKS DTHLNYPQLL GSVFSLFIHD KLLRTRALLG CLTFANFSIL WTSMAFLLAA
      241 PPFSYSEGMI GLFGLAGAAG ALGARPAGGF ADKGKSHLTT TFGLLLLLLS WLAIWLGHTS
      301 VLALIIGILV LDLTVQGVHI TNQTVIYRLH PDARNRLTAG YMTSYFIGGA AGSLISASAW
      361 QHAGWAGVCL AGVTVALLNL LVWWRGFHRQ EAVN
//