LOCUS       ABX67550.1               577 aa    PRT              BCT 31-JAN-2014
DEFINITION  Salmonella enterica subsp. enterica serovar Paratyphi
            B str. SPB7 hypothetical protein protein.
ACCESSION   CP000886-2106
PROTEIN_ID  ABX67550.1
SOURCE      Salmonella enterica subsp. enterica serovar Paratyphi B str. SPB7
  ORGANISM  Salmonella enterica subsp. enterica serovar Paratyphi B str. SPB7
            Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacterales;
            Enterobacteriaceae; Salmonella.
REFERENCE   1  (bases 1 to 4858887)
  AUTHORS   McClelland,M., Sanderson,E.K., Porwollik,S., Spieth,J.,
            Clifton,W.S., Fulton,R., Cordes,M., Wollam,A., Shah,N., Pepin,K.,
            Bhonagiri,V., Nash,W., Johnson,M., Thiruvilangam,P. and Wilson,R.
  CONSRTM   The Salmonella enterica serovar Paratyphi B Genome Sequencing
            Project
  TITLE     Direct Submission
  JOURNAL   Submitted (13-NOV-2007) Genetics, Genome Sequencing Center, 4444
            Forest Park Parkway, St. Louis, MO 63108, USA
COMMENT     The bacterium Salmonella Paratyphi B is host-specialized, for it
            grows well and causes disease only in humans, whereas most strains
            of Salmonella can grow in the gut of almost all animals, both
            domesticated and wild. Humans usually acquire Salmonella Paratyphi
            B by the ingestion of water or of food that has been contaminated
            through fecal contact with humans.  Paratyphi B is quite diverse
            and human infection is sometimes not associated with human-to-
            human system infection but rather associated with food borne
            infection.
            
            The specific strain of S. Paratyphi that was sequenced was isolated
            from the stool of a female in Penang Malaysia, and was obtained
            from Kwai-Lin Thong. It can be obtained from the Salmonella Genetic
            Stock Center as SGSC4150 (www.ucalgary.ca/
            kesander). The genome was sequenced to 8X coverage, using plasmid
            and fosmid libraries, and was manually finished.  Automated
            annotation has been performed and manual annotation will continue
            in the labs of Michael McClelland and Kenneth Sanderson.  The
            National Institute of Allergy and Infectious Diseases (NIAID),
            National Institutes of Health (NIH) has funded this project.
            
            Coding sequences below are predicted using GeneMark v3.3 and
            Glimmer2  v2.13.Intergenic regions not spanned by GeneMark and
            Glimmer2 were blasted against NCBI's non-redundant (NR) database
            and predictions generated based on protein alignments. RNA genes
            were determined  using tRNAscan-SE 1.23 or Rfam v8.0. This sequence
            was finished as follows unless otherwise noted: all regions were
            double stranded, sequenced with an alternate chemistries or covered
            by high quality data(i.e., phred quality >=30);an attempt was made
            to resolve all sequencing problems, such as compressions and
            repeats; all regions were covered by sequence from more than one
            m13 subclone.
FEATURES             Qualifiers
     source          /organism="Salmonella enterica subsp. enterica serovar
                     Paratyphi B str. SPB7"
                     /mol_type="genomic DNA"
                     /strain="SGSC4150; SPB7"
                     /serovar="Paratyphi B"
                     /sub_species="enterica"
                     /culture_collection="ATCC:BAA-1250"
                     /db_xref="taxon:1016998"
     protein         /locus_tag="SPAB_02167"
                     /inference="protein motif:HMMPfam:IPR005021"
                     /note="COG: COG4626 Phage terminase-like protein, large
                     subunit"
                     /transl_table=11
                     /db_xref="InterPro:IPR005021"
BEGIN
        1 MSRKSYPNVN AANQYARNVV RGKITACQYV IQACQRHIDD MAAEKSKRFR YRFDKDMAEK
       61 AAKFIQLLPH TKGEWAFKRM PITLEPWQLF IVCCAFGWVQ KGTKLRRFRE VYTEIPRKNG
      121 KSAISAGVAL YCFTCDNEFG AEVYSGATTE KQAWEVFRPA RLMCKRTPLL VEAFGIEVNA
      181 SNLNRPEDGA RFEPLIGNPG DGASPHCAIV DEYHEHPTDA LYTTMLTGMG ARRQPLMWAI
      241 TTAGYNIEGP CYDKRREVIE MLNGSVPNNE LFGVIYTVDE GDDWTDPKVL EKANPNIGVS
      301 VYRDFLLSQQ QRAINNARHA GVFKTKHLNV WVAARTAFFN LVSWQNCEDK TLTLELFEGQ
      361 PCVLAFDLAR KLDMNSMARL FTREIDGKTH FYSVAPRFWV PYDTVYSVEK NEDRRTAERF
      421 QKWVEMGFLT VTDGAEVDYR YILEEAKAAN KLNPVSESPI DPFGATGLSH DLADENLNPV
      481 TIIQNYTNMS DPMKELEAAI ESGRFHHDGN PIMTWCIGNV VGKTIPGNDD VVKPIKEQAE
      541 NKIDGAVALI MAVGRAMLYE KADTLSDHIE SYGIRSL
//