LOCUS       ABX68170.1               933 aa    PRT              BCT 31-JAN-2014
DEFINITION  Salmonella enterica subsp. enterica serovar Paratyphi
            B str. SPB7 hypothetical protein protein.
ACCESSION   CP000886-2726
PROTEIN_ID  ABX68170.1
SOURCE      Salmonella enterica subsp. enterica serovar Paratyphi B str. SPB7
  ORGANISM  Salmonella enterica subsp. enterica serovar Paratyphi B str. SPB7
            Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacterales;
            Enterobacteriaceae; Salmonella.
REFERENCE   1  (bases 1 to 4858887)
  AUTHORS   McClelland,M., Sanderson,E.K., Porwollik,S., Spieth,J.,
            Clifton,W.S., Fulton,R., Cordes,M., Wollam,A., Shah,N., Pepin,K.,
            Bhonagiri,V., Nash,W., Johnson,M., Thiruvilangam,P. and Wilson,R.
  CONSRTM   The Salmonella enterica serovar Paratyphi B Genome Sequencing
            Project
  TITLE     Direct Submission
  JOURNAL   Submitted (13-NOV-2007) Genetics, Genome Sequencing Center, 4444
            Forest Park Parkway, St. Louis, MO 63108, USA
COMMENT     The bacterium Salmonella Paratyphi B is host-specialized, for it
            grows well and causes disease only in humans, whereas most strains
            of Salmonella can grow in the gut of almost all animals, both
            domesticated and wild. Humans usually acquire Salmonella Paratyphi
            B by the ingestion of water or of food that has been contaminated
            through fecal contact with humans.  Paratyphi B is quite diverse
            and human infection is sometimes not associated with human-to-
            human system infection but rather associated with food borne
            infection.
            
            The specific strain of S. Paratyphi that was sequenced was isolated
            from the stool of a female in Penang Malaysia, and was obtained
            from Kwai-Lin Thong. It can be obtained from the Salmonella Genetic
            Stock Center as SGSC4150 (www.ucalgary.ca/
            kesander). The genome was sequenced to 8X coverage, using plasmid
            and fosmid libraries, and was manually finished.  Automated
            annotation has been performed and manual annotation will continue
            in the labs of Michael McClelland and Kenneth Sanderson.  The
            National Institute of Allergy and Infectious Diseases (NIAID),
            National Institutes of Health (NIH) has funded this project.
            
            Coding sequences below are predicted using GeneMark v3.3 and
            Glimmer2  v2.13.Intergenic regions not spanned by GeneMark and
            Glimmer2 were blasted against NCBI's non-redundant (NR) database
            and predictions generated based on protein alignments. RNA genes
            were determined  using tRNAscan-SE 1.23 or Rfam v8.0. This sequence
            was finished as follows unless otherwise noted: all regions were
            double stranded, sequenced with an alternate chemistries or covered
            by high quality data(i.e., phred quality >=30);an attempt was made
            to resolve all sequencing problems, such as compressions and
            repeats; all regions were covered by sequence from more than one
            m13 subclone.
FEATURES             Qualifiers
     source          /organism="Salmonella enterica subsp. enterica serovar
                     Paratyphi B str. SPB7"
                     /mol_type="genomic DNA"
                     /strain="SGSC4150; SPB7"
                     /serovar="Paratyphi B"
                     /sub_species="enterica"
                     /culture_collection="ATCC:BAA-1250"
                     /db_xref="taxon:1016998"
     protein         /locus_tag="SPAB_02798"
                     /inference="protein motif:HMMPfam:IPR001017"
                     /inference="protein motif:HMMPfam:IPR005475"
                     /inference="protein motif:HMMPIR:IPR011603"
                     /inference="protein motif:HMMTigr:IPR011603"
                     /inference="protein motif:superfamily:IPR009014"
                     /note="KEGG: sec:SC0740 0. sucA; 2-oxoglutarate
                     dehydrogenase (decarboxylase component) K00164; COG:
                     COG0567 2-oxoglutarate dehydrogenase complex,
                     dehydrogenase (E1) component, and related enzymes; Psort
                     location: Cytoplasmic, score:9.26"
                     /transl_table=11
                     /db_xref="InterPro:IPR001017"
                     /db_xref="InterPro:IPR005475"
                     /db_xref="InterPro:IPR009014"
                     /db_xref="InterPro:IPR011603"
BEGIN
        1 MQNSALKAWL DSSYLSGSNQ SWIEQLYEDF LTDPDSVDAN WRLTFQQLPG TGVKPDQLHS
       61 KTREYFRRQA LAGSRHSSTI SDPDTNVKQV KVLQLINAYR FRGHQHANLD PLGLWKQERV
      121 ADLDPSFHDL TEADFQETFN VGSFASGKET MKLGELLDAL KQTYCGPIGA EYMHITSTEE
      181 KRWIQQRIES GRAAFSADEK KRFLNELTAA EGLERYLGAK FPGAKRFSLE GGDALIPMLK
      241 EMVRHAGNSG TREVVLGMAH RGRLNVLINV LGKKPQDLFD EFAGKHKEHL GTGDVKYHMG
      301 FSSDIETEGG LVHLALAFNP SHLEIVSPVV MGSVRARLDR LDEPSSNKVL PITIHGDAAV
      361 TGQGVVQETL NMSKARGYEV GGTVRIVINN QVGFTTSNPL DARSTPYCTD IGKMVQAPIF
      421 HVNADDPEAV AFVTRLALDF RNTFKRDVFI DLVCYRRHGH NEADEPSATQ PLMYQKIKKH
      481 PTPRKIYADK LEADKVATLE DATEMVNLYR DALDAGECVV KEWRPMNMHS FTWSPYLNHE
      541 WDESYPNKVE MKRLQELAKR ISTVPEAIEM QSRVAKIYGD RQAMAAGEKL FDWGGAENLA
      601 YATLVDEGIP VRLSGEDSGR GTFFHRHAVI HNQTNGSTYT PLQHIHSGQG QFKVWDSVLS
      661 EEAVLAFEYG YATAEPRTLT IWEAQFGDFA NGAQVVIDQF ISSGEQKWGR MCGLVMLLPH
      721 GYEGQGPEHS SARLERYLQL CAEQNMQVCV PSTPAQVYHM LRRQALRGMR RPLVVMSPKS
      781 LLRHPLAVST LDELANGSFQ PAIGEIDELD PKAVKRVVMC SGKVYYDLLE QRRKNDQKDV
      841 AIVRIEQLYP FPHKAVQEAL QPYAHVHDFV WCQEEPLNQG AWYCSQHHFR EVIPFGAALR
      901 YAGRPASASP AVGYMSVHQK QQQDLVNDAL NVD
//