LOCUS       BAX22404.1               407 aa    PRT              BCT 27-MAR-2019
DEFINITION  Escherichia coli general secretion pathway protein F protein.
ACCESSION   AP017620-3062
PROTEIN_ID  BAX22404.1
SOURCE      Escherichia coli
  ORGANISM  Escherichia coli
            Bacteria; Pseudomonadati; Pseudomonadota; Gammaproteobacteria;
            Enterobacterales; Enterobacteriaceae; Escherichia.
REFERENCE   1  (bases 1 to 5042704)
  AUTHORS   Sekizuka,T., Kawanishi,M., Ohnishi,M., Shima,A., Kato,K.,
            Yamashita,A., Matsui,M., Suzuki,S. and Kuroda,M.
  TITLE     Direct Submission
  JOURNAL   Submitted (27-JUL-2016)
            Contact:Tsuyoshi Sekizuka
            National Institute of Infectious Diseases, Pathogen Genomics
            Center; 1-23-1 Toyama, Shinjuku-ku, Tokyo 162-8640, Japan
REFERENCE   2
  AUTHORS   Sekizuka,T., Kawanishi,M., Ohnishi,M., Shima,A., Kato,K.,
            Yamashita,A., Matsui,M., Suzuki,S. and Kuroda,M.
  TITLE     Elucidation of quantitative structural diversity of remarkable
            rearrangement regions, shufflons, in IncI2 plasmids
  JOURNAL   Sci. Rep. 7, 928 (2017)
  REMARK    Publication Status: Online-Only
            DOI:10.1038/s41598-017-01082-y
COMMENT     ##Genome-Assembly-Data-START##
            Assembly Method       :: HGAP Assembly v. 3.0; Celera Assembler v.
                                     8.2
            Genome Coverage       :: 200x
            Sequencing Technology :: MiSeq; PacBio RSII
            ##Genome-Assembly-Data-END##
FEATURES             Qualifiers
     source          /collection_date="2013"
                     /db_xref="taxon:562"
                     /geo_loc_name="Japan"
                     /host="Bos taurus"
                     /mol_type="genomic DNA"
                     /organism="Escherichia coli"
                     /strain="MRY15-131"
     protein         /gene="gspF"
                     /locus_tag="MRY15131_c30620"
                     /transl_table=11
BEGIN
        1 MALFYYQALE RNGRKTKGMI EADSARHARQ LLRGKELIPV HIEARLNAST GGMLQRRRHA
       61 HRRVAAADLA LFTRQLATLV QAAMPLETCL QAVSEQSEKL HVKSLGMALR SRIQEGYTLS
      121 DSLREHPRVF DSLFCSMVAA GEKSGHLDVV LNRLADYTEQ RQRLKSRLLQ AMLYPLVLLV
      181 VATGVVTILL TAVVPKIIEQ FDHLGHALPA STRMLIAMSD TLQTSGVYWL AGLLGLLVLG
      241 QRLLKNPAMR LRWDKTLLRL PVTGRVARGL NTARFSRTLS ILTASSVPLL EGIQTAAAVS
      301 ANRYVEQQLL LAADRVREGS SLRAALADLR LFPPMMLYMI ASGEQSGELE TMLEQAAVNQ
      361 EREFDTQVGL ALGLFEPALV VMMAGVVLFI VIAILEPMLQ LNNMVGM
//