LOCUS       BCA70822.1              1295 aa    PRT              BCT 06-NOV-2020
DEFINITION  Escherichia coli serine protease Sat protein.
ACCESSION   AP022811-4146
PROTEIN_ID  BCA70822.1
SOURCE      Escherichia coli
  ORGANISM  Escherichia coli
            Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales;
            Enterobacteriaceae; Escherichia.
REFERENCE   1  (bases 1 to 5283470)
  AUTHORS   Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S.
  TITLE     Direct Submission
  JOURNAL   Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases.
            Contact:Ken-ichi Lee
            National Institute of Infectious Diseases, Deaprtment of
            Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan
REFERENCE   2
  AUTHORS   Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S.
  TITLE     Global distribution of epidemic-related Shiga toxin 2 encoding
            phages among enteroaggregative Escherichia coli
  JOURNAL   Sci. Rep. 10, 11738 (2020)
  REMARK    Publication Status: Online-Only
            DOI:10.1038/s41598-020-68462-9
COMMENT     ##Genome-Assembly-Data-START##
            Assembly Method       :: Unicycler v. 0.4.7
            Genome Coverage       :: 117.0X
            Sequencing Technology :: Illumina MiSeq, PacBio RSII
            ##Genome-Assembly-Data-END##
FEATURES             Qualifiers
     source          /collection_date="1999"
                     /db_xref="taxon:562"
                     /host="Homo sapiens"
                     /mol_type="genomic DNA"
                     /organism="Escherichia coli"
                     /strain="JE86-ST02"
     protein         /gene="sat"
                     /locus_tag="JE86ST02C_41460"
                     /transl_table=11
BEGIN
        1 MNKIYSLKYS AATGGLIAVS ELAKRVSGKT NRKLVATMLS LAVAGTVNAA NIDISNVWAR
       61 DYLDLAQNKG IFQPGATDVT ITLKNGDKFS FHNLSIPDFS GAAASGAATA IGGSYSVTVA
      121 HNKKNPQAAE TQVYAQSSYK VVDRRNSNDF EIQRLNKFVV ETVGATPAET NPTTYSDALE
      181 RYGIVTSDGS KKIIGFRAGS GGTSFINGES KISTNSAYSH DLLSASLFEV TQWDSYGMMI
      241 YKNDKTFRNL EIFGDSGSGA YLYDNKLEKW VLVGTTHGIA SVNGDQLTWI TKYNDKLVSK
      301 LKDTYSHKIN LNGNNVTIKN TDITLHQNNA DTTGTQEKIT KDKDIVFTNG GNVLFKDNLD
      361 FGSGGIIFDE GHEYNINGQG FTFKGAGIDI GKESIVNWNA LYSSDDVLHK IGPGTLNVQK
      421 KQGANIKIGE GNVILNEEGT FNNIYLASGN GKVILNKDNS LGNDQYAGIF FTKRGGTLDL
      481 NGHNQTFTRI AATDDGTTIT NSDTTKEAVL AINNEDSYIY HGNINGNIKL THNINSQDKK
      541 TNAKLILDGS VNTKNDVEVS NASLTMQGHA TEHAIFRSTA NHCSLVFLCG TDWVTVLKET
      601 ESSYNKKFNS DYKSNNQQTS FDQPDWKTGV FKFDTLHLNN ADFSISRNAN VEGNISANKS
      661 AITIGDKNAY IDNLAGKNIT NNGFDFKQTI STNLSIGETK FTGGITAHNS QIAIGDQAVV
      721 TLNGATFLDN TPISIDKGAK VIAQNSMFTT KGIDISGELT MMGIPEQNSK AVTPGLHYAA
      781 DGFRLSGGNA NFIARNMASV TGNIYADDAA TITLGQPETE TPTISSAYQA WAETLLYGFD
      841 TAYRGAITAP KATVSMNNAI WHLNSQSSIN RLETKDSMVR FTGDNGKFTT LTVDNLTIDD
      901 SAFVLRANLA QADQLVVNKS LSGKNNLLLV DFIEKNGNSN GLNIDLVSAP KGTAVDVFKA
      961 TTRSIGFSDV TPVIEQKNDT DKATWTLIGY KSVANADAAK KATLLMSGGY KAFLAEVNNL
     1021 NKRMGDLRDI NGESGAWARI MSGTGSAGGG FSDNYTHVQV GADNKHELDG LDLFTGVTMT
     1081 YTDSHAGSDA FSGETKSVGA GLYASAMFES GAYIDLIGKY VHHDNEYTAT FAGLGTRDYS
     1141 SHSWYAGAEV GYRYHVTDSA WIEPQAELVY GAVSGKQFSW KDQGMNLTMK DKDFNPLIGR
     1201 TGVDVGKSFS GKDWKVTARA GLGYQFDLFA NGETVLRDAS GEKRIKGEKD GRMLMNVGLN
     1261 AEIRDNLRFG LEFEKSAFGK YNVDNAINAN FRYSF
//