LOCUS BCA70822.1 1295 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli serine protease Sat protein. ACCESSION AP022811-4146 PROTEIN_ID BCA70822.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /gene="sat" /locus_tag="JE86ST02C_41460" /transl_table=11 BEGIN 1 MNKIYSLKYS AATGGLIAVS ELAKRVSGKT NRKLVATMLS LAVAGTVNAA NIDISNVWAR 61 DYLDLAQNKG IFQPGATDVT ITLKNGDKFS FHNLSIPDFS GAAASGAATA IGGSYSVTVA 121 HNKKNPQAAE TQVYAQSSYK VVDRRNSNDF EIQRLNKFVV ETVGATPAET NPTTYSDALE 181 RYGIVTSDGS KKIIGFRAGS GGTSFINGES KISTNSAYSH DLLSASLFEV TQWDSYGMMI 241 YKNDKTFRNL EIFGDSGSGA YLYDNKLEKW VLVGTTHGIA SVNGDQLTWI TKYNDKLVSK 301 LKDTYSHKIN LNGNNVTIKN TDITLHQNNA DTTGTQEKIT KDKDIVFTNG GNVLFKDNLD 361 FGSGGIIFDE GHEYNINGQG FTFKGAGIDI GKESIVNWNA LYSSDDVLHK IGPGTLNVQK 421 KQGANIKIGE GNVILNEEGT FNNIYLASGN GKVILNKDNS LGNDQYAGIF FTKRGGTLDL 481 NGHNQTFTRI AATDDGTTIT NSDTTKEAVL AINNEDSYIY HGNINGNIKL THNINSQDKK 541 TNAKLILDGS VNTKNDVEVS NASLTMQGHA TEHAIFRSTA NHCSLVFLCG TDWVTVLKET 601 ESSYNKKFNS DYKSNNQQTS FDQPDWKTGV FKFDTLHLNN ADFSISRNAN VEGNISANKS 661 AITIGDKNAY IDNLAGKNIT NNGFDFKQTI STNLSIGETK FTGGITAHNS QIAIGDQAVV 721 TLNGATFLDN TPISIDKGAK VIAQNSMFTT KGIDISGELT MMGIPEQNSK AVTPGLHYAA 781 DGFRLSGGNA NFIARNMASV TGNIYADDAA TITLGQPETE TPTISSAYQA WAETLLYGFD 841 TAYRGAITAP KATVSMNNAI WHLNSQSSIN RLETKDSMVR FTGDNGKFTT LTVDNLTIDD 901 SAFVLRANLA QADQLVVNKS LSGKNNLLLV DFIEKNGNSN GLNIDLVSAP KGTAVDVFKA 961 TTRSIGFSDV TPVIEQKNDT DKATWTLIGY KSVANADAAK KATLLMSGGY KAFLAEVNNL 1021 NKRMGDLRDI NGESGAWARI MSGTGSAGGG FSDNYTHVQV GADNKHELDG LDLFTGVTMT 1081 YTDSHAGSDA FSGETKSVGA GLYASAMFES GAYIDLIGKY VHHDNEYTAT FAGLGTRDYS 1141 SHSWYAGAEV GYRYHVTDSA WIEPQAELVY GAVSGKQFSW KDQGMNLTMK DKDFNPLIGR 1201 TGVDVGKSFS GKDWKVTARA GLGYQFDLFA NGETVLRDAS GEKRIKGEKD GRMLMNVGLN 1261 AEIRDNLRFG LEFEKSAFGK YNVDNAINAN FRYSF //