LOCUS BCA68905.1 686 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli protease II protein. ACCESSION AP022811-2229 PROTEIN_ID BCA68905.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /EC_number="3.4.21.83" /gene="ptrB" /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:INSD:AAC74915.1" /locus_tag="JE86ST02C_22290" /note="DFAST-ECOLI:AAC74915.1 protease II [pid:99.1%, q_cov:100.0%, s_cov:100.0%, Eval:0.0e+00]" /note="MGA_2231" /transl_table=11 BEGIN 1 MLPKAARIPH AMTLHGDTRI DNYYWLRDDT RSQPEVLDYL QQENSYGHRV MASQQALQDR 61 ILKEIIDRIP QREVSAPYIK NGYRYRHIYE PGCEYAIYQR QSAFSEEWDE WETLLDANKR 121 AAHSEFYSMG GMAITPDNTI MALAEDFLSR RQYGIRFRNL ETGNWYPELL DNVEPSFVWA 181 NDSWTFYYVR KHPVTLLPYQ VWRHAIGTPA SQDKLIYEEK DDTYYVSLHK TTSKHYVVIH 241 LASATTSEVR LLDAEMADAE PFVFLPRRKD HEYSLDHYQH RFYLRSNRHG KNFGLYRTRM 301 RDEQQWEELI PPRENIMLEG FTLFTDWLVV EERQRGLTSL RQINRKTREV IGIAFDDPAY 361 VTWIAYNPEP ETARLRYGYS SMTTPDTLFE LDMDTGERRV LKQTEVPGFD AANYRSEHLW 421 IVARDGVEVP VSLVYHRKHF RKGHNPLLVY GYGSYGASID ADFSFSRLSL LDRGFVYAIV 481 HVRGGGELGQ QWYEDGKFLK KKNTFNDYLD ACDALLKLGY GSPSLCYAMG GSAGGMLMGV 541 AINQRPELFH GVIAQVPFVD VVTTMLDESI PLTTGEFEEW GNPQDPQYYE YMKSYSPYDN 601 VTAQAYPHLL VTTGLHDSQV QYWEPAKWVA KLRELKTDDH LLLLCTDMDS GHGGKSGRFK 661 SYEGVAMEYA FLVALAQGTL PATPAD //