LOCUS BCA68304.1 501 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli phage prohead protease protein. ACCESSION AP022811-1628 PROTEIN_ID BCA68304.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:INSD:AAN79917.1" /locus_tag="JE86ST02C_16280" /note="DFAST-ECOLI:AAN79917.1 phage prohead protease [pid:95.0%, q_cov:100.0%, s_cov:100.0%, Eval:5.4e-269]" /note="MGA_1629" /transl_table=11 BEGIN 1 MRRNLSHIIA AAFNEPLLLE PAYARVFFCA LGREMGAASL SVPQQQVQLD APGMLAETDE 61 YMAGGKRPAR VYRVVNGIAV LPVTGTLVHR LGGMRPFSGM TGYDGIVACL QQAMADSQVR 121 GILLDIDSPG GQAAGAFDCA DMIYRLRQQK PVWALCNDTA CSAAMLLASA CSRRLVTQTS 181 RIGSIGVMMS HVSYAGHLAQ AGVDITLIYS GAHKVDGNQF EALPAEVRQD MQQRIDAARR 241 MFAEKVAMFT GLSVDAVTGT EAAVFEGQSG IEAGLADELI NASDAISVMA TALNSNVRGG 301 TMPQLTATEA AEQENQRVMG ILTCQEAKGR EQLATMLAGQ QGMSVEQARA ILAAAAPQQP 361 VASAQSEADR IMACEEANGR EQLAATLAAM PEMTVEKARP ILAAAPLADA GPSLRDQIMA 421 LDEAKGAEAQ AEKLAACPGM TVENARAVLA AGSGKAEPVS ASTTALFEHF MANHSPAAVR 481 GGVSQTSADG DADVKMLMAM P //