LOCUS BCA76823.1 615 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli methyl-directed mismatch repair protein MutL protein. ACCESSION AP022815-4853 PROTEIN_ID BCA76823.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5327513) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 135.0X Sequencing Technology :: Illumina MiSeq, Oxford Nanopore MinION ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="2014" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST05" protein /gene="mutL" /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:INSD:ABF06303.1" /locus_tag="JE86ST05C_48520" /note="DFAST-ECOLI:ABF06303.1 methyl-directed mismatch repair protein MutL [pid:95.2%, q_cov:100.0%, s_cov:100.0%, Eval:0.0e+00]" /note="MGA_4866" /transl_table=11 BEGIN 1 MPIQVLPPQL ANQIAAGEVV ERPASVVKEL VENSLDAGAT RIDIDIERGG AKLIRIRDNG 61 CGIKKDELAL ALARHATSKI ASLDDLEAII SLGFRGEALA SISSVSRLTL TSRTAEQQEA 121 WQAYAEGRDM NVTVKPAAHP VGTTLEVLDL FYNTPARRKF LRTEKTEFNH IDEIIRRIAL 181 ARFDVTINLS HNGKIVRQYR AVPEGGQKER RLGAICGTAF LEQALAIEWQ HGDLTLRGWV 241 ADPNHTTPAL AEIQYCYVNG RMMRDRLINH AIRQACEDKL GADQQPAFVL YLEIDPHQVD 301 VNVHPAKHEV RFHQSRLVHD FIYQGVLSVL QQQLETPLPL DDEPQPAPRS IPENRVAAGR 361 NHFAEPAARE PVAPRYTPAP ASGSRPAAPW PNAQPGYQKQ QGEVYRQLLQ TPAPMQKPKA 421 PEPQEPALAA NSQSFGRVLT IVHSDCALLE RDGNISLLSL PVAERWLRQA QLTPGEAPVC 481 AQPLLIPLRL KVSAEEKSAL EKAQSALAEL GIDFQSDAQH VTIRAVPLPL RQQNLQILIP 541 ELIGYLAKQS VFEPGNIAQW IARNLMSEHA QWSMAQAITL LADVERLCPQ LVKTPPGGLL 601 QSVDLHPAIK ALKDE //