LOCUS BCA67478.1 421 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli hypothetical protein protein. ACCESSION AP022811-802 PROTEIN_ID BCA67478.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /locus_tag="JE86ST02C_08020" /note="DFAST-ECOLI:ABE06228.1 cell envelope integrity inner membrane protein TolA [pid:93.2%, q_cov:31.6%, s_cov:32.0%, Eval:3.4e-67, partial hit]" /note="MGA_803" /note="WP_000030637.1 cell envelope integrity/translocation protein TolA (Escherichia coli O104:H4 str. 2011C-3493) [pid:93.2%, q_cov:31.6%, s_cov:31.6%, Eval:1.1e-65, partial hit]" /transl_table=11 BEGIN 1 MSKATEQNDK LKRAIIISAV LHVILFAALI WSSFDENIEA SAGGGGGSSI DAVMVDSGAV 61 VEQYKRMQSQ ESSAKRSDEQ RKMKEQQAAE ELREKQAAEQ ERLKQLEKER LAAQEQKKQA 121 EEAAKQAELK QKQAEEAAAK AAADAKAKAE ADAKAAEEAA KKAAADAKKK AEAEAAKAAA 181 EAQKKAEAAT AALKKKAEAA EAAAAEARKK AATEAAEKAK AEAEKKAAAE KAAADKKAAA 241 EKAAADKKAA EKAAAEKAAA DKKAAAEKAA ADKKAAAAKA AAEKAAAAKA AAEADDIFGE 301 LSSGKNAPKT GGGAKGNNAS PAGSGNTKNN GASGADINNY AGQIKSAIES KFYDASSYAG 361 KTCTLRIKLA PDGMLLDIKP EGGDPALCQA ALAAAKLAKI PKPPSQAVYE VFKNAPLDFK 421 P //