LOCUS BCA69234.1 467 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli hypothetical protein protein. ACCESSION AP022811-2558 PROTEIN_ID BCA69234.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /locus_tag="JE86ST02C_25580" /note="DFAST-ECOLI:BAI31371.1 hypothetical protein [pid:96.1%, q_cov:100.0%, s_cov:63.6%, Eval:1.9e-260, partial hit]" /note="MGA_2560" /note="WP_001294422.1 hypothetical protein (Escherichia coli O104:H4 str. 2011C-3493) [pid:96.8%, q_cov:100.0%, s_cov:61.5%, Eval:1.4e-260, partial hit]" /note="frameshifted, deletion at around 2648243,2648246" /transl_table=11 BEGIN 1 MAAAHLHAMA LAQLRGHTLP LRTDWLDAIA GSLIKEALNA PLPWSYRGVI HPDTDPILLT 61 LIDTLAGDGF GKLAPSTPQP PLPKDVTCEL ERTAISLPAE LTLNRFNPNG LAQSQVLHRL 121 AILEIPGIVR QQGSTLTLAG NGEERWKLTR PLSQHAALIE AACFGATLQE AARHKLEADM 181 LDAGGIGSIT TCLSQAALAG LASFSQQLLE QLTLLIAQEN QFAEMGQALE VLYALWRLDE 241 ISGMQGAQIL QTTLCAAIDR TLWLCESNGR PDEKEFHAHL HSWQALCHIL RDLHSGVQLP 301 GISLSAAVAL LERRSQAIHA PALDRGAAHG ALMRLEHPNA SAEAALTMLA QLSPAQSGEA 361 LHGLLALARH QLACQPTFIA GFSSHLNQLS DADFINALPD LRAAMAWLPP RERGTLAHQV 421 LEHYQLAQLP VSALQMPLHC PPQAIAHHQQ LEQQALASLQ NWGVFHV //