LOCUS BCA72627.1 414 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli hypothetical protein protein. ACCESSION AP022815-657 PROTEIN_ID BCA72627.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5327513) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 135.0X Sequencing Technology :: Illumina MiSeq, Oxford Nanopore MinION ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="2014" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST05" protein /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /locus_tag="JE86ST05C_06570" /note="DFAST-ECOLI:BAB38392.1 phage portal protein [pid:97.8%, q_cov:100.0%, s_cov:74.7%, Eval:3.8e-241, partial hit]" /note="MGA_658" /note="WP_001057672.1 hypothetical protein (Escherichia coli O157:H7 str. Sakai) [pid:97.8%, q_cov:100.0%, s_cov:74.7%, Eval:1.2e-239, partial hit]" /note="frameshifted, insertion at around 710536" /transl_table=11 BEGIN 1 MTYMIRFPNS GFKIQALSSR PSNLRGLQGD VVIDEAAFHE SLDELLKAAM ALTMWGARVR 61 IISTHNGVDN LFNQYIQEAR EGRKDYSVHR ITLDDAIADG LYRRICYVTD QEWSPESEQK 121 WRDDLYKNAP TREDADEEYG CIPKKSGGAY IPHALIEMAM IRDIPILTFE APDNFISRAA 181 WLRESEVLTW CEEHLKPLLE KLNPRSRFSF GEDFARTGDL SCFVLLEITE SLAKREVFRV 241 ELRNLPYAQQ EQVMMYILTR VPALVGAAFD ATGNGGYLAE AALLAFGPDI IDCVMLSPKW 301 YGEWMPKLKA EFEDQNILVA RHQTTLDDLR HVKVVNGIPQ IDKGRTKDQN ATAANARRHG 361 DFAVALCMAN RASYMEGFIL DESACQALPE RSRVMEGGYR DDDEAYHEFD RGGW //