LOCUS BCA69792.1 345 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli hypothetical protein protein. ACCESSION AP022811-3116 PROTEIN_ID BCA69792.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /locus_tag="JE86ST02C_31160" /note="DFAST-ECOLI:ABF04923.1 alpha-amylase [pid:97.4%, q_cov:100.0%, s_cov:46.0%, Eval:3.1e-201, partial hit]" /note="MGA_3120" /note="Q8X958 hypothetical protein (Escherichia coli O157:H7 str. Sakai) [pid:97.7%, q_cov:100.0%, s_cov:46.0%, Eval:7.8e-202, partial hit]" /note="internal stop codon at [3243911:3243913](+)" /transl_table=11 BEGIN 1 MFSIKPGPRN LPIDNPTLLS WNITDGDLNS KLNTLEYLNC ITNIINSCGV YPQGLKDREI 61 ISTFHAEKVI NDLLKNDYKI SLSPDTTYRE LNKAAQRSIT APDRIGEGKT WVYQRDTMIE 121 RGDNSGVHQY GPAEHFTHII SDKPSPKDKY VAYAINIPDY ELAADVYNIN VTSHSGQQET 181 FKILINPEHL RQTLERKSLT AVQKSQCEII TPKKPGEAIL HAFNATYQQI RENMSEFARC 241 HYGYIQIPPV TTFRADGPET PEEEKGYWFH AYQPEDLCTI HNPMGDLQDF IALVKDAKKF 301 GIDIIPDYTF NFMGIGGSGK NDLDYPSADI RAKISKDIEG GIPGY //