LOCUS BCA72727.1 374 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli hypothetical protein protein. ACCESSION AP022815-757 PROTEIN_ID BCA72727.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5327513) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 135.0X Sequencing Technology :: Illumina MiSeq, Oxford Nanopore MinION ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="2014" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST05" protein /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /locus_tag="JE86ST05C_07570" /note="DFAST-ECOLI:CAU96554.1 type IV secretion protein Rhs [pid:98.9%, q_cov:100.0%, s_cov:26.8%, Eval:2.1e-230, partial hit]" /note="MGA_758" /note="WP_014966181.1 RHS element protein RhsA (Escherichia coli O104:H4 str. 2011C-3493) [pid:98.9%, q_cov:100.0%, s_cov:26.8%, Eval:6.4e-229, partial hit]" /note="internal stop codon at [806344:806346](+)" /transl_table=11 BEGIN 1 MTWYGWDGDR LTTIQNDRSR IQTIYQPGSF TPLIRVETAT GELARTQRRS LADTLQQSGG 61 EDGGSVVFPP VLVQMLDRLE SEILADRVSE ESRRWLASCG LTVEQMQNQM DPVYTPARKI 121 HLYHCDHRGL PLALISTEGA TAWCAEYDEW GNLLNEENPH QLQQLIRLPG QQYDEESGLY 181 YNRHRYYDPL QGRYITQDPI GLKGGWNFYQ YPLNPISNID PLGLETLKCI KPLHSMGGTG 241 ERSGPDIWGN PFYHQYLCVP DGKGDYTCGG QDQRGESKGD GLWGPGKASN DTKEAAGRCD 301 LVETDNSCVE NCLKGKFKEV RPRYSVLPDI FTPINLGLFK NCQDWSNDSL ETCKMKCSGN 361 NIGRFIRFVF TGVM //