LOCUS BCA67410.1 625 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli hypothetical protein protein. ACCESSION AP022811-734 PROTEIN_ID BCA67410.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:INSD:ABG68707.1" /locus_tag="JE86ST02C_07340" /note="DFAST-ECOLI:ABG68707.1 hypothetical protein [pid:89.3%, q_cov:100.0%, s_cov:100.0%, Eval:0.0e+00]" /note="MGA_735" /transl_table=11 BEGIN 1 MSASSVKPLN VQLPAITLIL FALCVGIFCY LAQWMSYEEV DQSALIHLGA NVASLTLSGE 61 SWRLFSSVFL HSSFSHLLMN MFALLVVGAV AERILGKWRL LIIWLFSGVF GGLISACYAL 121 RDSDQIVISV GASGAIMGIA GAALATQLAS GAGTYHKNQR RVFSLLGMVA LTLLYGARQA 181 GIDNACHIGG LIAGGALGWL SARLSGQNRL VTEGGIIVAG SLLLTGGIWL AQQQMDESVL 241 QVRQSLREAF YPQEIEQERR QKKQQLAEER NALRETLSAP VSREQASGDL LAEIADIHDM 301 AISRDGNMLY AAIENTNSIV VFDLGQKKIL HTFTAPIAKE KSVKHCGGCK DQGVRSLALS 361 PDEKLIYATS FEANALSVIN VATGEIIQSI TTGAHPDSLI LSRDGTKAWV MNRTSNSVSA 421 IDLVTYQHVA DIPLEKYDGT GTSNKPGAWV MALSPDEKIL LIPGMVRGDI VRINTITHQK 481 ESYPASDARG TISAMRFRPE NGDVIFADSQ GISRISVGDQ QASIMTQWCS RSVYSVEGIS 541 PDGQYLALVS YGLQGYVILL NINAGQIIGV YPASYVNHLR FSADGRKIFV MAKNGLIQLD 601 RTRSLDPQAI IRHPQYGNVA CIPEP //