LOCUS BCA75958.1 686 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli hypothetical protein protein. ACCESSION AP022815-3988 PROTEIN_ID BCA75958.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5327513) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 135.0X Sequencing Technology :: Illumina MiSeq, Oxford Nanopore MinION ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="2014" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST05" protein /gene="yhjG" /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:INSD:AAN82771.1" /locus_tag="JE86ST05C_39880" /note="DFAST-ECOLI:AAN82771.1 hypothetical protein [pid:94.6%, q_cov:100.0%, s_cov:99.3%, Eval:0.0e+00]" /note="MGA_3995" /transl_table=11 BEGIN 1 MSKAGKITAA ISGAFLLLIV VAIILIATFD WNRLKPTINQ KVSAELNRPF AIRGDLGVVW 61 ERQKQETGWR SWVPWPHVHA EDIILGNPPD IPEVTMVHLP RVEATLAPLA LLTKTVWLPW 121 IKLEKPDARL IRLSEKNNNW TFNLANDDNK DANAKPSAWS FRLDNILFDQ GRIAIDDKVS 181 KADLEIFVDP LGKPLPFSEV TGSKGKADKE KVGDYVFGLK AQGRYNGEPL TGTGKIGGML 241 ALRGEGTPFP VQADFRSGNT RVAFDGVVND PMKMGGVDLR LKFSGDSLGD LYELTGVLLP 301 DTPPFETDGR LVAKIDTEKS SVFDYRGFNG RIGDSDIHGS LIYTTGKPRP KLEGDVESRQ 361 LRLADLGPLI GVDSGKGAEK SKRSEQKKGE KSVQPAGKVL PYDRFETDKW DVMDADVRFK 421 GRRIEHGSSL PISDLSTHII LKNADLRLQP LKFGMAGGSI AANIHLEGDK KPMQGRADIQ 481 ARRLKLKELM PDVELMQKTL GEMNGDAELR GSGNSVAALL GNSNGNLKLL MNDGLVSRNL 541 MEIVGLNVGN YIVGAIFGDD EVRVNCAAAN LDIANGVARP QIFAFDTENA LINVTGTASF 601 ASEQLDLTID PESKGIRIIT LRSPLYVRGT FKNPQAGVKA GPLIARGAVA AALATLVTPA 661 AALLALISPS EGEANQCRTI LSQMKK //