LOCUS BCA76914.1 500 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli hypothetical protein protein. ACCESSION AP022815-4944 PROTEIN_ID BCA76914.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5327513) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 135.0X Sequencing Technology :: Illumina MiSeq, Oxford Nanopore MinION ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="2014" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST05" protein /gene="yjgR" /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:INSD:AAN45645.1" /locus_tag="JE86ST05C_49430" /note="DFAST-ECOLI:AAN45645.1 hypothetical protein [pid:97.0%, q_cov:100.0%, s_cov:96.9%, Eval:7.5e-279]" /note="MGA_4957" /transl_table=11 BEGIN 1 MSEPLLIART PDTELFLLPG MANRHGLITG ATGTGKTVTL QKLAESLSEI GVPVFMADVK 61 GDLTGIAQAG TASEKLLARL KNIGVNDWQP HANPVVVWDI FGEKGHPVRA TVSDLGPLLL 121 ARLLNLNDVQ SGVLDIIFRI ADDQGLLLLD FKDLRAITQY IGDNAKSFQN QYGNISSASV 181 GAIQRGLLSL EQQGAAHFFG EPMLDIKDWM RTDANGKGVI NILSAEKLYQ MPKLYAASLL 241 WMLSELYEQL PEAGDLEKPK LVFFFDEAHL LFNDAPQVLL DKIEQVIRLI RSKGVGVWFV 301 SQNPSDIPDN VLGQLGNRVQ HALRAFTPKD QKAVKAAAQT MRPTPAFDTE KAIQELGTGE 361 ALISFLDAKG SPSVVERAMV IAPCSRMGPV TEDERNGLIN HSPVYGKYED DVDRESAYEM 421 LQKGVQASTE QQNNPPAKGK EVAVDDGILG GLKDILFGTT GPRGGKKDGV VQTMAKSAAR 481 QVTNQIVRGM LGSLLGGRKR //