LOCUS BCA69482.1 631 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli hypothetical protein protein. ACCESSION AP022811-2806 PROTEIN_ID BCA69482.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /gene="yjhS_7" /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:INSD:CAU96912.1" /locus_tag="JE86ST02C_28060" /note="DFAST-ECOLI:CAU96912.1 hypothetical protein [pid:89.4%, q_cov:100.0%, s_cov:100.0%, Eval:0.0e+00]" /note="MGA_2808" /transl_table=11 BEGIN 1 MAFKHYDVVR AASPSDLAER ITQKLKEGWQ PYGSALISTA GYGAEFIQPV VSEGSISSPE 61 EPGNRPTTSA PSVAPEYYYV IALAGQSNGM SYGEGLPLPD TFDSPDPRIK QLARRSTVTP 121 GGAACKYNDI IPADHCLHDV QDMSTINHPR ADLSKGQYGC VGQGLHIAKK LLPYIPNNAG 181 ILLVPCCRGG SAFTQGTEGT FSESTGASQD SARWGVGKPL YQDLLFRTKA ALQKNPKNVL 241 LAICWMQGEF DMKNASYAQQ PAAFLAMVQQ FRADLAGLAA QCHGGSPASV PWICGDTTYA 301 WKQEHGTQYE VVYGAYKGKE SQQIYFVPFM TDGSGVNTPT NNPSEDPDIA GSGYYGSASR 361 TNKNWVSSNR PTHFSSWARR GIIPDRMATA ILNVAGRTLA FISGKAPEIK PSPGGDTPSG 421 PSEDASVRTI SLLPTAGDAA AQGWTIKDGG IQLSGGVFKI AKQSNKTWAL TRPVDDAVSL 481 LTRGGRLSCK FRLSGALTNN QFGLGIYLYT DVALPDVVAM TGTGNPFLMS FFTQTTDGKL 541 NLMHHKKAGN TKLGEFGNYS NDWQTLELVF TAGSATVTPK LNGVAGPAFQ VIKDSLTLGL 601 NALTLTDITK NAAYGVEIES LVLEINAPAS S //