LOCUS BCA67026.1 472 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli hypothetical protein protein. ACCESSION AP022811-350 PROTEIN_ID BCA67026.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /gene="yahG" /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:INSD:AAC73424.1" /locus_tag="JE86ST02C_03500" /note="DFAST-ECOLI:AAC73424.1 hypothetical protein [pid:99.2%, q_cov:100.0%, s_cov:100.0%, Eval:1.8e-274]" /note="MGA_351" /transl_table=11 BEGIN 1 MSQSLFSQPL NVINVGIAMF SDNLKKQHVE VTQLDWTPPG QGNMQVVQAL DNIADSPLAD 61 KIAAANQQAL ERIIQSHPVL IGFDQAINVV PGMTPKTILH AGPPITWEKM CGAMKGAVTG 121 ALVFEGLAKD LDEATELAAS GEITFSPCHE HDCVGSMAGV TSASMFMHIV KNKTYGNIAY 181 TNMSEQMAKI LRMGANDQSV IDRLNWMRDV QGPMLRDAMK IIGEIDLRLM LAQALHMGDE 241 CHNRNNAGTT LLIQALTPGI IQAGYSVEQQ REVFEFVASS DYFSGPTWMA MCKAAMDAAH 301 GIEYSTVVTT MARNGVEFGL RVSGLPGQWF TGPAQQVIGP MFAGYKPEDS GLDIGDSAIT 361 ETYGIGGFAM ATAPAIVALV GGTVEEAIDF SRQMREITLG ENPNVTIPLL GFMGVPSAID 421 ITRVGSSGIL PVINTAIAHK DAGVGMIGAG IVHPPFACFE KAILGWCERY GV //