LOCUS BCA68676.1 502 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli hypothetical protein protein. ACCESSION AP022811-2000 PROTEIN_ID BCA68676.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /gene="ydgA" /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:INSD:AAC74686.1" /locus_tag="JE86ST02C_20000" /note="DFAST-ECOLI:AAC74686.1 hypothetical protein [pid:97.6%, q_cov:100.0%, s_cov:100.0%, Eval:5.8e-279]" /note="MGA_2002" /transl_table=11 BEGIN 1 MNKSLVAVGV IVALGVVWTG GAWYTGKKIE THLEDMVAQA NAQLKLTAPE SNLEVSYQNY 61 HRGVFSSQLQ LLVKPIAGKE NPWIKSGQSV IFNESVDHGP FPLAQLKKLN LIPSMASIQT 121 TLVNNEVSKP LFDMAKGETP FEINSRIGYS GDSSSDISLK PLNYEQKDEK VAFSGGEFQL 181 NADRDGKAIS LSGEAQSGRI DAVNEYNQKV QLTFNNLKTD GSSTLASFGE RVGNQKLSLE 241 KMTISVEGKE LALLEGMEIS GKSDLVNDGK TINSQLDYSL NSLKVQNQDL GSGKLTLKVG 301 QIDGEAWHQF SQQYNAQTQA LLAQPEIANN PELYQEKVTE AFFSALPLML KGDPVITIAP 361 LSWKNSQGES ALNLSLFLKD PATTKEAPQT LAQEVDRSVK SLDAKLTIPV DMATEFMTQV 421 AKLEGYQEDQ AKKLAKQQVE GASAMGQMFR LTTLQDNTIT TSLQYTNGQI TLNGQKMPLE 481 DFVGMFAMPA LNVPVVPAIP QQ //