LOCUS BCA69188.1 635 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli hypothetical protein protein. ACCESSION AP022811-2512 PROTEIN_ID BCA69188.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /EC_number="3.6.1.-" /gene="yegI" /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:INSD:AAN81053.1" /locus_tag="JE86ST02C_25120" /note="DFAST-ECOLI:AAN81053.1 hypothetical protein [pid:93.6%, q_cov:99.1%, s_cov:96.9%, Eval:0.0e+00]" /note="MGA_2514" /transl_table=11 BEGIN 1 MTTLGLELGK GGEGAVYDIN EFVDSVAKIY HTPPPALKQD KLAFMAATAD AQLLNYVAWP 61 QATLHGGRGG KVIGFMMPKV SGKEPIHMIY SPAHRRQRYP HCAWDFLLYV ARNIASSFAT 121 VHEHGHVVGD VNQNSFMVGR DSKVVLIDSD SFQINANGTL HLCEVGVSHF TPPELQTLSS 181 FVGFERTKNH DNFGLALLIF HVLFGGRHPY SGVPLISDAG NALETDITHF RYAYASDNQR 241 RGLKPPPRSI PLSMLPSDVE AMFQQAFTES GVATGRPTAK AWVAALDSLR QQLKKCPVSA 301 MHVYPAHLTD CPWCALDNQG VIYFIDLGEE VITTGGDFVL AKVWAMVMAS VAPPALQLPL 361 PDHFQPTGRP LPLGLLRREY IILLEIALSA LSLLLCGLQA EPRYIILVPV LAAIWIIGSL 421 TSKAYKAEVQ QRREAFNRAK MDYDHLVRQI QQVGGLEGFI AKRTMLEKMK DEILGLPEEE 481 KRALAALHDT ARERQKQKFL EGFFIDVASI PGVGPARKAA LRSFGIETAA DVTRRGVKQV 541 KGFGDHLTQA VIDWKASCER RFVFRPNEAI TPADRQAVMA KMTAKRHRLE STLTVGATEL 601 QRFRLHAPAR TMPLMEPLRQ AAEKLAQAQA DLSRC //