LOCUS BCA71007.1 348 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli enterobacterial common antigen polysaccharide chain length modulation protein protein. ACCESSION AP022811-4331 PROTEIN_ID BCA71007.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /gene="wzzE" /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:INSD:AAN83138.1" /locus_tag="JE86ST02C_43300" /note="DFAST-ECOLI:AAN83138.1 enterobacterial common antigen polysaccharide chain length modulation protein [pid:98.3%, q_cov:100.0%, s_cov:99.7%, Eval:3.9e-196]" /note="MGA_4338" /transl_table=11 BEGIN 1 MTQPMPGKPA EDAENELDIR GLFRTLWAGK LWIIGMGLAF ALIALAYTFF ARQEWSSTAI 61 TDRPTVNMLG GYYSQQQFLR NLDVRSNMAS ADQPSVMDEA YKEFVMQLAS WDTRREFWLQ 121 TDYYKQRMVG NSKADAALLD EMINNIQFIP GDFTRAVNDS VKLIAETAPD ANNLLRQYVA 181 FASQRAASHL NDELKGAWAA RTIQMKAQVK RQEEVAKAIY DRRMNSIEQA LKIAEQHNIS 241 RSATDVPAEE LPDSEMFLLG RPMLQARLEN LQAVGPAFDL DYDQNRAMLN TLNVGPTLDP 301 RFQTYRYLRT PEEPVKRDSP RRAFLMIMWG IVGGLIGAGV ALTRRCSK //