LOCUS BCA67422.1 648 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli PTS N-acetylglucosamine EIICBA component protein. ACCESSION AP022811-746 PROTEIN_ID BCA67422.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /EC_number="2.7.1.69" /gene="nagE" /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:INSD:AAC73773.1" /locus_tag="JE86ST02C_07460" /note="DFAST-ECOLI:AAC73773.1 PTS N-acetylglucosamine EIICBA component [pid:94.4%, q_cov:100.0%, s_cov:100.0%, Eval:0.0e+00]" /note="MGA_747" /transl_table=11 BEGIN 1 MNILGFFQRL GRALQLPIAV LPVAALLLRF GQPDLLNVAF IAQAGGAIFD NLALIFAIGV 61 ASSWSKDSAG AAALAGAVGY FVLTKAMVTI NPEINMGVLA GIITGLVGGA AYNRWSDIKL 121 PDFLSFFGGK RFVPIATGFF CLVLAAIFGY VWPPVQHAIH AGGEWIVSAG ALGSGIFGFI 181 NRLLIPTGLH QVLNTIAWFQ IGEFTNAAGT VFHGDINRFY AGDGTAGMFM SGFFPIMMFG 241 LPGAALAMYF AAPKERRPMV GGMLLSVAVT AFLTGVTEPL EFLFMFLAPL LYLLHALLTG 301 ISLFVATLLG IHAGFSFSAG AIDYALMYNL PAASQNVWML LVMGVVFFAI YFVVFSLVIR 361 MFNLKTPGRE DKEDEIVTEE ANSNTEEGLN QLATNYIAAV GGTDNLKAID ACITRLRLTV 421 ADSARVNDTM CKRLGASGVV KLNKQTIQVI VGAKAESIGD AMKKVVARGP VAAASAEATP 481 ATAAPVAKPQ AVPNAVSIAE LVSPITGDVV ALDQVPDEAF ASKAVGDGVA VKPTDKIVVS 541 PAAGTIVKIF NTNHAFCLET EKGAEIVVHM GIDTVALEGK GFKRLVEEGA QVSAGQPILE 601 MDLDYLNANA RSMISPVVCS NIDDFSGLII KAQGHVVAGQ TPLYEIKK //