LOCUS BCA69246.1 765 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli beta-D-glucoside glucohydrolase protein. ACCESSION AP022811-2570 PROTEIN_ID BCA69246.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /EC_number="3.2.1.21" /gene="bglX" /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:INSD:ABG70170.1" /locus_tag="JE86ST02C_25700" /note="DFAST-ECOLI:ABG70170.1 beta-D-glucoside glucohydrolase [pid:99.7%, q_cov:100.0%, s_cov:97.0%, Eval:0.0e+00]" /note="MGA_2572" /transl_table=11 BEGIN 1 MKWLCSVGIA VSLALQPALA DDLFGNHPLT PEARDAFVTE LLKKMTVDEK IGQLRLISVG 61 PDNPKEAIRE MIKDGQVGAI FNTVTRQDIR AMQDQVMELS RLKIPLFFAY DVLHGQRTVF 121 PISLGLASSF NLDAVKTVGR VSAYEAADDG LNMTWAPMVD VSRDPRWGRA SEGFGEDTYL 181 TSTMGKTMVE AMQGKSPADR YSVMTSVKHF AAYGAVEGGK EYNTVDMSPQ RLFNDYMPPY 241 KAGLDAGSGA VMVALNSLNG TPATSDSWLL KDVLRDQWGF KGITVSDHGA IKELIKHGTA 301 ADPEDAVRVA LKSGINMSMS DEYYSKYLPG LIKSGKVTME ELDDAARHVL NVKYDMGLFN 361 DPYSHLGPKE SDPVDTNAES RLHRKEAREV ARESLVLLKN RLETLPLKKS ATIAVVGPLA 421 DSKRDVMGSW SAAGVADQSV TVLTGIKNSV GENGKVLYAK GANVTSDKGI IDFLNQYEEA 481 VKVDPRSPQE MIDEAVQTAK QSDVVVAVVG EAQGMAHEAS SRTDITIPQS QRDLIAALKA 541 TGKPLVLVLM NGRPLALVKE DQQADAILET WFAGTEGGNA IADVLFGDYN PSGKLPMSFP 601 RSVGQIPVYY SHLNTGRPYN ADKPNKYTSR YFDEANGALY PFGYGLSYTT FTVSDVKLSA 661 PTMKRDGKVT ASVQVTNTGK REGATVVQMY LQDVTASMSR PVKQLKGFEK ITLKPGETQT 721 VSFPIDIEAL KFWNQQMKYD AEPGKFNVFI GTDSARVKKG EFELL //