LOCUS BCA70949.1 464 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli cryptic 6-phospho-beta-glucosidase protein. ACCESSION AP022811-4273 PROTEIN_ID BCA70949.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /EC_number="3.2.1.86" /gene="bglB_2" /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:INSD:AAC76744.1" /locus_tag="JE86ST02C_42720" /note="DFAST-ECOLI:AAC76744.1 cryptic 6-phospho-beta-glucosidase [pid:99.6%, q_cov:100.0%, s_cov:98.7%, Eval:6.3e-280]" /note="MGA_4279" /transl_table=11 BEGIN 1 MKAFPETFLW GGATAANQVE GAWQEDGKGI STSDLQPHGV MGKMEPRILG KENIKDVAID 61 FYHRYPEDIA LFAEMGFTCL RISIAWARIF PQGDEVEPNE AGLAFYDRLF DEMAQAGIKP 121 LVTLSHYEMP YGLVKNYGGW ANRAVIDHFE HYARTVFTRY QHKVALWLTF NEINISLHEP 181 FTGVGLAEES GEAEVYQAIH HQLVASARAV KACHSLLPEA KIGNMLLGGL VYPLTCQPQD 241 MLQAMEENRR WMFFGDVQAR GQYPGYMQRF FRDHNITIEM TESDAEDLKH TVDFISFSYY 301 MTGCVSHDES INKNAQGNIL NMIPNPHLKS SEWGWQIDPV GLRVLLNTLW DRYQKPLFIV 361 ENGLGAKDSV EADGSIQDDY RIAYLNDHLV QVNEAIADGV DIMGYTSWGP IDLVSASHSQ 421 MSKRYGFIYV DRDDNGEGSL TRTRKKSFGW YAEVIKTRGL SLKK //