LOCUS BCA70672.1 519 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli cellulose production protein protein. ACCESSION AP022811-3996 PROTEIN_ID BCA70672.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /gene="bcsE" /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:INSD:AAC76561.1" /locus_tag="JE86ST02C_39960" /note="DFAST-ECOLI:AAC76561.1 cellulose production protein[pid:99.8%, q_cov:100.0%, s_cov:99.2%, Eval:7.3e-309]" /note="MGA_4003" /transl_table=11 BEGIN 1 MDPVFSIGIS SLWDELRHMP AGGVWWFNVD RHEDAISLAN QTIASQAETA HVAVISMDSD 61 PAKIFQLDDS QGPEKIKLFS MLNHEKGLYY LTRDLQCSID PHNYLFILVC ANNAWQNIPA 121 ERLRSWLDKM NKWSRLNHCS LLVINPGNNN DKQFSLLLEE YRSLFGLASL RFQGDQHLLD 181 IAFWCNEKGV SARQQLSVQQ QNGIWTLVQS EEAEIQPRSD EKRILSNVAV LEGAPPLSEH 241 WQLFNNNEVL FNEARTAQAA TVVFSLQQNA QIEPLARSIH TLRRQRGSAM KILVRENTAS 301 LRATDERLLL ACGANMVIPW NAPLSRCLTM IESVQGQKFS RYVPEDITTL LSMTQPLKLR 361 GFQKWDVFCN AVNNMMNNPL LPAHGKGVLV ALRPVPGIRV EQALTLCRPN RTGDIMTIGG 421 NRLVLFLSFC RINDLDTALN HIFPLPTGDI FSNRMVWFED DQISAELVQM RLLAPEQWGM 481 PLPLTQSSKP VINAEHDGRH WRRIPEPMRL LDDAVERSS //