LOCUS BCA68573.1 440 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli PTS system cellobiose-specific IIC component protein. ACCESSION AP022811-1897 PROTEIN_ID BCA68573.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /EC_number="2.7.1.69" /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:INSD:AAN80415.1" /locus_tag="JE86ST02C_18970" /note="DFAST-ECOLI:AAN80415.1 PTS system cellobiose-specific IIC component [pid:99.5%, q_cov:100.0%, s_cov:97.1%, Eval:2.5e-254]" /note="MGA_1899" /transl_table=11 BEGIN 1 MGLMASFERG MERFLVPVAI KLNSQKHVAA VRDGFVFTFP IIMASSLIIL INFAILSPDG 61 FIAGLLHLNS IFPNLEKAQA IFTPVMNGSV NIMSIMIAFL VARNMAISYE QDDLLCGLTA 121 IGAFFIVYTP YQMIDGQAFL TTKYLGAQGL FVAVIVALIT SEIFCRLARN PKITITMPAA 181 VPPAVARSFK VLLPIFFVMV FFSALNYCLT LISPAGLNDL IYTLIQTPLK HMGTNIFAVI 241 ILGAVGNFLW VLGIHGPNTT SAIRETVFSE ANMENLSWAA QHGTTWGAPY PITWTSINDA 301 FANCGGSGMT LGLLLAIFIA SKRAEYRDLA KMSFIPGIFN INEPIMFGLP IVLNPIMMVP 361 FIMVPIVNCA IGYFFVSMEI IPPVAYAVPW TTPGPLIAFL GTGGNWLALL VGFLCLGVAT 421 MIYLPFVIAA NKVNNMTTNG //