LOCUS BCA67939.1 421 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli tail fiber protein protein. ACCESSION AP022811-1263 PROTEIN_ID BCA67939.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:RefSeq:WP_001387718.1" /locus_tag="JE86ST02C_12630" /note="DFAST-ECOLI:ADD54981.1 phage tail fiber protein [pid:56.4%, q_cov:71.3%, s_cov:62.7%, Eval:1.2e-80, partial hit]" /note="MGA_1264" /note="WP_001387718.1 tail fiber protein (Escherichia coliO104:H4 str. 2011C-3493) [pid:63.8%, q_cov:100.0%,s_cov:85.2%, Eval:3.3e-144]" /note="frameshifted, insertion at around 1301025" /transl_table=11 BEGIN 1 MYEDSRPGTL NDFLGAMTED DARPEALRRF ELMVEEVVRN AEEAKKNAGE AETSARNAGI 61 SASQAEESAA NADTSAGDAS ESARQAAESA ASAKQSEEAS SSSASAAAQK ASESLQSATD 121 AELSKKTAES AAGNAARDAT TATEKARESA ESAQSAEQSR IAAEEAVNRI PTVVGPPGPK 181 GEPGPAGPQG PKGDKGERGD TGPAGATGER GPAGDAGPAG PQGPKGDRGE TGLTGNAGPQ 241 GPKGDAGAAG PAGPQGPKGD TGAAGPAGPQ GPKGDAGAAG PAGPQGPKGD TGAAGPTGPQ 301 GPKGDTGAAG PAGPQGPKGD AGVAGPAGPQ GPSGSPDSGL FGVGSFVLAA YYQTHYSGDR 361 APGSTVAGSS LSACCLSNGT PLVASGNVGE TRLPGTWRAC GPMLWTSSPG IRQAGLFQRI 421 S //