LOCUS BCA67931.1 867 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli phage tail length tape measure protein protein. ACCESSION AP022811-1255 PROTEIN_ID BCA67931.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:INSD:CAU96932.1" /locus_tag="JE86ST02C_12550" /note="DFAST-ECOLI:CAU96932.1 phage tail length tape measure protein [pid:93.5%, q_cov:100.0%, s_cov:99.7%, Eval:0.0e+00]" /note="MGA_1256" /transl_table=11 BEGIN 1 MAGNFADLTA VLTLDSTRFS EEAARVKKEL GETSDLADLM AGRVSQSFKK QAAAVEQGLS 61 RQALAAQKAG ISVGQYKAAM RTLPAQFTDI ATQLAGGQNP WLILLQQGGQ VKDSFGGMIP 121 MFRGLAGAIT LPMVGVTSLA VATGALVYAW YQGDSTLSAF NKTLVLSGNQ SGLTADRMLT 181 LSRAGQAAGL TFNQARESLA ALVNAGVRGG EQFDAINQSV ARFASASGVE VDKVAEAFGK 241 LTTDPTSGLM AMARQFRNVT AEQIAYVAQL QRSGDEAGAL QAANDIATKG FDEQTRRLKE 301 NMGTLETWAD KTGKAFKSMW DAILDIGRPE SSADMLASAQ KAFDEADKKW QWYQSRSQRR 361 GKTSSFRANL QGAWDDRENA RLGLAAATLQ SDMEKAGELA ARDRAEREAS QLKYTGEAQK 421 AYERLLTPLE KYTARQEELN KALKDGKILQ ADYNTLMASA KKDYESTQKK PSGVKVSAGE 481 RQEDQAHAAL LALETELRTL EKHSGANEKI SQQRRDLWKA ENQYVVLKEA ATKRQLSEQE 541 KSLLAHEKET LEYKRQLAEL GDKIEHQKRL NELAQQAARF EQQQSAKQAA ISAKARGLTD 601 RQAQRESEEQ RLREVYGDNP AALAKATSAL KNTWSAEEQL RGSWMAGMKS GWGEWAESAT 661 DSMSQVKSAA TQTFDGIAQN MAAMLTGSEQ NWRSFTRSVL SMMTEILLKQ AMVGIVGSIG 721 SAIGGAVGGG ASASGGTAIQ AAAAKFHFAT GGFTGTGGKY EPAGIVHRGE FVFTKEATSR 781 IGVGNLYRLM RGYAEGGYVG GAGSPAQMRR AEGINFNQNN HVVIQNDGTN GLPGPQMMKA 841 VYDMARKGAR DEIQAQMRDG GLFSGGG //