LOCUS BCA67329.1 685 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli tail tape measure protein protein. ACCESSION AP022811-653 PROTEIN_ID BCA67329.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:RefSeq:WP_000113525.1" /locus_tag="JE86ST02C_06530" /note="DFAST-ECOLI:BAG78100.1 phage tape measure protein [pid:41.8%, q_cov:68.0%, s_cov:60.2%, Eval:1.3e-87, partial hit]" /note="MGA_654" /note="WP_000113525.1 tail tape measure protein (Escherichia coli O157:H7 str. Sakai) [pid:66.3%, q_cov:100.0%, s_cov:100.0%, Eval:4.3e-259]" /transl_table=11 BEGIN 1 MAKNLKASLI VDLLGNISAK SRQWSQELGA FSRSGRTGLG GLGNAARRAG QETELVGSRM 61 QRTLAGVRGS IRTVTSDFDR LQGSITGTIG RISNLYGMLA GGAAVYGFNK AFIRPAAEME 121 NYILRLNAIN HGDTAKTEAV KAWAVQNAKD TTWGLAGVMQ EYASSRGFGM SDREARRFIT 181 MLQDQGGYHG WSLSDAQGAS LQLKQMFARQ SIQAADANIL TGYGINVYQL LADKLGVNQK 241 IIREKGEKGK LGPDSIRLLF QVMAEQAKGA QKNAMNSWTG MTSMMGDVWD GFAREVMAKG 301 PFDSLKKSLK GFLDYADAAQ KSGLQDKLAT QTASALNQGF EYARDAATGF YRVIQKVRET 361 LQALRDAGYG DALDRIGQGA QTAAKYLMYM YLASRALKVL RFAGTGALRL GATPLRYGMA 421 MTSVLTSPFR KPQTTVPGTQ PGRAGRFLNF LTGVNPAAVQ PVLVTNWPAG GLASGGGDVV 481 VSGDGKTVRG RKKRGPGRGR GVTTVVTAGE QLAESAGKQG FFGRMMSRAG GLLTAAGNRM 541 GLGRFAGLFR GAGRLGGGAL WAGAMAAPVL LDSSASAADK AGAVGSLAGS IAGGALGAAA 601 GPVGVAIGST VGSYLGDYLG GWLTQAWQKL RGGSDENGGQ ATAKTAARVE LVAPEGWRAR 661 SIDVDDTAQH GLDVNVWNGG NYGLY //