LOCUS       BCA67548.1              1021 aa    PRT              BCT 06-NOV-2020
DEFINITION  Escherichia coli phage tail length tape measure protein protein.
ACCESSION   AP022811-872
PROTEIN_ID  BCA67548.1
SOURCE      Escherichia coli
  ORGANISM  Escherichia coli
            Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales;
            Enterobacteriaceae; Escherichia.
REFERENCE   1  (bases 1 to 5283470)
  AUTHORS   Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S.
  TITLE     Direct Submission
  JOURNAL   Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases.
            Contact:Ken-ichi Lee
            National Institute of Infectious Diseases, Deaprtment of
            Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan
REFERENCE   2
  AUTHORS   Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S.
  TITLE     Global distribution of epidemic-related Shiga toxin 2 encoding
            phages among enteroaggregative Escherichia coli
  JOURNAL   Sci. Rep. 10, 11738 (2020)
  REMARK    Publication Status: Online-Only
            DOI:10.1038/s41598-020-68462-9
COMMENT     ##Genome-Assembly-Data-START##
            Assembly Method       :: Unicycler v. 0.4.7
            Genome Coverage       :: 117.0X
            Sequencing Technology :: Illumina MiSeq, PacBio RSII
            ##Genome-Assembly-Data-END##
FEATURES             Qualifiers
     source          /collection_date="1999"
                     /db_xref="taxon:562"
                     /host="Homo sapiens"
                     /mol_type="genomic DNA"
                     /organism="Escherichia coli"
                     /strain="JE86-ST02"
     protein         /EC_number="2.7.1.-"
                     /inference="COORDINATES:ab initio
                     prediction:MetaGeneAnnotator"
                     /inference="similar to AA sequence:INSD:CAR06179.1"
                     /locus_tag="JE86ST02C_08720"
                     /note="DFAST-ECOLI:CAR06179.1 phage tail length tape
                     measure protein [pid:87.7%, q_cov:100.0%, s_cov:99.0%,
                     Eval:0.0e+00]"
                     /note="MGA_873"
                     /transl_table=11
BEGIN
        1 MDQIANLVID LGIDAAEFKN EIPRIKNLLN GAASDAERSS ARMQRFMERQ TQAARQTTQA
       61 ASSAATAASV HAQTVEKNAQ AHERMAREVE KTRQRMEALS QKMREEQAQA MALAEAQDKA
      121 AAAFYRQIDS VKQASAGLQE LQRIQQQIRQ ARNSGGIGQQ DYLALISEVT AKTRVLTQAE
      181 EEATRQKVAF IRQLKEQATR QNLSSSELLR AKAAQLGVSS AAEVYIRKME QAGKATHSLG
      241 LKSAAARQEI GVLIGELARG NLGALRGSGI TLANRAGWID TLMSPKGMML GGVIGGIAAA
      301 VYGLGKAWYD GQKEGEEFNR QLSLTGHYAG VTAGQLWTLS RTISGNGITQ HAAAGALAQV
      361 VGSGAFRGND IGMVARAAAK MERSVGQSVS DTINQFKRLK DDPVNAAKAL DNELHFLTAT
      421 QLEQIRVLGD QGRSSDAARI AMSALAEETG RRTADIDNNL NALGSTLKYL SDLWSRFWDA
      481 AMNIGREDSL DEQIAALQEK VSRAKRLPWT ASSSQVEYDQ QRLNDLQEKK RQKDLQDAKE
      541 QAERNYQEQQ KRRNAENAAL NRMNETEAAR HQREIARINA MQYADQAVRD AAIQRENERY
      601 EKAIKKKTSE NRNDEATRLL LQYSQQQAQV EGQIAAARQS AGLTTEKMTE AHKQLLALQQ
      661 RISDLAGKKL TADEKSVLAH KDELIQALTL LDAKQQELQK QTALNDLKKK SIQLASQLAE
      721 EERALRQQHD LDIATTGMGD KQRQRYQTQF SLQQKYQQQM EQLERDSKQK GTYGTDEYRN
      781 AEQMLTDSLN RQLNENRRYW QEQELMQADW KNGAMRAFQN FTESADNAAG TAEQMFTAAF
      841 NSAGNALATF CTTGKLNFRS FTASLLSDLA KIMSQMVMMQ AVKGIGSAFG WGSAAAASVT
      901 PNADGGVYQS ADLSRYSGTV VNRPTFFAFA KGAGVMGEAG PEAILPLRRG ADGKLGVVAD
      961 IGGSGMAMFA PQYNIEINND GTNGQIGPAA LKVVYDLGKK AAADFMQQQA RDGGRLSGAY
     1021 R
//