LOCUS       BCA66908.1               353 aa    PRT              BCT 06-NOV-2020
DEFINITION  Escherichia coli H repeat-associated protein protein.
ACCESSION   AP022811-232
PROTEIN_ID  BCA66908.1
SOURCE      Escherichia coli
  ORGANISM  Escherichia coli
            Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales;
            Enterobacteriaceae; Escherichia.
REFERENCE   1  (bases 1 to 5283470)
  AUTHORS   Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S.
  TITLE     Direct Submission
  JOURNAL   Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases.
            Contact:Ken-ichi Lee
            National Institute of Infectious Diseases, Deaprtment of
            Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan
REFERENCE   2
  AUTHORS   Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S.
  TITLE     Global distribution of epidemic-related Shiga toxin 2 encoding
            phages among enteroaggregative Escherichia coli
  JOURNAL   Sci. Rep. 10, 11738 (2020)
  REMARK    Publication Status: Online-Only
            DOI:10.1038/s41598-020-68462-9
COMMENT     ##Genome-Assembly-Data-START##
            Assembly Method       :: Unicycler v. 0.4.7
            Genome Coverage       :: 117.0X
            Sequencing Technology :: Illumina MiSeq, PacBio RSII
            ##Genome-Assembly-Data-END##
FEATURES             Qualifiers
     source          /collection_date="1999"
                     /db_xref="taxon:562"
                     /host="Homo sapiens"
                     /mol_type="genomic DNA"
                     /organism="Escherichia coli"
                     /strain="JE86-ST02"
     protein         /gene="ydcC_1"
                     /inference="COORDINATES:ab initio
                     prediction:MetaGeneAnnotator"
                     /inference="similar to AA sequence:INSD:AAC74542.1"
                     /locus_tag="JE86ST02C_02320"
                     /note="DFAST-ECOLI:AAC74542.1 H repeat-associated protein
                     [pid:96.9%, q_cov:100.0%, s_cov:93.4%, Eval:1.0e-199]"
                     /note="MGA_233"
                     /transl_table=11
BEGIN
        1 MSGILLLTIF AVISGAESWE DIEDFGETHL DFLKQYGDFE NGIPVHDTIA RVVSCISPAK
       61 FHECFINWMR DCHSSDDKDV IAIDGKTLRH SYDKSRRRGA IHVISAFSTM HSLVIGQIRT
      121 DEKSNEITAI PELLNMLDIK GKIITTDAMG CQKDIAEKIQ KQGGDYLFAV KGNQGRLNKA
      181 FEEKFPLKEL NNPEHDSYAM SEKSHGREEI RLHIVCDVPD ELIDFTFEWK GLKKLCGAVS
      241 FRSIIAEQKK EPEMTVRYYI SSADLTAEKF ATAIRNHWHL ENKLHWRLDV VMNEDDCKIR
      301 RGNAAELFSG IRHIAINILT NDKVFKAGLR RKMRKAAMDR NYLASVLAGS GLS
//