LOCUS       BCA74979.1              1653 aa    PRT              BCT 06-NOV-2020
DEFINITION  Escherichia coli bacterial alpha2-macroglobulin colonization
            factor ECAM protein.
ACCESSION   AP022815-3009
PROTEIN_ID  BCA74979.1
SOURCE      Escherichia coli
  ORGANISM  Escherichia coli
            Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales;
            Enterobacteriaceae; Escherichia.
REFERENCE   1  (bases 1 to 5327513)
  AUTHORS   Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S.
  TITLE     Direct Submission
  JOURNAL   Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases.
            Contact:Ken-ichi Lee
            National Institute of Infectious Diseases, Deaprtment of
            Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan
REFERENCE   2
  AUTHORS   Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S.
  TITLE     Global distribution of epidemic-related Shiga toxin 2 encoding
            phages among enteroaggregative Escherichia coli
  JOURNAL   Sci. Rep. 10, 11738 (2020)
  REMARK    Publication Status: Online-Only
            DOI:10.1038/s41598-020-68462-9
COMMENT     ##Genome-Assembly-Data-START##
            Assembly Method       :: Unicycler v. 0.4.7
            Genome Coverage       :: 135.0X
            Sequencing Technology :: Illumina MiSeq, Oxford Nanopore MinION
            ##Genome-Assembly-Data-END##
FEATURES             Qualifiers
     source          /collection_date="2014"
                     /db_xref="taxon:562"
                     /host="Homo sapiens"
                     /mol_type="genomic DNA"
                     /organism="Escherichia coli"
                     /strain="JE86-ST05"
     protein         /gene="yfhM"
                     /inference="COORDINATES:ab initio
                     prediction:MetaGeneAnnotator"
                     /inference="similar to AA sequence:INSD:AAC75573.1"
                     /locus_tag="JE86ST05C_30090"
                     /note="DFAST-ECOLI:AAC75573.1 bacterial
                     alpha2-macroglobulin colonization factor ECAM [pid:98.5%,
                     q_cov:100.0%, s_cov:100.0%, Eval:0.0e+00]"
                     /note="MGA_3012"
                     /transl_table=11
BEGIN
        1 MKKLRVAACM LMLALAGCDN NDNAPTAVKK DAPSEVTKAA SSENASSAKL SVPERQKLAQ
       61 QSAGKVLTLL DLSEVQLDGA ATLVLTFSIP LDPDQDFSRV IHVVDKKSGK VDGAWELSDN
      121 LKELRLRHLE PKRDLIVTIG KEVKALNNAT FSKDYEKTIT TRDIQPSVGF ASRGSLLPGK
      181 VVEGLPVMAL NVNNVDVNFF RVKPESLPAF ISQWEYRNSL ANWQSDKLLQ MADLVYTGRF
      241 DLNPARNTRE KLLLPLGDIK PLQQAGVYLA VMNQAGRYDY SNPATLFTLS DIGVSAHRYH
      301 NRLDIFTQSL ENGAAQQGIE VSLLNEKGQT LTQATSDAQG HVQLENDKNA ALLLARKDGQ
      361 TTLLDLKLPA LDLAEFNIAG APGYSKQFFM FGPRDLYRPG ETVILNGLLR DADGKALPNQ
      421 PIKLDVIKPD GQVLRSVVSQ PENGLYHFTW PLDSNAATGM WHIRANTGDN QYRMWDFHVE
      481 DFMPERMALN LTGEKTPLTP KDEVKFSVVG YYLYGAPANG NTLQGQLFLR PLREAVSALP
      541 GFEFGDIAAE NLSRTLDEVQ LTLDDKGRGE VSTESQWKET HSPLQVIFQG SLLESGGRPV
      601 TRRAEQAIWP ADALPGIRPQ FASKSVYDYR TDSTVKQPIV DEGSNAAFDI VYSDAQGVKK
      661 AVSGLQVRLI RERRDYYWNW SEDEGWQSQF DQKDLIENEQ TLDLKADETG KVSFPVEWGA
      721 YRLEVKAPNE AVSSVRFWAG YSWQDNSDGS GAVRPDRVTL KLDKASYRPG DTIKLHIAAP
      781 TAGKGYAMVE SSEGPLWWQE IDVRAQGLDL TIPVDKTWNR HDLYLSTLVV RPGDKSRSAT
      841 PKRAVGVLHL PLGDENRRLD LALETPAKMR PNQPLTVKIK ASTKNGEKPK QVNVLVSAVD
      901 SGVLNITDYV TPDPWQAFFG QKRYGADIYD IYGQVIEGQG RLAALRFGGD GDELKRGGKP
      961 PVNHVNIVAQ QALPVTLNEQ GEGSVTLPIG DFNGELRVMA QAWTADDFGS NESKVIVAAP
     1021 VIAELNMPRF MASGDTSRLT LDITNLTDKP QKLNVALTAS GLLELVSDSP AAVELAPGVR
     1081 TTLFIPVRAL PGYGDGEIQA TISGLALPDE TVADQQKQWK IGVRPAFPAQ TVNYGTALQP
     1141 GETWAIPADG LQNFSPVTLE GQLLLSGKPP LNIARYIKEL KAYPYGCLEQ TASGLFPSLY
     1201 TNAAQLQALG IKGDSDEKRR ASVDIGISRL LQMQRDNGGF ALWDKNGDEE YWLTAYVMDF
     1261 LVRAGEQGYS VPTDAINRGN ERLLRYLQDP GMMSIPYADN LKASKFAVQS YAALVLARQQ
     1321 KAPLGALREI WEHRADAASG LPLLQLGVAL KTMGDATRGE EAIVLALKTP RNSDERIWLG
     1381 DYGSPLRDNA LMLSLLEENK LLPDEQYTLL NTLSQQAFGE RWLSTQESNA LFLAARTIQD
     1441 LPGKWQAQTS FSAEPLTGEK TLNSNLNSDQ LATLQVRNSG DQPLWLRMDA SGYPQSAPLP
     1501 ANNVLQIERH ILGTDGKSKS LDSLRSGDLV LVWLQVKASN SVPDALVVDL LPAGLELENQ
     1561 NLANGSASLE QSGGEVQNLL NQMQQASIKH IEFRDDRFVA AVAVDEYQPV TLVYLARAVT
     1621 PGTYQVPQPM VESMYVPQWR ATGAAEDLLI VRP
//