LOCUS       BAX21844.1               724 aa    PRT              BCT 27-MAR-2019
DEFINITION  Escherichia coli invasin protein.
ACCESSION   AP017620-2502
PROTEIN_ID  BAX21844.1
SOURCE      Escherichia coli
  ORGANISM  Escherichia coli
            Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales;
            Enterobacteriaceae; Escherichia.
REFERENCE   1  (bases 1 to 5042704)
  AUTHORS   Sekizuka,T., Kawanishi,M., Ohnishi,M., Shima,A., Kato,K.,
            Yamashita,A., Matsui,M., Suzuki,S. and Kuroda,M.
  TITLE     Direct Submission
  JOURNAL   Submitted (27-JUL-2016) to the DDBJ/EMBL/GenBank databases.
            Contact:Tsuyoshi Sekizuka
            National Institute of Infectious Diseases, Pathogen Genomics
            Center; 1-23-1 Toyama, Shinjuku-ku, Tokyo 162-8640, Japan
REFERENCE   2
  AUTHORS   Sekizuka,T., Kawanishi,M., Ohnishi,M., Shima,A., Kato,K.,
            Yamashita,A., Matsui,M., Suzuki,S. and Kuroda,M.
  TITLE     Elucidation of quantitative structural diversity of remarkable
            rearrangement regions, shufflons, in IncI2 plasmids
  JOURNAL   Sci. Rep. 7, 928 (2017)
  REMARK    Publication Status: Online-Only
            DOI:10.1038/s41598-017-01082-y
COMMENT     ##Genome-Assembly-Data-START##
            Assembly Method       :: HGAP Assembly v. 3.0; Celera Assembler v.
                                     8.2
            Genome Coverage       :: 200x
            Sequencing Technology :: MiSeq; PacBio RSII
            ##Genome-Assembly-Data-END##
FEATURES             Qualifiers
     source          /collection_date="2013"
                     /country="Japan"
                     /db_xref="taxon:562"
                     /host="Bos taurus"
                     /mol_type="genomic DNA"
                     /organism="Escherichia coli"
                     /strain="MRY15-131"
     protein         /locus_tag="MRY15131_c25020"
                     /transl_table=11
BEGIN
        1 MLRWTRCIML TFISGAAFAA PEINVEHKES LPDLGSQAAQ QEEQTNKGKS LKERGADYVI
       61 NSATQGFENL TPEALESQAR SYLQSQITST TQSYIEDTLS PYGKVRTNLS LGQGGDLDGS
      121 SIDYFVPWYD NQTTVYFSQF SAQRKEDRTI GNIGLGVRHN FDKWLLGGNI FYDYDFTRGH
      181 RRLGLGAEAW TDYLKFSGNY YHPLSDWKDS EDFDFYEERP ARGWDIRAEA WLPAYPQLGG
      241 KIVFEQYYGN EVALFGTDNL EKDPFAVTLG VKYQPVPLIA VGTDFKAGTG DNTDLSVNAT
      301 LNYQFGVPLK DQLDPDKVSA AHSLMGSRHD FVERNNFIVL EYKEKDPLDV TLWLKADATN
      361 EHPECVIKDT PEEAIGLEKC KWTINALINH HYKIVAASWQ AKNNAARTLV MPVIKENTLT
      421 EGNNNHWNLV LPAWQYSSDK AEQEKLNTWR VRLALEDEKG NRQNSGVVEI TVQQDRKIEL
      481 IVNNIADVPD ENNHSHEASA QADGVDGVVM DLDITDSFGD NTDRNGNVLP QDNLNPQLFD
      541 ANDKKVTLTN KPCTTETPCV FIAKQDKEKG TVTLSSTLPG TFRWKAKAAP YDDSNYVDVT
      601 FLGSDIGGLN AFIYRVGAAK PVNLIGNKEP LPLNSSYRFV LWRDANKDGV FQLSEKLTEE
      661 EMKQYDYQWE FTGHSVNGNT GAQANTTNAD IEIPATNKEA ATKFSAQVTD GVQGYGLQVN
      721 YSKK
//