LOCUS       BAX20240.1               759 aa    PRT              BCT 27-MAR-2019
DEFINITION  Escherichia coli hypothetical protein protein.
ACCESSION   AP017620-898
PROTEIN_ID  BAX20240.1
SOURCE      Escherichia coli
  ORGANISM  Escherichia coli
            Bacteria; Pseudomonadati; Pseudomonadota; Gammaproteobacteria;
            Enterobacterales; Enterobacteriaceae; Escherichia.
REFERENCE   1  (bases 1 to 5042704)
  AUTHORS   Sekizuka,T., Kawanishi,M., Ohnishi,M., Shima,A., Kato,K.,
            Yamashita,A., Matsui,M., Suzuki,S. and Kuroda,M.
  TITLE     Direct Submission
  JOURNAL   Submitted (27-JUL-2016)
            Contact:Tsuyoshi Sekizuka
            National Institute of Infectious Diseases, Pathogen Genomics
            Center; 1-23-1 Toyama, Shinjuku-ku, Tokyo 162-8640, Japan
REFERENCE   2
  AUTHORS   Sekizuka,T., Kawanishi,M., Ohnishi,M., Shima,A., Kato,K.,
            Yamashita,A., Matsui,M., Suzuki,S. and Kuroda,M.
  TITLE     Elucidation of quantitative structural diversity of remarkable
            rearrangement regions, shufflons, in IncI2 plasmids
  JOURNAL   Sci. Rep. 7, 928 (2017)
  REMARK    Publication Status: Online-Only
            DOI:10.1038/s41598-017-01082-y
COMMENT     ##Genome-Assembly-Data-START##
            Assembly Method       :: HGAP Assembly v. 3.0; Celera Assembler v.
                                     8.2
            Genome Coverage       :: 200x
            Sequencing Technology :: MiSeq; PacBio RSII
            ##Genome-Assembly-Data-END##
FEATURES             Qualifiers
     source          /collection_date="2013"
                     /db_xref="taxon:562"
                     /geo_loc_name="Japan"
                     /host="Bos taurus"
                     /mol_type="genomic DNA"
                     /organism="Escherichia coli"
                     /strain="MRY15-131"
     protein         /locus_tag="MRY15131_c08980"
                     /transl_table=11
BEGIN
        1 MSEPLIVGIR HHSPACARLV KSLIESQRPR YVLIEGPADF NDRVDELFLA HQLPVAIYSY
       61 CQYQDGTAPG RGAWTPFAEF SPEWQALQAA RRIQAQTYFI DLPCWAQSEE DDDSPDTQEE
      121 SQTLLLRATR MDNSDTLWDH LFEDESQQTA LPSALAHYFA QLRGDSPGDA LNRQREAFMA
      181 RWIAWAMQQN NGDVLVVCGG WHAPALAKMW REYTQEINTP ELPSLADVVT GCYLTPYSEK
      241 RLDVLAGYLS GMPAPVWQNW CWRWGLQQAG EQLLKTVLTR LRQHHLPAST ADMAAAHLHA
      301 MALAQLRGHT LPLRTDWLDA IAGSLIKEAL NAPLPWSYRG VIHPDTDPIL LTLIDTLAGD
      361 GFGKLAPSTP QPPLPKDVTC ELERTAISLP AELTLNRFNP NGLAQSQVLH RLAILEIPGI
      421 VRQQGSTLTL AGNGEERWKL TRPLSQHAAL IEAACFGATL QEAARHKLEA DMLDAGGIGS
      481 ITTCLSQAAL AGLASFSQQL LEQLTLLIAQ ENQFAEMGQA LEVLYALWQL DEISGMQGAQ
      541 ILQTTLCAAI DRTLWLCESN GRPDEKEFHA HLHSWQALCH ILRDLHSGVN LPGVSLSAAV
      601 ALLERRSQAI HAPALDRGAA LGALMRLEHP NASAEAALTM LAQLSPAQSG EALHGLLALA
      661 RHQLACQLAF IAGFSSHLNQ LSDDDFINAL PDLRAAMAWL PPRERGTLAH QVLEHYQLAQ
      721 LPISALQMPL HCPPQAIAHH QQLEQQALAS LQHWGVFHV
//