LOCUS BAX21844.1 724 aa PRT BCT 27-MAR-2019 DEFINITION Escherichia coli invasin protein. ACCESSION AP017620-2502 PROTEIN_ID BAX21844.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5042704) AUTHORS Sekizuka,T., Kawanishi,M., Ohnishi,M., Shima,A., Kato,K., Yamashita,A., Matsui,M., Suzuki,S. and Kuroda,M. TITLE Direct Submission JOURNAL Submitted (27-JUL-2016) to the DDBJ/EMBL/GenBank databases. Contact:Tsuyoshi Sekizuka National Institute of Infectious Diseases, Pathogen Genomics Center; 1-23-1 Toyama, Shinjuku-ku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Sekizuka,T., Kawanishi,M., Ohnishi,M., Shima,A., Kato,K., Yamashita,A., Matsui,M., Suzuki,S. and Kuroda,M. TITLE Elucidation of quantitative structural diversity of remarkable rearrangement regions, shufflons, in IncI2 plasmids JOURNAL Sci. Rep. 7, 928 (2017) REMARK Publication Status: Online-Only DOI:10.1038/s41598-017-01082-y COMMENT ##Genome-Assembly-Data-START## Assembly Method :: HGAP Assembly v. 3.0; Celera Assembler v. 8.2 Genome Coverage :: 200x Sequencing Technology :: MiSeq; PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="2013" /country="Japan" /db_xref="taxon:562" /host="Bos taurus" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="MRY15-131" protein /locus_tag="MRY15131_c25020" /transl_table=11 BEGIN 1 MLRWTRCIML TFISGAAFAA PEINVEHKES LPDLGSQAAQ QEEQTNKGKS LKERGADYVI 61 NSATQGFENL TPEALESQAR SYLQSQITST TQSYIEDTLS PYGKVRTNLS LGQGGDLDGS 121 SIDYFVPWYD NQTTVYFSQF SAQRKEDRTI GNIGLGVRHN FDKWLLGGNI FYDYDFTRGH 181 RRLGLGAEAW TDYLKFSGNY YHPLSDWKDS EDFDFYEERP ARGWDIRAEA WLPAYPQLGG 241 KIVFEQYYGN EVALFGTDNL EKDPFAVTLG VKYQPVPLIA VGTDFKAGTG DNTDLSVNAT 301 LNYQFGVPLK DQLDPDKVSA AHSLMGSRHD FVERNNFIVL EYKEKDPLDV TLWLKADATN 361 EHPECVIKDT PEEAIGLEKC KWTINALINH HYKIVAASWQ AKNNAARTLV MPVIKENTLT 421 EGNNNHWNLV LPAWQYSSDK AEQEKLNTWR VRLALEDEKG NRQNSGVVEI TVQQDRKIEL 481 IVNNIADVPD ENNHSHEASA QADGVDGVVM DLDITDSFGD NTDRNGNVLP QDNLNPQLFD 541 ANDKKVTLTN KPCTTETPCV FIAKQDKEKG TVTLSSTLPG TFRWKAKAAP YDDSNYVDVT 601 FLGSDIGGLN AFIYRVGAAK PVNLIGNKEP LPLNSSYRFV LWRDANKDGV FQLSEKLTEE 661 EMKQYDYQWE FTGHSVNGNT GAQANTTNAD IEIPATNKEA ATKFSAQVTD GVQGYGLQVN 721 YSKK //