LOCUS BAX23171.1 3321 aa PRT BCT 27-MAR-2019 DEFINITION Escherichia coli invasin protein. ACCESSION AP017620-3829 PROTEIN_ID BAX23171.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadati; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5042704) AUTHORS Sekizuka,T., Kawanishi,M., Ohnishi,M., Shima,A., Kato,K., Yamashita,A., Matsui,M., Suzuki,S. and Kuroda,M. TITLE Direct Submission JOURNAL Submitted (27-JUL-2016) Contact:Tsuyoshi Sekizuka National Institute of Infectious Diseases, Pathogen Genomics Center; 1-23-1 Toyama, Shinjuku-ku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Sekizuka,T., Kawanishi,M., Ohnishi,M., Shima,A., Kato,K., Yamashita,A., Matsui,M., Suzuki,S. and Kuroda,M. TITLE Elucidation of quantitative structural diversity of remarkable rearrangement regions, shufflons, in IncI2 plasmids JOURNAL Sci. Rep. 7, 928 (2017) REMARK Publication Status: Online-Only DOI:10.1038/s41598-017-01082-y COMMENT ##Genome-Assembly-Data-START## Assembly Method :: HGAP Assembly v. 3.0; Celera Assembler v. 8.2 Genome Coverage :: 200x Sequencing Technology :: MiSeq; PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="2013" /db_xref="taxon:562" /geo_loc_name="Japan" /host="Bos taurus" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="MRY15-131" protein /locus_tag="MRY15131_c38290" /transl_table=11 BEGIN 1 MAGKAHGNGD RRGDNTICGL GDRLRRLTAG ICLITQTIFP VMAAAPTHIN PAHSDTAASL 61 ILPNVKTIPY TLGALESPPT VAARFGITVD ELRRLNQFRT FARGFDNVRQ GDEIDVPLIN 121 SNSPEARNLK AMQMERDGKD PQMQVAEVAQ QSGTLLARDM DSEQAASMAR GWVASSASAQ 181 ATDWLSRWGT ARVSLGVDED FSLKSSSFEF LHPWYETPDN LVFSQHTLHR TDDRTQTNHG 241 IGWRYFTSSW MSGVNMFIDH DLTRYHTRTG MGVEYWRDYL KLSGNGYLRL SNWRSAPELD 301 NDYEARPANG WDLRAEGWLP AWPQLGGKLV YEQYYGDEVA LFGKDERQND PHAITAGLSY 361 TPVPLISFSA EQRQSKQGEN DTRIGMELTL QPGHSLQKQL DPAEVAARRS LVGSRYDLVD 421 RNNNIVLEYR KKELVRLTLT DPLKGKPGEV KSLVSSLQTK YALKGYDIEA ASLQSAGGKV 481 AVSGKDIQVT IPPYRFTAMP ETDNTYPIAV TAEDSKGNFS RREESMVVVE KPTLSLTDST 541 LSVDLQILLA DGKSTSTLTY TARDSSGKPI PGMTLKTQVK GLQDFALSEW KDNGNGTYTQ 601 IVTAGKTSGA LSLMPQFNGD DIAKTPALIA IVANTASRAD STIETDQDNY VAGKPIVVKV 661 TLRDDNGNGV TGRKELLKQT VKVDNTKADD VSAWTEESEG IYKASYTAHL IGDKLTAQLT 721 MPGWQTKHSD AFSIAGDKDT AKIAAMQITA NNAVARRDHN TVAVTVRDVH QNLLQGQNVT 781 FTVVNGAAVF ADPNGGIVTT DKDGIASVNL ASDQAVNSLI KAEINGSSQS VEVSFITGDI 841 SQLTSTIKTD DVSYTAGGKI KVSVTLMDEQ KNRVKGMASL LAGSGVVEVS GTDKNETGNW 901 SEESDGVYTT TRTAKIAGDR HYATLKLSTW SSAQQSDAYA IRESGAVLAY SSIVTDKTAY 961 TAGGAIKVTV TLKDSYENLV GGQRDAINLA IQLPNTKAES IAWNEDQKGI YTATYTALLP 1021 GTGLKAQLQM SGWANALTSN DYSISGDAAS AQIVAMQVTT GNPDVLANGS DRHTVNVRVE 1081 DQFGNVLPEQ TVTFTVTKGA AVFANAGQSA DIRTDAHGMA EVDLSSTVAD ASTVEAKINQ 1141 SSDSKTVNFV ADVSTAQVAE LVVTQDGSVA DGATANTLRA RVTDAFGNAL AGQTVSVLAG 1201 NGATTAPTVT TQPDGTVEIS VTSQTAGTSV ITASVNNSSQ SRNVTFIADV STAQIADLVV 1261 SQDNAVADGA TANTLQVRVT DAFGNALAGQ TVSVLADNGA TVAPVVTTQP DGTVEISVTS 1321 QTAGSSAVTV SINSSSQSRD VTFIADVRTA QIADLVVIKD DSVADGAMAN MLRARVTDVF 1381 GNALAGQTVS VMADNGAAVA STMTTKPDGT VEISVTSQTA GISVVTASIN NSSQSQNVTF 1441 VADVRTAKIA DLVVSQDNAV ADGSTANTLR ARVTDAFGNT LAGQTVSVMA GNGATVAPAV 1501 ITEPDGTAEI SVTSQTAGVS AVTASINNSS QSRDVTFIAD VRTAKIADLV VTRDNSVADG 1561 AMANTLRVRV TDAFGNTLAG QTVSVLADNG ATVAPTVITG QDGTVEISVT SQTAGISTVT 1621 ATINSSSLSR NVTFVADVRT AQIADLVVIK DGSEADGATA NTLRARVTDA FGNALAGQTV 1681 SVMADNGATT APTVITEPDG TVEISVTSRT AGISTVTATI NSSSQSQNVT FIADIRTAQI 1741 ASLEVTQDNS VADGTMANTL RVKITDAFGN TLGGQTVSVL ADNGATTAPT VTTQPDGTVE 1801 ISVTSQTAGT SAVTASINSS SQSRDVTFIA DVRTAKIADL VVIKDGSVAD GAMANMLRAR 1861 VTDAFGNTLA GQTVSVLADN GATTAPTVTT QPDGTVEISV TSQTAGTSAV TASINSSTAS 1921 RNVTFVADVR TAKIADLVVI KDGSVADGAM ANTLRVKITD AFGNTLAGQT VSVLADNGAT 1981 TAPTVTTQPD GTVEISATSQ TAGTSAVTAS INNSSQSQNV TFVADVRTAK IADLVVSQDN 2041 AVADGSTANT LRARVTDVFG NTLAGQTVSV MAGNGATVAP TVITEPDGTA EISVTSQTAG 2101 VSAVTASINN SSQSRNVTFV ADVRTAKIAD LVVTRDNSVA DGAMANMLRA RVSDAFGNAL 2161 AGQTVSVLAD NGATTAPTVT TQPDGTVEIS VTSQTAGTSA VTATINSSSQ SRDVTFIADI 2221 RTAKIADLVV IKDGSEADGS TANTLRARVT DAFGNALGGQ TVSVMADNGA TTAPTVITEP 2281 DGTVEISVTS RTAGISTVTA TINNSSLSRN VTFIADIRTA QIASLEVTQD NAVADGAMAN 2341 TLRARVTDAF GNALAGQTVS VLADNGATTA PTVITEPDGT VEISVTSQTA GTSAVTASIN 2401 NSSQSRNVTF IADVRTAQIA ELVVIKDGSA ADGVMANMLR ARVTDANGNA LAGQTVSVSA 2461 GNSATVAPAV ITEPDGTVEI SVTSQTAGIS AVTASINSSS QSRDVTFIAD VRTAKIAELE 2521 VIRDNAVADG STANTLQVKV TDANGNKLAG QTVSVLAGNS ATVAPTVTTQ PDGTVEISVT 2581 SQTAGTSTVT ASINNSSQSQ NVTFVPGDAS QLTSTVETNK SNYTVGETIT ITVTLRDAFD 2641 NLVTGAASQL AADGVLTVAG TDPSETGSWV ESGGVYTTTR MATIASTNQH ANLQLQTWSD 2701 GVTSDRYDIQ SGSPAQATST IATDKNAYTA GDTITVAVTL KDAHGNLVEG GESLLSGDNV 2761 TVEGAVRSGG WSETAGVYTA TWSAQMAGDS HHATLKLSEW GSSKQSESYS IHSGAPVQAN 2821 SAIRTDKSAY IAGEPLTVTI TLRDEFGNPA LGLTSEVIES YIDSFAVGGA TPDSMRWVEQ 2881 NNGEYTIVWT AWVAEENLVA SLKLKTWATE IKSSLYGIQP GAAAKNQSTI VADKTIYIAG 2941 DSITVTVVLK DAQGNFITDG VVQLNEENVQ VRNADPIQGN NWVYNGNGQY QRQYMAHFAE 3001 ANLNAQLKMA GWSDANYSNN YTIKPGEVSP LGSQLRIREV LVVEGADLPV SVLLVDDFGN 3061 PVDNGLDLLD DTVYLQNVEK KEGEKWRYVG DGIYERTYMA YQEGENLTSF MEIKGWRIYG 3121 QPSYTILPFV EVELLSVNGV KFRATDGFPE TGFDGAKFTL LLTHNMKNTD YNWTAGIYGI 3181 NVDSNGEVTL SVLIRSEVTI TGKPKNGKGN DVVFKFKIKK WFTSLGATSS NTWDIINTSC 3241 SYGQMPSSLE LAQRPSGGVV PRKVGTLWGE YGNLKIYGNA FSGTDYWTST QLMGVHEKFN 3301 PETGISELGT GKSSGLCVEY Y //