LOCUS BAX21446.1 754 aa PRT BCT 27-MAR-2019 DEFINITION Escherichia coli ComEC family competence protein protein. ACCESSION AP017620-2104 PROTEIN_ID BAX21446.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5042704) AUTHORS Sekizuka,T., Kawanishi,M., Ohnishi,M., Shima,A., Kato,K., Yamashita,A., Matsui,M., Suzuki,S. and Kuroda,M. TITLE Direct Submission JOURNAL Submitted (27-JUL-2016) to the DDBJ/EMBL/GenBank databases. Contact:Tsuyoshi Sekizuka National Institute of Infectious Diseases, Pathogen Genomics Center; 1-23-1 Toyama, Shinjuku-ku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Sekizuka,T., Kawanishi,M., Ohnishi,M., Shima,A., Kato,K., Yamashita,A., Matsui,M., Suzuki,S. and Kuroda,M. TITLE Elucidation of quantitative structural diversity of remarkable rearrangement regions, shufflons, in IncI2 plasmids JOURNAL Sci. Rep. 7, 928 (2017) REMARK Publication Status: Online-Only DOI:10.1038/s41598-017-01082-y COMMENT ##Genome-Assembly-Data-START## Assembly Method :: HGAP Assembly v. 3.0; Celera Assembler v. 8.2 Genome Coverage :: 200x Sequencing Technology :: MiSeq; PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="2013" /country="Japan" /db_xref="taxon:562" /host="Bos taurus" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="MRY15-131" protein /locus_tag="MRY15131_c21040" /transl_table=11 BEGIN 1 MKITTVGVCI ICGIFPLLIL PQLPGTVTLA FLTLFACVLA FIPVKTVRYI ALTLLFFVWG 61 ILAAKQILWA GETLTGATQD AIVEITATDG MTTHYGQITH LQSRRIFPAP GLVLYGEYLP 121 QAVCAGQVWS MKLKVRAVHG QLNDGGFDSQ RYAIAQHQPL TGRFLQASVI EPNCSLRAQY 181 LASLQTTLQP YPWNAVILGL GMGERLSVPK EIKNIMRDTG TAHLMAISGL HIAFAALLAA 241 GLIRSGQIFL PGRWIHWQMP LIGGICCAAF YAWLTGMQPP ALRTVVALAM WGMLKLSGRQ 301 WSGWDVWICC LAAILLMDPV AILSQSLWLS AAAVAALIFW YQWFPCPEWQ LPPVLRAVVA 361 LIHLQLGITL LLMPVQIVVF HGISLTSFIA NLFAIPLVTF ITVPLILAAM VMHLSGPLIV 421 EQGVWFLADR SLALLFWGLK SLPEGWINIA ERWQWLSFSP WFLLVVWRLN AWRTLPAMCV 481 AGGLLMCWPL WQKPRPDEWQ VYMLDVGQGL AMVIARNGKA ILYDTGLAWP EGDSGQQLII 541 PWLHWHNLEP EGVILSHEHL DHRGGLDSIL HTWPMLWIRS PLNWEHHQPC VRGEAWQWQG 601 LRFSAHWPLQ GSNDKGNNHS CVVKVDDGTN SILLTGDIEA PAEQKMLSRY WQQMQATLLQ 661 VPHHGSNTSS SLPLIQRVNG KVALASASRY NAWRLPSSKV KHRYQQQGYK WLDTPHQGQI 721 TVNFSAQGWR ISSLREQILP RWYHQWFGVP VDNG //