LOCUS BCA69899.1 899 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli CRISPR-associated helicase/endonuclease Cas3 protein. ACCESSION AP022811-3223 PROTEIN_ID BCA69899.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /gene="ygcB" /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:INSD:ACB16501.1" /locus_tag="JE86ST02C_32230" /note="DFAST-ECOLI:ACB16501.1 CRISPR-associated helicase/endonuclease Cas3 [pid:92.9%, q_cov:100.0%, s_cov:100.0%, Eval:0.0e+00]" /note="MGA_3227" /transl_table=11 BEGIN 1 MRKYPLSLLK DKNIVTFFDF WGKTRRGEKD GGDDYHLLCW HSLDVAAMGY LMVKRNCFGL 61 ADYFRQLGIS DKEQAAQFFA WLLCWHDIGK FARSFQQLYL PPELKIQEGA RKNYEKISHS 121 TLGYWLWNHY LSECQELLPS SSLSPRKLRR VIEMWMPVTT GHHGRPPDRM DELDNFLPED 181 KAAARDFLLE IKPLFPLIEI PAFWDDDEGI ELIKHLSWYI SATVVLADWT GSSTRFFPRV 241 AHPMDIKGYW QKTLIQAQNA LTVFPLKAKV APFNGINTLF PFIENPTPLQ QKVLDLDISQ 301 QGPQLFILED VTGAGKTEAA LILAHRLIAA GKAQGLFFGL PTMATANAMY DRLVKTWLAF 361 YSPESRPSLV LAHSARTLMD RFNESLWSGD LVGSEEPDEQ TFSQGCAAWF ADSNKKALLA 421 EIGVGTLDQA MMAVMPFKHN NLRLLGLSNK ILLADEIHAC DAYMSCILEG LIERQARGGN 481 SVILLSATLS QQQRDKLVAA FARGTEGQQE APFLEKDDYP WLTHVTKSDV NSHRVATRKD 541 VERSVSVGWL HSEQECIARI ESAVSQGKCI AWIRNSVDDA IKVHRQLLAR GVIPASSLSL 601 FHSRFAFSDR QRIETETLAR FGKYCSLQRA SQVIVCTQVI EQSVDIDLDE MISDLAPVDL 661 LIQRAGRLQR HIRDINGQLK RDGKDERSPP ELLILAPVWD DSPGDEWFGS AMRNSAFVYP 721 DHGRIWLTQR VLREQGAIQM PHAARLLIES VYGEDVVMPE GFARSEQEQV GKYYCDRAMA 781 KKFVLNFRPG YAANINDYLP EKLSTRLAEE SVSLWLATCI DGVVKPYATG AHAWEMSVVR 841 VRRSWWKKHR DEFSLLEGEA FRLWCIEQRQ DPEMANVILV NDDESCGYSA TEGLIGKVG //