LOCUS BCA67228.1 504 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli IS21 family transposase protein. ACCESSION AP022811-552 PROTEIN_ID BCA67228.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /gene="istA" /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:RefSeq:WP_001324342.1" /locus_tag="JE86ST02C_05520" /note="DFAST-ECOLI:BAI58011.1 transposase [pid:27.2%, q_cov:69.8%, s_cov:86.7%, Eval:3.6e-26, partial hit]" /note="MGA_553" /note="WP_001324342.1 IS21 family transposase (Escherichiacoli O83:H1 str. NRG 857C) [pid:98.8%, q_cov:100.0%,s_cov:99.4%, Eval:6.8e-296]" /transl_table=11 BEGIN 1 MAILSAIRRW HFRDGASIRE IARRSGLSRN TVRKYLQSKV VEPQYPARDS VGKLSPFEPK 61 LRQWLSTEHK KTKKLRRNLR SMYRDLVALG FTGSYDRVCA FARQWKDSEQ FKAQTSGKGC 121 FIPLRFACGE AFQFDWSEDF ARIAGKQVKL QIAQFKLAHS RAFVLRAYYQ QKHEMLFDAH 181 WHAFQIFGGI PKRGIYDNMK TAVDSVGRGK ERRVNQRFTA MVSHYLFDAQ FCNPASGWEK 241 GQIEKNVQDS RQRLWQGAPD FQSLADLNVW LEHRCKALWS ELRHPELDQT VQEAFADEQG 301 ELMALPNAFD AFVEQTKRVT STCLVHHEGN RYSVPASYAN RAISLRIYAD KLVMAAEGQH 361 IAEHPRLFGS GHARRGHTQY DWHHYLSVLQ KKPGALRNGA PFAELPPAFK KLQSILLQRP 421 GGDRDMVEIL ALVLHHDEGA VLSAVELALE CGKPSKEHVL NLLGRLTEEP PPKPIPIPKG 481 LRLTLEPQAN VNRYDSLRRA HDAA //