LOCUS BCA67537.1 700 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli phage terminase large subunit protein. ACCESSION AP022811-861 PROTEIN_ID BCA67537.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:INSD:BAB34248.1" /locus_tag="JE86ST02C_08610" /note="DFAST-ECOLI:BAB34248.1 phage terminase large subunit [pid:98.7%, q_cov:100.0%, s_cov:100.0%, Eval:0.0e+00]" /note="MGA_862" /transl_table=11 BEGIN 1 MLNQETAKAA RTDSGYILRA PRRMRVADAV AQYMRVPMGA GNSVPWDPLV APYVIEPMNC 61 LASREYDAVI FVGPARTGKT IGLIDGWVIY NVICDPADML IIQMTEEKAR EHSKKRLART 121 FRVSPEVVSR LSPNKNDNNV YDRTFLAGNY LKIGWPSVNI MSSSDYKCVA LTDYDRFPED 181 IDGEGDAFSL ASKRTTTFMS SGMTLVESSP GRDVKDVKWR RTSPHEAPPT TGILSLYNRG 241 DRRRWYWPCP HCGEYFQPCG DVVAGFRDIA DPVLASEAAY IQCPSCSGRI MPEQKRELNG 301 RGVWLRDGES INADGSRYGD PRRSRIASFW MEGPAAAYQT LSQLVYKLLT AEQEYETTGS 361 EETLKTVINT DWGLPYLPRA SMEQRKSELL EQRAEPVPSR SVPDGVNFLV ATVDVQAGRH 421 RRFVVQVTGY GSRGERWIID RYNITQSLRG DSDGESQRID PASYPEDWDV LLTDVFHKSW 481 PLASDPSQQM RLMAMAVDSG GEDGVTDNAY KFWRRCRRDG LGKRIYLFKG DSIRRAKLIT 541 RTFPDNTGRT GRRAQAAGDV PLWLLQTDAL KDRVNNALWR DSPGPGYVHF PDWLGSWFYD 601 ELTYEERSSD GKWSKPGRGA NEAFDLMVYA EALVILHGYE KIRWPDAPEW ASRETWLECV 661 PDSTKPSPSP EPVSTPVKKQ KRKKTVTDDV NPWLTSGGWL //