LOCUS BCA68301.1 582 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli phage terminase large subunit protein. ACCESSION AP022811-1625 PROTEIN_ID BCA68301.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:INSD:ABJ00550.1" /locus_tag="JE86ST02C_16250" /note="DFAST-ECOLI:ABJ00550.1 phage terminase large subunit [pid:99.0%, q_cov:100.0%, s_cov:88.7%, Eval:0.0e+00]" /note="MGA_1626" /transl_table=11 BEGIN 1 MNCMGNDLIR TVNLIKSARV GYTKMLLGVE AYFIEHKSRN SLLFQPTDSA AEDFMKSHVE 61 PTIREVPALL ELAPWFGRKH RDNTLTLKRF SSGVGFWCLG GAAAKNYREK SVDVVCYDEL 121 SSFEPDVEKE GSPTLLGDKR IEGSVWPKSI RGSTPKIKGS CQIEKAANES AHFMRFYVPC 181 PHCGEEQYLK FGDDASPFGL KWEKNKPESV FYLCEHHGCV IHQSELDQSN GRWICENTGM 241 WTRDGLTFFS AADNEIPPPR SITFHIWTAY SPFTTWVQIV YDWLDALKDP NGLKTFVNTT 301 LGETWEEAVG EKIDHQVLMD KVVRYTAAVP ARVVYLTAGI DSQRNRFEMY VWGWAPGEEA 361 FLVDKIIIMG RPDEEETLLR VDAAINKKYR HADGTEMTIS RVCWDIGGID GEIVYQRSKK 421 HGVFRVLPVK GASVYGKPVI TMPKTRNQRG VYLCEVGTDT AKEILYARMK ADPTPVDEAT 481 SYAIRFPDDP EIFSQTEAQQ LVAEELVEKW EKGKMRLLWD NKKRRNEALD CLVYAYAALR 541 VSVQRWQLDL AVLAKSREEE TTRPTLKELA AKLSGGVNGY SR //