LOCUS BCA66960.1 577 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli phage terminase large subunit protein. ACCESSION AP022811-284 PROTEIN_ID BCA66960.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /gene="ymfN" /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:INSD:ABE08441.1" /locus_tag="JE86ST02C_02840" /note="DFAST-ECOLI:ABE08441.1 phage terminase large subunit [pid:99.8%, q_cov:100.0%, s_cov:100.0%, Eval:0.0e+00]" /note="MGA_285" /transl_table=11 BEGIN 1 MSRKSYPNVN AANQYARDVV RGKIAACQFV IQACQRHLDD LMAEKSKSFR YRFDKDLAER 61 AAKFIQLLPH TKGEWAFKRM PITLEPWQLF VICCAFGWVN KGSRLRRFRE VYTEIPRKNG 121 KSAISAGVAL YCFACDNEFG AEVYSGATTE KQAWEVFRPA RLMCKRTPML TEAFGIEVNA 181 SNMNRPEDGA RFEPLIGNPG DGSSPHCAVV DEYHEHATDA LYTTMLTGMG ARRQPLMWAI 241 TTAGYNIEGP CYDKRREVIE MLNGSVPNDE LFGIIYTVDE GDDWTDPQVL EKANPNIGVS 301 VYREFLLSQQ QRAKNNARLA NVFKTKHLNI WVSARSAYFN LVSWQSCEDK SLTLEQFEGQ 361 PCILAFDLAR KLDMNSMARL YTREIDGKTH YYSVAPRFWV PYDTVYSVEK NEDRRTAERF 421 QKWVEMGVLT VTDGAEVDYR YILEEAKAAN KISPVSESPI DPFGATGLSH DLADEDLNPI 481 TIIQNYTNMS DPMKELEAAI ESGRFHHDGN PIMTWCIGNV VGKTIPGNDD VVKPVKEQAE 541 NKIDGAVALI MAVGRAMLYE KEDTLSDHIE SYGIRSL //