LOCUS BCA67556.1 635 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli hypothetical protein protein. ACCESSION AP022811-880 PROTEIN_ID BCA67556.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /locus_tag="JE86ST02C_08800" /note="DFAST-ECOLI:CAU97268.1 phage tail fiber protein [pid:76.4%, q_cov:100.0%, s_cov:56.7%, Eval:6.4e-278, partial hit]" /note="MGA_881" /note="WP_000216502.1 tail fiber protein (Escherichia coliO104:H4 str. 2011C-3493) [pid:75.6%, q_cov:100.0%,s_cov:63.5%, Eval:1.3e-273, partial hit]" /note="internal stop codon at [933065:933067](+)" /transl_table=11 BEGIN 1 MEDASLTTKG VVKLSSAVDS TSESLAATPK AVKAANDNAN SRVPSNRKVN GKALTADITL 61 TPKDIGTLNS VTMSFSGGAG WFKLATVTMP QASSIVYIAL IGGAGYNVGS PHQAGISELV 121 LRAGNGNPKG ITGALWKRTA VGLTNFAWIN TSGDTYDIYV EIGNYATRVN IHWDCTANAT 181 VSIYTSPTYS ASKPPSVTDG VVYTMYSTHQ KPTPLDIGAL PTTGGTVSGP LSVTGGLTGS 241 LNGNASTATK LQTARSIGGV VFDGSANINL PGVNTTGNQN TTGNAATATK LQTARKISGV 301 PFDGSTDITL TAAHVAAFAR RATDAYADAD GGVPWNAESG AYNVTRSGDS YILVNFYTGV 361 GSCRTLQMKA HYRNGGLFYR SSRDGYGFEE DWAEVYTSKN LPPESYPVGA PIPWPSDTVP 421 SGYALMQGQT FDKSAYPKLA AAYPSGVIPD MRGWTIKGKP ASGRAVLSQE QDGIKSHTHS 481 ASASSTDLGT KTTSSFDYGT KSTNNTGAHT HSISGTANSA GAHQHKSSGA FGGTNTSIFP 541 NGYTAISNPS AGIMSTTSGS GQTRNAGKTS SDGAHTHSLS GTAASAGAHA HTVGIGAHTH 601 SVAIGSHGHT ITVNAAGNAE NTVKNIAFNY IVRLA //