LOCUS BCA68050.1 684 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli hypothetical protein protein. ACCESSION AP022811-1374 PROTEIN_ID BCA68050.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /gene="yjhS_5" /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:INSD:BAI30503.1" /locus_tag="JE86ST02C_13740" /note="DFAST-ECOLI:BAI30503.1 hypothetical protein [pid:72.9%, q_cov:85.4%, s_cov:90.2%, Eval:2.5e-259]" /note="MGA_1375" /transl_table=11 BEGIN 1 MSDSKWKMLS AMPGEFSVKV AGGTVAFIES PFRPSGNKGG ITFADCVIRF STKEPLWVMP 61 VSGNPSAEIT SSGVTGIIPI TSDVAGTLTP SDWNAPDNAG PSGSTNGSSS EPEYYFVVIL 121 AGQSNGMAYG EGLPLPDSFD RPDPRIKQLA RRSTVTPNGT SCTYNDIIPA DHCLHDVQDM 181 SGINHPKADL AKGQYGTVGQ GLHIAKKLLP YIPQNAGILL VPCCRGGSAF TTGADGSFSE 241 ASGASADSSR WGAGKPLYQD LVSRTKVALA KNPKNKLLAV VWMQGEADLA SGSQQHNSLF 301 TAMVQQFRTD LAPLAAQCVS GNATTVPWIC GDTTYYWKNA GTDKYEAVYG GYKGKEAQKI 361 FFVPFLTDEN GQSTPTNAPA EDPDIVAVGY YGAASRTQGS FVSTQRDSHF SSWARRGVIS 421 DRLASAIMLH AGRTAELMRG QTVTPPDEKP SPDTPSKPST PPADTTTMST LFAYRASESE 481 GRLTPQGWAA GGGKAQIVDD AGASGGKAMK LTKETGRAPW YLEHDAGNGA DLLGKGGLVS 541 CRFKLDGALT ANQYALALYW PVSALPQGVT LEGNAGHNLL ASFYVQSDAT DLNVMYHKGN 601 TAQNTKLGSF GAFNNEWHTL GFRFAGNNSI EVTPVIDGKD GTPFMLSQSP VGTFTADKLR 661 VTDITKNATY PVLIDSIVVE VNNA //