LOCUS BCA69236.1 666 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli hypothetical protein protein. ACCESSION AP022811-2560 PROTEIN_ID BCA69236.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /gene="yehQ" /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:INSD:BAB36354.1" /locus_tag="JE86ST02C_25600" /note="DFAST-ECOLI:BAB36354.1 hypothetical protein [pid:96.1%, q_cov:100.0%, s_cov:89.5%, Eval:0.0e+00]" /note="MGA_2562" /transl_table=11 BEGIN 1 MNSLRPELLE LTPQALTALS NAGFVKRSLK ELENGNVPEI SHENGALIAT FSDGVRTQLA 61 NSQALKEAQC SCGASGMCRH RVMLVLSYQR LCATVQPTEK EEEWDPAIWL EELATLPDAT 121 RKRAQALVAK GITIELFCAP GEIPSARLPM SDVRFYSRSS IRFARCDCIE GTLCEHVVLA 181 VQAFVEAKTQ QAEFTHLIWQ MRSEHVTSSD DPFANDEGNA CRQYVQQLSQ ALWLGGISQP 241 LIHYEAAFSR AQQAAERCNW RWVSESLRQL RASVDAFHAR ASHYHAGECL RQLAALNSRL 301 NCAQEMARRD SVGEVPPVPW RTVVGSGIAG EAKLDHLRLV SLGMRCWQDI EHYGLRIWFT 361 DPDTGSILHL SRSWPRSEQE NSPAATRRLF SFQAGALAGG QIVSQAAKRS ADGELLLATR 421 NRLSSVVPLS PDAWQMLSAP LRQPGIVALR EYLRQRPPAC IRPLNQVDNL FILPVAECIS 481 LGWDSSRQTL DAQVISGEGE DNLLTLSLPA SACSPFAVER MAALLQQTDD PVSLVSGFVS 541 FVDGQLTLEP RVMMTKTRAW ALDAETAPVA PLPSASVLPV PSTAHQLLMR CQALLIQLLH 601 NGWRYQEQSA ISQAELLAND LSAVGFYRLA HVLGQFRNTE SEARVEAMNN GVLLCEQLFP 661 MLQQQG //