LOCUS BCA67342.1 529 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli hypothetical protein protein. ACCESSION AP022811-666 PROTEIN_ID BCA67342.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:INSD:BAB38393.1" /locus_tag="JE86ST02C_06660" /note="DFAST-ECOLI:BAB38393.1 hypothetical protein [pid:95.9%, q_cov:95.8%, s_cov:99.8%, Eval:4.7e-287]" /note="MGA_667" /transl_table=11 BEGIN 1 MGRIIDTDGK PFSFDPEMQS AALDIPQIAS RYIEHPASGI TPNRAAQCLR GAERGDLIAQ 61 SDLAADIEEK DTHLFAELGK RRLAIQGVPW SIEPPPNASA NEKKDAEMLD EYLHSADWFD 121 AMLFDATDAI LKGYSCMEIE HGMLGKMHII RAIRWRDSGH FCLNPDDLSE LRLRDGSHSG 181 VAFQPFGWIV HQSRSRTGYG GATGLVRTLI WPFIFKNYSV RDLAEFLEVY GLPMKVGKYP 241 SGATPEQKSA LMRAVMDIGR RTGGIIPAGM SLEFQAAANG QADPFETMIS WGERSISKAI 301 LGGTLTTEAG DKGARSLGEV HNEVRREIRD SDLRQLAATL NRDLVYPLYA LNTTHTIDIR 361 RLPRICFQTK EPGDITKITS AVMQLSTGMD ISDPWIREQT GIPQPTPGEA IFRVRQSGNE 421 PAQTDRETPP EKQEKTEQTA LSARLPEAKS TPRDELDDMG DAVPARRLQE AIDPVLEPVI 481 DAIRTRGLAD ALADLPALYR EMDDSRLMTL LSDAMFAAEM KGMLDDTGD //