LOCUS BCA68494.1 344 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli hypothetical protein protein. ACCESSION AP022811-1818 PROTEIN_ID BCA68494.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /locus_tag="JE86ST02C_18180" /note="DFAST-ECOLI:ADD56394.1 type VI secretion system Vgrfamily protein [pid:99.1%, q_cov:100.0%, s_cov:48.2%,Eval:4.0e-201, partial hit]" /note="MGA_1819" /note="WP_000103276.1 type IV secretion protein Rhs (Escherichia coli O157:H7 str. Sakai) [pid:98.8%, q_cov:100.0%, s_cov:49.0%, Eval:4.7e-199, partial hit]" /note="frameshifted, deletion at around 1847208" /transl_table=11 BEGIN 1 MLKPLVDGPQ SAVVTGPAGE EIFCDEHGRV RVKFNWDRYN PSNQDSSCWI RVAQAWAGTG 61 FGNLAIPRVG QEVIVDFLNG DPDQPIIMGR TYHHENRTPG SLPGTKTQMT IRSKTYKGSG 121 FNELKFDDAT GKEQVYIHAQ KNMNTEVLNN RTTDVINNHA ETIGNNQMIA VTNNQIQTVG 181 VNQIETVGSN QIIKVGSVQV ETIGLVRALT VGVAYQTTVG GIMNTSVALM QSSQIGLHKS 241 LRVGLGYDVK VGNNVTFTVG KTKKDDTGQT AIYSAGEHLE LCCGKARLVL TKDGQIFLNG 301 TKIHLQGKEQ VNGDSLLINW NCAASKSPPK TPDEKQDTPD MREY //