LOCUS BCA70023.1 541 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli hypothetical protein protein. ACCESSION AP022811-3347 PROTEIN_ID BCA70023.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /EC_number="1.17.1.4" /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:INSD:AAC75913.1" /locus_tag="JE86ST02C_33470" /note="DFAST-ECOLI:AAC75913.1 hypothetical protein [pid:99.6%, q_cov:100.0%, s_cov:100.0%, Eval:9.0e-310]" /note="MGA_3352" /transl_table=11 BEGIN 1 MNIFTEAAKL EEQNCPFAMA QIVDSRGSTP RHSAQMLVRA DGSIVGTIGG GMVERKVIEE 61 SLQALQERKP RLFHGRMARN GADAVGSDCG GAMSVFISVH GMRPRLVLIG AGHVNRAIAQ 121 SAALLGFDIA VADIYRESLN PELFPPSTTL LHAESFGAAV EALDIRPDNF VLIATNNQDR 181 EALDKLIEKP IAWLGLLASR RKVQLFLRQL REKGVAEEHI ARLHAPVGYN IGAETPQEIA 241 ISVLAEILQV KNNAPGGLMM KPSHPSGHQL VVIRGAGDIA SGVALRLYHA GFKVIMLEVE 301 KPTVIRCTVA FAQAVFDGEM TVEGVTARLA TSSAEAMKLT ERGFIPVMAD PACSLLDELK 361 PLCVVDAILA KQNLGTRADM APVTIALGPG FTAGKDCHAV IETNRGHWLG QVIYSGCAQE 421 NTGVPGNIMG HTTRRVIRAP AAGIMRSNVK LGDLVKEGDV IAWIGEHEIK APLTGMVRGL 481 LNDGLAVVGG FKIGDIDPRG ETADFTSVSD KARAIGGGVL EALMMLMHQG VKATKEVLEV 541 A //