LOCUS BCA69870.1 420 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli hypothetical protein protein. ACCESSION AP022811-3194 PROTEIN_ID BCA69870.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /gene="ygbK" /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:INSD:AAN81747.1" /locus_tag="JE86ST02C_31940" /note="DFAST-ECOLI:AAN81747.1 hypothetical protein [pid:98.3%, q_cov:100.0%, s_cov:100.0%, Eval:3.8e-236]" /note="MGA_3198" /transl_table=11 BEGIN 1 MIKIGVIADD FTGATDIASF LVENGLPTVQ INGVPTGKMP EAIDALVISL KTRSCPVVEA 61 TQQSLAALSW LQQQGCKQIY FKYCSTFDST AKGNIGPVTD ALMDALDTPF TVFSPALPVN 121 GRTVYQGYLF VMNQLLAESG MRHHPVNPMT DSYLPRLVEA QSTGLCGVVS AHVFEQGVDA 181 VRQELARLQQ EGYRYAVLDA LTEHHLEIQG EALRDAPLVT GGSGLAIGLA RQWAQENGNQ 241 AREAGRPLAG RGVVLSGSCS QMTNRQVAHY RQIAPAREVD VARCLSTETL AAYAHELAEW 301 VLGQESVLAP LVFATASTDA LAAIQQQYGA QKASQAVETL FSKLAARLAA EGVTRFIVAG 361 GETSGVVTQS LGIKGFHIGP TISPGVPWVN ALDKPVSLAL KSGNFGDEAF FSRAQREFLS //