LOCUS BCA70621.1 540 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli hypothetical protein protein. ACCESSION AP022811-3945 PROTEIN_ID BCA70621.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /gene="yhiJ" /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:INSD:AAC76513.1" /locus_tag="JE86ST02C_39450" /note="DFAST-ECOLI:AAC76513.1 hypothetical protein [pid:96.9%, q_cov:100.0%, s_cov:100.0%, Eval:8.7e-313]" /note="MGA_3952" /transl_table=11 BEGIN 1 MKIGTVAGTN DSITTIATND MVQEHVTNFT KELFGYIANG IGDDISSIAR TMLGEVVEKI 61 DDWQIERFQQ SIQDDKISFT IQTDHSEKYS MLSGMRAHIL RRNNSYQFIV TINSKNYGCP 121 LDDTDINWCS IVYLLNNMTV NDNANDVAVT ESYKPIWNWK ISQYNVSDIK FETIIKLQFA 181 DRTYFSNCSL VDPTSTRPTY FGDTDGSVGA VLYALFATGH LGIMAEGVNF LSQLLNIEDE 241 VLNVLLRENF NEQLDTNVNT IISILNRRDI ILESLQPYLV INKDAVTPCT FLGDQTGDRF 301 SNICGDQFII DLLKRIMSIN DNVHVLAGNH ETNCNGNYMQ NFTRMKPLDE DTYAGIKDYP 361 VCFYDSKYKI MANHHGITFD DQRKRYIIGP ITVSIDEMTN ALDPVELAAI INKKHHTIIN 421 GKKFKTSRAI SCRSFNRYFS VSTDYRPKLE ALLACSQMLG INQVVAHNGN GGRERIGETG 481 TVLGLNARDS KHAGRMFSMH NCQINPGAGP EITTPWKSYQ HEKNKNGLMP LIRRRTMLQL //