LOCUS BCA66789.1 567 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli hypothetical protein protein. ACCESSION AP022811-113 PROTEIN_ID BCA66789.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /gene="yacH" /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:INSD:AAC73228.1" /locus_tag="JE86ST02C_01130" /note="DFAST-ECOLI:AAC73228.1 hypothetical protein [pid:90.5%, q_cov:100.0%, s_cov:94.0%, Eval:1.4e-305]" /note="MGA_113" /transl_table=11 BEGIN 1 MEPQSTQQTA PDITAVTLPA TVSAPPVTPA VVKSAFSTAQ IDQWVAPVAL YPDSLLSQVL 61 MASTYPANVA QAVQWSHDNP LKQGDAAIQA VSDQPWDASV KSLVAFPQLM ALMGENPQWV 121 QNLGDAFLAQ PQDVMDSVQR LRQLAQQTGS LKSSTEQKVI TTTKKAVPVK QTVTAPVIPS 181 NTVLTASPVI TEPATTVISI EPANPDVVYI PNYNPTVVYG NWANTAYPPV YLPPPAGEPF 241 IDSFVRGFGY SMGVATTYAL FSSIDWDDDD HDHHHRDGNG WQHNGDNINI DVNNFNRITG 301 EHLTDKNMAW RHNPNYRNGV PYHDQDMAKR FHQTDVNGGM SATQLPAPTR DSQRQAAASQ 361 FQQRTHAAPV ITRDTQRQAA AQRFNEAENY GSYDDFRDFS RRQPLTQQQK DAARQRYQSA 421 SPEQRQAVRE KMQTNPQIQQ RRDAARERIQ SASPEQRQAV REKMQTNPQI QQRRDAARER 481 IQSASPEQRQ VFKEKVQQRP LNQQQRDNYR QRVQSASPEQ RQVFREKVQE SRPQRLNDSN 541 HTVRLNNEQR SAVRERLSER GARRQER //