LOCUS BCA71076.1 476 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli hypothetical protein protein. ACCESSION AP022811-4400 PROTEIN_ID BCA71076.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /gene="yihF" /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:INSD:AAN45367.2" /locus_tag="JE86ST02C_43990" /note="DFAST-ECOLI:AAN45367.2 hypothetical protein [pid:98.3%, q_cov:100.0%, s_cov:97.1%, Eval:3.2e-271]" /note="MGA_4408" /transl_table=11 BEGIN 1 MIRKSATGVI VALAVIWGGG TWYTGTQIQP GVEKFIKDFN DAKKKGEHAY DMTLSYKNFD 61 KGFFNSRFQM QMTFDNGAPD LNIKPGQKVV FDVDVEHGPL PITMLMHGNV IPALAAAKVN 121 LVNNELTQPL FIAAKNKSPV EATLRFAFGG SFSTTLDVAP AEYGKFSFGE GQFTFNGDGS 181 SLSNLDVEGK VEDIVLQLSP MNKVTAKSFT IDSLARLEEK KFPVGESESK FNQINIINHG 241 EDVAQIDAFV AKTRLDRVKD KDYINVNLTY ELDKLTKGNQ QLGSGEWSLI AESIDPSAVR 301 QFIIQYNIAM QKQLAAHPEL ANDEVALQEV NAALFKEYLP LLQKSEPTIK QPVKWKNALG 361 ELNANLDISI ADPAKSSSST NKDIKSLNFD VKLPLNVATE TAKQLNLSEG MDAEKAQKRA 421 DKQISGMMTL GQMFQLITID NNTASLQLRY TPGKVVFNGQ EMSEEEFMSR AGRFVH //