LOCUS BCA67005.1 720 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli hypothetical protein protein. ACCESSION AP022811-329 PROTEIN_ID BCA67005.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /locus_tag="JE86ST02C_03290" /note="DFAST-ECOLI:CBI99862.1 invasin [pid:81.8%, q_cov:99.9%, s_cov:50.8%, Eval:0.0e+00, partial hit]" /note="MGA_330" /note="WP_000092564.1 intimin-like adhesin FdeC (Escherichia coli O104:H4 str. 2011C-3493) [pid:93.9%, q_cov:100.0%, s_cov:50.8%, Eval:0.0e+00, partial hit]" /note="frameshifted, insertion at around 353144, deletion at around 352394,352388,352340,353175" /transl_table=11 BEGIN 1 MVGGTVTAIW TVKDAYDNPV TSLTPEAPSL AGAAAVGSTA SGWTNNGDGT WTAQITLGST 61 AGELEVMPKL NGQDAAANAA KVTVVADALS SNQSKVSVAE DHVKAGESTT VTLVAKDAHG 121 NAISGLSLSA SLTGTASEGA TVSSWTEKGD GSYVATLTTG GKTGELRVMP LFNGQPAATE 181 AVQLTVIAGE MSSANSTLVA DNKAPTVKTT TELTFTVKDA YGNPVTGMKP DAPVFSGAAN 241 TGSERPSAGN WTEKGNGVYV STLTLGSAAG QLSVMPRVNG QNAVAQPLVL NVAGDASKAE 301 IGDMTVKVDN QLANGQSTNQ VTLTVVDTYG NPLQGQEVTL NLPQGVTSKT GNTVTTNAAG 361 KSDIELISTV AGELEIAAAV KNSQKTVTVK FNADASTGQA NLQVDTAVQK VANGKDAFTL 421 TATVEDKNGN PVPGSLVTFN LPRGVKPLTG DNVWVKANDE GKAELQVVSV TAGTYEITAS 481 AGNSQPSNTQ TITFVADKAT ATVSGIEVMG NYALADGKAK QTYKVTVTDA NNNLVKDSEV 541 TLTASPASLN LEPNGTATTN EQGQAIFTAT TTVAATYTLK AQVSQTNGQV STKTAESKFV 601 ADDKNAVLTA SSDMQSLVAD GKSTAKLEVT LMSANNPVGG NMWVDIQTPE GVTEKDYQFL 661 PSKNDHFVSG KITRKFSTSK PGVYTFTFNA LTYGGYEMKP VTVTITAVDA DTAKDEEAMK //