LOCUS BBL41608.1 645 aa PRT BCT 11-OCT-2019 DEFINITION Escherichia coli O111:H- major capsid protein protein. ACCESSION AP019761-1889 PROTEIN_ID BBL41608.1 SOURCE Escherichia coli O111:H- ORGANISM Escherichia coli O111:H- Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5302257) AUTHORS Sekizuka,T., Iyoda,S., Isobe,J., Mitobe,J., Lee,K., Sata,T., Kuroda,M., Ohnishi,M. and Watahiki,M. TITLE Direct Submission JOURNAL Submitted (21-JUN-2019) to the DDBJ/EMBL/GenBank databases. Contact:Tsuyoshi Sekizuka National Institute of Infectious Diseases, Pathogen Genomics Center; 1-23-1 Toyama, Shinjuku-ku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Sekizuka,T., Lee,K., Kimata,K., Isobe,J., Kuroda,M., Iyoda,S., Ohnishi,M., Sata,T. and Watahiki,M. TITLE Complete Genome Sequence of an Enterohemorrhagic Escherichia coli O111:H8 Strain Recovered from a Large Outbreak in Japan Associated with Consumption of Raw Beef JOURNAL Microbiol Resour Announc 8, e00882-19 (2019) REMARK Publication Status: Online-Only DOI:10.1128/MRA.00882-19 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: HGAP Assembly v. 3.0; Pilon v. 1.18; A5 MiSeq v. 20140513 Genome Coverage :: 200x Sequencing Technology :: MiSeq; PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="2011-05-01" /country="Japan" /db_xref="taxon:168927" /host="Homo sapiens" /isolation_source="bloody diarrhea" /mol_type="genomic DNA" /note="Enterohemorrhagic Escherichia coli ; EHEC" /organism="Escherichia coli O111:H-" /serotype="O111:H-" /strain="110512" protein /locus_tag="EC110512_C18890" /transl_table=11 BEGIN 1 MTLKRACSLL TVKSFSEDER VITGIASTPS PDRDGDILEP EGAEFGSAIP FLWQHDHSRP 61 VGQCTVRRVS EGLEITATLV KPVPDMPSQL AARLDEVWAA IKTGLVRGLS VGFRPHEYTF 121 LDGGGLHFLR WELMEVSAVT VPANAECTIR TIKSYDRPFS AASGNRKPVV KIASSAGAAA 181 QSTTVFHKEK TIMNIGEQIK SFENKRAALA ASLEEVMTKA AEEGRTLDVE EEEHYDNTAA 241 EIRQVDAHLK RLRELEAGKA ATAQPVKQAG NGNVAAVASA PVIRVEQKLD KGIGFARFAK 301 SLAAAKGVRS EALEVARRQY PDDSCLHHVL KSAVGAGTTT DPQWAGSLSE YQEYAQDFID 361 YLRPQTIIGR FGQGGIPALR QVPFNIRVHA QVSGGAAGWV GEGKARPLTK FDFESITFSH 421 AKVSAIAVLT EELIRFSSPA ADALVRNALA EAVVARLDTD FVDPKKAAVA DVSPASITHD 481 VKGTASTGNP DADAEAAFGQ FVAANLQPTG AVWLMSSTNA LALSMRKNAL GQKEYPDMTL 541 LGGSFQGLPV IVSQYVGDQL VLVNAPDIYL ADDGGVAVDM SREASLEMQS EPTGDSTTPS 601 PVELVSMFQT GSVAIRAERW INWRRRRTAA VAVITGVNYG SASGG //