LOCUS BCA71239.1 698 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli O-antigen capsule production periplasmic protein protein. ACCESSION AP022811-4563 PROTEIN_ID BCA71239.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /gene="gfcD_2" /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:INSD:AAC76999.1" /locus_tag="JE86ST02C_45620" /note="DFAST-ECOLI:AAC76999.1 O-antigen capsule productionperiplasmic protein [pid:99.6%, q_cov:100.0%, s_cov:100.0%,Eval:0.0e+00]" /note="MGA_4574" /transl_table=11 BEGIN 1 MKKRHLLSLL ALGISTACYG ETYPAPIGPS QSDFGGVGLL QTPTARMARE GELSLNYRDN 61 DQYRYYSASV QLFPWLETTL RYTDVRTRQY SSVEAFSGDQ TYKDKAFDLK LRLWEESYWL 121 PQVAVGARDI GGTGLFDAEY LVASKAWGPF DFTLGLGWGY LGTNGNVKNP LCSASDKYCY 181 RDNSYKQAGS IDGSQMFHGP ASLFGGVEYQ TPWQPLRLKL EYEGNNYQQD FAGKLEQKSK 241 FNVGAIYRVT DWADVNLSYE RGNTFMFGVT LRTNFNDLRP SYNDNARPQY QPQPQDAILQ 301 HSVVANQLTL LKYNAGLADP QIQAKGDTLY VTGEQVKYRD SREGIIRANR IVMNDLPDGI 361 KTIRITENRL NMPQVTTETD VASLKNHLAG EPLGHETKLA QKRVEPVVPQ STEQGWYIDK 421 SRFDFHIDPV LNQSVGGPEN FYMYQLGVMG TADLWLTDHL LTTGSLFANL ANNYDKFNYT 481 NPPQDSHLPR VRTHVREYVQ NDVYVNNLQA NYFQHLGNGF YGQVYGGYLE TMFGGAGAEV 541 LYRPLDSNWA FGLDANYVKQ RDWRSAKDMM KFTDYSVKTG HLTAYWTPSF AQDVLVKASV 601 GQYLAGDKGG TLEIAKRFDS GVVVGGYATI TNVSKEEYGE GDFTKGVYVS VPLDLFSSGP 661 TRSRAAIGWT PLTRDGGQQL GRKFLLYDMT SDRSVNFR //