LOCUS BCA70126.1 407 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli general secretion pathway protein GspF protein. ACCESSION AP022811-3450 PROTEIN_ID BCA70126.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /gene="gspF" /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:INSD:BAG78767.1" /locus_tag="JE86ST02C_34500" /note="DFAST-ECOLI:BAG78767.1 general secretion pathway protein GspF [pid:93.9%, q_cov:100.0%, s_cov:100.0%, Eval:7.1e-208]" /note="MGA_3455" /transl_table=11 BEGIN 1 MALFYYQALE RNGRKTKGMI EADSARHARQ LLRGKELIPV HIEARMNTSS GGMLQRRRHA 61 HRRVAAADLA LFTRQLATLV QAAMPLETCL QAVSEQSEKL HVKSLGMALR SRIQEGYTLS 121 DSLREHPRVF DSLFCSMVAA GEKSGHLDVV LNRQADYTEQ RQRLKSRLLQ AMLYPLVLLV 181 VATGVVTILL TAVVPKIIEQ FDHLGHALPA STRTLIAMSD ALQASGVYWL AGLLGLLVLG 241 QRLLKNPAMR LRWDQTVLRL PVTGRVARGL NTARFSRTLS ILTASSVPLL EGIQTAAAVS 301 ANRYVEQQLL LAADRVREGS SLRAALAELR LFPPMMLYMI ASGEQSGELE TMLEQAAVNQ 361 EREFDTQVGL ALGLFEPALV VMMAGVVLFI VIAILEPMLQ LNNMVGM //