LOCUS BCA70128.1 686 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli general secretion pathway protein GspD protein. ACCESSION AP022811-3452 PROTEIN_ID BCA70128.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /gene="gspD" /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:INSD:BAG78769.1" /locus_tag="JE86ST02C_34520" /note="DFAST-ECOLI:BAG78769.1 general secretion pathway protein GspD [pid:98.7%, q_cov:100.0%, s_cov:100.0%, Eval:0.0e+00]" /note="MGA_3457" /transl_table=11 BEGIN 1 MFWRDITLSV WRKKTTGLKT KKRLLPLVLA AALCSSPVWA EEATFTANFK DTDLKSFIET 61 VGANLNKTII MGPGVQGKVS IRTMTPLNER QYYQLFLNLL EAQGYAVVPM ENDVLKVVKS 121 SAAKVEPLPL VGEGSDNYAG DEMVTKVVPV RNVSVRELAP ILRQMIDSAG SGNVVNYDPS 181 NVIMLTGRAS VVERLTEVIQ RVDHAGNRTE EVIPLDNASA SEIARVLESL TKNSGENQPA 241 TLKSQIVADE RTNSVIVSGD PATRDKMRRL IRRLDSEMER SGNSQVFYLK YSKAEDLVDV 301 LKQVSGTLTA AKEEAEGTVG SGREIVSIAA SKHSNALIVT APQDIMQSLQ SVIEQLDIRR 361 AQVHVEALIV EVAEGSNINF GVQWASKDAG LMQFANGTQI PIGTLGAAIS QAKPQKGSTV 421 ISENGATTIN PDTNGDLSTL AQLLSGFSGT AVGVVKGDWM ALVQAVKNDS SSNVLSTPSI 481 TTLDNQEAFF MVGQDVPVLT GSTVGSNNSN PFNTVERKKV GIMLKVTPQI NEGNAVQMVI 541 EQEVSKVEGQ TSLDVVFGER KLKTTVLAND GELIVLGGLM DDQAGESVAK VPLLGDIPLI 601 GNLFKSTADK KEKRNLMVFI RPTILRDGMA ADGVSQRKYN YMRAEQIYRD EQGLSLMPHT 661 AQPVLPAQNQ ALPPEVRAFL NAGRTR //