LOCUS BCA71157.1 1394 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli type IV secretion protein Rhs protein. ACCESSION AP022811-4481 PROTEIN_ID BCA71157.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /gene="rhsD_2" /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:INSD:CAV01146.1" /locus_tag="JE86ST02C_44800" /note="DFAST-ECOLI:CAV01146.1 type IV secretion protein Rhs [pid:97.4%, q_cov:100.0%, s_cov:100.0%, Eval:0.0e+00]" /note="MGA_4490" /transl_table=11 BEGIN 1 MSGKPAARQG DMTQYGGSIV QGSAGVRIGA PTGVACSVCP GGVTSGHPVN PLLGAKVLPG 61 ETDIALPGPL PFILSRTYSS YRTKTPAPVG SLGPGWKMPA DIRLQLRDNT LILSDNGGRS 121 LYFEHLFPGE DGYSRSESLW LVRGGVAKLD EGHRLAALWQ ALPEELRLSP HRYLATNSPQ 181 GPWWVLGWCQ RVPEADEVLP APLPPYRVLT GLVDRFGRTQ TFHHEAAGEF SGEITGVTDG 241 AGRHFRLVLT TQAQRAEEAR QQAISGGTEP SAFPDTLPGY TEYGRDNGIR LSAVWLTHDP 301 EYPENLPAAP LVRYGWTPRG ELAAVYDRSN TQVRSFTYDD KYRGRMVAHR HTGRPEIRYR 361 YDSDGRVTEQ LNPAGLSYTY QYEKDRITIT DSLDRREVLH TQGEAGLKRV VKKEHADGSV 421 TQSQFDAVGR LRAQTDAAGR TTEYSPDVVT GLITRITTPD GRASAFYYNH HSQLTSATGP 481 DGLEMRRKYD EYGRLIQETA PDGDITRYRY DNPHSDLPCA TDDATGSRKT MTWSRYGQLL 541 SFTDCSGYQT RYDHDRFGQM TAVHREEGLS QYRAYDSRGQ LIAVKDTQGH ETRYEYNIAG 601 DLTTVIAPDG SRNGTQYDAW GKAICTTQGG LTRSMEYDAA GRVIRLTSEN GSHTTFRYDV 661 LDRLIQETGF DGRTQRYHHD LTGKLIRSED EGLVTHWHYD EADRLTHRTV NGETAEQWQY 721 DERGWLTDIS HISKGHRVTV HYGYDEKGRL TGERQTVHHP QTEALLWQHE TRHAYNAQGL 781 ANRCIPDSLP AVEWLTYGSG YLAGMKLGDT PQVEYTRDRL HRETLRSFGR YELTTAYTPA 841 GQLQSQHLNS LLSDRDYTWN DNGELIRISS PRQTRSYSYS TTGRLTGVHT TAANLDIRIP 901 YATDPAGNRL PDPELHPDST LSMWPDNRIA RDAHYLYRYD RHGRLTEKTD LIPEGVIRTD 961 DERTHRYHYD SQHRLVHYTR TQYAEPLVES RYLYDPLGRR VAKRVWRRER DLTGWMSLSR 1021 KPQVTWYGWD GDRLTTIQND RSRIQTIYQP GSFTPLIRVE TATGELARTQ RRSLADALQQ 1081 SGGEDGGSVV FPPVLVQMLD RLESEILADR VSEESRRWLA SCGLTVEQMQ NQMDPVYTPA 1141 RKIHLYHCDH RGLPLALISK EGATAWCAEY DEWGNLLNEE NPHQLQQLIR LPGQQYDEES 1201 GLYYNRHRYY DPLQGRYITQ DPIGLKGGWN LYTYPLSPVN SMDPLGLYEF KSKNIDDIGI 1261 FALAMCNGES INENKEYGGL ICKKQGEYLP MNPISSNDND SVDLRNIKCP EGSERVGDYH 1321 THGFYSDDKG NKVTKENDVY DSLNFSSKDL TNSYMNGMEK KEYSSYLGTP NNTYLKYNPK 1381 AKGNGVTIIR QGSN //