LOCUS BCA73882.1 1402 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli type IV secretion protein Rhs protein. ACCESSION AP022815-1912 PROTEIN_ID BCA73882.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5327513) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 135.0X Sequencing Technology :: Illumina MiSeq, Oxford Nanopore MinION ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="2014" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST05" protein /gene="rhsC_3" /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:INSD:ABV18137.1" /locus_tag="JE86ST05C_19120" /note="DFAST-ECOLI:ABV18137.1 type IV secretion protein Rhs [pid:94.1%, q_cov:100.0%, s_cov:100.0%, Eval:0.0e+00]" /note="MGA_1913" /transl_table=11 BEGIN 1 MSGKPAARQG DMTQYGGPIV QGSAGVRIGA PTGVACSVCP GGMTSGNPVN PLLGAKVLPG 61 ETDLALPGPL PFILSRTYSS YRTKTPAPVG VFGPGWKAPS DIRLQLRDDG LILNDNGGRS 121 IHFEPLLPGE AVYSRSESMW LVRGGKAAQP DGHTLARLWG ALPPDIRLSP HLYLATNSAQ 181 GPWWILGWSE RVPGAEDVLP APLPPYRELT GLADRFGRTL TYRREAAGDL TGEITGVTDG 241 AGREFRLVLT TQAQRAEEAR TSSLSSSDSS RPLSASAFPD TLPGTEYGPD RGIRLSAVWL 301 MHDPAYPESL PAAPLVRYTY TEAGELLAVY DRSNTQVRAF TYDAQHPGRM VAHRYAGRPE 361 MRYRYDDAGR VVEQLNPAGL SYRYQYEQDR ITVTDSLNRR EVLHTEGGAG LKRVVKKELA 421 DGSVTHSGYD AAGRLTAQTD AAGRRTEYGL NVVSGDITDI TTPDGRETKF YYNDGNQLTA 481 VVSPDGLESR REYDEPGRLV SETSRSGETV RYRYDDAHSE LPATTTDATG STRQMTWSRY 541 GQLLAFTDCS GYQTRYEYDR FGQMTAVHRE EGISLYRHYD NRGRLTSVKD AQGRETQYEY 601 NAAGDLTAVI TPDGNRSETQ YDAWGKAVST TQGGLTRSME YDAAGRVISL TNENGSHSDF 661 SYDALDRLVQ QGGFDGRTQR YHYDLTGKLT QSEDEGLVTL WYYDESDRIT HRTVNGEPAE 721 QWQYDDHGWL TDISHLSEGH RVAVHYGYDD KGRLTGERQT VENPETGELL WQHETTHAYN 781 EQGLANRVTP DSLPPVEWLT YGSGYLAGMK LGDTPLLEYT RDRMHRETVR SFGSMAGSNA 841 AYKLTSTYTP AGQLQSQHLN SLVYDRDYGW NDNGDLVRIS GPRQTREYGY SATGRLESVR 901 TLAPDLDIRI PYATDPAGNR LPDPELHPDS TLTAWPDNRI AEDAHYVYHY DEYGRLTEKT 961 DLIPAGVIRT DDERTHHYHY DSQHRLVFYT RIQHGEPLVE SRYLYDPLGR RMAKRVWRRE 1021 RDLTGWMSLS RKPEMTWYGW DGDRLTTVQT DTTRIQTVYQ PGSFAPLIRI ETDNGEREKA 1081 QRRSLAEKLQ QEGSEDGHGV VFPAELVRLL DRLEEEIRAD RVSSESRAWL AQCGLTVEQL 1141 ARQVEPEYTP ARKVHFYHCD HRGLPLALIS EDGNTAWRGE YDEWGNQLNE ENPYYLHQPY 1201 RLPGQQHDEE SGLYYNRNRY YDPLQGRYIT QDPIGLAGGW NLYNYPLNPI IRMDPLGLYN 1261 LYQLLYDVWH DDSYGTSSID ITGSGDLISL GGHAGLGVAF AKKKGEMLSD ICIYATACGH 1321 AGIGGGINAA ITYSETKSLP TSGVSNSVGV TVGGGVGGHF AYTYVVDVDN PESSTESVGI 1381 GAGVDASVMT LACRTWQECW VN //