LOCUS BCA67199.1 1403 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli type IV secretion protein Rhs protein. ACCESSION AP022811-523 PROTEIN_ID BCA67199.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /gene="rhsC_2" /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:INSD:ABV18137.1" /locus_tag="JE86ST02C_05230" /note="DFAST-ECOLI:ABV18137.1 type IV secretion protein Rhs [pid:95.6%, q_cov:89.9%, s_cov:89.8%, Eval:0.0e+00]" /note="MGA_524" /transl_table=11 BEGIN 1 MSGKPAARQG DMTQYGGPIV QGSAGVRIGA PTGVACSVCP GGMTSGNPVN PLLGAKVQPG 61 ETDLALPGPL PFILSRTYSS YRTKTPAPVG VFGPGWKAPS DIRLQIRDDA LVLNDNGGRS 121 IHFEPLLPGE AVYSRSESMW LVRGGKAAQP DGHTLARLWG ALPPDIRLSP HLYLATNSAQ 181 GPWWILGWSE RVPGAEDVLP APLPPYRVLT GMADRFGRTL AYRREAAGDL AGEITGVTDG 241 AGREFRLVLT TQAQRAEEAR KPHTASLSSP DSPRPLSAPS FPDTLPGTEY GADSGIRLSA 301 VWLMHDPEYP ENLPAAPLVC YDWTPRGELA AVYDRSGTQM RHFTYDDKYR GRMVGHRYAG 361 RPEMRYRYDD AGRVVEQLNP AGLSYRYQYE QDRITVTDSL NRREVLHTEG GAGLKRVVKK 421 ELADGSVTHS GYDAAGRLTA QTDAAGRRTE YGLNVVSGDI TDITTPDGRE TKFYYNDGNQ 481 LTAVVYPDGL ESRREYDEPG RLVSETSRSG ETVRYRYDDA HSELPATTTD ATGSTRQMTW 541 SRYGQLLAFT DCSGYQTRYE YDRFGQMTAV HREEGISLYR HYDNRGRLTS VKDAQGRETQ 601 YEYNAAGDLT AVITPDGNRS ETQYDAWGKA VSTTQGGLTR SMEYDAAGRV ISLTNENGSH 661 SDFSYDALDR LVQQGGFDGR TQRYHYDLTG KLTQSEDEGL VTLWHYDASD RITHRTVNGD 721 PAEQWQYDGH GWLREISHLS EGHRVAVHYG YDDKGRLTGE RQTVENPETG ELLWQHETKH 781 AYNEQGLANR VTPDSLPPVE WLTYGSGYLA GMKLGGTPLV EYTRDRLHRE TVRSFGSMAG 841 SNAAYELTST YTPAGQLQSQ HLNSLVYDRD YGWNDNGDLV RISGPRQTRE YGYSATGRLE 901 SVRTLAPDLD IRIPYATDPA GNRLPDPELH PDSTLTAWPD NRIAEDAHYV YHYDEYGRLT 961 EKTDRIPTGV IRTDDERTHH YHYDSQHRLV FHTRIQHGEP LVESRYLYDP LGRRMAKRVW 1021 RRERDLTGWM SLSRKPEETW YGWDGDRLTT VQTDTTRIQT VYQPGSFAPL IRIETDNGER 1081 EKAQCRSLAE KLQQEGSEDG HGVVFPAELV GLLDRLEGEI RANCVSSESR QWLAQCGLTV 1141 ERLAAQIEPV YLPERKIHLY HCDHRGLPLA LISEDGNTAW SAEYDEWGNQ LNEENPHHLH 1201 QPYRLPGQQY DKESGLYYNR NRYYDPLQGR YITQDPIGLE GGWSLYAYPL NPVNGIDPLG 1261 LSPADVALIR RKDQLNHQRA WDILSDTYED MKRLNLGGTD QFFHCMAFCR VSKLNDAGVS 1321 RSAKGLGYEK EIRDYGLNLF GMYGRKVKLS HSEMIEDNKK DLAVNDHGLT CPSTTDCSDR 1381 CSDYINPEHK KTIKALQDAG YLK //