LOCUS BCA66906.1 1417 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli type IV secretion protein Rhs protein. ACCESSION AP022811-230 PROTEIN_ID BCA66906.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /gene="rhsC_1" /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:INSD:ABV04635.1" /locus_tag="JE86ST02C_02300" /note="DFAST-ECOLI:ABV04635.1 type IV secretion protein Rhs [pid:98.8%, q_cov:100.0%, s_cov:100.0%, Eval:0.0e+00]" /note="MGA_231" /transl_table=11 BEGIN 1 MGGKPAARQG DMTRKGLDIV QGSAGVLIGA PTGVACSVCP GGITYANPVN PLLGAKVLPG 61 ETDLALPGPL PFILSRAYSS YRTRTPAPVG VFGPGWKAPF DIRLQIRDEG LILNDSGGRS 121 IHFEPLFPGE ISYSRSESLW LARGGVAAQH SSQPLSALWQ VLPEDVRLSP HVYLATNSLQ 181 GPWWILSWPE RVPGADEVLP PEPPAYRVLT GVVDGFGRTL AFHRAAEGDV AGAVTGVTDG 241 AGRRFHLVLT TQAQRAEVFR KQRATSLSSP AGPRSASSSL VFPDTLPAGT GYGTDNGIRL 301 EAVWLTHDPA YPDEQPTAPL ARYTYTAGGE LRAVYDRSGT QMRGFTYDAE HAGRMVAHHY 361 AGRPESCYRY DDTGRVTEQV NPEGLDYRFE YGESRVIITD SLNRREVLYT EGEGGLKRVV 421 KKEHADGSIT RSEYDEAGRL KAQTDAAGRR TEYSLHMASG AVTAVTGPDG RTVRYGYNSQ 481 RQVTSVTYPD GLRSSREYDE KGRLTAETSR SGETTRYSYD DPASELPTGI QDATGSTKQM 541 AWSRYGQLLA FTDCSGYTTR YEYDRYGQQI AVHREEGIST YSSYNPRGQL VSQKDAQGRE 601 IRYEYSAAGD LTATISPDGK RSTIEYDKRG RAVSVTEGGL TRSMGYDAAG RITVLTNENG 661 SQSTFRYDPV DRLTEQRGFD GRTQRYHYDL TGKLTQSEDE GLITLWHYDA SDRITHRTVN 721 GDPAEQWQYD EHGWLTTLSH TCEGHRVSVH YGYDDKGRLT GERQTVENPE TGEMLWEHET 781 GHAYSEQGLA TRQEPDGLPP VEWLTYGSGY LAGMKLGGTP LVEYTRDRLH RETARSFGGA 841 GSTAGYEQAT AYTLTVQLQS RHLNLPQLDC DYTWNDNGQL VRISGPQECR EYRYSGTGRL 901 TGVHTTAANL DIDIPYATDP AGNRLPDPEL HPDSTLTAWP DNRIAEDAHY VYRYDEYGRL 961 AEKTDRIPEG VIRMHDERTH HYHYDSQHRL VFYTRIQHGE PQVESRYLYD PLGRRTGKRV 1021 WRRERDLTGW MSLSRKPEET WYGWDGDRLT TVQTQQTRIQ TVYQPGSFTP LLRIETENGE 1081 QAKARHRSLA EVLQEDTGVT LPAELAVMLG RLERELRQGS VSEESQQWLA QCGLTAEQMA 1141 AQLEAEYIPE RKLHLYHCDH RGLPLALISP EGETAWQGEY DEWGNLLGEE SAQHLQQSLR 1201 LPGQQYDEES GLYYNRNRYY DPLQGRYITQ DPIGLRGEWN LYKYPLNPVR FIDSLGLKFH 1261 VNGDPSDFNQ AVEYLKQDSQ MKETIDFLSS SEETINIEYI EGTNVRFNSN NMAIYWNSRA 1321 SLFCSTELNS KSQSPALGLG HEFAHAQYYL LDKENFMALL SRTDKKYENK EEARVITIIE 1381 SRAAKTLGEC TRGAHSGLPF YRVDGPLQTM KITGTPE //