LOCUS AL391753 213494 bp DNA linear BCT 16-APR-2005 DEFINITION Shigella flexneri virulence plasmid pWR100: from 1 to 213494. ACCESSION AL391753 VERSION AL391753.1 KEYWORDS virulence plasmid, type III secretion. SOURCE Shigella flexneri ORGANISM Shigella flexneri Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Shigella. REFERENCE 1 (bases 1 to 213494) AUTHORS Buchrieser C., Glaser P., Rusniok C., Nedjari H., d'Hauteville H., Kunst F., Sansonetti P., Parsot C. TITLE The virulence plasmid pWR100 and the repertoire of proteins secreted by the type III secretion apparatus of Shigella flexneri JOURNAL Mol. Microbiol. 38(4), 760-771(2000). PUBMED 11115111 REFERENCE 2 (bases 1 to 213494) AUTHORS Glaser P., Buchrieser C., Rusniok C., Nedjari H., d'Hauteville H., Kunst F., Sansonetti P., Parsot C. JOURNAL Submitted (29-AUG-2000) to the INSDC. P. Glaser, Institut Pasteur, Genomique des Microorganismes Pathogenes, 25 rue du Docteur Roux, 75724 Paris Cedex 15, FRANCE. E-mail: pglaser@pasteur.fr Phone: +33 1 45 68 89 96, Fax: +33 (0)1 45 68 87 46 FEATURES Location/Qualifiers source 1..213494 /organism="Shigella flexneri" /strain="M90T" /mol_type="genomic DNA" /db_xref="taxon:623" CDS 991..1938 /transl_table=11 /gene="icsP (sopA)" /product="IcsP (SopA), outermembrane protease of the OmpP family, involved in cleavage of surface exposed IcsA" /note="ORF1, IcsP (SopA), length= 315 aa, id to SFU73461_1 and AF0016 33_1" /db_xref="GOA:O33641" /db_xref="InterPro:IPR000036" /db_xref="InterPro:IPR020079" /db_xref="InterPro:IPR020080" /db_xref="UniProtKB/Swiss-Prot:O33641" /protein_id="CAC05769.2" /translation="MKLKFFVLALCVPAIFTTHATTNYPLFIPDNISTDISLGSLSGK TKERVYHPKEGGRKISQLDWKYSNATIVRGGIDWKLIPKVSFGVSGWTTLGNQKASMV DKDWNNSNTPQVWTDQSWHPNTHLRDANEFELNLKGWLLNNLDYRLGLIAGYQESRYS FNAMGGSYIYSENGGSRNKKGAHPSGERTIGYKQLFKIPYIGLTANYRHENFEFGAEL KYSGWVLSSDTDKHYQTETIFKDEIKNQNYCSVAANIGYYVTPSAKFYIEGSRNYISN KKGDTSLYEQSTNISGTIKNSASIEYIGFLTSAGIKYIF" misc_feature complement(1975..2128) /note="IS629.01, 95% id over 154 nt with IS629, from 1 to 154" misc_feature 2489..2873 /note="IS2.01, 96% id over 385 nt with IS2, from 1 to 393" CDS 3485..4351 /transl_table=11 /gene="ospB" /product="OspB, protein secreted by the Mxi-Spa secretion machinery, function unknown" /note="ORF4, length= 288 aa" /db_xref="UniProtKB/TrEMBL:Q99Q23" /protein_id="CAC05770.1" /translation="MNLDGVRPYCRIVNKKNESISDIAFAHIIKRVKNSSCTHPKAAL VFLGEKGFCDSNDVLSIMGQQIPRVFKNKMLYDYVFKNEKSKNDFLKMAESWLPQSEP IVINNDDDALNAAAYFSVKKAKIKTVNDTDFKEYNKVYILGHGSPGSHQLGLGSELID VQTIISRMKDCGILNVKDIRFTSCGSADKVAPKNFNNAPAESLSCILNSLPFFKEKES LLEQIKKHLENDESLSDGLKISGYHGYGVHYGQELFPYSHYRSTSIPADPEHTVKRSS QKKTFIINKELD" CDS 4680..5420 /transl_table=11 /gene="phoN2 (apy)" /product="PhoN2 (Apy), periplasmic phosphatase, apyrase, ATP diphosphohydrolase" /note="ORF5, PhoN2 (Apy), length= 246 aa, id to SFU04539_1" /db_xref="GOA:Q99QG5" /db_xref="InterPro:IPR000326" /db_xref="InterPro:IPR001011" /db_xref="InterPro:IPR036938" /db_xref="UniProtKB/TrEMBL:Q99QG5" /protein_id="CAC05771.1" /translation="MKTKNFLLFCIATNMIFIPSANALKAEGFLTQQTSPDSLSILPP PPAEDSVVFLADKAHYEFGRSLRDANRVRLASEDAYYENFGLAFSDAYGMDISRENTP ILYQLLTQVLQDSHDYAVRNAKEYYKRVRPFVIYKDATCTPDKDEKMAITGSYPSGHA SFGWAVALILAEINPQRKAEILRRGYEFGESRVICGAHWQSDVEAGRLMGASVVAVLH NTPEFTKSLSEAKKEFEELNTPTNELTP" misc_feature 5527..5657 /note="IS629.02, 93% id over 130 nt with IS629, from 1180 to 0" misc_feature complement(5782..7051) /note="ospC4 gene, inactivated by frameshits" misc_feature 7573..7602 /note="IS630.01, 93% id over 30 nt with IS630, from 1 to 30" misc_feature complement(7603..8111) /note="ISShf5.01, fragment of the putative ISShf5, from 1220 to 1732" misc_feature complement(8133..8875) /note="IS600.01, 91% id over 743 nt with IS600, from 1 to 743" CDS complement(9501..11210) /transl_table=11 /gene="ospD2" /product="OspD2, probably secreted by the Mxi-Spa secretion machinery, function unknown" /note="ORF10, length= 569 aa, 39.7% id in 537 aa overlap with ECSHET2B_1 (S. flexneri SenA 565 aa)" /db_xref="InterPro:IPR012927" /db_xref="InterPro:IPR036770" /db_xref="UniProtKB/TrEMBL:Q9AJW9" /protein_id="CAC05772.1" /translation="MPLNKTFSSSIFSTKNSLSTDMSVNRDNRTITSSIMRVSNSSEL IQFKNKTAPYFSEKRNVKVNINGVAKDIYGRQIVCRHLASYWEMNFMETNGKVNYQLL STPDAIAKNVCLEKTEDFSKSPAYIYFVENKKWGTVITNFFYNMKKNGDFVRTLSACT LNHQMALGLKIKRVQESEKWVVQFFDPNRTVTHKRTVFTCDSHFELSQLSAKDFFDDF YWKIYGLEQPGQVIFEDRHNSPLTNTVKLLPDELINSRVIYHAITKNLTEVLFILMEK YKNGEISQSKLVNLLATRSSDGTPAFYIALQNGYSDIIQVYGKILNMCNLSQETILTL LAAVGANNVPGLCMSFMNGHVDTIKAYGEIVFKTPLTSDKRLYLLAAKDSHDLPGLFF ALQNGHADSIRMFGSLLNKKMLSSEQIKELLKVKHGLFMALQNGHTKAIMAYGDILKI LPPHQEYIDELLWIKNPNGTSGLFMAFYNGHTETIRAFCNILKNYSFTTRRLVEMLSA TNKDGIPGVFVSVVNRDKETILEYCRIIKENNLEPDTIAEQFSKKMKKTFIEIINRFN HFL" CDS complement(11642..12361) /transl_table=11 /gene="ospF" /product="ospF, secreted by the Mxi-Spa secretion machinery, function unknown" /note="ORF12, length= 239 aa, 63.3% id in 245 aa overlap with STMKFA_1 (Salmonella typhimurium virulence plasmid MkfA = 241 aa)" /db_xref="GOA:Q8VSP9" /db_xref="InterPro:IPR003519" /db_xref="InterPro:IPR038498" /db_xref="PDB:3I0U" /db_xref="UniProtKB/Swiss-Prot:Q8VSP9" /protein_id="CAC05773.1" /translation="MPIKKPCLKLNLDSLNVVKSEIPQMLSANERLKNNFNILYNQIR QYPAYYFKVASNVPTYSDICQFFSVMYQGFQIVNHSGDVFIHACRENPQSKGDFVGDK FHISIAREQVPLAFQILSGLLFSEDSPIDKWKITDMNRVSQQSRVGIGAQFTLYVKSD QECSQYSALLLHKIRQFIMCLESNLLRSKIAPGEYPASDVRPEDWKYVSYRNELRSDR DGSERQEQMLREEPFYRLMIE" CDS complement(13393..13971) /transl_table=11 /note="ORF13, length= 192 aa, unknown" /db_xref="UniProtKB/TrEMBL:Q99Q33" /protein_id="CAC05774.1" /translation="MKVSFKSLGYIFHDIYNKKHTIDEFNDVVRKAVLSGKINELNAC HKVAIFLAEKDNEITKKDKAKIIDTLTENYSIEFQQLMNISERTLNSSLYITPGESGF VSFVNREGKICHTAYVKSSDNSMTYYHANGSSIDKYITDMCGLICMRHIESTGIIFYM LDEKVLSAIAEFMNEKGWRAAFCSAKNLYKCV" misc_feature 14014..14241 /note="IS2.02, 90% id over 228 nt with IS2, from 4 to 230" misc_feature complement(14367..14629) /note="IS21.01, 97% id over 263 nt with IS21, from 1825 to 2098" misc_feature complement(14630..15558) /note="IS91.01, 97% id over 929 nt with IS91, from 191 to 0" misc_feature 15559..15924 /note="ISShf5.02a, id to ISShf5 from 1 to 366" misc_feature complement(15925..17350) /note="IS4.01, 99% id over 1426 nt with IS4, from 1 to 0" misc_feature 17351..19039 /note="ISSfh5.02b, id over 1689 nt with ISShf5, from 335 to 0" misc_feature 19326..19523 /note="IS3.01, 88% id over 198 nt with IS3, from 1062 to 0" misc_feature 20459..20601 /note="IS630.02, 84% id over 143 nt with IS630, from 999 to 1141" CDS 20964..21641 /transl_table=11 /gene="ospD1" /product="OspD1, secreted by the Mxi-Spa secretion machinery, function unknown" /note="ORF21, length = 225 aa, 34.3% id in 140 aa overlap with ECSHET2B_1 (S.flexneri SenA = 565 aa)" /db_xref="UniProtKB/TrEMBL:Q99QN6" /protein_id="CAC05775.1" /translation="MSINNYGLHPANNKNMHLIIGSNTANENKGMKNNIINVTNTAIS HAINEEKSGGGYSGVSFRKLAKIQNISIPTKNNKEYNRHNLFSLIWHGNADAARKYSE SLLAAEIPKEEKLEVLAARNNAGESALFIALQEGHSAAIQAYGDFIKTFDLSPKETIK LLDVRDNEGLPGLFLAAGKGNIEAMMAYINICHHSGIKLTEIADRLNNNEQDMFNIIS DKIQELF" misc_feature 21977..22105 /note="IS3.02, 89% id over 129 nt with IS3, from 1130 to 0" CDS 22153..22467 /transl_table=11 /note="ORF22,length = 104 aa, unknown" /db_xref="UniProtKB/TrEMBL:Q9AJW8" /protein_id="CAC05776.1" /translation="MLQRQRGKVGFAQLPVDFVAIEPDSVQGVGKRANLTNRCFIIRI NDSFKKRQGFIEFISNSGSGHTVTVYTKRRFQRGVFMNSLNTNVVKPVMYRLRSSADA RP" CDS 22749..23315 /transl_table=11 /gene="ipgB2" /product="IpgB2, probably secreted by the Mxi-Spa secretion machinery" /note="ORF23, length = 188 aa, 43.2% id in 183 aa overlap with AB016764_2 (Escherichia coli EvcA = 196 aa; virulence-specific chaperone-like protein), 28.1% id in 146 aa overlap with L0028 (E. coli pathogenicity island = 203 aa), and 28% id with IpgB1 from the S. flexneri virulence plasmid" /db_xref="GOA:Q9AJW7" /db_xref="InterPro:IPR004959" /db_xref="PDB:3LW8" /db_xref="PDB:3LWN" /db_xref="PDB:3LXR" /db_xref="PDB:3LYQ" /db_xref="UniProtKB/TrEMBL:Q9AJW7" /protein_id="CAC05777.1" /translation="MLGTSFNNFGISLSHKRYFSGKVDEIIRCTMGKRIVKISSTKIN TSILSSVSEQIGENITDWKNDEKKVYVSRVVNQCIDKFCAEHSRKIGDNLRKQIFKQV EKDYRISLDINAAQSSINHLVSGSSYFKKKMDELCEGMNRSVKNDTTSNVANLISDQF FEKNVQYIDLKKLRGNMSDYITNLESPF" misc_feature 23490..23821 /note="ISShf6.01, 94% id over 332 nt with ISShf6, from 1 to 332" misc_feature 23822..24426 /note="IS3.03, 95% id over 605 nt with IS3, from 1 to 0" misc_feature 24682..25096 /note="IS1N.01a, 88% id over 415 nt with IS1N, from 5 to 419" misc_feature 25097..25860 /note="IS100.01a, 99% id over 764 nt with IS100, from 1 to 764" misc_feature complement(25861..27161) /note="ISShf1.01, reference sequence for ISSHf1, from 1 to 0" misc_feature 27162..28359 /note="IS100.01b, 98% id over 1198 nt with IS100, from 756 to 0" misc_feature 28360..28712 /note="IS1.01b, 81% id over 353 nt with IS1N, from 414 to 0" CDS 29020..30219 /transl_table=11 /gene="parA" /note="ORF29, ParA, length = 399 aa; 75.1% id in 398 aa overlap with ECP1PARA_1 (E. coli bacteriophage P1 partition protein ParA = 398 aa)" /db_xref="InterPro:IPR025669" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR041250" /db_xref="UniProtKB/TrEMBL:Q99Q38" /protein_id="CAC05778.1" /translation="MTSFEQLSKVAQRADKMLLALTKQIQEQKQEFQADVFYQVYSKS AVAKLPKLTRASVDGAVGEMEAQGYQFEKRPAGTATKYALTIQNIIDIYAHRGIPKYR DRYSEAYSIFIGSLKGGVSKTVSSVSVAHALRAHPHLLSEDLRILLLDLDPQSSATMF LNYLHAVGLVDTTAPQAMLQNVSREELLEDFIVPSVIPGVYVMPASIDDAFIASNWDT LCEEHLLGQNKHAILRENIIDKLKHDFDFILIDTGPHLDAFLKNAIAAADIMFTPVPP AQVDFHSTLKYLARLPELVQIIEQDGCSCRLQANIGFMSKLANKSDHKYCHSLTKEIF GGDMLDVSMPRLDGFERSGESFDTVISANPVTYVGSGEALKNARMAAEDFAKAVFDRI EFIRANY" CDS 30219..31199 /transl_table=11 /gene="parB" /note="ORF30, ParB, length = 326 aa; 58.3% id in 314 aa overlap with P07621|PARB_ECOLI (bacteriophage P1 partition protein ParB = 343 aa)" /db_xref="GOA:Q99QN9" /db_xref="InterPro:IPR003115" /db_xref="InterPro:IPR004437" /db_xref="InterPro:IPR014884" /db_xref="UniProtKB/TrEMBL:Q99QN9" /protein_id="CAC05779.1" /translation="MENRKHRPTIGRTLNTNILNNTEEISAPVHVFTLNTGRKAKFTE IKVDHDKVDTQTFVVEEVNGREQTALTPDSLKDITRTIRLQQFYPCIGIRTGDLIEIL DGSRRRAAALLCKVGLRVLVTDDELTVSEAQHLAKDLQTSLEHNIREIGLRLVRLKEA GMNQKQIAEREGLSAAKVTRALQAASVPKDFVSLFPVQSELTYADYRQLAELSERLRL GDISIDEVVKNISPSIELITADDNLSEDEVKNSIMRLITKEMSSLLDSGVKDKAVVTL LWKFDSKDKFARKRVKGRTFSYEFGRLPLEVQDKLDRMIALVLKDNLNSL" misc_feature 31455..31817 /note="probable remnant of an IS element, contains an ORF that shares similarities to members of the IS1111 A/IS1328/IS1533 family of transposases (387 aa), uncharacterized" misc_feature 31933..32616 /note="ISShf6.02, id over 684 nt with ISShf6, from 1 to 684" misc_feature complement(32616..33404) /note="probable remnant of an IS element, contains an ORF that shares similarities with reverse transcriptases, uncharacterized" misc_feature complement(33931..34103) /note="ISShf5.03, 100% id over 173 nt with ISShf5, from 1798 to 1970" misc_feature complement(34104..35409) /note=" IS629.03, 98% id over 1306 nt with IS629, from 1 to 0" misc_feature complement(35410..35704) /note="IS150.01a, 97% id over 295 nt with IS150, from 804 to 1098" misc_feature 35705..37393 /note="IS1294.01, 94% id over 1689 nt with IS1294, from 1 to 0" misc_feature complement(37394..38195) /note="IS150.01b, 98% id over 802 nt with IS150, from 1 to 802" misc_feature 38196..38509 /note="IS100.02, 100% id over 314 nt with IS100, from 1173 to 1486" CDS complement(38511..39299) /transl_table=11 /gene="virF" /product="VirF, member of the AraC family of transcriptional activators, required for transcription of virB and icsA" /note="ORF38, VirF, length = 262 aa, id to X58464|SDVIRF_1 and Q04248|VIRF_SHIDY (S.dysenteriae), and X16661|SFVIRFP_1 (S. flexneri)" /db_xref="GOA:P0A2T1" /db_xref="InterPro:IPR009057" /db_xref="InterPro:IPR018060" /db_xref="InterPro:IPR018062" /db_xref="InterPro:IPR020449" /db_xref="UniProtKB/Swiss-Prot:P0A2T1" /protein_id="CAC05780.1" /translation="MMDMGHKNKIDIKVRLHNYIILYAKRCSMTVSSGNETLTIDEGQ IAFIERNIQINVSIKKSDSINPFEIISLDRNLLLSIIRIMEPIYSFQHSYSEEKRGLN KKIFLLSEEEVSIDLFKSIKEMPFGKRKIYSLACLLSAVSDEEALYTSISIASSLSFS DQIRKIVEKNIEKRWRLSDISNNLNLSEIAVRKRLESEKLTFQQILLDIRMHHAAKLL LNSQSYINDVSRLIGISSPSYFIRKFNEYYGITPKKFYLYHKKF" misc_feature complement(40025..40103) /note="IS2/03, 86% id over 79 nt with IS2, from 65 to 143" CDS 40311..40577 /transl_table=11 /gene="ospE2" /note="ORF40, length = 88 aa, secreted by the Mxi-Spa machinery, function unknown" /db_xref="UniProtKB/TrEMBL:Q9AJW6" /protein_id="CAC05781.1" /translation="MLTQTIFPCLPQKQENIILEVSNPVLLSSTVTTDGYTVFNKKAA IYELQIPAANRTKTLKFTATEMQWLTKINEAGIDEKQSQRYSDF" misc_feature 40712..41479 /note="IS1F, 98% id over 768 nts with IS1F, from 1 to 0" misc_feature complement(41702..43152) /note="ISShf4.01, reference sequence for ISShf4, from 1 to 0" CDS 43257..44948 /transl_table=11 /gene="ipaH2.5" /product="IpaH2.5, member of the IpaH family, probably secreted by the Mxi-Spa machinery, function unknown" /note="ORF43, IpaH2.5, length = 563 aa" /db_xref="GOA:Q99Q42" /db_xref="InterPro:IPR001611" /db_xref="InterPro:IPR029487" /db_xref="InterPro:IPR032674" /db_xref="InterPro:IPR032675" /db_xref="UniProtKB/TrEMBL:Q99Q42" /protein_id="CAC05782.1" /translation="MIKSTNIQVIGSGIMHQINNIHSLTLFSLPVSLSPSCNEYYLKV WSEWEKNGTPGEQRNIAFNRLKICLQNQEAELNLSELDLKTLPDLPPQITTLEIRKNL LTHLPDLPPMLKVIHAQFNQLESLPALPETLEELNAGDNKIKELPFLPENLTHLRVHN NRLHILPLLPPELKLLVVSGNRLDSIPPFPDKLEGLALANNFIEQLPELPFSMNRAVL MNNNLTTLPESVLRLAQNAFVNVAGNPLSGHTMRTLQQITTGPDYSGPRIFFSMGNSA TISAPEHSLADAVTAWFPENKQSDVSQIWHAFEHEEHANTFSAFLDRLSDTVSARNTS GFREQVAAWLEKLSASAELRQQSFAVAADATESCEDRVALTWNNLRKTLLVHQASEGL FDNDTGALLSLGREMFRLEILEDIARDKVRTLHFVDEIEVYLAFQTMLAEKLQLSTAV KEMRFYGVSGVTANDLRTAEAMVRSREENEFTDWFSLWGPWHAVLKRTEADRWAQAEE QKYEMLENEYSQRVADRLKASGLSGDADAEREAGAQVMRETEQQIYRQLTDEVLA" misc_feature 44946..46146 /note="IS629.04, 98% id over 1204 nt with IS629, from 1 to 0" misc_feature complement(46147..46734) /note="IS600.02, 93% id over 588 nt with IS600, from 1 to 589" CDS complement(46738..47220) /transl_table=11 /note="ORF47, length = 160 aa, 47.3% identity in 148 aa overla with MTY21C12_13 Mycobacterium tuberculosis H37Rv hypothetical protein (166 aa)" /db_xref="GOA:Q99Q62" /db_xref="InterPro:IPR000182" /db_xref="InterPro:IPR016181" /db_xref="UniProtKB/TrEMBL:Q99Q62" /protein_id="CAC05783.1" /translation="MEINVTAPALLTDEHILQPFDCGNEVLSNWLRGRAMKNQMLNAS RTFVICLEDTLRIVGYYSLATGSVTHAELGRSLRHNMPNPVPVVLLGRLAVDVCTQGH GFGKWLLSDAIHRVVNLADQVGIKAVMVHAIDDDARAFYERFGFVQSVVAPNTLFYKV " CDS complement(47211..47513) /transl_table=11 /note="ORF48, length = 100 aa, 32.9% id in 85 aa overlap with MTY21C12_12 Mycobacterium tuberculosis hypothetical protein Rv0918 (158 aa)" /db_xref="GOA:Q99Q00" /db_xref="InterPro:IPR010985" /db_xref="InterPro:IPR014795" /db_xref="UniProtKB/TrEMBL:Q99Q00" /protein_id="CAC05784.1" /translation="MEVFMSTAASVRKTPREHQINIRATDEERAVIDYAASLVNKNRT DFIMELAYQEAKNIILDQRLFVLDNERYDSFITQLEAPVQNAEGRERLMAVKPEWK" misc_feature complement(47705..49418) /note="IS1294.02, 95% id over 1714 nt with IS1294, from 1 to 0" misc_feature complement(49419..49561) /note="ISShf7.01, 99% id over 143 nt with ISShf7, from 1 to 143" CDS complement(49695..51149) /transl_table=11 /gene="ospC2" /product="OspC2, probably secreted by the Mxi-Spa secretion machinery, function unknown" /note="ORF50, length = 484 aa" /db_xref="InterPro:IPR010366" /db_xref="UniProtKB/TrEMBL:Q99QI8" /protein_id="CAC05785.1" /translation="MKIPEAVNHINVQNNIDLVDGKINPNKDTKALQKNISCVTNSSS SGISEKHLDHCADTVKSFLRKSIAAQSYSKMFSQGTSFKSLNLSIEAPSGARSSFRSL EHLDKVSRHYLSEIIQKTHPLSSDERHLLSIIINSDFNFRHQSNANLSNNTLNIKSFD KIKSENIQTYKNTFSEDIEEIANHDFVFFGVEISNHQETLPLNKTHHTVDFGANAYII DHDSPYGYMTLTDHFDNAIPPVFYHEHQSFFLDNFKEVVDEVSRYVHGNQGKTDVPIF NTKDMRLGIGLHLIDFIRKSKDQRFREFCYNKNIDPVSLDRIINFVFQPEYHIPRMLS TDNFKKIRLRDISLEDAIKASNYEEINNKVTDKKMAHQALAYSLGNAKSDMALYLLSK FNFTKQDVAEMEKMNNNMYCELYDVEYLLSEDSANYKVLEYFINNGLVDVNKRFQKAN SGDTMLDNAMKSKDSKTIDFLLKNGAVSGKRFGR" misc_feature 51489..52167 /note="IS1F.02, 98% id over 679 nt with IS1F, from 1 to 679" misc_feature 52168..53368 /note="IS629.05, 98% id over 1201 nt with IS629, from 1 to 0" misc_feature 53376..53777 /note="probable remnant of an IS element, contains an ORF that shares sequence similarities with transposases, uncharacterized" misc_feature 53815..54490 /note="remnant of an IS element, contains an ORF that shares similarities with transposases, uncharacterized" CDS complement(54594..58688) /transl_table=11 /gene="sepA" /product="SepA, extracellular serine protease of the IgA1 protease family, secreted by a C-terminal autotransporter domain" /note="ORF55, SepA, length = 1364 aa, id to SFSEPAGN_1" /db_xref="GOA:Q8VSL2" /db_xref="InterPro:IPR000710" /db_xref="InterPro:IPR005546" /db_xref="InterPro:IPR006315" /db_xref="InterPro:IPR009003" /db_xref="InterPro:IPR011050" /db_xref="InterPro:IPR012332" /db_xref="InterPro:IPR024973" /db_xref="InterPro:IPR030396" /db_xref="InterPro:IPR036709" /db_xref="PDB:5J44" /db_xref="UniProtKB/Swiss-Prot:Q8VSL2" /protein_id="CAC05786.1" /translation="MNKIYYLKYCHITKSLIAVSELARRVTCKSHRRLSRRVILTSVA ALSLSSAWPALSATVSAEIPYQIFRDFAENKGQFTPGTTNISIYDKQGNLVGKLDKAP MADFSSATITTGSLPPGDHTLYSPQYVVTAKHVSGSDTMSFGYAKNTYTAVGTNNNSG LDIKTRRLSKLVTEVAPAEVSDIGAVSGAYQAGGRFTEFYRLGGGMQYVKDKNGNRTQ VYTNGGFLVGGTVSALNSYNNGQMITAQTGDIFNPANGPLANYLNMGDSGSPLFAYDS LQKKWVLIGVLSSGTNYGNNWVVTTQDFLGQQPQNDFDKTIAYTSGEGVLQWKYDAAN GTGTLTQGNTTWDMHGKKGNDLNAGKNLLFTGNNGEVVLQNSVNQGAGYLQFAGDYRV SALNGQTWMGGGIITDKGTHVLWQVNGVAGDNLHKTGEGTLTVNGTGVNAGGLKVGDG TVILNQQADADGKVQAFSSVGIASGRPTVVLSDSQQVNPDNISWGYRGGRLELNGNNL TFTRLQAADYGAIITNNSEKKSTVTLDLQTLKASDINVPVNTVSIFGGRGAPGDLYYD SSTKQYFILKASSYSPFFSDLNNSSVWQNVGKDHNKAIDTVKQQKIEASSQPYMYHGQ LNGNMDVNIPQLSGKDVLALDGSVNLPEGSITKKSGTLIFQGHPVIHAGTTTSSSQSD WETRQFTLEKLKLDAATFHLSRNGKMQGDINATNGSTVILGSSRVFTDRSDGTGNAVS SVEGSATATTVGDQSDYSGNVTLENKSSLQIMERFTGGIEAYDSTVSVTSQNAVFDRV GSFVNSSLTLGKGAKLTAQSGIFSTGAVDVKENASLTLTGMPSAQKQGYYSPVISTTE GINLEDNASFSVKNMGYLSSDIHAGTTAATINLGDSDADAGKTDSPLFSSLMKGYNAV LRGSITGAQSTVNMINALWYSDGKSEAGALKAKGSRIELGDGKHFATLQVKELSADNT TFLMHTNNSRADQLNVTDKLSGSNNSVLVDFLNKPASEMSVTLITAPKGSDEKTFTAG TQQIGFSNVTPVISTEKTDDATKWVLTGYQTTADAGASKAAKDFMASGYKSFLTEVNN LNKRMGDLRDTQGDAGVWARIMNGTGSADGDYSDNYTHVQIGVDRKHELDGVDLFTGA LLTYTDSNASSHAFSGKNKSVGGGLYASALFNSGAYFDLIGKYLHHDNQHTANFASLG TKDYSSHSWYAGAEVGYRYHLTKESWVEPQIELVYGSVSGKAFSWEDRGMALSMKDKD YNPLIGRTGVDVGRAFSGDDWKITARAGLGYQFDLLANGETVLQDASGEKRFEGEKDS RMLMTVGMNAEIKDNMRLGLELEKSAFGKYNVDNAINANFRYVF" misc_feature complement(58971..60659) /note="IS1294.03, 94% id over 1689 nt with IS1294, from 1 to 0" misc_feature complement(60660..61521) /note="ISShf7.02, 98% id over 862 nt with ISShf7, from 3 to 864" misc_feature 61522..62674 /note="IS630.03, 99% id over 1153 nt with IS630, from 1 to 0" misc_feature 62835..63548 /note="IS1N.02, 80% id over 713 nt with IS1N, from 44 to 759" CDS 64062..65759 /transl_table=11 /gene="ipaH7.8" /product="IpaH7.8, member of the IpaH family, secreted by the Mxi-Spa secretion machinery, function unknown" /note="ORF64, IpaH7.8, length = 565 aa, frameshift in sequence reported for IPA7_SHIFL" /db_xref="GOA:P18014" /db_xref="InterPro:IPR001611" /db_xref="InterPro:IPR029487" /db_xref="InterPro:IPR032674" /db_xref="InterPro:IPR032675" /db_xref="PDB:3CKD" /db_xref="UniProtKB/Swiss-Prot:P18014" /protein_id="CAC05787.1" /translation="MFSVNNTHSSVSCSPSINSNSTSNEHYLRILTEWEKNSSPGEER GIAFNRLSQCFQNQEAVLNLSDLNLTSLPELPKHISALIVENNKLTSLPKLPAFLKEL NADNNRLSVIPELPESLTTLSVRSNQLENLPVLPNHLTSLFVENNRLYNLPALPEKLK FLHVYYNRLTTLPDLPDKLEILCAQRNNLVTFPQFSDRNNIRQKEYYFHFNQITTLPE SFSQLDSSYRINISGNPLSTRVLQSLQRLTSSPDYHGPQIYFSMSDGQQNTLHRPLAD AVTAWFPENKQSDVSQIWHAFEHEEHANTFSAFLDRLSDTVSARNTSGFREQVAAWLE KLSASAELRQQSFAVAADATESCEDRVALTWNNLRKTLLVHQASEGLFDNDTGALLSL GREMFRLEILEDIARDKVRTLHFVDEIEVYLAFQTMLAEKLQLSTAVKEMRFYGVSGV TANDLRTAEAMVRSREENEFTDWFSLWGPWHAVLKRTEADRWAQAEEQKYEMLENEYS QRVADRLKASGLSGDADAEREAGAQVMRETEQQIYRQLTDEVLALRLSENGSRLHHS" misc_feature 66004..66064 /note="IS629.06, 85% id over 61 nt with IS629, from 11332 to 1192" CDS 66187..67911 /transl_table=11 /gene="ipaH4.5" /product="IpaH4.5, member of the IpaH family, probably secreted by the Mxi-Spa secretion machinery, function unknown" /note="ORF66, IpaH4.5, length = 574 aa, id to IPA4_SHIFL" /db_xref="GOA:P18009" /db_xref="InterPro:IPR001611" /db_xref="InterPro:IPR029487" /db_xref="InterPro:IPR032674" /db_xref="InterPro:IPR032675" /db_xref="UniProtKB/Swiss-Prot:P18009" /protein_id="CAC05788.1" /translation="MKPINNHSFFRSLCGLSCISRLSVEEQCTRDYHRIWDDWAREGT TTENRIQAVRLLKICLDTREPVLNLSLLKLRSLPPLPLHIRELNISNNELISLPENSP LLTELHVNGNNLNILPTLPSQLIKLNISFNRNLSCLPSLPPYLQSLSARFNSLETLPE LPSTLTILRIEGNRLTVLPELPHRLQELFVSGNRLQELPEFPQSLKYLKVGENQLRRL SRLPQELLALDVSNNLLTSLPENIITLPICTNVNISGNPLSTHVLQSLQRLTSSPDYH GPQIYFSMSDGQQNTLHRPLADAVTAWFPENKQSDVSQIWHAFEHEEHANTFSAFLDR LSDTVSARNTSGFREQVAAWLEKLSASAELRQQSFAVAADATESCEDRVALTWNNLRK TLLVHQASEGLFDNDTGALLSLGREMFRLEILEDIARDKVRTLHFVDEIEVYLAFQTM LAEKLQLSTAVKEMRFYGVSGVTANDLRTAEAMVRSREENEFTDWFSLWGPWHAVLKR TEADRWAQAEEQKYEMLENEYSQRVADRLKASGLSGDADAEREAGAQVMRETEQQIYR QLTDEVLA" misc_feature 67909..68370 /note="IS629.07, 97% id over 462 nt with IS629, from 1 to 462" misc_feature 68370..69287 /note="truncated gene; ORF69, length = 305 AA; 37.2% id in 301 aa overlap with the C-terminus of P08194|GLPT_ECOLI GLYCEROL-3-PHOSPHATE TRANSPORTER (452 aa)" misc_feature complement(69712..69917) /note="IS100.03, 95% id over 206 nt with IS100, from 1236 to 1440" misc_feature complement(69918..70498) /note="probable remnant of an IS element, contains an ORF that exhibits 62.2% id in 164 aa overlap with the C-terminal part of AF071034_14 from Escherichia coli (512 aa)" misc_feature 70499..70506 /note="IS600.03a, 100% id over 8 nt with IS600, from 152 to 158" misc_feature complement(70507..71811) /note="IS629.08, 99% id over 1305 nt with IS629, from 1 to 0" misc_feature 71812..71894 /note="IS600.03b, 96% id over 83 nt with IS600, from 159 to 240" misc_feature 71899..72737 /note="IS100.04, 98% id over 839 nt with IS100, from 1 to 0" misc_feature 72875..74588 /note="IS1294.04, 95% id over 1714 nt with IS1294, from 1 to 0" misc_feature 74589..74626 /note="IS62.09, 100% id over 38 nt with IS629, from 1273 to 0" misc_feature 74627..74671 /note="IS629.10, 97% id over 45 nt with IS629, from 193 to 237" misc_feature complement(74675..74878) /note="ISShf5.04, 98% id over 204 nt with ISShf5, from 1 to 204" CDS complement(74911..76608) /transl_table=11 /gene="ospD3" /product="OspD3 (SenA), probably secreted by the Mxi-Spa secretion machinery, function unknown" /note="ORF75, length = 565 aa, id to O47635, Shet2 enterotoxin" /db_xref="InterPro:IPR012927" /db_xref="UniProtKB/TrEMBL:Q99Q01" /protein_id="CAC05789.1" /translation="MPSVNLIPSRKICLQNMINKDNVSVETIQSLLHSKQLPYFSDKR SFLLNLNCQVTDHSGRLIVCRHLASYWIAQFNKSSGHVDYHHFAFPDEIKNYVSVSEE EKAINVPAIIYFVENGSWGDIIFYIFNEMIFHSEKSRALEISTSNHNMALGLKIKETK NGGDFVIQLYDPNHTATHLRAEFNKFNLAKIKKLTVDNFLDEKHQKCYGLISDGMSIF VDRHTPTSMSSIIRWPDNLLHPKVIYHAMRMGLTELIQKVTRVVQLSDLSDNTLELLL AAKNDDGLSGLLLALQNGHSDTILAYGELLETSGLNLDKTVELLTAEGMGGRISGLSQ ALQNGHAETIKTYGRLLKKRAINIEYNKLKNLLTAYYYDEVHRQIPGLMFALQNGHAD AIRAYGELILSPPLLNSEDIVNLLASRRYDNVPGLLLALNNGQADAILAYGDILNEAK LNLDKKAELLEAKDSNGLSGLFVALHNGCVETIIAYGKILHTADLTPHQASKLLAAEG PNGVSGLIIAFQNRNFEAIKTYMGIIKNENITPEEIAEHLDKKNGSDFLEIMKNIKS" CDS complement(76938..78350) /transl_table=11 /gene="ospC1" /product="OspC1, secreted by the Mxi-Spa secretion machinery, function unknown" /note="ORF77, length = 470 aa, unknown" /db_xref="InterPro:IPR010366" /db_xref="InterPro:IPR036770" /db_xref="UniProtKB/TrEMBL:Q99QR0" /protein_id="CAC05790.1" /translation="MNISETLNSANTQCNIDSMDNRLHTLFPKVTSVRNAAQQTMPDE KNLKDSANIIKDFFRKTIAAQSYSRMFSQGSNFKSLNIAIDAPSDAKASFKAIEHLDR LSKHYISEIREKLHPLSAEELNLLSLIINSDLIFRHQSNSDLSDKILNIKSFNKIQSE GICTKRNTYADDIKKIANHDFVFFGVEISNHQKKHPLNTKHHTVDFGANAYIIDHDSP YGYMTLTDHFDNAIPPVFYHEHQSFLDKFSEVNKEVSRYVHGSKGIIDVPIFNTKDMK LGLGLYLIDFIRKSEDQSFKEFCYGKNLAPVDLDRIINFVFQPEYHIPRMVSTENFKK VKIREISLEEAVTASNYEEINKQVTNKKIALQALFLSITNQKEDVALYILSNFEITRQ DVISIKHELYDIEYLLSAHNSSCKVLEYFINKGLVDVNTKFKKTNSGDCMLDNAIKYE NAEMIKLLLKYGATSDNKYI" misc_feature 78803..78883 /note="ISShf5.05, 90% id over 81 nt with ISShf5, from 373 to 453" misc_feature complement(78884..79165) /note="ISShf5.06, 95% id over 282 nt with ISShf5, from 1319 to 1038" misc_feature 79434..79650 /note="IS91.02, 88% id over 217 nt with IS91, from 1 to 217" misc_feature 79651..79674 /note="IS629.11, 100% id over 24 nt with IS629, from 1 to 24" misc_feature 79675..80498 /note="ISShf7.03, 98% id over 824 nt with ISShf7, from 1 to 824" CDS complement(80644..81534) /transl_table=11 /note="ORF81, length = 296 aa, similarities with transcriptional activators of the AraC family" /db_xref="GOA:Q9AJW5" /db_xref="InterPro:IPR009057" /db_xref="InterPro:IPR018060" /db_xref="UniProtKB/TrEMBL:Q9AJW5" /protein_id="CAC05791.1" /translation="MKKQIFINNKPPVVPYSGTHAKIFKYIEIPLPFFYFIYTSGEPF HISVQNTVIYVSKYNGIFINKLVPFSLLFDRDISVLQRRDICVVRFTSEEISEHNVLF DHDIERLKKISKAQLISPDYVLIDFSSVGGGGDEPYAMSWLISQCAHQCTDNKKTETD AIYDKVRSSYLLSCILKKNKNVGLILHAPSFVSVSEKIARIVMANYSRNWSNSELASA VLMSESSLKRRMYKEVGSISTFVHKIKLTEAIRKLRRTNTPISVISSELGYSSPSYFS KVFFKYLKTYPQNIRKKNGR" CDS complement(81762..82400) /transl_table=11 /note="ORF82, length = 212 aa, 54.7% id in 214 aa overlap with YBDM_ECOLI (hypothetical protein = 209 aa), function unknown" /db_xref="InterPro:IPR003115" /db_xref="InterPro:IPR036086" /db_xref="UniProtKB/TrEMBL:Q99Q60" /protein_id="CAC05792.1" /translation="MGDSVTPEILGNMIRQYFSQVRTEKESIQALNHLRRVLHEVSPF AQEPVDCVLWVKADEVVANDYNPNVMALGEKKLLKQSLEKDGFTQPVVVSEEKNHYLV VDGFHRQLLGREADTGKRLRGWLPVACINPERKGQAARIAATIRHNRARGKHQITLMS DIVRDLSRLGWTNERIGTELGMDQDEVLRLKQISGLTELFQEEDFSSAWTVR" misc_feature complement(82530..82554) /note="IS600.04a, 100% id over 25 nt with IS600, from 1240 to 0" misc_feature 82555..83885 /note="IS2.04, 99% id over 1331 nt with IS2, from 1 to 0" misc_feature complement(83886..85129) /note="IS600.04b, 98% id over 1244 nt with IS600, from 1 to 1244" misc_feature complement(85130..85438) /note="ISShf1.02, 100% id over 309 nt with ISShf1, from 1 to 309" misc_feature complement(85491..85581) /note="IS4.02, 91% id over 96 nt with IS4, from 1 to 91" CDS complement(85586..86071) /transl_table=11 /note="ORF85, length = 161 aa; 50.3% id in 157 aa overlap with MTY21C12_13 from Mycobacterium tuberculosis H37Rv (166 aa); 24.7% id in 162 aa overlap with AE000065_9 from Rhizobium sp. NGR234 plasmid pN (181 aa)" /db_xref="GOA:Q99QI3" /db_xref="InterPro:IPR000182" /db_xref="InterPro:IPR016181" /db_xref="UniProtKB/TrEMBL:Q99QI3" /protein_id="CAC05793.1" /translation="MGCVTAPEPLSSFHQVAEFVSGEAVLDDWLKQKELKNQAIGATR TFVVCRKGTQQIVGFYSLATGSVNHTEATGNLRRNMPDPIPVIILARLAVDVSFRGKG LGADLLHDAVRRCYRVAENIGVRAIMVHALTENAKQFYIHHGFKPSKTQVQTLFLKLP Q" CDS complement(86059..86343) /transl_table=11 /note="ORF86, length = 94 aa, 37.8% id in 82 aa overlap with MTY21C12_12 from Mycobacterium tuberculosis H37Rv (158 aa); 27.7% id in 83 aa overlap with U32725_7 from Haemophilus influenzae Rd section 4 (99 aa)" /db_xref="GOA:Q9AJW4" /db_xref="InterPro:IPR010985" /db_xref="InterPro:IPR014795" /db_xref="UniProtKB/TrEMBL:Q9AJW4" /protein_id="CAC05794.1" /translation="MIYGGFMKSGVQLNLRARESQRILIDAAAEILHKSRTDFILEMA CKAAEDVILDRRVFNFNDRQYEEFIEMLDAPVADDPAIEKLLARKPQWDV" misc_feature 87560..87630 /note="ISShf5.07, 95% id over 71 nt with ISShf5, from 1 to 71" misc_feature complement(87631..90359) /note="ISShf3.01, reference sequence for