LOCUS PGU42210 6889 bp DNA linear BCT 24-AUG-1996 DEFINITION Porphyromonas gingivalis porphypain (prtP) gene, complete cds. ACCESSION U42210 VERSION U42210.1 KEYWORDS . SOURCE Porphyromonas gingivalis ORGANISM Porphyromonas gingivalis Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; Porphyromonadaceae; Porphyromonas. REFERENCE 1 (bases 1 to 6889) AUTHORS Barkocy-Gallagher,G.A., Han,N., Patti,J.M., Whitlock,J., Progulske-Fox,A. and Lantz,M.S. TITLE Analysis of the prtP gene encoding porphypain, a cysteine proteinase of Porphyromonas gingivalis JOURNAL J. Bacteriol. 178 (10), 2734-2741 (1996) PUBMED 8631659 REFERENCE 2 (bases 1 to 6889) AUTHORS Han,N. TITLE Direct Submission JOURNAL Submitted (05-DEC-1995) Naiming Han, Oral Biology, University of Florida, Box 100424, Gainesville, FL 32610, USA FEATURES Location/Qualifiers source 1..6889 /organism="Porphyromonas gingivalis" /mol_type="genomic DNA" /strain="W12" /db_xref="taxon:837" /note="pUC18, pBluescript II SK, pNH 5" gene 696..5894 /gene="prtP" CDS 696..5894 /gene="prtP" /function="cysteine protease/hemagglutinin" /codon_start=1 /transl_table=11 /product="porphypain" /protein_id="AAB06565.1" /translation="MRKLLLLIAASLLGVGLYAQSAKIKLDAPTTRTTCTNNSFKQFD ASFSFNEVELTKVETKGGTFASVSIPGAFPTGEVGSPEVPAVRKLIAVPVGATPVVRV KSFTEQVYSLNQYGSEKLMPHQPSMSKSDDPEKVPFVYNAAAYARKGFVGQELTQVEM LGTMRGVRIAALTINPVQYDVVANQLKVRNNIEIEVSFQGADEVATQRLYDASFSPYF ETAYKQLFNRDVYTDHGDLYNTPVRMLVVAGAKFKEALKPWLTWKAQKGFYLDVHYTD EAEVGTTNASIKAFIHKKYNDGLAASAAPVFLALVGDTDVISGEKGKKTKKVTDLYYS AVDGDYFPEMYTFRMSASSPEELTNIIDKVLMYEKATMPDKSYLEKVLLIAGADYSWN SQVGQPTIKYGMQYYYNQEHGYTDVYNYLKAPYTGCYSHLNTGVSFANYTAHGSETAW ADPLLTTSQLKALTNKDKYFLAIGNCCITAQFDYVQPCFGEVITRVKEKGAYAYIGSS PNSYWGEDYYWSVGANAVFGVQPTFEGTSMGSYDATFLEDSYNTVNSIMWAGNLAATH AGNIGNITHIGAHYYWEAYHVLGDGSVMPYRAMPKTNTYTLPASLPQNQASYSIQASA GSYVAISKDGVLYGTGVANASGVATVSMTKQITENGNYDVVITRSNYLPVIKQIQVGE PSPYQPVSNLTATTQGQKVTLKWEAPSAKKAEGSREVKRIGDGLFVTIEPANDVRANE AKVVLAADNVWGDNTGYQFLLDADHNTFGSVIPATGPLFTGTASSNLYSANFEYLVPA NADPVVTTQNIIVTGQGEVVIPGGVYDYCITNPEPASGKMWIAGDGGNQPARYDDFTF EAGKKYTFTMRRAGMGDGTDMEVEDDSPASYTYTVYRDGTKIKEGLTATTFEEDGVAA GNHEYCVEVKYTAGVSPKVCKDVTVEGSNEFAPVQNLTGSSVGQKVTLKWDAPNGTPN PNPNPNPNPGTTLSESFENGIPASWKTIDADGDGHGWKPGNAPGIAGYNSNGCVYSES FGLGGIGVLTPDNYLITPALDLPNGGKLTFWVCAQDANYASEHYAVYASSTGNDASNF TNALLEETITAKGVRSPKAIRGRIQGTWRQKTVDLPAGTKYVAFRHFQSTDMFYIDLD EVEIKANGKRADFTETFESSTHGEAPAEWTTIDADGDGQGWLCLSSGQLDWLTAHGGS NVVSSFSWNGMALNPDNYLISKDVTGATKVKYYYAVNDGFPGDHYAVMISKTGTNAGD FTVVFEETPNGINKGGARFGLSTEANGAKPQSVWIERTVDLPAGTKYVAFRHYNCSDL NYILLDDIQFTMGGSPTPTDYTYTVYRDGTKIKEGLTETTFEEDGVATGNHEYCVEVK YTAGVSPKKCVDVTVNSTQFNPVQNLTAEQAPNSMDAILKWNAPASKRAEVLNEDFEN GIPASWKTIDADGDGNNWTTTPPPGGSSFAGHNSAICVSSASHINFEGPQNPDNYLVT PELSLPGGGTLTFWVCAQDANYASEHYAVYASSTGNDASNFANALLEEVLTAKTVVTA PEAIRGTRAQGTWYQKTVQLPAGTKYVAFRHFGCTDFFWINLDDVVITSGNAPSYTYT IYRNNTQIASGVTETTYRDPDLATGFYTYGVKVVYPNGESAIETATLNITSLADVTAQ KPYTLTVVGKTITVTCQGEAMIYDMNGRRLAAGRNTVVYTAQGGHYAVMVVVDGKSYV EKLAVK" BASE COUNT 1828 a 1568 c 1622 g 1871 t ORIGIN 1 ggatcctacg cccgataccc atactcgaag cctttgctca gtaccatcct gcagaaggtt 61 actctttcgc atatagtgac cctcttttct ctcagcataa tggtacctat catatcagta 121 aggggcgtat tgtcttttcg aacaatgtac agcccgagaa ctctttactt ccacatcaca 181 cccccgactc cttagtcaag gatctttttt cccctttccc ctccgctctc ttcctcatgc 241 tggactgact taaccttggt ctgctctact tttcggttgt aaatacatgc aacacaataa 301 ctttaagtgt tgttagacaa cacttttaca agactctgac ttttaatgag gtggagcatg 361 aaccttttcc tctttcatct tctccttcag attacagtca atattttggc aaaaggctaa 421 ttgacagcct tttataaggg ttaatccctt gtggcttata ttgaaaacat gttctttata 481 atccgatact cttcttaaat cgaatttttt ctctaaattg cgccgcaaca aaactccttg 541 agaaaagtac caatagaaat agaaggtagc attttgcctt taaattcctt ttcttttctt 601 ggattgttct tgaaatgaat cttatttgtg gatttttttt gtttttttaa cccggccgtg 661 gttctctgaa tcacgaccat aaattgtttt aaagtatgag gaaattatta ttgctgatcg 721 cggcgtccct tttgggagtt ggtctttacg cccaaagcgc caagattaag cttgatgctc 781 cgactactcg aacgacatgt acgaacaata gcttcaagca gttcgatgca agcttttcgt 841 tcaatgaagt cgagctgaca aaggtggaga ccaaaggtgg tactttcgcc tcagtgtcaa 901 ttccgggtgc attcccgacc ggtgaggttg gttctcccga agtgccagca gttaggaagt 961 tgattgctgt gcctgtcgga gccacacctg ttgttcgcgt gaaaagtttt accgagcaag 1021 tttactctct gaaccaatac ggttccgaaa aactcatgcc acatcaaccc tctatgagca 1081 agagtgatga tcccgaaaag gttcccttcg tttacaatgc tgctgcttat gcacgcaaag 1141 gttttgtcgg acaagaactg acccaagtag aaatgttggg gacaatgcgt ggtgttcgca 1201 ttgcagctct taccattaat cctgttcagt atgatgtggt tgcaaaccaa ttgaaggtta 1261 gaaacaacat cgaaattgaa gtaagctttc aaggagctga tgaagtagct acacaacgtt 1321 tgtatgatgc ttcttttagc ccttatttcg aaacagctta taaacagctc ttcaatagag 1381 atgtttatac agatcatggc gacttgtata atacgccggt tcgtatgctt gttgttgcag 1441 gtgcaaaatt caaagaagct ctcaagcctt ggctcacttg gaaggctcaa aagggcttct 1501 atctggatgt gcattacaca gacgaagctg aagtaggaac gacaaacgcc tctatcaagg 1561 catttattca caagaaatac aatgatggat tggcagctag tgctgctccg gtcttcttgg 1621 ctttggttgg tgacactgac gttattagcg gagaaaaagg aaagaaaaca aaaaaagtta 1681 ccgacttgta ttacagtgca gtcgatggcg actatttccc tgaaatgtat actttccgta 1741 tgtctgcttc ttccccagaa gaactgacga acatcattga taaggtattg atgtatgaaa 1801 aggctactat gccagataag agttatttgg agaaagttct cttgattgca ggtgcagatt 1861 atagctggaa ttcccaggta ggtcagccaa ccattaaata cggtatgcag tactactaca 1921 accaagagca tggttatacc gacgtgtaca actatctcaa agccccttat acaggttgct 1981 acagtcattt gaataccgga gtcagctttg caaactatac agcgcatgga tctgagaccg 2041 catgggctga tccacttctg actacttctc aactgaaagc actcactaat aaggacaaat 2101 acttcttagc tattggcaac tgctgtatta cagctcaatt cgattatgta cagccttgct 2161 tcggagaggt aataactcgc gttaaggaga aaggggctta tgcctatatc ggttcatctc 2221 caaattctta ttggggcgag gactactatt ggagtgtggg tgctaatgcc gtatttggtg 2281 ttcagcctac ttttgaaggt acgtctatgg gttcttatga tgctacattc ttggaggatt 2341 cgtacaacac agtgaattct attatgtggg caggtaatct tgccgctact catgctggaa 2401 atatcggcaa tattacccat attggtgctc attactattg ggaagcttat catgtccttg 2461 gcgatggttc ggttatgcct tatcgtgcaa tgcctaagac caatacttat acgcttcctg 2521 cctctttgcc tcagaatcag gcttcttata gcattcaggc ttctgccggt tcttacgtag 2581 ctatttctaa agatggagtt ttgtatggaa caggtgttgc taatgccagc ggtgttgcga 2641 ctgtgagtat gactaagcag attacggaaa atggtaatta tgatgtagtt atcactcgct 2701 ctaattatct tcctgtgatc aagcaaattc aggtaggtga gcctagcccc taccagcccg 2761 tttccaactt gacagctaca acgcagggtc agaaagtaac gctcaagtgg gaagcaccga 2821 gcgcaaagaa ggcagaaggt tcccgtgaag taaaacggat cggagacggt cttttcgtta 2881 cgatcgaacc tgcaaacgat gtacgtgcca acgaagccaa ggttgtgctt gcggcagaca 2941 acgtatgggg agacaatacg ggttaccagt tcttgttgga tgccgatcac aatacattcg 3001 gaagtgtcat tccggcaacc ggtcctctct ttaccggaac agcttcttcc aatctttaca 3061 gtgcgaactt cgagtatttg gtcccggcca atgccgatcc tgttgttact acacagaata 3121 ttatcgttac aggacagggt gaagttgtaa tccccggtgg tgtttacgac tattgcatta 3181 cgaacccgga acctgcatcc ggaaagatgt ggatcgcagg agatggaggc aaccagcctg 3241 cacgttatga cgatttcaca ttcgaagcag gcaagaagta caccttcacg atgcgtcgcg 3301 ccggaatggg agatggaact gatatggaag tcgaagacga ttcacctgca agctatacct 3361 acacggtgta tcgtgacggc acgaagatca aggaaggtct gacagctacg acattcgaag 3421 aagacggtgt agctgcaggc aatcatgagt attgcgtgga agttaagtac acagccggcg 3481 tatctccgaa ggtatgtaaa gacgttacgg tagaaggatc caatgaattt gctcctgtac 3541 agaacctgac cggtagttca gtaggtcaga aagtaacgct taagtgggat gcacctaatg 3601 gtaccccgaa tccgaatcca aatccgaatc cgaatccggg aacaacactt tccgaatcat 3661 tcgaaaatgg tattccggca tcttggaaga cgatcgatgc agacggtgac gggcatggct 3721 ggaaacctgg aaatgctccc ggaatcgctg gctacaatag caatggttgt gtatattcag 3781 agtcattcgg tcttggtggt ataggagttc ttacccctga caactatctg ataacaccgg 3841 cattggattt gcctaacgga ggtaagttga ctttctgggt atgcgcacag gatgctaatt 3901 atgcatccga gcactatgcg gtgtatgcat cttcgaccgg taacgatgca tccaacttca 3961 cgaatgcttt gttggaagag acgattacgg caaaaggtgt tcgctcgccg aaagctattc 4021 gtggtcgtat acagggtact tggcgccaga agacggtaga ccttcccgca ggtacgaaat 4081 atgttgcttt ccgtcacttc caaagcacgg atatgttcta catcgacctt gatgaggttg 4141 agatcaaggc caatggcaag cgcgcagact tcacggaaac gttcgagtct tctactcatg 4201 gagaggcacc agcggaatgg actactatcg atgccgatgg cgatggtcag ggttggctct 4261 gtctgtcttc cggacaattg gactggctga cagctcatgg cggcagcaac gtagtaagct 4321 ctttctcatg gaatggaatg gctttgaatc ctgataacta tctcatctca aaggatgtta 4381 caggcgcaac gaaggtaaag tactactatg cagtcaacga cggttttccc ggggatcact 4441 atgcggtgat gatctccaag acgggcacga acgccggaga cttcacggtt gttttcgaag 4501 aaacgcctaa cggaataaat aagggcggag caagattcgg tctttccacg gaagccaatg 4561 gcgccaaacc tcaaagtgta tggatcgagc gtacggtaga tttgcctgca ggcacgaagt 4621 atgttgcttt ccgtcactac aattgctcgg atttgaacta cattcttttg gatgatattc 4681 agttcaccat gggtggcagc cccaccccga ccgattatac ctacacggtg tatcgtgatg 4741 gtacgaagat caaggaaggt ttgaccgaaa cgaccttcga agaagacggc gtagctacgg 4801 gcaatcatga gtattgcgtg gaagtgaagt acacagccgg cgtatctccg aagaaatgtg 4861 tagacgtaac tgttaattcg acacagttca atcctgtaca gaacctgacg gcagaacaag 4921 ctcctaacag catggatgca atccttaaat ggaatgcacc ggcatctaag cgtgcggaag 4981 ttctgaacga agacttcgaa aatggtattc ctgcctcatg gaagacgatc gatgcagacg 5041 gtgacggcaa caattggacg acgacccctc ctcccggagg ctcctctttt gcaggtcaca 5101 acagtgcgat ctgtgtctct tcagcttctc atatcaactt tgaaggtcct cagaaccctg 5161 ataactatct ggttacaccg gagctttctc ttcctggcgg aggaacgctt actttctggg 5221 tatgtgcaca agatgccaat tatgcatcag agcactatgc cgtgtacgca tcttctacgg 5281 gtaacgacgc ttccaacttc gccaacgctt tgttggaaga agtgctgacg gccaagacag 5341 ttgttacggc acctgaagcc attcgtggta ctcgtgctca gggcacctgg tatcaaaaga 5401 cggtacagtt gcctgcgggt actaagtatg ttgccttccg tcacttcggc tgtacggact 5461 tcttctggat caaccttgat gatgttgtaa tcacttcagg gaacgctccg tcttacacct 5521 atacgatcta tcgtaataat acacagatag catcaggcgt aacggagact acttaccgag 5581 atccggactt ggctaccggt ttttacacgt acggtgtaaa ggttgtttac ccgaacggag 5641 aatcagctat cgaaactgct acgttgaata tcacttcgtt ggcagacgta acggctcaga 5701 agccttacac gctgacagtt gtaggaaaga cgatcacggt aacttgccaa ggcgaagcta 5761 tgatctacga catgaacggt cgtcgtctgg cagcgggtcg caacacggtt gtttacacgg 5821 ctcagggcgg ccactatgca gtcatggttg tcgttgacgg caagtcttac gtagagaaac 5881 tcgctgtaaa gtaaatctgt cttggactcg gagactttgt gcagacactt ttaagatagg 5941 tctgtaattg tctcagagta tgaatcggtc gcccgacttc cttaaaagga ggtcgggcga 6001 cttcgttttt attattgctg tccggtaaac ttgtcaagag gagacctttg aaaaatgaga 6061 cctttgcacg gcgattggtg tgtattttgt ttgttaattc attgtataat agggagttat 6121 tttgtatatt tgagtattaa aaacagcata atattcctcc catggcatac caatccaaga 6181 ataccgatga gcatgtaaca tttgcagacg cactcctttc aaagcgttat cgcaaagcac 6241 aaaacgactt cctcaatcag gttgacaggc ttatcgattg gcgtccgatc aggacgctga 6301 tcaacaagaa atacacgaag cgacaaaatg ccatcggcgc cccggcttat gacgtgattc 6361 tcttattcaa gatgttgctt ccgaagacat ggtacaacct cagtgattgt gctttggagg 6421 agcgcatcaa tgattcaatc accttttccc gattcttggg gctatggaag aggtatctcc 6481 cgaccacagc accatcagtc gatttcgttc ggcactgaca gagttggggc tcatggacaa 6541 actattggcg cagtttaaca aacaactttt ccgccatcac atttcggtca gggaaagggt 6601 gcttgtcgat gcaagccttg tggagatacg gagcaccatc gaacgcacct ttggcagtat 6661 tcgccggtgg tttcatggcg gacgatgtcg ataccgggga cttgccaaga cccatactca 6721 aaacattctt gaaagcatcg cctttaattt atacagaacc ccggggataa ttatgtcctc 6781 atctctagga taaggtataa ccacccttga ggagctcgtg caagcagctc ctcaaggggg 6841 atttacaact actttcactc cttaccgcca cccttttccc tccctcccg //