LOCUS TTHAQUI 2173 bp DNA linear BCT 25-APR-2009 DEFINITION Thermus aquaticus aquI gene for aqualysin precursor, complete cds. ACCESSION D90108 J05414 VERSION D90108.1 KEYWORDS . SOURCE Thermus aquaticus ORGANISM Thermus aquaticus Bacteria; Deinococcota; Deinococci; Thermales; Thermaceae; Thermus. REFERENCE 1 (bases 528 to 1632) AUTHORS Kwon,S.T., Terada,I., Matsuzawa,H. and Ohta,T. TITLE Nucleotide sequence of the gene for aqualysin I (a thermophilic alkaline serine protease) of Thermus aquaticus YT-1 and characteristics of the deduced primary structure of the enzyme JOURNAL Eur. J. Biochem. 173, 491-497 (1988) REFERENCE 2 AUTHORS Terada,I., Kwon,S.T., Miyata,Y., Matsuzawa,H. and Ohta,T. TITLE Unique precursor structure of an extracellular protease, aqualysin I, with NH2- and COOH-terminal pro-sequences and its processing in Escherichia coli JOURNAL J. Biol. Chem. 265, 6576-6581 (1990) COMMENT These data kindly submitted in computer readable form by: Hiroshi Matsuzawa Department of Agricultural Chemistry, The University of Tokyo Bunkyo-ku, Tokyo 113 Japan FEATURES Location/Qualifiers source 1..2173 /db_xref="taxon:271" /mol_type="genomic DNA" /organism="Thermus aquaticus" /strain="YT-1" regulatory 248..253 /regulatory_class="minus_35_signal" regulatory 272..277 /regulatory_class="minus_10_signal" regulatory 297..302 /note="ribosome-binding sequence" /regulatory_class="ribosome_binding_site" CDS 306..1847 /codon_start=1 /EC_number="3.4.21.62" /gene="aquI" /note="aa 1 to 513" /product="aqualysin precursor" /protein_id="BAA14135.1" /transl_table=11 /translation="MRKTYWLMALFAVLVLGGCQMASRSDPTPTLAEAFWPKEAPVYG LDDPEAIPGRYIVVFKKGKGQSLLQGGITTLQARLAPQGVVVTQAYTGALQGFAAEMA PQALEAFRQSPDVEFIEADKVVRAWATQSPAPWGLDRIDQRDLPLSNSYTYTATGRGV NVYVIDTGIRTTHREFGGRARVGYDALGGNGQDCNGHGTHVAGTIGGVTYGVAKAVNL YAVRVLDCNGSGSTSGVIAGVDWVTRNHRRPAVANMSLGGGVSTALDNAVKNSIAAGV VYAVAAGNDNANACNYSPARVAEALTVGATTSSDARASFSNYGSCVDLFAPGASIPSA WYTSDTATQTLNGTSMATPHVAGVAALYLEQNPSATPASVASAILNGATTGRLSGIGS GSPNRLLYSLLSSGSGSTAPCTSCSYYTGSLSGPGDYNFQPNGTYYYSPAGTHRAWLR GPAGTDFDLYLWRWDGSRWLTVGSSTGPTSEESLSYSGTAGYYLWRIYAYSGSGMYEF WLQRP" sig_peptide 306..347 /note="aa 1 to 14" mat_peptide 687..1529 /note="aa 128 to 408" /product="aqualysin" regulatory 1862..1888 /note="transcription termination signal" /regulatory_class="terminator" BASE COUNT 361 a 739 c 680 g 393 t ORIGIN 1 cctaggacgg gatcgcccgg gccatccgcc ccgcccacac ccccctggac ggggacctgg 61 tcttcgccct ggccctgggg gagggcaggg gggtggaccc ctacctcctc ctccgcctcg 121 gggcctacgc cgccgacgcc ctcgcgcggg ccatgcccgg gcggtgctgt tggcggaggg 181 gatgttgggg gtgcctgcat accggcagct tgtgcgtaaa aatgaagaat aactaataaa 241 aacccccttg acacccgggc atccttaggg ttagctttgc cctcgtgaaa tccacaaagg 301 agcgtatgag gaagacttat tggctgatgg cgcttttcgc ggtgctcgtt ttgggtggtt 361 gtcagatggc ctcccgctcc gatccaaccc ctaccttggc tgaggccttc tggcccaagg 421 aggctcccgt ctatggcctg gatgaccctg aagctatccc gggccggtac attgtggtct 481 ttaagaaggg gaagggtcag tctctgctcc aaggtggcat cacaaccctg caggcacggc 541 tggctcctca gggggtagtg gtgacccagg cctacacggg cgccctccag ggatttgcgg 601 cggagatggc gccccaggcc ttagaggcct ttaggcagag tcccgacgtg gagttcatag 661 aggcggacaa ggtggtacgg gcctgggcta cccagagccc ggctccttgg ggcctggacc 721 ggattgacca gcgggacctt cccctttcca acagctacac ctacaccgct acgggaaggg 781 gggttaacgt ctatgtgatt gacaccggaa tccgcacgac ccaccgggag ttcggcggcc 841 gggcccgggt aggctatgac gccttagggg ggaacggcca ggactgcaac ggccacggta 901 cccatgtggc gggaacgatc ggcggagtga cctatggggt agccaaagcg gttaacctct 961 acgctgtgcg cgtcctggac tgcaacggtt ccggctccac ctctggggtc atcgctgggg 1021 tggactgggt cacgcggaac caccgcaggc cggccgttgc caacatgagc ttaggaggcg 1081 gagtctccac tgccctggac aacgccgtga agaactccat cgccgcggga gtggtctacg 1141 ctgtggctgc ggggaacgac aacgccaacg cctgcaacta ctccccagcc cgggtggccg 1201 aggcccttac cgtgggcgct accacatctt ccgacgcccg tgccagcttc tccaactacg 1261 gtagttgcgt ggacctcttc gcccctgggg cttccattcc ctcggcgtgg tacacctcgg 1321 acacggccac ccagaccctt aacggcacct ccatggccac cccccatgtg gccggggtgg 1381 ctgctttgta tctagagcaa aatccttcgg ctacgccggc ctctgttgct agcgctatcc 1441 tcaacggagc cactacgggg cggctttcgg ggatcggatc ggggtccccc aaccgtctcc 1501 tttactccct gctctcctcg gggagtggtt ccacagcccc ctgtaccagc tgtagctact 1561 acacgggcag cctttccggt cctggtgact ataacttcca acccaacggc acctactact 1621 acagccctgc aggtacccat agggcctggc ttaggggccc cgccggaacg gactttgacc 1681 tctacctctg gcggtgggac ggctcccgtt ggctgaccgt gggtagctct acggggccca 1741 cctcggagga aagtctcagc tacagcggaa ctgctggcta ctacctctgg cgcatctacg 1801 cctatagcgg ctcggggatg tacgagttct ggctccagcg cccctaggcg aaggagttct 1861 tcctcccctg ggaagcgcct gggggaggtt ttccctttag cgtcctggga aaggggcgag 1921 gaccgcttcc acctggaaga cctggccccc ccggcgcacg gtgagggcga tccggccccc 1981 cacctggtgg cggcgcacct cgcggaggag gtcctcaaag gagttcaccg gcaccccgtt 2041 cacctccagg atcacgtcgg gcacgcctcc cgcctcgagg cccctaagcc ccgcccggtg 2101 ggccgccccg cccggcacca cctcccccac caggaccccc gcccggcggg aggcccagct 2161 cccgggcgag ctc //