LOCUS HM051054 6671 bp DNA linear BCT 13-JUN-2010 DEFINITION Streptomyces sp. MK730-62F2 caprazamycin biosynthesis L-rhamnose gene cluster, complete sequence. ACCESSION HM051054 VERSION HM051054.1 KEYWORDS . SOURCE Streptomyces sp. MK730-62F2 ORGANISM Streptomyces sp. MK730-62F2 Bacteria; Actinomycetota; Actinomycetes; Kitasatosporales; Streptomycetaceae; Streptomyces. REFERENCE 1 (bases 1 to 6671) AUTHORS Kaysser,L., Wemakor,E., Siebenberg,S., Salas,J.A., Sohng,J.K., Kammerer,B. and Gust,B. TITLE Formation and attachment of the deoxysugar moiety and assembly of the gene cluster for caprazamycin biosynthesis JOURNAL Appl. Environ. Microbiol. 76 (12), 4008-4018 (2010) PUBMED 20418426 REFERENCE 2 (bases 1 to 6671) AUTHORS Gust,B., Kaysser,L. and Heide,L. TITLE Direct Submission JOURNAL Submitted (01-APR-2010) Pharmaceutical Biology, University of Tuebingen, Auf der Morgenstelle 8, Tuebingen 72076, Germany FEATURES Location/Qualifiers source 1..6671 /organism="Streptomyces sp. MK730-62F2" /mol_type="genomic DNA" /strain="MK730-62F2" /db_xref="taxon:643403" /clone="Cosmid 4H11" misc_feature 1..6671 /note="caprazamycin biosynthesis L-rhamnose gene cluster; involved in the biosynthesis of the caprazamycin deoxysugar moiety" gene 1..174 /gene="cpzDI" CDS 1..174 /gene="cpzDI" /codon_start=1 /transl_table=11 /product="unknown" /protein_id="ADI50275.1" /translation="MDTSEKPVVRQHETGWSKARRLRGKAALRTCLPAVEDRGTPGVT HSGAEWNIVRGED" gene 279..1346 /gene="cpzDII" CDS 279..1346 /gene="cpzDII" /codon_start=1 /transl_table=11 /product="dTDP-glucose synthase" /protein_id="ADI50276.1" /translation="MKALVLSGGAGTRLRPITHTSAKQLVPVANKAVLFYGLESIAEA GITDVGMIVGETAEEIEEAVGDGSKFGLKVTYIPQERPLGLAHAVLIARDYLGDDDFV MYLGDNFIVGGITGLVEEFRDNRPDAQILLTRVADPRAFGVAELDPSGQVIGLEEKPD RPKSDLALVGVYMFTPLIHEAVRAIEPSWRGELEITHAIQHLIDTRADVRSTVIKGYW KDTGNVGDMLEVNRTVLEAMERRIDGEVDDASETIGRVVLEEGARIVNSRVVGPVVIG SGTVVSNSYVGPFTSVAENCRITDSELEFSIVLRDSSIQGVGRIEASLIGRHVEVTPA PSVPSAHRLVLGDHSKVQITS" gene 1343..2323 /gene="cpzDIII" CDS 1343..2323 /gene="cpzDIII" /codon_start=1 /transl_table=11 /product="dTDP-glucose dehydratase" /protein_id="ADI50277.1" /translation="MNLLVTGAAGFIGSRYVRALLASDAPDAPRVTVLDKLTYAGTLD NLELTHPRLEFVQGDICDAELVDKLMADADQVVHFAAESHVDRSISGAADFVRTNVLG TQTLLDAALRHGTGPFVHVSTDEVYGSIETGSWPEDHPLQPNSPYSASKASSDLLALA YHRTHGLDVRVTRCSNNYGPHQFPEKVVPLFVTNLLDGHKVPLYGEGRNIRDWLHVDD HCQGVDLARTKGRPGEVYNIGGGTELTNKELTGLLLDACGADWDRVEYVEDRKGHDLR YSVDCSKARDELGYRPRHDFTTGLAETVAWYRDNRAWWEPLKQRVAQERA" gene 2320..3207 /gene="cpzDIV" CDS 2320..3207 /gene="cpzDIV" /codon_start=1 /transl_table=11 /product="4-ketoreductase" /protein_id="ADI50278.1" /translation="MRWLITGASGMLGRDVVEELTRRGERVVGLDRAALDITSPPAVD SAVREHRPDVVVNCAAYTAVDDAETDEARALEINGAGPRLLARACAAHEARLIHVSTD YVFSGEARTTPYPEDHRTGPRTAYGRTKLAGEQAVLEELPGASAVLRTAWLYGVHGSN FVRTMIGLEARRDTLDVVDDQRGQPTWSADVAQRIAELGPRLGPEAHGVFHATNSGEA TWYDLAREVFSLIGADPDRVRPTSSAAFPRPAPRPAYSALAHRRWQEIGLPLPRDWRS ALHEALPRIRKEGLPRETP" gene 3194..3997 /gene="cpzDV" CDS 3194..3997 /gene="cpzDV" /codon_start=1 /transl_table=11 /product="unknown" /protein_id="ADI50279.1" /translation="MKRHEFLRELHKVSANRNYLEIGVNDGRSLTLSRVPSIAIDPAF KVVSELRCDVHLVKATSDDFFARDNPLQHLKGGRHPLRNLRRGRSPIGYWRKTTLDLS FIDGMHLFEFALRDFMNVEKYSDWGSVIVLDDMLPRSVDEAARDRHTNAWTGDVYKIT EVLARYRPDLVTVQVDTAPTGQLVVFGADPDNRVLHDKYDEIMAEYKIPDPQKVPEAI LERAGAVRPETLLEAGFWRPLAQARNRGLPRAVGWEPLRKALHQVGVSR" gene complement(4121..4726) /gene="cpzDVI" CDS complement(4121..4726) /gene="cpzDVI" /codon_start=1 /transl_table=11 /product="sugar 3,5-epimerase" /protein_id="ADI50280.1" /translation="MRPLAIEGAWVLEPKIFPDDRGSFHEWYRGAEFREATGHDLSLA QANCSVSRRGVLRGVHFADVPPGQAKYVTCVRGAVLDVVIDIRVGSPTYGTWEAVRLD DDTRHAVFLAEGLGHAFMALTDDATVVYLCSEGYAPGREHGIHPLDPALGIEWPEGIT PLLSPKDEQAPTLAEAERQGLLPSYEACLAYYEELKSGQGA" gene complement(4731..6671) /gene="cpzDVII" CDS complement(4731..6671) /gene="cpzDVII" /codon_start=1 /transl_table=11 /product="glycosyltransferase" /protein_id="ADI50281.1" /translation="MPVKVSVIVPVYNPGIYIEDCISSLQRQSLPPDEFEVIFVDDGS TDETPARLDALAAEDPRMKVIHQENSGWSGKPRNVGIEASRGEFVMFVDNDDYLGDEA LERMYDYGTANGADVVVGKMAGKNRGVPVELFRRNHPRATVENAPLIDSLTPHKMVRR AFLDRIGLRFPEGRRRLEDHVFIAEAYLRAENVSVLSDYVCYYHIRRDDGSNAGFERF DPVGYFKNLREALDVVEQYTEPGPVRDRLFRRWLRVEMVERLRARRLLNLPDDYRREL FGEIHEIVVERFGPGVAAGLQPTQQVIAALTAADRYDDVVAFAEWEAGVGPAAVPGDI EWRDGSLRIGFTAEYLSGGEPMLFPADAEAAPLTDVPKDLTEAVRWVASETAARFGQA TADLLLRERTSAAQYFQPVELTRETVPVGDGEEVRLVLRGTATVDPGALPRDGAWDAL VRVKSGGWTKECRLGPAPVEDRPTPPAGLVGDRPVLPYWTTPHGNLSLELGARGKRLG LGRVELGDVTVSDNRFRVLLPLHVPGDSQVRLRFVSSHRILETPGTLSPDAGRPGSVL EAVLPEDLSDDVWRVAVCPNPGSDRARFAGLPFALRVGGGSVLVTPVPGPGVALRLAR RARRVLGTARRKVNSRIRNGGR" BASE COUNT 1051 a 2504 c 2209 g 907 t ORIGIN 1 atggacacca gtgagaagcc cgtcgtccgt cagcacgaga ccggctggtc gaaggcacgc 61 cgtctgcgcg gaaaggccgc gctgcgcacc tgcctccccg ccgtcgagga ccggggcacg 121 cccggagtga cgcactccgg ggcggagtgg aacatcgtcc ggggtgagga ctgacgcctc 181 cctcgtggac agcgcggtga cgatccgagc aactccgcgc atcgacaact aaaagaagtg 241 atgttcacat ccgccggatc acgactaagg tgaaccgaat gaaggctctc gtgctgtccg 301 gcggcgcagg aacaagactg aggccgatca cgcacacgtc ggccaagcaa ctggtgcccg 361 tggccaacaa ggccgtgctg ttctacgggt tggagtcgat cgccgaggcc ggtatcaccg 421 acgtcggcat gatcgtcggg gagaccgccg aggagatcga ggaagcggtc ggggacgggt 481 cgaagttcgg cctcaaggtc acctacatcc cccaggagcg gcccctcggg ctggcccacg 541 cggtgctgat cgcccgggac tacctcggag acgacgactt cgtgatgtac ctcggcgaca 601 acttcatcgt cggcggcatc accggcctcg tcgaggagtt ccgcgacaac cggcccgacg 661 cccagatcct gctcacgcgc gtggccgacc cgcgcgcctt cggtgtcgcc gaactcgacc 721 cgtccggcca ggtgatcggc ctggaggaga agcccgaccg gcccaagagc gatctcgcgc 781 tggtcggcgt ctacatgttc acccccctca ttcacgaggc ggtccgcgcc atcgaaccct 841 cctggcgcgg cgaactggag atcacccacg ccatccagca cctgatcgac acccgcgccg 901 acgtgcgctc cacggtcatc aagggctact ggaaggacac cggcaacgtc ggcgacatgc 961 tcgaggtgaa ccgcacggtc ctcgaagcca tggagcgccg catcgacggc gaggtggacg 1021 acgcgtccga gaccatcggg cgcgtcgtgc tcgaagaggg cgcgcggatc gtcaactccc 1081 gtgtcgtcgg acccgtcgtc atcggctcgg gcaccgtcgt cagcaactcc tacgtcggcc 1141 ccttcacctc cgtcgccgag aactgccgga tcaccgacag cgagctggag ttctccatcg 1201 tgctgcggga ctcctcgatc cagggcgtcg gccgcatcga ggcctcgctg atcggccggc 1261 acgtcgaggt gacccccgcc cccagcgttc ccagcgccca ccgtctcgtc ctcggagatc 1321 acagcaaggt gcagatcact tcatgaacct cctcgtcacc ggcgccgccg ggttcatcgg 1381 ctcccgctac gtccgggccc tgctcgcctc ggacgcgccc gacgcgccgc gcgtcaccgt 1441 gctggacaag ctcacctacg ccggcaccct cgacaacctc gaactgaccc acccgcggct 1501 ggagttcgtg cagggcgaca tctgcgacgc cgaactggtc gacaagctga tggccgacgc 1561 ggaccaggtc gtgcacttcg ccgccgagtc ccatgtcgac cgctcgatca gcggcgccgc 1621 cgacttcgtc cgcaccaacg tcctcggcac ccagacgctg ctggacgccg ccctgcgcca 1681 cggcacgggc cccttcgtgc acgtctccac cgacgaggtc tacggctcca tcgagaccgg 1741 ctcgtggccg gaggaccatc cgctccagcc gaactcgccg tactcggcct ccaaggcctc 1801 ctccgacctc cttgcgttgg cctaccaccg cacccacggc ctggacgtgc gcgtcacgcg 1861 ctgctccaac aactacggcc cgcaccagtt ccccgagaag gtcgtcccgc tgttcgtcac 1921 caacctcctc gacggccaca aggtgccgct gtacggcgag ggccgcaaca tccgcgactg 1981 gctgcacgtc gacgaccact gccagggcgt cgacctcgcc cgcaccaagg gccggcccgg 2041 cgaggtctac aacatcggcg gcggcaccga actcaccaac aaggaactca ccggcctgct 2101 cctggacgcc tgcggagccg actgggaccg ggtggagtac gtcgaggacc gcaagggcca 2161 cgacctgcgc tactccgtgg actgctccaa ggcccgcgac gaactcggct accgcccccg 2221 ccacgacttc accaccggcc tggccgagac cgtcgcctgg taccgcgaca accgggcctg 2281 gtgggagccc ctgaagcagc gcgtcgccca ggagcgggca tgaggtggct catcaccggc 2341 gcgagcggaa tgctcggccg tgacgtcgtc gaggaactca cgcgccgcgg cgagagagtc 2401 gtcggcctcg accgcgcggc cctggacatc accagccctc cggccgtcga ctccgccgtg 2461 cgggagcacc gacccgacgt ggtcgtcaac tgcgccgcct acacggccgt cgacgacgcc 2521 gagaccgacg aggcccgcgc cctggagatc aacggcgccg gtccacgcct gctggcccgc 2581 gcctgcgccg cgcacgaggc ccgcctgatc cacgtctcca cggactacgt cttctccgga 2641 gaggcccgca ccacccccta cccggaggac catcggaccg gcccccgcac cgcctacggc 2701 cgcaccaaac tggccgggga gcaggccgtg ctggaggaac tccccggggc gagcgcggtg 2761 ctgcgcacgg cgtggctcta cggcgtccac ggctccaact tcgtccggac catgatcgga 2821 ctggaagccc gccgggacac cctcgacgtc gtcgacgacc agcgcggaca gcccacctgg 2881 agcgcggacg tcgcccagcg gatcgccgaa ctcggccccc ggctcggacc cgaggcgcac 2941 ggcgtcttcc acgcgaccaa ctccggcgag gcgacctggt acgacctcgc acgggaggtg 3001 ttctccctca tcggcgcgga cccggaccgg gtgcgcccca ccagcagcgc ggccttcccc 3061 cggcccgcgc cccgcccggc gtacagcgcc ctcgcgcacc gccgctggca ggagatcggc 3121 ctgccgctgc cgcgcgactg gcgctccgcc ctgcacgaag cactgccccg catccgcaag 3181 gaaggtcttc ctcgtgaaac gccatgagtt cctccgggaa ctgcacaagg tcagcgccaa 3241 tcgcaactac ctggagatcg gcgtcaacga cgggcgcagc ctgacgctgt cccgcgtccc 3301 cagcatcgcg atcgaccccg ccttcaaggt ggtctcggag ctgaggtgcg acgtccacct 3361 ggtgaaggcc accagcgacg acttcttcgc gcgtgacaat ccgctccagc acctcaaggg 3421 cggccgtcac ccgctgcgca acctgcgtcg cggccgcagc ccgatcggct actggcgcaa 3481 gaccacgctg gacctgtcgt tcatcgacgg catgcacctg ttcgagttcg cgctgcgcga 3541 cttcatgaac gtcgagaagt actcggactg gggcagtgtg atcgtcctcg acgacatgct 3601 gccgcgcagc gtcgacgagg cggcccggga ccggcacacc aacgcctgga ccggcgacgt 3661 ctacaagatc accgaggtgc tggcgcgcta ccgcccggac ctggtcacgg tgcaggtgga 3721 caccgccccc accggtcagc tggtcgtctt cggcgccgac ccggacaacc gggtcctgca 3781 cgacaagtac gacgagatca tggccgagta caagatcccc gacccgcaga aggtccccga 3841 ggcgatcctg gagcgggccg gcgcggtccg ccccgagacc ctcctggaag ccggcttctg 3901 gcgacccctc gcccaggccc gcaaccgcgg cctgccccgt gccgtcggct gggagcccct 3961 gcgcaaggcc ctgcaccagg tcggcgtgag ccgctgaacg acggcgccgg ccaccgcccg 4021 ggggcggccg gcgccgacgc accttccggc cttgacggag cgcaggcccg gacgccgatc 4081 gcgtccgggc ctgcggtgtg ttcccaggtc gtgccgaggg ctacgccccc tgaccgctct 4141 tcagttcctc gtagtacgcc agacaggcct cgtacgaggg caggagcccc tggcgttccg 4201 cttcggccag ggtcggggcc tgctcgtcct tgggggacag gagcggggtg atgccttcgg 4261 gccactcgat gcccagagcc gggtcgaggg ggtggatgcc gtgctcacgg cccggggcgt 4321 agccctccga gcacaggtag accaccgtgg cgtcgtccgt gagggccatg aaggcgtgac 4381 cgaggccctc cgccaggaac accgcgtgcc gcgtgtcgtc gtcgagccgc acggcctccc 4441 acgtcccgta cgtgggggag ccgacacgga tgtcgatcac cacgtccagg acggcaccgc 4501 ggacgcacgt gacgtacttg gcctggccgg gcggcacgtc ggcgaagtgc acaccgcgca 4561 gcacgccccg acgtgaaacc gagcagttgg cctgggccag ggacaggtcg tgtcccgtcg 4621 cctcgcggaa ctccgcgccg cggtaccact cgtggaagct gccccggtcg tccgggaaga 4681 tcttgggctc cagtacccag gcgccctcta tcgccagcgg tcgcatgccg tcaccgccct 4741 ccgttcctga tacgggagtt caccttccgg cgggccgtac cgagcacccg ccgggcgcgc 4801 cgggccagcc gcagcgcgac accgggcccc ggcaccggcg tcaccagcac gctcccgccc 4861 ccgacccgca gcgcgaacgg cagaccggcg aaacgggccc ggtcggagcc cgggttcggg 4921 cacacggcca cccgccagac gtcgtccgag agatcctcgg gcaggaccgc ctccagcacc 4981 gagcccggcc ggccggcgtc gggggagagc gtgcccggcg tctccaggat ccggtgggag 5041 gagacgaacc tcagccgcac ctgcgagtcg cccgggacgt gcagcgggag cagcacccgg 5101 aaccggttgt cggagacggt gacgtccccc agttccacgc ggcccagccc gagccgcttg 5161 ccgcgcgcgc cgagctccag ggagaggttg ccgtgcggtg tggtccagta cggcaggacc 5221 gggcggtcgc cgacgagccc cgcagggggc gtgggccggt cctcgaccgg cgccggtccc 5281 agccggcact ccttggtcca gccgcccgac ttcacccgga cgagtgcgtc ccaggcgccg 5341 tcacgcggca gggcgcccgg gtcgaccgtg gcggttcccc gcagcaccag ccggacctct 5401 tcgccgtccc cgaccggcac ggtctcgcgg gtgagctcca ccggctggaa gtactgcgcg 5461 gcgctggtgc gctcccgcag cagcagatcg gcggtggcct gcccgaaccg cgcggcggtc 5521 tccgaggcca cccagcgcac cgcctcggtc aggtccttcg gtacgtcggt cagcggcgcg 5581 gcctcggcgt cggcggggaa gagcatcggc tcgccgccgg acaggtactc ggccgtgaag 5641 ccgatacgga gcgaaccgtc acgccactcg atgtcccctg ggaccgccgc gggtccgacg 5701 cccgcctccc actcggcgaa ggccaccacg tcgtcgtagc ggtcggccgc cgtcagcgcg 5761 gcgatgacct gctgcgtcgg ctgcagaccg gcggcgacac cgggtccgaa gcgttcgacg 5821 acgatctcgt ggatctcccc gaacagctcg cggcggtagt cgtccggcag gttcagcagg 5881 cgccgggccc gcagccgctc caccatctcc acgcgcagcc accgccggaa cagccggtcg 5941 cgcaccggac cgggctccgt gtactgctcg acgacgtcga gggcctcgcg caggttcttg 6001 aagtagccga cgggatcgaa gcgttcgaag cccgcgttgg aaccgtcgtc ccgccggatg 6061 tggtagtagc agacgtagtc gctgagcacc gagacgttct ccgcgcgcag atacgcctcc 6121 gcgatgaaga cgtggtcctc caggcgccgt ctgccctcgg ggaagcgcag gccgatgcgg 6181 tccaggaagg cccggcggac catcttgtgc ggggtgaggc tgtcgatgag cggggcgttc 6241 tcgacggtgg cacgcgggtg gttgcggcgg aacagctcca ccggcacccc gcggttcttg 6301 ccggccatct tgcccacgac gacatcggcg ccgttggccg tgccgtagtc gtacatccgc 6361 tccagggcct cgtcgcccag gtagtcgtcg ttgtcgacga acatcacgaa ctcgccccgg 6421 gaggcctcga tgccgacgtt gcggggcttg cccgaccagc cggagttctc ctggtggatg 6481 accttcatcc gggggtcctc ggctgcgagc gcgtcgagcc gggccggggt ctcgtcggtc 6541 gagccgtcgt cgacgaagat cacctcgaac tcgtcggggg gcagcgactg ccgctgcagc 6601 gaggagatgc agtcctcgat gtagatcccc gggttgtaca cggggacgat gacgctgacc 6661 ttgaccggca t //