LOCUS Z84817 4004 bp DNA linear BCT 18-APR-2005 DEFINITION Sphingomonas sp. cmpC, cmpF, cmpE and cmpX genes. ACCESSION Z84817 VERSION Z84817.1 KEYWORDS 2-hydroxymuconic semialdehyde dehydrogenase; 2-hydroxymuconic semialdehyde hydrolase; cmpC gene; cmpE gene; cmpF gene; cmpX gene. SOURCE Sphingomonas sp. ORGANISM Sphingomonas sp. Bacteria; Proteobacteria; Alphaproteobacteria; Sphingomonadales; Sphingomonadaceae; Sphingomonas. REFERENCE 1 (bases 1 to 4004) AUTHORS Yrjala K., Paulin L., Romantschuk M. TITLE Novel organization of catechol meta-pathway genes in Sphingomonas sp. HV3 pSKY4 plasmid JOURNAL FEMS Microbiol. Lett. 154(2), 403-408(1997). PUBMED 9311141 REFERENCE 2 (bases 1033 to 1956) AUTHORS Yrjala K., Paulin L., Kilpi S., Romantschuk M. TITLE Cloning of cmpE, a plasmid-borne catechol 2,3-dioxygenase-encoding gene from the aromatic- and chloroaromatic-degrading Pseudomonas sp. HV3 JOURNAL Gene 138(1-2), 119-121(1994). PUBMED 8125288 REFERENCE 3 AUTHORS Kilpi S., Backstrom V., Korhola M. TITLE Degradation of 2-methyl-4-chlorophenoxy acetic acid (MCPA), 2,4-dichlorophenoxyacetic acid (2,4-D), benzoic acid and salicylic acid by Pseudomonas sp. HV3 JOURNAL FEMS Microbiol. Lett. 8, 177-182(1980). REMARK (sites) REFERENCE 4 (bases 1 to 4004) AUTHORS Paulin L.G. JOURNAL Submitted (04-FEB-1997) to the INSDC. L.G. Paulin, Institute of Biotechnology, DNA Synthesis and Sequencing Lab., Viikinkaari 9, P.O.Box 56, FIN-00014 Helsinki, FINLAND FEATURES Location/Qualifiers source 1..4004 /organism="Sphingomonas sp." /strain="HV3" /mol_type="genomic DNA" /db_xref="taxon:28214" CDS 164..1042 /transl_table=11 /gene="cmpF" /product="2-hydroxymuconic semialdehyde hydrolase" /db_xref="GOA:O33766" /db_xref="InterPro:IPR000073" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:O33766" /citation=[1] /experiment="experimental evidence, no additional details recorded" /protein_id="CAB06612.1" /translation="MASATLDTSRPEIANSFVIGGSSTNYHDVGEGDPVLLVHGSGPG VTAWANWRLNIPVLAQDFRVIAPDMFGFGYSDSKGRIEDKQVWVDQLASFLDGLGIDK ISMVGNSFGGGITLAFMIAHPDRVERAVLMGPAGLNFPITPALDKVWGYVPSVEAMRE SLKYLAWDHSRLTEDLIQSRYVASARPEAHEPYHATFGGADRQANVAMLASREEDIAA IAHETLILHGIADQVIPLDSTVRLATLMQRADLHLFAECGHWVQIERMASFNRMVAEF FKHGLKADRRGDSSWL" CDS 1033..1956 /transl_table=11 /gene="cmpE" /product="catechol 2,3-dioxygenase" /note="already in database under accession number L10655" /db_xref="GOA:Q798L3" /db_xref="InterPro:IPR000486" /db_xref="InterPro:IPR004360" /db_xref="InterPro:IPR017624" /db_xref="InterPro:IPR029068" /db_xref="InterPro:IPR037523" /db_xref="UniProtKB/TrEMBL:Q798L3" /citation=[2] /citation=[1] /experiment="experimental evidence, no additional details recorded" /protein_id="CAB06613.1" /translation="MALTGVIRPGYVQLRVLDLDEAIIHYRDRIGLNFVNREGDRAFF QAFDEFDRHSIILREADQAGMDVMGFKVAKDADLDHFTERLLDIGVHVDVIPAGEDPG VGRKIRFNTPTQHVFELYAEMALSATGPAVKNPDVWVVEPRGMRATRFDHCALNGVDI ASSAKIFVDALDFSVAEELVDETSGARLGIFLSCSNKAHDVAFLGYPEDGKIHHTSFN LESWHDVGHAADIISRYDISLDIGPTRHGITRGQTIYFFDPSGNRNETFSGGYIYYPD NPQRLWQAENAGKAIFYYEKALNDRFMTVNT" CDS 1959..2390 /transl_table=11 /gene="cmpX" /product="Unknown" /db_xref="InterPro:IPR005624" /db_xref="InterPro:IPR038084" /db_xref="UniProtKB/TrEMBL:O34515" /citation=[1] /protein_id="CAB06614.1" /translation="MDGVVSVRTVGHELALKAAAAAVAMGAAAGCPVVAAVVGAGGDL VAFVRASGSPSPSAKIAQDKAYSAASFRVPTPDLYAMVSGNPALRDGIVAQPGIAMFG GGLPIEIAGEFVGAIGISGGSEAMDVEYANAGLAAIGARQF" CDS 2431..3915 /transl_table=11 /gene="cmpC" /product="2-hydroxymuconic semialdehyde dehydrogenase" /db_xref="GOA:O33767" /db_xref="InterPro:IPR015590" /db_xref="InterPro:IPR016160" /db_xref="InterPro:IPR016161" /db_xref="InterPro:IPR016162" /db_xref="InterPro:IPR016163" /db_xref="InterPro:IPR017628" /db_xref="InterPro:IPR029510" /db_xref="UniProtKB/TrEMBL:O33767" /citation=[1] /experiment="experimental evidence, no additional details recorded" /protein_id="CAB06615.1" /translation="MTSVSSSPITDTILNFIDGSYREGSEGKSFSNVNPATGAEIGVV HEASQADVADAVAAAKAALTGPWGKMTTAERVKLITAVATEIERRADDFLAAEVADTG KPRHVASHIDIPRGAANFRMFADVVSTMPGESFNTPTPDGGQAFNYTVRKPKGVVAVV CPWNFPLLLMTWKVGPALACGNTVVVKPSEETPRTAALLGEVMNAVGMPKGVYNVVHG FGPGSAGEFLTSNPDVDAITFTGETGTGQAIMQKAATGVRDISFELGGKNPAIVFADA DLDKAVEGLSRSVFLNTGQVCLGTERVYVERPIFDAFVARMAAAAQDFKPGVTGDRAY LGPLISAEHREKVLGYYRRAVEDGATVVTGGGVPEISGAEAGGFFVEPTLWIDVAHGD TVMREEIFGPCCGILPFDSEDEVIALANDTVYGLCASVWTENLSRGHRVAAAMEVGVC WVNSWFLRDLRTAFGGSGHSGIGREGGVHSLEFYTEITNICVKL" BASE COUNT 751 a 1208 c 1256 g 789 t ORIGIN 1 gcatgcgcga ggcggcgatt tctgccgcct cgccgcattc tttatttgcc aatgcgtttg 61 aaatagtgca atgaaagcta ctgtaaggtg tcagttcgcc gactcggtcc cgacaccatg 121 tggatacaaa atcaacgaag cgggaaagga ctggttggct atcatggcaa gcgcgacatt 181 ggacacgagc cgaccggaaa tcgccaattc gttcgtcatc ggcggaagct cgaccaatta 241 tcatgacgtc ggcgagggcg atccggtcct gttggttcat ggttccgggc ctggcgtgac 301 tgcatgggcc aactggcgcc tgaacattcc ggttcttgct caggatttcc gggtcatcgc 361 gcccgacatg ttcggtttcg gctattcgga cagcaagggc cggatcgaag acaagcaggt 421 ctgggtcgat cagctcgcca gttttctcga cggattgggc atcgacaaga tttcgatggt 481 cggcaactcg ttcggtggcg ggatcacgct ggcattcatg attgcccacc ccgatcgggt 541 ggaacgcgcc gttctgatgg ggcctgcagg gctcaacttt ccgattactc ctgcgctcga 601 caaggtttgg ggctacgtgc cgtcagttga ggccatgcgc gaatcgctca agtaccttgc 661 ctgggatcac agccgcctga cggaggatct gatccagtcg cgctatgtcg ccagcgcccg 721 gcccgaagca catgaaccct atcatgcaac gttcggcggc gccgaccgcc aggccaacgt 781 ggcgatgctc gccagccgcg aggaggacat cgccgcaatc gcgcacgaga ccttgatcct 841 gcacgggatt gccgatcagg tgatcccgct cgattctacc gtacgcttgg ccacgctgat 901 gcagcgggcc gatctgcacc tgttcgccga atgcggacac tgggtgcaga tcgagcggat 961 ggcaagtttc aaccggatgg ttgccgagtt ctttaagcat ggcctcaagg ctgatcggag 1021 aggagattct tcatggcttt gactggtgta attcgtcctg gctatgtcca gctcagggtt 1081 ctggacttgg acgaggccat tatccactac cgcgaccgga ttggtctaaa cttcgtcaat 1141 cgcgaggggg atcgggcctt tttccaggcg ttcgacgaat tcgatcgtca cagtatcatc 1201 cttcgcgagg ccgatcaggc gggcatggat gtgatgggct tcaaggtcgc caaggacgcg 1261 gacttggacc attttaccga gcgcttgctc gatatcggtg tccatgtcga cgtgatcccg 1321 gcgggggaag atcccggtgt aggccgcaag attcggttta acacgccgac acagcacgtc 1381 ttcgaacttt acgccgagat ggcgctgtcg gccaccggtc cggccgtcaa gaaccccgat 1441 gtctgggtcg tggagccacg tggcatgcgt gccacccgct ttgatcactg tgcgctcaac 1501 ggcgtggata tagccagttc ggccaagatt tttgtcgatg cgcttgattt ctcagtcgcc 1561 gaggaactgg tcgatgaaac cagcggcgcc cggctcggca tctttcttag ctgcagcaac 1621 aaagcacacg atgtcgcctt cttaggctat cccgaagacg gtaagatcca ccatacctcg 1681 ttcaacctgg aatcctggca cgatgttggc catgccgccg acatcatcag ccgctacgat 1741 atttcgctgg atatcgggcc gacccgtcat gggatcaccc gcgggcagac gatctacttc 1801 ttcgatccct cgggcaaccg caacgaaacc ttcagcggcg gttacattta ttatccggac 1861 aatccgcagc gcctgtggca ggcagagaac gccggcaagg ccatcttcta ctacgaaaag 1921 gcgctcaacg accgcttcat gacggtgaac acctgatcat ggacggggtg gtttcggtcc 1981 ggacagtagg ccacgaattg gcgctcaagg cagcggctgc cgccgttgcc atgggtgcgg 2041 cggcgggctg tccggtcgtt gccgcggtgg tcggcgcggg gggagacctg gtcgcgttcg 2101 tgcgcgccag cggctcccct tcgccgtctg caaagatcgc gcaggacaag gcttacagcg 2161 ccgccagctt ccgcgtcccg acgccggatc tctatgcgat ggtttcgggc aatccggcct 2221 tgcgcgacgg gatcgtcgcg cagccgggaa tagcgatgtt cggcggcggc ctaccgatcg 2281 aaattgccgg tgaattcgtc ggtgcgattg gtatttccgg cgggagcgaa gcgatggacg 2341 tcgagtacgc caacgccggc ctcgctgcga tcggcgcaag gcaattctga cgccccatgg 2401 gttctggcaa aacgtaggta atatgcacca atgacttccg tctcctcctc tccaatcacg 2461 gatactatcc tgaacttcat cgacgggtcc tatcgcgaag gcagcgaggg caagtcgttt 2521 tccaacgtca atccggccac cggggccgag atcggggttg tgcacgaagc aagccaggca 2581 gacgtcgccg acgccgtggc tgcggccaag gccgcgctta ccgggccgtg gggcaagatg 2641 accacggccg aacgggtcaa gctgatcacc gccgtggcga ccgagatcga acgccgagcg 2701 gatgatttcc tggctgccga agtggccgac accggcaagc cgcgtcatgt tgcgtcgcat 2761 atcgatattc cgcgcggagc cgctaacttc cgcatgttcg ccgatgtcgt ctcgacgatg 2821 ccgggcgaaa gcttcaacac gccaaccccc gatggcggcc aggcgttcaa ctataccgtg 2881 cgcaagccca agggtgtggt cgccgtcgtc tgcccgtgga acttcccgct gctgctgatg 2941 acctggaagg ttggcccggc gcttgcctgc ggcaataccg tggtggtcaa gccgtccgag 3001 gaaacgcctc gtaccgctgc cctactgggc gaagtgatga acgcggtggg catgcccaag 3061 ggtgtctaca acgtcgtcca tggattcggt ccgggttcgg ccggcgaatt cctcacgtcc 3121 aaccccgatg tcgatgccat caccttcacc ggcgaaaccg gcaccggaca ggccatcatg 3181 cagaaggccg cgaccggcgt tcgcgacatc tcgttcgaac tcggtggcaa gaacccggcg 3241 atcgtgttcg ccgatgccga cctcgacaaa gcggtcgagg gtctgtcgcg ctcggtcttc 3301 ctgaacaccg ggcaggtctg cctcggaacc gagcgggtct atgtcgaacg gccgatcttc 3361 gatgccttcg tggcgcggat ggcggcggcg gcgcaggact tcaagccggg cgtgaccggt 3421 gatcgcgcct atctcggccc tctgatcagc gccgagcacc gcgagaaagt gctgggctac 3481 tatcgccgtg cggtcgagga cggggccacc gtggtcaccg gcggcggcgt tcctgaaatc 3541 tcgggcgcgg aagccggcgg cttcttcgtg gaaccgacgt tgtggatcga cgtcgcccac 3601 ggcgacaccg tgatgcgcga ggaaatcttc gggccgtgct gcggcatctt accgttcgac 3661 agcgaggacg aggtgatcgc gctggcaaac gatacggtat acggcctgtg cgcctcagtc 3721 tggaccgaaa acctgtcccg cggacaccgc gtggcggcgg cgatggaggt gggggtgtgc 3781 tgggtcaatt cctggttcct gcgcgatctg cgcacggctt tcggcgggtc cggccattcc 3841 ggcataggcc gggaaggcgg ggtgcacagc ctcgaattct acaccgagat caccaacatc 3901 tgcgtaaagc tttaagacga tgactatcga ccctaaaacc atcgaacaag ccggctctgg 3961 cgctgcgcgg cgccgccgaa agcggcacac cggtcagtcc gatc //