LOCUS YSCCORA 8241 bp DNA linear PLN 06-APR-1996 DEFINITION Yeast COR gene cluster encoding iso-1-cytochrome c (CYC1), dispensible protein UTR3, osmotic growth protein (OSM1), Gly-tRNA, and UV-induced damage repair protein (RAD7), complete cds. ACCESSION M37696 J01319 M12920 M27552 VERSION M37696.1 KEYWORDS COR gene cluster; CYC1 gene; OSM1 gene; RAD7 gene; UTR1 gene; UTR3 gene; UV-induced damage repair protein; dispensible protein; iso-1-cytochrome c; osmotic growth protein; transfer RNA; transfer RNA-Gly. SOURCE Saccharomyces cerevisiae (brewer's yeast) ORGANISM Saccharomyces cerevisiae Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; Saccharomycetes; Saccharomycetales; Saccharomycetaceae; Saccharomyces. REFERENCE 1 (bases 249 to 352) AUTHORS Montgomery,D.L., Hall,B.D., Gillam,S. and Smith,M. TITLE Identification and isolation of the yeast cytochrome c gene JOURNAL Cell 14 (3), 673-680 (1978) PUBMED 210956 REFERENCE 2 (bases 1 to 857) AUTHORS Smith,M., Leung,D.W., Gillam,S., Astell,C.R., Montgomery,D.L. and Hall,B.D. TITLE Sequence of the gene for iso-1-cytochrome c in Saccharomyces cerevisiae JOURNAL Cell 16 (4), 753-761 (1979) PUBMED 222467 REFERENCE 3 (bases 190 to 299) AUTHORS Stiles,J.I., Szostak,J.W., Young,A.T., Wu,R., Consaul,S. and Sherman,F. TITLE DNA sequence of a mutation in the leader region of the yeast iso-1-cytochrome c mRNA JOURNAL Cell 25 (1), 277-284 (1981) PUBMED 6268305 REFERENCE 4 (bases 188 to 280; 576 to 753) AUTHORS Boss,J.M., Gillam,S., Zitomer,R.S. and Smith,M. TITLE Sequence of the yeast iso-1-cytochrome c mRNA JOURNAL J. Biol. Chem. 256 (24), 12958-12961 (1981) PUBMED 6273415 REFERENCE 5 (bases 1 to 260) AUTHORS Faye,G., Leung,D.W., Tatchell,K., Hall,B.D. and Smith,M. TITLE Deletion mapping of sequences essential for in vivo transcription of the iso-1-cytochrome c gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 78 (4), 2258-2262 (1981) PUBMED 6264471 REFERENCE 6 (bases 576 to 765) AUTHORS Zaret,K.S. and Sherman,F. TITLE DNA sequence required for efficient transcription termination in yeast JOURNAL Cell 28 (3), 563-573 (1982) PUBMED 6280875 REFERENCE 7 (sites) AUTHORS Ernst,J.F., Hampsey,D.M. and Sherman,F. TITLE DNA sequences of frameshift and other mutations induced by ICR-170 in yeast JOURNAL Genetics 111 (2), 233-241 (1985) PUBMED 2414151 REFERENCE 8 (bases 249 to 578) AUTHORS Hampsey,D.M., Das,G. and Sherman,F. TITLE Amino acid replacements in yeast iso-1-cytochrome c. Comparison with the phylogenetic series and the tertiary structure of related cytochromes c JOURNAL J. Biol. Chem. 261 (7), 3259-3271 (1986) PUBMED 3005287 REFERENCE 9 (bases 771 to 8241) AUTHORS Melnick,L. and Sherman,F. TITLE Nucleotide sequence of the COR region: a cluster of six genes in the yeast Saccharomyces cerevisiae JOURNAL Gene 87 (2), 157-166 (1990) PUBMED 2158927 COMMENT Original source text: Saccharomyces cerevisiae (clone: pAB40) DNA. Dr. Graeme A. Reid (ebmo21@castle.edinburgh.ac.uk) writes us about the possibility of a sequence error, a deletion of a c at position 4617-4618 (cc->ccc), which would extend the coding region to position 5225 instead of 4683. He made the following observations: This sequence is homologous to Shewanella putrefaciens flavocytochrome c (L04283) and other fumarate reductases. The present GenBank entry indicates that OSM1 encodes a protein of 301 residues. Extremely close sequence similarity between OSM1 protein and flavocytochrome c beyond the end point was found in another reading frame. It is possible that a sequencing error accounts for this observation, but it has not been checked experimentally as yet. The putative signal sequence of the OSM1 protein does not fit the consensus at all well and there is no evidence to suggest that this is a membrane protein or enters the secretory pathway. The proposed correction would restore the sequebnce QxHPTG that is well conserved throughout the Frd family. with the histidine in the sequence having been identified at the active site in E.coli frdA (Schryder et al., 1991). Wild-type yeast cells have two cytochrome c proteins, iso-1 and iso-2, encoded by the two distinct genetic loci cyc1 and cyc7, respectively. The iso-1 cytochrome c protein is the predominant cellular cytochrome c, normally representing about 95% of the material of that kind. The cyc1-512 mutant has a 38 bp deletion in the 3' nontranslated region which prevents proper transcription termination. Authors suggest that polyadenylation may be coupled to transcription termination in yeast. They also report a consensus sequence from many yeast genes. FEATURES Location/Qualifiers source 1..8241 /organism="Saccharomyces cerevisiae" /mol_type="genomic DNA" /db_xref="taxon:4932" /clone="pAB40" gene 188..752 /gene="CYC1" mRNA 188..752 /gene="CYC1" /citation=[4] gene 249..578 /gene="COX5b" /db_xref="SGD:L0000448" CDS 249..578 /gene="COX5b" /codon_start=1 /product="iso-1-cytochrome c" /protein_id="AAB59344.1" /db_xref="SGD:L0000448" /translation="MTEFKAGSAKKGATLFKTRCLQCHTVEKGGPHKVGPNLHGIFGR HSGQAEGYSYTDANIKKNVLWDENNMSEYLTNPKKYIPGTKMAFGGLKKEKDRNDLIT YLKKACE" misc_difference 597 /gene="CYC1" /note="conflict" /citation=[4] /replace="" misc_difference 641..642 /gene="CYC1" /note="conflict" /citation=[4] /replace="" variation 708..745 /gene="CYC1" /note="cyc1 deletion in mutant strain cyc1-512 [6]" gene 2909..3316 /gene="UTR3" /db_xref="SGD:L0002730" CDS 2909..3316 /gene="UTR3" /codon_start=1 /product="dispensable protein" /protein_id="AAB59345.1" /db_xref="SGD:L0002730" /translation="MEKKTKRKRLEDSHVLMNSGKLINGKRYFGRALELPEVKEWLKQ SQRQNDGGSINTKCIPKDRNDFYYHGKVTAALTEFEANWTSILKAHYNVPVNEDEEEM SRQTQEIHVPTLADMEHWLVQRRKKKLMDELNL" gene 3778..4683 /gene="OSM1" /db_xref="SGD:L0001317" CDS 3778..4683 /gene="OSM1" /note="precursor" /codon_start=1 /product="osmotic growth protein" /protein_id="AAB59346.1" /db_xref="SGD:L0001317" /translation="MIRSVRRVFIYVSIFVLIIVLKRTLSGTDQTSMKQPVVVIGSGL AGLTTSNRLISKYRIPVVLLDKAASIGGNSIKASSGINGAHTDTQQNLKVMDTPELFL KDTFDSAKGRGVPSLMDKLTKESKSAIRWLQTEFDLKLDLLAQLGGHSVPRTHRSSGK LPPGFEIVQARLSKKLKDISSKDSNLVQIMLEVVDIELDNQGHVTGVVYMDENGNRKI MKSHHVVFCSGGFGYSKEMLKEYSPNLIHLPTTNGKQTTGDGQKILSKLGAELIDMDQ VQVHLPASLIQMTVKITGSFWLQRH" sig_peptide 3778..3840 /gene="OSM1" mat_peptide 3841..4680 /gene="OSM1" /product="osmotic growth protein" gene 5743..5811 /gene="tRNA-Gly" tRNA 5743..5811 /gene="tRNA-Gly" /product="tRNA-Gly" gene 5976..7673 /gene="RAD7" /db_xref="SGD:L0001561" CDS 5976..7673 /gene="RAD7" /note="UV-induced" /codon_start=1 /product="damage-repair protein" /protein_id="AAB59347.1" /db_xref="SGD:L0001561" /translation="MYRSRNRPKRGGENEVKGPNSALTQFLREEGISAENIKQKWYQR QSKKQEDATDEKKGKAEDDSFTAEISRVVEDEEIDEIGTGSGTETERAQVSYDARMKL VPADSDEEEYETSHISDTPVSLSSANDRESLTKKRQNTAKIIQNRRRKRKRAADLLDR RVNKVSSLQSLCITKISENISKWQKEADESSKLVFNKLRDVLGGVSTANLNNLAKALS KNRALNDHTLQLFLKTDLKRLTFSDCSKISFDGYKTLAIFSPHLTELSLQMCGQLNHE SFVYIAEKLPNLKSLNLDGPFLINEDTWEKFFVIMKGRLEEFHISNTHRFTDKSLSNL LINCGSTLVSLGLSRLDSISNYALLPQYLVNDEFHSLCIEYPFNEEDVNDEIIINLLG QIGRTLRKLVLNGCIDLTDSMIINGLTAFIPEKCPLEVLSLEESDQITTDSLSYFFSK VELNNLIECSFRRCLQLGDMAIIELLLNGARDSLRSLNLNSLKELTKEAFVALRPPNL TYLDLGFVRCVDDSVIQMLGEQNPNLTVIDVFGDNLVTEKATMRPGLTLIGRQSDSI" BASE COUNT 2624 a 1530 c 1645 g 2441 t ORIGIN 1 ctcgagcaga tccgccaggc gtgtatatag cgtggatggc caggcaactt tagtgctgac 61 acatacaggc atatatatat gtgtgcgacg acacatgatc atatggcatg catgtgctct 121 gtatgtatat aaaactcttg ttttcttctt ttctctaaat attctttcct tatacattag 181 gtcctttgta gcataaatta ctatacttct atagacacgc aaacacaaat acacacacta 241 aattaataat gactgaattc aaggccggtt ctgctaagaa aggtgctaca cttttcaaga 301 ctagatgtct acaatgccac accgtggaaa agggtggccc acataaggtt ggtccaaact 361 tgcatggtat ctttggcaga cactctggtc aagctgaagg gtattcgtac acagatgcca 421 atatcaagaa aaacgtgttg tgggacgaaa ataacatgtc agagtacttg actaacccaa 481 agaaatatat tcctggtacc aagatggcct ttggtgggtt gaagaaggaa aaagacagaa 541 acgacttaat tacctacttg aaaaaagcct gtgagtaaac aggccccttt tcctttgtcg 601 atatcatgta attagttatg tcacgcttac attcacgccc tccccccaca tccgctctaa 661 ccgaaaagga aggagttaga caacctgaag tctaggtccc tatttatttt tttatagtta 721 tgttagtatt aagaacgtta tttatatttc aaatttttct tttttttctg tacagacgcg 781 tgtacgcatg taacattata ctgaaaacct tgcttgagaa ggttttggga cgctcgaagg 841 ctttaatttg caagcttcgc agttttcact ctcatcgtcg ctctcatcat cgcttccgtt 901 gttgttttcc ttaatagcgt ctgcttccag agagtattta tctcttatta cctctaaagg 961 ttctgcttga tttctgactt tgttcgcctc atgtgcatat ttttcttggt tctttgggac 1021 aaaatatgcg taaaggactt ttgttgttcc ctcacattcc cagtttagtt gtcgactgat 1081 actgttaata aactcatcgg gcgaggcttc cacggttgga aaagcatatg ggctggcgca 1141 tatggttata aaatcacctt tttgcaattc aattctatgt ttcccatcaa aagccgccca 1201 tgctggagcc cttgacttca tcgagacttt cacttttaaa tttatacttt ctggtaagat 1261 gatgggtctg aaactcaatg catgtggaca aatgggtgtt aaagcgattg cattgacggt 1321 tgggcatacc aatgacccac ctgcactcaa agaataggcc gtggacccag tcggagtagc 1381 agcaatcagt ccgtccgcct gcgcaacggt cattaatgag ccgtcaccat acaattctaa 1441 catggataga aaaggacttg gaccacgatc gatggtcact tcgttcaaaa tgtggtgtgt 1501 gcttagtttt tccaccacac atattttctt ccccgtgttt gggtctactt cagggcggtg 1561 tctacgataa attgtgcact ccaacctcaa ccgtaaattt gtcttgattt tatgattcat 1621 aatccgaggt aaatcctccc tgaaatgttc aaacttaaaa tttgttaaaa atcctagaga 1681 ccctaatgaa aacgacataa cgggtggtac atgtctctga aaaatggaac ttacaaaaag 1741 aacagtaccg tcgccaccca aagtcactac caaatcgaag aaaaacatca tgttccctga 1801 tgaaatcctt tgtccaatac ttgatccttg attctctaca tttactatct tcacataact 1861 cgccagcgcg aattttttgc tgtttttcaa ttcggaatcc acataaacag tcacacgtgg 1921 aaaatgtacc aaaacccatt ctaccaactc tcttgttaag aaatacagtg agacatcgtt 1981 gagtttcgta acaatcatca aattttccac atccagttcc actttggtat tagatatatc 2041 tttacttaac attcttacac catacgccgt ggaggcatac tgaaagtgtg tattactact 2101 cttagaactt cctcttgttc cgttaatgct cgtaccaccg ccactgtttg cattgccgtt 2161 cacgagagat gaatctttat tcaacaggga gcttctgcgc gagctgcttt cacttgatat 2221 tttcctcaac atttctttgg cgttgtcgat atcctgagtt ctatcaattt gctcattgtt 2281 catcatggcc ttcttcatca agttattatt gttgttatga tgatcatttc gaccatcttc 2341 ctcatttacc catttatcta cgccattatt catgtcattc tccttcatct caagagcttt 2401 tactctactt atttagggag cctattgtat ttttttttgg cttacgatca cgatgtcaat 2461 gatcatttgc cctattatac ctttttctcc gttctctttg cgtagtgcgg tgaaataatt 2521 tcaggatgca tgaaagagca aataaaagtg tcacaccgtt atatcgcaag gcgcaccaca 2581 tcagtaatca gatacctgtg cattcaaaat gagtagaaat gtagataagg ccaactcagt 2641 tctggtacga tttcaagagc agcaagcaga gtcggcaggt ggatacaagg attattcacg 2701 ttaccagagg cccaggagcg tgtctaaggt aaaatccata aaagaggcca acgaatggaa 2761 gcgacaagta agtaaagaga taaaacaaaa aagcacaaga atatatgatc cgtctttaaa 2821 tgaaatgcag attgcggaac ttaacgacga acttaataat cttttcaaac aatggaagag 2881 atggcagtgg cacattgacc atacacttat ggaaaaaaaa accaagagga aaaggttaga 2941 agacagtcat gtgctgatga attcaggaaa gctgataaat ggtaagagat attttggaag 3001 agccctggaa ttgcctgaag taaaagaatg gctcaagcag tctcaaaggc agaatgatgg 3061 gggttctata aacaccaaat gcataccgaa ggatagaaat gatttttatt accatggcaa 3121 agtcacagcc gccttaaccg aatttgaggc taactggaca tccattttga aagcacatta 3181 taatgtgcct gtgaatgaag atgaagaaga aatgtcaagg cagacacaag aaatccatgt 3241 accaactctg gcagatatgg agcattggtt agtgcaaaga agaaaaaaga aactaatgga 3301 tgaacttaac ctttagctat ggaaaattat tcaaacattt agaagaggca ctatttcgaa 3361 tgaccatcac aagcattata atcttcaaac atgtccgagc tgcagatatg aagtaaaata 3421 gtgatcggga ctcatttttt tttttcttct atgatatcat attgtcccgt tcgttattaa 3481 atgtaattct tccaccgtac tagttatcga aggcaagatc tcctttttac gcttaggtag 3541 attacataga tttcataatg tacaccaata caaatgtatc ttatacactc gtcaaatctt 3601 atatgaatca atgcaggagt agctctcgtt ttctttttac gagctacttg aaaaaaaaaa 3661 aactagcgaa aatcctttgc ttagaaatcc agatttaaaa ccaaaaccgc cgaaaaggga 3721 aagattttat gataggaaat accttaataa ttctatatca tcccgagtct taggaaaatg 3781 attagatctg tgagaagggt tttcatttac gtctcaatat tcgtattgat aatagttttg 3841 aaaagaacat taagtggcac agatcaaacg tcaatgaaac aaccagtggt ggtcattggc 3901 tctggtttgg caggcttaac cacaagtaat cgtctcatta gtaaatacag aattcctgtt 3961 gtgcttttgg ataaggcggc ttctattggt gggaattcta taaaggcttc tagtggtatt 4021 aatggtgctc acacagacac tcaacaaaat ttaaaggtaa tggacactcc cgaattgttt 4081 ttgaaagata ctttcgattc ggctaaaggc agaggggttc catcactgat ggataagttg 4141 actaaggaat ccaagagtgc tatcaggtgg ttgcaaacag aattcgattt gaaattagac 4201 ctccttgcgc aattgggcgg tcactctgtt ccaaggaccc atagatcttc tggcaaatta 4261 ccaccaggtt ttgaaatcgt gcaagcgcgt ttatcaaaaa aactaaagga tatctcttcc 4321 aaagattcca atctcgtgca gattatgctt gaagtagtgg atatcgagct tgataatcaa 4381 ggtcatgtta ctggtgtagt atatatggac gagaacggaa accgtaaaat catgaagtca 4441 caccatgtcg tgttttgctc aggtggattt ggttactcta aggaaatgtt gaaagagtac 4501 tcaccaaatt tgattcactt gccaactact aatggcaaac agactacagg tgatggtcaa 4561 aaaatccttt caaagttggg tgccgaattg attgatatgg atcaagtgca ggtacaccta 4621 ccggcttcat tgatccaaat gaccgtgaaa ataactggaa gtttttggct gcagaggcat 4681 tgaggggttt aggcggcatc ttattgcatc ccaccactgg aagaaggttt acaaatgaat 4741 tgagcaccag agatacagta accatggaaa tacagtctaa atgtccgaaa aatgataata 4801 gagcactttt ggtaatgagc gacaaagtct acgagaacta tacgaataac ataaactttt 4861 atatgtccaa aaacttaatc aaaaaagtgt caatcaacga tctgatccga caatatgacc 4921 tacaaactac agcttctgaa ctggtaactg aactgaagag ctattccgat gttaatacta 4981 aggatacgtt tgataggcca ttgattatca atgcctttga taaagatatt tcgactgaat 5041 caactgttta tgttggggaa gttacaccag ttgttcattt cacaatgagt ggtgtgaaaa 5101 ttaatgagaa atctcaggta attaagaaaa attcggaaac gcttctatct aatgggatat 5161 ttgctgctgg tgaagtttcg ggtggtgttc atggagccaa cagattgggt ggatctagtt 5221 ttgtttagag tgtgttgtct ttggaaagac agctgcggat aacatagcaa aattgtactg 5281 agaatttata gggaaataaa agatattttt gcaacgtact ttacatttca ataattaatg 5341 cgacatgata ggatatactg tttgctattt ctttccgtaa cgatatacga ttttttcttt 5401 tctttggagt tccaaatgaa gaatatatct tgttgggaaa agctatagta ttaggtttac 5461 agaatatggt agaggtttct ccccgacgat ataggaatct acaagagaga gcatgattct 5521 acacgataat actgcaattt cttctttgcg ttgaagtgtc accattgcct atccttttac 5581 attatcaatg tcaatgttcc ctcttctaac aaactgaatg acttttctac acgattcata 5641 ttatcttctt gcgcagtaaa ttatgttaac gtaatagtag ggtctcttgt gcttcattat 5701 tttgtaactg tacacataat tgacacgttt aacaattagt gagcgcaatg gtttagtggt 5761 aaaatccaac gttgccatcg ttgggccccc ggttcgattc cgggcttgcg cagtttcttt 5821 tctttttttc tggaaaataa gaaaacttat ttatggaagc aaaaatggaa taaaggattg 5881 ggacgaagtt agtgaaaaaa aactgaaata gtcctagagt aactccgaag tgtctttgta 5941 taagctatta tctagaaatc acgagagggg aagaaatgta tcgcagtaga aaccgaccaa 6001 aaagaggtgg agaaaatgaa gttaagggac caaattctgc cttgactcaa tttttaagag 6061 aagaagggat cagtgctgaa aatatcaaac aaaaatggta ccagcgacag tcgaagaagc 6121 aagaagatgc aacagacgaa aaaaaaggta aagcggagga tgatagcttt actgccgaga 6181 tatctcgagt agttgaagat gaagaaattg atgaaattgg aacaggtagt ggtaccgaga 6241 cagaaagagc tcaggtttcc tacgatgcca ggatgaaatt agtccctgct gatagtgatg 6301 aagaagaata tgaaactagc cacatttctg acacgccagt cagtttaagt tcggctaatg 6361 accgggaatc attgactaaa aaaaggcaaa atactgcaaa aattatccaa aatcgtcgca 6421 gaaagcgtaa aagagcagct gacctattgg atagacgcgt caacaaagta tccagcttac 6481 aaagtctttg tattacgaaa attagtgaaa atatatccaa gtggcaaaaa gaggctgatg 6541 aatcatcaaa gttggtattt aacaaattga gagatgtcct tggtggcgta tcaaccgcta 6601 atttgaataa tttggcaaaa gcactatcga agaatagggc cctgaatgat catactttgc 6661 aacttttctt gaagacagat ctaaaaaggt taactttcag cgattgttct aaaatttcat 6721 ttgatggtta caaaacgcta gccatttttt cgccacacct aaccgaatta tccctacaaa 6781 tgtgtgggca gttgaaccat gaatcattcg tttacattgc tgaaaagcta ccgaacttga 6841 aatcgctgaa tttagatgga ccatttctga tcaacgagga cacatgggag aagttctttg 6901 taataatgaa aggtagatta gaagagttcc acatttctaa tacgcaccgc ttcaccgaca 6961 aatcattatc taatttattg atcaactgcg gttctacctt ggtatcctta gggttatcca 7021 gactagattc tatatcaaat tacgctttat taccgcagta cctagtcaac gacgaatttc 7081 acagtctctg tattgaatat ccattcaatg aagaggatgt taacgacgag atcatcataa 7141 atctgctagg tcaaatcggg cgcacgttac gtaaattggt tttgaatggc tgtattgact 7201 tgacagattc aatgataatc aatggtctga ctgcattcat tcctgagaaa tgtccattgg 7261 aggtattgag cttggaagaa tcagatcaga tcactacaga ttcactgtcg tactttttca 7321 gcaaagtaga actgaataat ttgattgaat gcagctttag aagatgtcta caattgggcg 7381 atatggcaat tatagagcta ttgcttaacg gagcaagaga tagtctgaga agcttaaatc 7441 tcaattcatt aaaagagcta actaaggagg catttgtggc gttacgtcct cctaatctga 7501 cgtatcttga tcttggtttt gtacgttgtg ttgatgactc ggtgattcaa atgttgggtg 7561 agcaaaatcc gaatttaact gtaattgatg ttttcggaga caatttggtt actgaaaagg 7621 ccacaatgag gcctggactt acgttgatag ggagacagag tgacagtata taataaaata 7681 aagacatata aaacgtgtat tagatagaaa acggaaaatt gactaatgtt ttattcctat 7741 tatacccaga tcgtcaaatg ttgcctttat tttctattga agggatttta tggcatctcg 7801 cgaataataa acaaacaata atagtaaaaa aaaaagtgta aacttatgta ttcagcgcag 7861 ataaaaggaa aaaaacgaga agtttgacgt aaagaaaaaa gttcgaaaac ttttgtagca 7921 gttgaaagtt ttgtatgcta tgtcaattag gcctctcacg ttaaacggtt tagatgagcc 7981 agaaacctct tttgaagaac tgaatacaac tctacctcgc tttcaatccc atgaaacatt 8041 aactttggna gaaaacgtgc caccattgag tacatcaact tatataccgc ctccatcctc 8101 ggtaggtgac ttctgacact ggcgacagta ttttccaaca gtaccacgcg ctttttggtg 8161 ctaataatgc aagcagaacg atgatcagga catggaggta gaccacggat gatgagtttc 8221 taaatgactt ccacggaatt c //