LOCUS AOW26001.1 1065 aa PRT PLN 14-OCT-2016 DEFINITION Candida albicans SC5314 Aro80p protein. ACCESSION CP017623-297 PROTEIN_ID AOW26001.1 SOURCE Candida albicans SC5314 ORGANISM Candida albicans SC5314 Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; Saccharomycetes; Saccharomycetales; Debaryomycetaceae; Candida/Lodderomyces clade; Candida. REFERENCE 1 (bases 1 to 3188341) AUTHORS Jones,T., Federspiel,N.A., Chibana,H., Dungan,J., Kalman,S., Magee,B.B., Newport,G., Thorstenson,Y.R., Agabian,N., Magee,P.T., Davis,R.W. and Scherer,S. TITLE The diploid genome sequence of Candida albicans JOURNAL Proc. Natl. Acad. Sci. U.S.A. 101 (19), 7329-7334 (2004) PUBMED 15123810 REFERENCE 2 (bases 1 to 3188341) AUTHORS van het Hoog,M., Rast,T.J., Martchenko,M., Grindle,S., Dignard,D., Hogues,H., Cuomo,C., Berriman,M., Scherer,S., Magee,B.B., Whiteway,M., Chibana,H., Nantel,A. and Magee,P.T. TITLE Assembly of the Candida albicans genome into sixteen supercontigs aligned on the eight chromosomes JOURNAL Genome Biol. 8 (4), R52 (2007) PUBMED 17419877 REFERENCE 3 (bases 1 to 3188341) AUTHORS Muzzey,D., Schwartz,K., Weissman,J.S. and Sherlock,G. TITLE Assembly of a phased diploid Candida albicans genome facilitates allele-specific measurements and provides a simple model for repeat and indel structure JOURNAL Genome Biol. 14 (9), R97 (2013) PUBMED 24025428 REFERENCE 4 (bases 1 to 3188341) CONSRTM Candida Genome Database TITLE Direct Submission JOURNAL Submitted (30-SEP-2016) Department of Genetics, Candida Genome Database, Stanford University, Mail Stop-5120, Stanford, CA 94305, USA REMARK Sequence and annotation update by submitter REFERENCE 5 (bases 1 to 3188341) AUTHORS Muzzey,D., Schwartz,K., Weissman,J.S. and Sherlock,G. TITLE Direct Submission JOURNAL Submitted (04-OCT-2016) Department of Genetics, Candida Genome Database, Stanford University, Mail Stop-5120, Stanford, CA 94305, USA COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Bowtie v. 1.0 Assembly Name :: C_albicans_SC5314_A22 Long Assembly Name :: Candida albicans SC5314 Assembly 22 Genome Coverage :: 700x Sequencing Technology :: Illumina GAIIx ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: CGD Annotation Status :: Full annotation Annotation Version :: A22-s07-m01-r01 URL :: http://www.candidagenome.org/ ##Genome-Annotation-Data-END## FEATURES Qualifiers source /organism="Candida albicans SC5314" /mol_type="genomic DNA" /strain="SC5314" /host="Homo sapiens" /culture_collection="ATCC:MYA-2876" /db_xref="taxon:237561" /chromosome="1" /haplotype="A" /country="USA: New York" /collected_by="Margarita Silva-Hutner" protein /gene="ARO80" /locus_tag="CAALFM_C103200CA" /note="Zn(II)2Cys6 transcription factor; transcriptional activator of aromatic amino acid catabolism; regulator of aromatic alcohol biosynthesis via the Ehrlich pathway; mutant is viable" /transl_table=12 /db_xref="CGD:CAL0000193732" BEGIN 1 MSIVEPEVAE STGGDAPQTS VKSPEPSYQQ SSQAQGKYKR NYRACLNCRL RKVKCDLGPV 61 DNPHDGKCAR CLRERKDCVF VESKRGGTTN VVNGKRKRQR ASSRAMSRSE TQSPDSNNVS 121 STTSTIASQN DNTNVPLINM SINNIMQKLP GVESILPKDN TKNPPHSAFS NPSTSSPSSS 181 LSSMPMGQPP QLPPISNKPS IQNQSLSANS KSLSNEFATM ESALVFLANA AGEIAKADER 241 DNIDAQSKYD QIEASLSGAN SHRVSIDESM NHQQQQEQSH PHYQQKSQQN HHHHKPPQQE 301 LHHSQPLNIP NNAPEGFTNN HRIQTEQSNI PPYIKPTTSR RMFVPPAESG NAVRPKGSNK 361 LSSIDYIGPA PRGILTEDEA KRLINLFFAT MHPYFPHIPK FLHSPKVLSN YPILLCAILT 421 ISSRYHPFET DMANQTNGNG AVPRHIEVHD RLWLYVQRLI SQTVWAEAST RSIGTVFAFL 481 LFTEWNPRAI HWRWSDYANK AEEINDTDQQ QQQPQPHQQQ TQTNSNTNSS TALSSAAAAA 541 ATASTAATTA SVFGQPNDEN NSGLAGLGAM RRSHRMAWML IGSAVRLAQD MGFMEISSKA 601 FLATHIAEIN AVMNMSRRSM LANSLSEVDL DEDEITVEDM EQAEEKDDDS KIMQMNEEEL 661 KKISTHHVLK FTKSQKATIE LLQIMSLGHE SLYGYKAQLG QLTHRQTLSV LNILSPLINN 721 WGRKYKEFLV PSSTTTNNKL VKLAPNLPQH WLDPESKICR EIADTIERET FIIEFNYVKL 781 YIYSLALNQS PKSMLEKGTK IKLDELSKSA KYIEQAFHAA NETLNAAHRI HRFKMLRFMP 841 VRLLTRFIRA AAFIVRCHLT MTAQENISLS VSKITIDEII KSTHRAAMTL RECSPDELHL 901 LSRYSTILMV LYSEMKSKRN RKEEVDDENN ESAVVTETVM QDNSSGNASE PKQVPGGSTV 961 NASGTHFLSA SSGASNNGIP GMGQSTATQQ SQPSHTPVGL TPGTYNNSHN NSGPAAINTG 1021 QMEMNENSSF PSVPPPVEDF DFDFNIDELL GDGFQNIVDL WSFLN //