LOCUS AOW25793.1 1040 aa PRT PLN 14-OCT-2016 DEFINITION Candida albicans SC5314 histone methyltransferase protein. ACCESSION CP017623-89 PROTEIN_ID AOW25793.1 SOURCE Candida albicans SC5314 ORGANISM Candida albicans SC5314 Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; Saccharomycetes; Saccharomycetales; Debaryomycetaceae; Candida/Lodderomyces clade; Candida. REFERENCE 1 (bases 1 to 3188341) AUTHORS Jones,T., Federspiel,N.A., Chibana,H., Dungan,J., Kalman,S., Magee,B.B., Newport,G., Thorstenson,Y.R., Agabian,N., Magee,P.T., Davis,R.W. and Scherer,S. TITLE The diploid genome sequence of Candida albicans JOURNAL Proc. Natl. Acad. Sci. U.S.A. 101 (19), 7329-7334 (2004) PUBMED 15123810 REFERENCE 2 (bases 1 to 3188341) AUTHORS van het Hoog,M., Rast,T.J., Martchenko,M., Grindle,S., Dignard,D., Hogues,H., Cuomo,C., Berriman,M., Scherer,S., Magee,B.B., Whiteway,M., Chibana,H., Nantel,A. and Magee,P.T. TITLE Assembly of the Candida albicans genome into sixteen supercontigs aligned on the eight chromosomes JOURNAL Genome Biol. 8 (4), R52 (2007) PUBMED 17419877 REFERENCE 3 (bases 1 to 3188341) AUTHORS Muzzey,D., Schwartz,K., Weissman,J.S. and Sherlock,G. TITLE Assembly of a phased diploid Candida albicans genome facilitates allele-specific measurements and provides a simple model for repeat and indel structure JOURNAL Genome Biol. 14 (9), R97 (2013) PUBMED 24025428 REFERENCE 4 (bases 1 to 3188341) CONSRTM Candida Genome Database TITLE Direct Submission JOURNAL Submitted (30-SEP-2016) Department of Genetics, Candida Genome Database, Stanford University, Mail Stop-5120, Stanford, CA 94305, USA REMARK Sequence and annotation update by submitter REFERENCE 5 (bases 1 to 3188341) AUTHORS Muzzey,D., Schwartz,K., Weissman,J.S. and Sherlock,G. TITLE Direct Submission JOURNAL Submitted (04-OCT-2016) Department of Genetics, Candida Genome Database, Stanford University, Mail Stop-5120, Stanford, CA 94305, USA COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Bowtie v. 1.0 Assembly Name :: C_albicans_SC5314_A22 Long Assembly Name :: Candida albicans SC5314 Assembly 22 Genome Coverage :: 700x Sequencing Technology :: Illumina GAIIx ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: CGD Annotation Status :: Full annotation Annotation Version :: A22-s07-m01-r01 URL :: http://www.candidagenome.org/ ##Genome-Annotation-Data-END## FEATURES Qualifiers source /organism="Candida albicans SC5314" /mol_type="genomic DNA" /strain="SC5314" /host="Homo sapiens" /culture_collection="ATCC:MYA-2876" /db_xref="taxon:237561" /chromosome="1" /haplotype="A" /country="USA: New York" /collected_by="Margarita Silva-Hutner" protein /gene="SET1" /locus_tag="CAALFM_C100960CA" /note="Lysine histone methyltransferase; methylates histone H3 K4; regulates of white-opaque switch, epithelial cell adhesion, agar-embedded filamentation, virulence in mice; unique N-terminus immunogenic in human; rat catheter biofilm repressed" /transl_table=12 /db_xref="CGD:CAL0000198993" BEGIN 1 MSYNNRSGGG ASGGYSRRGY HGSHRGGYRT GRSKYPEDRY LVGGMLSLNK GSHYESSDNR 61 YIPNEIGSKS PENRSHRSST KDGRTPSGLS TPLSSSDKVS TPISIESING SDRNTGVNNK 121 DSEFPKLSHH SDFTSTIPFS RSINPQKNFM VINDSHTPKT DKGIQSKKIR YNGEGVNHVS 181 DPRIAQSNSN LQKPTKKTKK TPYKQLPQPK FVYNSDSLGP APMSTIIIWD LPISTSEPFL 241 RNFVSRYGNP LEEMTFITDP TTAVPLGIVT FKFQGNPQKA SELAKNFIKT VRQDELKIDG 301 ATLKIALNDN ENQLLNRKLE SAKKKMLQQR LQREQEEEKR RQKLVEEQKK QELLKKKEKE 361 HQESVKKEKS VEHESTIVST RDKNLVYKPN STVLSMRHNH KIISSVILPK DLEKYIKSRP 421 YILIRDKYVP TKKISSHDIK RALKKYDWTR VLSDKSGFFI VFNSLNECER CFLNEDNKKF 481 FEYKLVMEMA IPEGFTNNIR ENESKSTNDV LDEATNILIK EFQTFLAKDI RERIIAPNIL 541 DLLAHDKYPE LVEELKSREQ AAKPKVLVTN NQLKENALSI LEKQRQLFQQ RLPSFRMSHD 601 RTQQHKPKRR NSIIPMQHAL NFDDDEDSES HSQSESEDED EDETTASRPL TPVVSTMKRE 661 RSSTITSIED DIELEEREIK KQKVKVPAIE AEIAPESSPE EGEEEEKEEV EIKQEAEEVD 721 IKFQPTEESP RTVYPEIPFS GDFDLNALQH TIKDSEDLLL AQEVLSETTP SGLSNIEYWS 781 WKSKNRKDVQ EISQEEEYIE ELPESLQSTT GSFKSEGVRK IPEIEKIGYL PHRKRTNKPI 841 KTIQYEDEDE EKPNENTNAV QSSRVNRANN RRFAADITAQ IGSESDVLSL NALTKRKKPV 901 TFARSAIHNW GLYAMEPIAA KEMIIEYVGE RIRQQVAEHR EKSYLKTGIG SSYLFRIDDN 961 TVIDATKKGG IARFINHCCS PSCTAKIIKV EGKKRIVIYA LRDIEANEEL TYDYKFERET 1021 NDEERIRCLC GAPGCKGYLN //