LOCUS AEE83565.1 2335 aa PRT PLN 23-MAR-2023 DEFINITION Arabidopsis thaliana SET domain protein 2 protein. ACCESSION CP002687-2429 PROTEIN_ID AEE83565.1 SOURCE Arabidopsis thaliana (thale cress) ORGANISM Arabidopsis thaliana Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis. REFERENCE 1 (bases 1 to 18585056) AUTHORS Mayer,K., Schuller,C., Wambutt,R., Murphy,G., Volckaert,G., Pohl,T., Dusterhoft,A., Stiekema,W., Entian,K.D., Terryn,N., Harris,B., Ansorge,W., Brandt,P., Grivell,L., Rieger,M., Weichselgartner,M., de Simone,V., Obermaier,B., Mache,R., Muller,M., Kreis,M., Delseny,M., Puigdomenech,P., Watson,M., Schmidtheini,T., Reichert,B., Portatelle,D., Perez-Alonso,M., Boutry,M., Bancroft,I., Vos,P., Hoheisel,J., Zimmermann,W., Wedler,H., Ridley,P., Langham,S.A., McCullagh,B., Bilham,L., Robben,J., Van der Schueren,J., Grymonprez,B., Chuang,Y.J., Vandenbussche,F., Braeken,M., Weltjens,I., Voet,M., Bastiaens,I., Aert,R., Defoor,E., Weitzenegger,T., Bothe,G., Ramsperger,U., Hilbert,H., Braun,M., Holzer,E., Brandt,A., Peters,S., van Staveren,M., Dirske,W., Mooijman,P., Klein Lankhorst,R., Rose,M., Hauf,J., Kotter,P., Berneiser,S., Hempel,S., Feldpausch,M., Lamberth,S., Van den Daele,H., De Keyser,A., Buysshaert,C., Gielen,J., Villarroel,R., De Clercq,R., Van Montagu,M., Rogers,J., Cronin,A., Quail,M., Bray-Allen,S., Clark,L., Doggett,J., Hall,S., Kay,M., Lennard,N., McLay,K., Mayes,R., Pettett,A., Rajandream,M.A., Lyne,M., Benes,V., Rechmann,S., Borkova,D., Blocker,H., Scharfe,M., Grimm,M., Lohnert,T.H., Dose,S., de Haan,M., Maarse,A., Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M., Fartmann,B., Granderath,K., Dauner,D., Herzl,A., Neumann,S., Argiriou,A., Vitale,D., Liguori,R., Piravandi,E., Massenet,O., Quigley,F., Clabauld,G., Mundlein,A., Felber,R., Schnabl,S., Hiller,R., Schmidt,W., Lecharny,A., Aubourg,S., Chefdor,F., Cooke,R., Berger,C., Montfort,A., Casacuberta,E., Gibbons,T., Weber,N., Vandenbol,M., Bargues,M., Terol,J., Torres,A., Perez-Perez,A., Purnelle,B., Bent,E., Johnson,S., Tacon,D., Jesse,T., Heijnen,L., Schwarz,S., Scholler,P., Heber,S., Francs,P., Bielke,C., Frishman,D., Haase,D., Lemcke,K., Mewes,H.W., Stocker,S., Zaccaria,P., Bevan,M., Wilson,R.K., de la Bastide,M., Habermann,K., Parnell,L., Dedhia,N., Gnoj,L., Schutz,K., Huang,E., Spiegel,L., Sehkon,M., Murray,J., Sheet,P., Cordes,M., Abu-Threideh,J., Stoneking,T., Kalicki,J., Graves,T., Harmon,G., Edwards,J., Latreille,P., Courtney,L., Cloud,J., Abbott,A., Scott,K., Johnson,D., Minx,P., Bentley,D., Fulton,B., Miller,N., Greco,T., Kemp,K., Kramer,J., Fulton,L., Mardis,E., Dante,M., Pepin,K., Hillier,L., Nelson,J., Spieth,J., Ryan,E., Andrews,S., Geisel,C., Layman,D., Du,H., Ali,J., Berghoff,A., Jones,K., Drone,K., Cotton,M., Joshu,C., Antonoiu,B., Zidanic,M., Strong,C., Sun,H., Lamar,B., Yordan,C., Ma,P., Zhong,J., Preston,R., Vil,D., Shekher,M., Matero,A., Shah,R., Swaby,I.K., O'Shaughnessy,A., Rodriguez,M., Hoffmann,J., Till,S., Granat,S., Shohdy,N., Hasegawa,A., Hameed,A., Lodhi,M., Johnson,A., Chen,E., Marra,M., Martienssen,R. and McCombie,W.R. TITLE Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana JOURNAL Nature 402 (6763), 769-777 (1999) PUBMED 10617198 REFERENCE 2 (bases 1 to 18585056) AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E. CONSRTM TAIR TITLE Direct Submission JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie Institution, 260 Panama Street, Stanford, CA, USA REFERENCE 3 (bases 1 to 18585056) AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M., Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R., Vaughn,M. and Town,C.D. TITLE Direct Submission JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute, 9704 Medical Center Dr, Rockville, MD 20850, USA REMARK Protein update by submitter FEATURES Qualifiers source /organism="Arabidopsis thaliana" /mol_type="genomic DNA" /db_xref="taxon:3702" /chromosome="4" /ecotype="Columbia" protein /gene="SDG2" /locus_tag="AT4G15180" /gene_synonym="ATXR3" /gene_synonym="DL3635W" /gene_synonym="FCAALL.214" /gene_synonym="SET domain protein 2" /inference="Similar to RNA sequence, EST:INSD:AV546152.1,INSD:ES102813.1,INSD:ES019843.1, INSD:ES110605.1,INSD:CB518275.1,INSD:BP790849.1, INSD:ES094218.1,INSD:EL046033.1,INSD:BX838418.1, INSD:EG474442.1,INSD:EH974021.1,INSD:EL216094.1, INSD:EH889558.1,INSD:BP779656.1,INSD:EH935591.1, INSD:EL995417.1,INSD:ES187425.1,INSD:ES063619.1, INSD:EG474431.1" /inference="Similar to RNA sequence, mRNA:INSD:AK226663.1,INSD:AK226725.1" /note="SET domain protein 2 (SDG2); EXPRESSED IN: male gametophyte, cultured cell; EXPRESSED DURING: M germinated pollen stage; CONTAINS InterPro DOMAIN/s: SET domain (InterPro:IPR001214); BEST Arabidopsis thaliana protein match is: histone methyltransferases(H3-K4 specific);histone methyltransferases(H3-K36 specific) (TAIR:AT1G77300.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink)." /db_xref="TAIR:AT4G15180" /db_xref="Araport:AT4G15180" intron_pos 672:0 (1/19) intron_pos 752:1 (2/19) intron_pos 771:1 (3/19) intron_pos 1132:1 (4/19) intron_pos 1256:0 (5/19) intron_pos 1434:0 (6/19) intron_pos 1467:2 (7/19) intron_pos 1781:0 (8/19) intron_pos 1805:0 (9/19) intron_pos 1842:0 (10/19) intron_pos 1875:2 (11/19) intron_pos 1909:0 (12/19) intron_pos 1941:0 (13/19) intron_pos 1997:0 (14/19) intron_pos 2039:0 (15/19) intron_pos 2057:0 (16/19) intron_pos 2139:2 (17/19) intron_pos 2176:0 (18/19) intron_pos 2277:0 (19/19) BEGIN 1 MSDGGVACMP LLNIMEKLPI VEKTTLCGGN ESKTAATTEN GHTSIATKVP ESQPANKPSA 61 SSQPVKKKRI VKVIRKVVKR RPKQPQKQAD EQLKDQPPSQ VVQLPAESQL QIKEQDKKSE 121 FKGGTSGVKE VENGGDSGFK DEVEEGELGT LKLHEDLENG EISPVKSLQK SEIEKGEIVG 181 ESWKKDEPTK GEFSHLKYHK GYVERRDFSA DKNWKGGKEE REFRSWRDPS DEIEKGEFIP 241 DRWQKMDTGK DDHSYIRSRR NGVDREKTWK YEYEYERTPP GGRFVNEDIY HQREFRSGLD 301 RTTRISSKIV IEENLHKNEY NNSSNFVKEY SSTGNRLKRH GAEPDSIERK HSYADYGDYG 361 SSKCRKLSDD CSRSLHSDHY SQHSAERLYR DSYPSKNSSL EKYPRKHQDA SFPAKAFSDK 421 HGHSPSRSDW SPHDRSRYHE NRDRSPYARE RSPYIFEKSS HARKRSPRDR RHHDYRRSPS 481 YSEWSPHDRS RPSDRRDYIP NFMEDTQSDR NRRNGHREIS RKSGVRERRD CQTGTELEIK 541 HKYKESNGKE STSSSKELQG KNILYNNSLL VEKNSVCDSS KIPVPCATGK EPVQVGEAPT 601 EELPSMEVDM DICDTPPHEP MASDSSLGKW FYLDYYGTEH GPARLSDLKA LMEQGILFSD 661 HMIKHSDNNR WLVNPPEAPG NLLEDIADTT EAVCIEQGAG DSLPELVSVR TLPDGKEIFV 721 ENREDFQIDM RVENLLDGRT ITPGREFETL GEALKVNVEF EETRRCVTSE GVVGMFRPMK 781 RAIEEFKSDD AYGSESDEIG SWFSGRWSCK GGDWIRQDEA SQDRYYKKKI VLNDGFPLCL 841 MQKSGHEDPR WHHKDDLYYP LSSSRLELPL WAFSVVDERN QTRGVKASLL SVVRLNSLVV 901 NDQVPPIPDP RAKVRSKERC PSRPARPSPA SSDSKRESVE SHSQSTASTG QDSQGLWKTD 961 TSVNTPRDRL CTVDDLQLHI GDWFYTDGAG QEQGPLSFSE LQKLVEKGFI KSHSSVFRKS 1021 DKIWVPVTSI TKSPETIAML RGKTPALPSA CQGLVVSETQ DFKYSEMDTS LNSFHGVHPQ 1081 FLGYFRGKLH QLVMKTFKSR DFSAAINDVV DSWIHARQPK KESEKYMYQS SELNSCYTKR 1141 ARLMAGESGE DSEMEDTQMF QKDELTFEDL CGDLTFNIEG NRSAGTVGIY WGLLDGHALA 1201 RVFHMLRYDV KSLAFASMTC RHWKATINSY KDISRQVDLS SLGPSCTDSR LRSIMNTYNK 1261 EKIDSIILVG CTNVTASMLE EILRLHPRIS SVDITGCSQF GDLTVNYKNV SWLRCQNTRS 1321 GELHSRIRSL KQTTDVAKSK GLGGDTDDFG NLKDYFDRVE KRDSANQLFR RSLYKRSKLY 1381 DARRSSAILS RDARIRRWAI KKSEHGYKRV EEFLASSLRG IMKQNTFDFF ALKVSQIEEK 1441 MKNGYYVSHG LRSVKEDISR MCREAIKDEL MKSWQDGSGL SSATKYNKKL SKTVAEKKYM 1501 SRTSDTFGVN GASDYGEYAS DREIKRRLSK LNRKSFSSES DTSSELSDNG KSDNYSSASA 1561 SESESDIRSE GRSQDLRIEK YFTADDSFDS VTEEREWGAR MTKASLVPPV TRKYEVIEKY 1621 AIVADEEEVQ RKMRVSLPED YGEKLNAQRN GIEELDMELP EVKEYKPRKL LGDEVLEQEV 1681 YGIDPYTHNL LLDSMPGELD WSLQDKHSFI EDVVLRTLNR QVRLFTGSGS TPMVFPLRPV 1741 IEELKESARE ECDIRTMKMC QGVLKEIESR SDDKYVSYRK GLGVVCNKEG GFGEEDFVVE 1801 FLGEVYPVWK WFEKQDGIRS LQENKTDPAP EFYNIYLERP KGDADGYDLV VVDAMHMANY 1861 ASRICHSCRP NCEAKVTAVD GHYQIGIYSV RAIEYGEEIT FDYNSVTESK EEYEASVCLC 1921 GSQVCRGSYL NLTGEGAFQK VLKDWHGLLE RHRLMLEACV LNSVSEEDYL ELGRAGLGSC 1981 LLGGLPDWMI AYSARLVRFI NFERTKLPEE ILKHNLEEKR KYFSDIHLDV EKSDAEVQAE 2041 GVYNQRLQNL AVTLDKVRYV MRHVFGDPKN APPPLERLTP EETVSFVWNG DGSLVDELLQ 2101 SLSPHLEEGP LNELRSKIHG HDPSGSADVL KELQRSLLWL RDEIRDLPCT YKCRNDAAAD 2161 LIHIYAYTKC FFKVREYQSF ISSPVHISPL DLGAKYADKL GESIKEYRKT YGENYCLGQL 2221 IYWYNQTNTD PDLTLVKATR GCLSLPDVAS FYAKAQKPSK HRVYGPKTVK TMVSQMSKQP 2281 QRPWPKDKIW TFKSTPRVFG SPMFDAVLNN SSSLDRELLQ WLRNRRHVFQ ATWDS //