LOCUS AEE83565.1 2335 aa PRT PLN 23-MAR-2023
DEFINITION Arabidopsis thaliana SET domain protein 2 protein.
ACCESSION CP002687-2429
PROTEIN_ID AEE83565.1
SOURCE Arabidopsis thaliana (thale cress)
ORGANISM Arabidopsis thaliana
Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
Camelineae; Arabidopsis.
REFERENCE 1 (bases 1 to 18585056)
AUTHORS Mayer,K., Schuller,C., Wambutt,R., Murphy,G., Volckaert,G.,
Pohl,T., Dusterhoft,A., Stiekema,W., Entian,K.D., Terryn,N.,
Harris,B., Ansorge,W., Brandt,P., Grivell,L., Rieger,M.,
Weichselgartner,M., de Simone,V., Obermaier,B., Mache,R.,
Muller,M., Kreis,M., Delseny,M., Puigdomenech,P., Watson,M.,
Schmidtheini,T., Reichert,B., Portatelle,D., Perez-Alonso,M.,
Boutry,M., Bancroft,I., Vos,P., Hoheisel,J., Zimmermann,W.,
Wedler,H., Ridley,P., Langham,S.A., McCullagh,B., Bilham,L.,
Robben,J., Van der Schueren,J., Grymonprez,B., Chuang,Y.J.,
Vandenbussche,F., Braeken,M., Weltjens,I., Voet,M., Bastiaens,I.,
Aert,R., Defoor,E., Weitzenegger,T., Bothe,G., Ramsperger,U.,
Hilbert,H., Braun,M., Holzer,E., Brandt,A., Peters,S., van
Staveren,M., Dirske,W., Mooijman,P., Klein Lankhorst,R., Rose,M.,
Hauf,J., Kotter,P., Berneiser,S., Hempel,S., Feldpausch,M.,
Lamberth,S., Van den Daele,H., De Keyser,A., Buysshaert,C.,
Gielen,J., Villarroel,R., De Clercq,R., Van Montagu,M., Rogers,J.,
Cronin,A., Quail,M., Bray-Allen,S., Clark,L., Doggett,J., Hall,S.,
Kay,M., Lennard,N., McLay,K., Mayes,R., Pettett,A.,
Rajandream,M.A., Lyne,M., Benes,V., Rechmann,S., Borkova,D.,
Blocker,H., Scharfe,M., Grimm,M., Lohnert,T.H., Dose,S., de
Haan,M., Maarse,A., Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M.,
Fartmann,B., Granderath,K., Dauner,D., Herzl,A., Neumann,S.,
Argiriou,A., Vitale,D., Liguori,R., Piravandi,E., Massenet,O.,
Quigley,F., Clabauld,G., Mundlein,A., Felber,R., Schnabl,S.,
Hiller,R., Schmidt,W., Lecharny,A., Aubourg,S., Chefdor,F.,
Cooke,R., Berger,C., Montfort,A., Casacuberta,E., Gibbons,T.,
Weber,N., Vandenbol,M., Bargues,M., Terol,J., Torres,A.,
Perez-Perez,A., Purnelle,B., Bent,E., Johnson,S., Tacon,D.,
Jesse,T., Heijnen,L., Schwarz,S., Scholler,P., Heber,S., Francs,P.,
Bielke,C., Frishman,D., Haase,D., Lemcke,K., Mewes,H.W.,
Stocker,S., Zaccaria,P., Bevan,M., Wilson,R.K., de la Bastide,M.,
Habermann,K., Parnell,L., Dedhia,N., Gnoj,L., Schutz,K., Huang,E.,
Spiegel,L., Sehkon,M., Murray,J., Sheet,P., Cordes,M.,
Abu-Threideh,J., Stoneking,T., Kalicki,J., Graves,T., Harmon,G.,
Edwards,J., Latreille,P., Courtney,L., Cloud,J., Abbott,A.,
Scott,K., Johnson,D., Minx,P., Bentley,D., Fulton,B., Miller,N.,
Greco,T., Kemp,K., Kramer,J., Fulton,L., Mardis,E., Dante,M.,
Pepin,K., Hillier,L., Nelson,J., Spieth,J., Ryan,E., Andrews,S.,
Geisel,C., Layman,D., Du,H., Ali,J., Berghoff,A., Jones,K.,
Drone,K., Cotton,M., Joshu,C., Antonoiu,B., Zidanic,M., Strong,C.,
Sun,H., Lamar,B., Yordan,C., Ma,P., Zhong,J., Preston,R., Vil,D.,
Shekher,M., Matero,A., Shah,R., Swaby,I.K., O'Shaughnessy,A.,
Rodriguez,M., Hoffmann,J., Till,S., Granat,S., Shohdy,N.,
Hasegawa,A., Hameed,A., Lodhi,M., Johnson,A., Chen,E., Marra,M.,
Martienssen,R. and McCombie,W.R.
TITLE Sequence and analysis of chromosome 4 of the plant Arabidopsis
thaliana
JOURNAL Nature 402 (6763), 769-777 (1999)
PUBMED 10617198
REFERENCE 2 (bases 1 to 18585056)
AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
CONSRTM TAIR
TITLE Direct Submission
JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE 3 (bases 1 to 18585056)
AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
Vaughn,M. and Town,C.D.
TITLE Direct Submission
JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
9704 Medical Center Dr, Rockville, MD 20850, USA
REMARK Protein update by submitter
FEATURES Qualifiers
source /organism="Arabidopsis thaliana"
/mol_type="genomic DNA"
/db_xref="taxon:3702"
/chromosome="4"
/ecotype="Columbia"
protein /gene="SDG2"
/locus_tag="AT4G15180"
/gene_synonym="ATXR3"
/gene_synonym="DL3635W"
/gene_synonym="FCAALL.214"
/gene_synonym="SET domain protein 2"
/inference="Similar to RNA sequence,
EST:INSD:AV546152.1,INSD:ES102813.1,INSD:ES019843.1,
INSD:ES110605.1,INSD:CB518275.1,INSD:BP790849.1,
INSD:ES094218.1,INSD:EL046033.1,INSD:BX838418.1,
INSD:EG474442.1,INSD:EH974021.1,INSD:EL216094.1,
INSD:EH889558.1,INSD:BP779656.1,INSD:EH935591.1,
INSD:EL995417.1,INSD:ES187425.1,INSD:ES063619.1,
INSD:EG474431.1"
/inference="Similar to RNA sequence,
mRNA:INSD:AK226663.1,INSD:AK226725.1"
/note="SET domain protein 2 (SDG2); EXPRESSED IN: male
gametophyte, cultured cell; EXPRESSED DURING: M germinated
pollen stage; CONTAINS InterPro DOMAIN/s: SET domain
(InterPro:IPR001214); BEST Arabidopsis thaliana protein
match is: histone methyltransferases(H3-K4
specific);histone methyltransferases(H3-K36 specific)
(TAIR:AT1G77300.1); Has 30201 Blast hits to 17322 proteins
in 780 species: Archae - 12; Bacteria - 1396; Metazoa -
17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other
Eukaryotes - 2996 (source: NCBI BLink)."
/db_xref="TAIR:AT4G15180"
/db_xref="Araport:AT4G15180"
intron_pos 672:0 (1/19)
intron_pos 752:1 (2/19)
intron_pos 771:1 (3/19)
intron_pos 1132:1 (4/19)
intron_pos 1256:0 (5/19)
intron_pos 1434:0 (6/19)
intron_pos 1467:2 (7/19)
intron_pos 1781:0 (8/19)
intron_pos 1805:0 (9/19)
intron_pos 1842:0 (10/19)
intron_pos 1875:2 (11/19)
intron_pos 1909:0 (12/19)
intron_pos 1941:0 (13/19)
intron_pos 1997:0 (14/19)
intron_pos 2039:0 (15/19)
intron_pos 2057:0 (16/19)
intron_pos 2139:2 (17/19)
intron_pos 2176:0 (18/19)
intron_pos 2277:0 (19/19)
BEGIN
1 MSDGGVACMP LLNIMEKLPI VEKTTLCGGN ESKTAATTEN GHTSIATKVP ESQPANKPSA
61 SSQPVKKKRI VKVIRKVVKR RPKQPQKQAD EQLKDQPPSQ VVQLPAESQL QIKEQDKKSE
121 FKGGTSGVKE VENGGDSGFK DEVEEGELGT LKLHEDLENG EISPVKSLQK SEIEKGEIVG
181 ESWKKDEPTK GEFSHLKYHK GYVERRDFSA DKNWKGGKEE REFRSWRDPS DEIEKGEFIP
241 DRWQKMDTGK DDHSYIRSRR NGVDREKTWK YEYEYERTPP GGRFVNEDIY HQREFRSGLD
301 RTTRISSKIV IEENLHKNEY NNSSNFVKEY SSTGNRLKRH GAEPDSIERK HSYADYGDYG
361 SSKCRKLSDD CSRSLHSDHY SQHSAERLYR DSYPSKNSSL EKYPRKHQDA SFPAKAFSDK
421 HGHSPSRSDW SPHDRSRYHE NRDRSPYARE RSPYIFEKSS HARKRSPRDR RHHDYRRSPS
481 YSEWSPHDRS RPSDRRDYIP NFMEDTQSDR NRRNGHREIS RKSGVRERRD CQTGTELEIK
541 HKYKESNGKE STSSSKELQG KNILYNNSLL VEKNSVCDSS KIPVPCATGK EPVQVGEAPT
601 EELPSMEVDM DICDTPPHEP MASDSSLGKW FYLDYYGTEH GPARLSDLKA LMEQGILFSD
661 HMIKHSDNNR WLVNPPEAPG NLLEDIADTT EAVCIEQGAG DSLPELVSVR TLPDGKEIFV
721 ENREDFQIDM RVENLLDGRT ITPGREFETL GEALKVNVEF EETRRCVTSE GVVGMFRPMK
781 RAIEEFKSDD AYGSESDEIG SWFSGRWSCK GGDWIRQDEA SQDRYYKKKI VLNDGFPLCL
841 MQKSGHEDPR WHHKDDLYYP LSSSRLELPL WAFSVVDERN QTRGVKASLL SVVRLNSLVV
901 NDQVPPIPDP RAKVRSKERC PSRPARPSPA SSDSKRESVE SHSQSTASTG QDSQGLWKTD
961 TSVNTPRDRL CTVDDLQLHI GDWFYTDGAG QEQGPLSFSE LQKLVEKGFI KSHSSVFRKS
1021 DKIWVPVTSI TKSPETIAML RGKTPALPSA CQGLVVSETQ DFKYSEMDTS LNSFHGVHPQ
1081 FLGYFRGKLH QLVMKTFKSR DFSAAINDVV DSWIHARQPK KESEKYMYQS SELNSCYTKR
1141 ARLMAGESGE DSEMEDTQMF QKDELTFEDL CGDLTFNIEG NRSAGTVGIY WGLLDGHALA
1201 RVFHMLRYDV KSLAFASMTC RHWKATINSY KDISRQVDLS SLGPSCTDSR LRSIMNTYNK
1261 EKIDSIILVG CTNVTASMLE EILRLHPRIS SVDITGCSQF GDLTVNYKNV SWLRCQNTRS
1321 GELHSRIRSL KQTTDVAKSK GLGGDTDDFG NLKDYFDRVE KRDSANQLFR RSLYKRSKLY
1381 DARRSSAILS RDARIRRWAI KKSEHGYKRV EEFLASSLRG IMKQNTFDFF ALKVSQIEEK
1441 MKNGYYVSHG LRSVKEDISR MCREAIKDEL MKSWQDGSGL SSATKYNKKL SKTVAEKKYM
1501 SRTSDTFGVN GASDYGEYAS DREIKRRLSK LNRKSFSSES DTSSELSDNG KSDNYSSASA
1561 SESESDIRSE GRSQDLRIEK YFTADDSFDS VTEEREWGAR MTKASLVPPV TRKYEVIEKY
1621 AIVADEEEVQ RKMRVSLPED YGEKLNAQRN GIEELDMELP EVKEYKPRKL LGDEVLEQEV
1681 YGIDPYTHNL LLDSMPGELD WSLQDKHSFI EDVVLRTLNR QVRLFTGSGS TPMVFPLRPV
1741 IEELKESARE ECDIRTMKMC QGVLKEIESR SDDKYVSYRK GLGVVCNKEG GFGEEDFVVE
1801 FLGEVYPVWK WFEKQDGIRS LQENKTDPAP EFYNIYLERP KGDADGYDLV VVDAMHMANY
1861 ASRICHSCRP NCEAKVTAVD GHYQIGIYSV RAIEYGEEIT FDYNSVTESK EEYEASVCLC
1921 GSQVCRGSYL NLTGEGAFQK VLKDWHGLLE RHRLMLEACV LNSVSEEDYL ELGRAGLGSC
1981 LLGGLPDWMI AYSARLVRFI NFERTKLPEE ILKHNLEEKR KYFSDIHLDV EKSDAEVQAE
2041 GVYNQRLQNL AVTLDKVRYV MRHVFGDPKN APPPLERLTP EETVSFVWNG DGSLVDELLQ
2101 SLSPHLEEGP LNELRSKIHG HDPSGSADVL KELQRSLLWL RDEIRDLPCT YKCRNDAAAD
2161 LIHIYAYTKC FFKVREYQSF ISSPVHISPL DLGAKYADKL GESIKEYRKT YGENYCLGQL
2221 IYWYNQTNTD PDLTLVKATR GCLSLPDVAS FYAKAQKPSK HRVYGPKTVK TMVSQMSKQP
2281 QRPWPKDKIW TFKSTPRVFG SPMFDAVLNN SSSLDRELLQ WLRNRRHVFQ ATWDS
//