LOCUS       AEE83565.1              2335 aa    PRT              PLN 23-MAR-2023
DEFINITION  Arabidopsis thaliana SET domain protein 2 protein.
ACCESSION   CP002687-2429
PROTEIN_ID  AEE83565.1
SOURCE      Arabidopsis thaliana (thale cress)
  ORGANISM  Arabidopsis thaliana
            Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
            Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
            Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
            Camelineae; Arabidopsis.
REFERENCE   1  (bases 1 to 18585056)
  AUTHORS   Mayer,K., Schuller,C., Wambutt,R., Murphy,G., Volckaert,G.,
            Pohl,T., Dusterhoft,A., Stiekema,W., Entian,K.D., Terryn,N.,
            Harris,B., Ansorge,W., Brandt,P., Grivell,L., Rieger,M.,
            Weichselgartner,M., de Simone,V., Obermaier,B., Mache,R.,
            Muller,M., Kreis,M., Delseny,M., Puigdomenech,P., Watson,M.,
            Schmidtheini,T., Reichert,B., Portatelle,D., Perez-Alonso,M.,
            Boutry,M., Bancroft,I., Vos,P., Hoheisel,J., Zimmermann,W.,
            Wedler,H., Ridley,P., Langham,S.A., McCullagh,B., Bilham,L.,
            Robben,J., Van der Schueren,J., Grymonprez,B., Chuang,Y.J.,
            Vandenbussche,F., Braeken,M., Weltjens,I., Voet,M., Bastiaens,I.,
            Aert,R., Defoor,E., Weitzenegger,T., Bothe,G., Ramsperger,U.,
            Hilbert,H., Braun,M., Holzer,E., Brandt,A., Peters,S., van
            Staveren,M., Dirske,W., Mooijman,P., Klein Lankhorst,R., Rose,M.,
            Hauf,J., Kotter,P., Berneiser,S., Hempel,S., Feldpausch,M.,
            Lamberth,S., Van den Daele,H., De Keyser,A., Buysshaert,C.,
            Gielen,J., Villarroel,R., De Clercq,R., Van Montagu,M., Rogers,J.,
            Cronin,A., Quail,M., Bray-Allen,S., Clark,L., Doggett,J., Hall,S.,
            Kay,M., Lennard,N., McLay,K., Mayes,R., Pettett,A.,
            Rajandream,M.A., Lyne,M., Benes,V., Rechmann,S., Borkova,D.,
            Blocker,H., Scharfe,M., Grimm,M., Lohnert,T.H., Dose,S., de
            Haan,M., Maarse,A., Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M.,
            Fartmann,B., Granderath,K., Dauner,D., Herzl,A., Neumann,S.,
            Argiriou,A., Vitale,D., Liguori,R., Piravandi,E., Massenet,O.,
            Quigley,F., Clabauld,G., Mundlein,A., Felber,R., Schnabl,S.,
            Hiller,R., Schmidt,W., Lecharny,A., Aubourg,S., Chefdor,F.,
            Cooke,R., Berger,C., Montfort,A., Casacuberta,E., Gibbons,T.,
            Weber,N., Vandenbol,M., Bargues,M., Terol,J., Torres,A.,
            Perez-Perez,A., Purnelle,B., Bent,E., Johnson,S., Tacon,D.,
            Jesse,T., Heijnen,L., Schwarz,S., Scholler,P., Heber,S., Francs,P.,
            Bielke,C., Frishman,D., Haase,D., Lemcke,K., Mewes,H.W.,
            Stocker,S., Zaccaria,P., Bevan,M., Wilson,R.K., de la Bastide,M.,
            Habermann,K., Parnell,L., Dedhia,N., Gnoj,L., Schutz,K., Huang,E.,
            Spiegel,L., Sehkon,M., Murray,J., Sheet,P., Cordes,M.,
            Abu-Threideh,J., Stoneking,T., Kalicki,J., Graves,T., Harmon,G.,
            Edwards,J., Latreille,P., Courtney,L., Cloud,J., Abbott,A.,
            Scott,K., Johnson,D., Minx,P., Bentley,D., Fulton,B., Miller,N.,
            Greco,T., Kemp,K., Kramer,J., Fulton,L., Mardis,E., Dante,M.,
            Pepin,K., Hillier,L., Nelson,J., Spieth,J., Ryan,E., Andrews,S.,
            Geisel,C., Layman,D., Du,H., Ali,J., Berghoff,A., Jones,K.,
            Drone,K., Cotton,M., Joshu,C., Antonoiu,B., Zidanic,M., Strong,C.,
            Sun,H., Lamar,B., Yordan,C., Ma,P., Zhong,J., Preston,R., Vil,D.,
            Shekher,M., Matero,A., Shah,R., Swaby,I.K., O'Shaughnessy,A.,
            Rodriguez,M., Hoffmann,J., Till,S., Granat,S., Shohdy,N.,
            Hasegawa,A., Hameed,A., Lodhi,M., Johnson,A., Chen,E., Marra,M.,
            Martienssen,R. and McCombie,W.R.
  TITLE     Sequence and analysis of chromosome 4 of the plant Arabidopsis
            thaliana
  JOURNAL   Nature 402 (6763), 769-777 (1999)
   PUBMED   10617198
REFERENCE   2  (bases 1 to 18585056)
  AUTHORS   Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
  CONSRTM   TAIR
  TITLE     Direct Submission
  JOURNAL   Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
            Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE   3  (bases 1 to 18585056)
  AUTHORS   Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
            Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
            Vaughn,M. and Town,C.D.
  TITLE     Direct Submission
  JOURNAL   Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
            9704 Medical Center Dr, Rockville, MD 20850, USA
  REMARK    Protein update by submitter
FEATURES             Qualifiers
     source          /organism="Arabidopsis thaliana"
                     /mol_type="genomic DNA"
                     /db_xref="taxon:3702"
                     /chromosome="4"
                     /ecotype="Columbia"
     protein         /gene="SDG2"
                     /locus_tag="AT4G15180"
                     /gene_synonym="ATXR3"
                     /gene_synonym="DL3635W"
                     /gene_synonym="FCAALL.214"
                     /gene_synonym="SET domain protein 2"
                     /inference="Similar to RNA sequence,
                     EST:INSD:AV546152.1,INSD:ES102813.1,INSD:ES019843.1,
                     INSD:ES110605.1,INSD:CB518275.1,INSD:BP790849.1,
                     INSD:ES094218.1,INSD:EL046033.1,INSD:BX838418.1,
                     INSD:EG474442.1,INSD:EH974021.1,INSD:EL216094.1,
                     INSD:EH889558.1,INSD:BP779656.1,INSD:EH935591.1,
                     INSD:EL995417.1,INSD:ES187425.1,INSD:ES063619.1,
                     INSD:EG474431.1"
                     /inference="Similar to RNA sequence,
                     mRNA:INSD:AK226663.1,INSD:AK226725.1"
                     /note="SET domain protein 2 (SDG2); EXPRESSED IN: male
                     gametophyte, cultured cell; EXPRESSED DURING: M germinated
                     pollen stage; CONTAINS InterPro DOMAIN/s: SET domain
                     (InterPro:IPR001214); BEST Arabidopsis thaliana protein
                     match is: histone methyltransferases(H3-K4
                     specific);histone methyltransferases(H3-K36 specific)
                     (TAIR:AT1G77300.1); Has 30201 Blast hits to 17322 proteins
                     in 780 species: Archae - 12; Bacteria - 1396; Metazoa -
                     17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other
                     Eukaryotes - 2996 (source: NCBI BLink)."
                     /db_xref="TAIR:AT4G15180"
                     /db_xref="Araport:AT4G15180"
     intron_pos      672:0 (1/19)
     intron_pos      752:1 (2/19)
     intron_pos      771:1 (3/19)
     intron_pos      1132:1 (4/19)
     intron_pos      1256:0 (5/19)
     intron_pos      1434:0 (6/19)
     intron_pos      1467:2 (7/19)
     intron_pos      1781:0 (8/19)
     intron_pos      1805:0 (9/19)
     intron_pos      1842:0 (10/19)
     intron_pos      1875:2 (11/19)
     intron_pos      1909:0 (12/19)
     intron_pos      1941:0 (13/19)
     intron_pos      1997:0 (14/19)
     intron_pos      2039:0 (15/19)
     intron_pos      2057:0 (16/19)
     intron_pos      2139:2 (17/19)
     intron_pos      2176:0 (18/19)
     intron_pos      2277:0 (19/19)
BEGIN
        1 MSDGGVACMP LLNIMEKLPI VEKTTLCGGN ESKTAATTEN GHTSIATKVP ESQPANKPSA
       61 SSQPVKKKRI VKVIRKVVKR RPKQPQKQAD EQLKDQPPSQ VVQLPAESQL QIKEQDKKSE
      121 FKGGTSGVKE VENGGDSGFK DEVEEGELGT LKLHEDLENG EISPVKSLQK SEIEKGEIVG
      181 ESWKKDEPTK GEFSHLKYHK GYVERRDFSA DKNWKGGKEE REFRSWRDPS DEIEKGEFIP
      241 DRWQKMDTGK DDHSYIRSRR NGVDREKTWK YEYEYERTPP GGRFVNEDIY HQREFRSGLD
      301 RTTRISSKIV IEENLHKNEY NNSSNFVKEY SSTGNRLKRH GAEPDSIERK HSYADYGDYG
      361 SSKCRKLSDD CSRSLHSDHY SQHSAERLYR DSYPSKNSSL EKYPRKHQDA SFPAKAFSDK
      421 HGHSPSRSDW SPHDRSRYHE NRDRSPYARE RSPYIFEKSS HARKRSPRDR RHHDYRRSPS
      481 YSEWSPHDRS RPSDRRDYIP NFMEDTQSDR NRRNGHREIS RKSGVRERRD CQTGTELEIK
      541 HKYKESNGKE STSSSKELQG KNILYNNSLL VEKNSVCDSS KIPVPCATGK EPVQVGEAPT
      601 EELPSMEVDM DICDTPPHEP MASDSSLGKW FYLDYYGTEH GPARLSDLKA LMEQGILFSD
      661 HMIKHSDNNR WLVNPPEAPG NLLEDIADTT EAVCIEQGAG DSLPELVSVR TLPDGKEIFV
      721 ENREDFQIDM RVENLLDGRT ITPGREFETL GEALKVNVEF EETRRCVTSE GVVGMFRPMK
      781 RAIEEFKSDD AYGSESDEIG SWFSGRWSCK GGDWIRQDEA SQDRYYKKKI VLNDGFPLCL
      841 MQKSGHEDPR WHHKDDLYYP LSSSRLELPL WAFSVVDERN QTRGVKASLL SVVRLNSLVV
      901 NDQVPPIPDP RAKVRSKERC PSRPARPSPA SSDSKRESVE SHSQSTASTG QDSQGLWKTD
      961 TSVNTPRDRL CTVDDLQLHI GDWFYTDGAG QEQGPLSFSE LQKLVEKGFI KSHSSVFRKS
     1021 DKIWVPVTSI TKSPETIAML RGKTPALPSA CQGLVVSETQ DFKYSEMDTS LNSFHGVHPQ
     1081 FLGYFRGKLH QLVMKTFKSR DFSAAINDVV DSWIHARQPK KESEKYMYQS SELNSCYTKR
     1141 ARLMAGESGE DSEMEDTQMF QKDELTFEDL CGDLTFNIEG NRSAGTVGIY WGLLDGHALA
     1201 RVFHMLRYDV KSLAFASMTC RHWKATINSY KDISRQVDLS SLGPSCTDSR LRSIMNTYNK
     1261 EKIDSIILVG CTNVTASMLE EILRLHPRIS SVDITGCSQF GDLTVNYKNV SWLRCQNTRS
     1321 GELHSRIRSL KQTTDVAKSK GLGGDTDDFG NLKDYFDRVE KRDSANQLFR RSLYKRSKLY
     1381 DARRSSAILS RDARIRRWAI KKSEHGYKRV EEFLASSLRG IMKQNTFDFF ALKVSQIEEK
     1441 MKNGYYVSHG LRSVKEDISR MCREAIKDEL MKSWQDGSGL SSATKYNKKL SKTVAEKKYM
     1501 SRTSDTFGVN GASDYGEYAS DREIKRRLSK LNRKSFSSES DTSSELSDNG KSDNYSSASA
     1561 SESESDIRSE GRSQDLRIEK YFTADDSFDS VTEEREWGAR MTKASLVPPV TRKYEVIEKY
     1621 AIVADEEEVQ RKMRVSLPED YGEKLNAQRN GIEELDMELP EVKEYKPRKL LGDEVLEQEV
     1681 YGIDPYTHNL LLDSMPGELD WSLQDKHSFI EDVVLRTLNR QVRLFTGSGS TPMVFPLRPV
     1741 IEELKESARE ECDIRTMKMC QGVLKEIESR SDDKYVSYRK GLGVVCNKEG GFGEEDFVVE
     1801 FLGEVYPVWK WFEKQDGIRS LQENKTDPAP EFYNIYLERP KGDADGYDLV VVDAMHMANY
     1861 ASRICHSCRP NCEAKVTAVD GHYQIGIYSV RAIEYGEEIT FDYNSVTESK EEYEASVCLC
     1921 GSQVCRGSYL NLTGEGAFQK VLKDWHGLLE RHRLMLEACV LNSVSEEDYL ELGRAGLGSC
     1981 LLGGLPDWMI AYSARLVRFI NFERTKLPEE ILKHNLEEKR KYFSDIHLDV EKSDAEVQAE
     2041 GVYNQRLQNL AVTLDKVRYV MRHVFGDPKN APPPLERLTP EETVSFVWNG DGSLVDELLQ
     2101 SLSPHLEEGP LNELRSKIHG HDPSGSADVL KELQRSLLWL RDEIRDLPCT YKCRNDAAAD
     2161 LIHIYAYTKC FFKVREYQSF ISSPVHISPL DLGAKYADKL GESIKEYRKT YGENYCLGQL
     2221 IYWYNQTNTD PDLTLVKATR GCLSLPDVAS FYAKAQKPSK HRVYGPKTVK TMVSQMSKQP
     2281 QRPWPKDKIW TFKSTPRVFG SPMFDAVLNN SSSLDRELLQ WLRNRRHVFQ ATWDS
//