LOCUS       AEE82112.1               856 aa    PRT              PLN 23-MAR-2023
DEFINITION  Arabidopsis thaliana SET domain-containing protein protein.
ACCESSION   CP002687-420
PROTEIN_ID  AEE82112.1
SOURCE      Arabidopsis thaliana (thale cress)
  ORGANISM  Arabidopsis thaliana
            Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
            Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
            Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
            Camelineae; Arabidopsis.
REFERENCE   1  (bases 1 to 18585056)
  AUTHORS   Mayer,K., Schuller,C., Wambutt,R., Murphy,G., Volckaert,G.,
            Pohl,T., Dusterhoft,A., Stiekema,W., Entian,K.D., Terryn,N.,
            Harris,B., Ansorge,W., Brandt,P., Grivell,L., Rieger,M.,
            Weichselgartner,M., de Simone,V., Obermaier,B., Mache,R.,
            Muller,M., Kreis,M., Delseny,M., Puigdomenech,P., Watson,M.,
            Schmidtheini,T., Reichert,B., Portatelle,D., Perez-Alonso,M.,
            Boutry,M., Bancroft,I., Vos,P., Hoheisel,J., Zimmermann,W.,
            Wedler,H., Ridley,P., Langham,S.A., McCullagh,B., Bilham,L.,
            Robben,J., Van der Schueren,J., Grymonprez,B., Chuang,Y.J.,
            Vandenbussche,F., Braeken,M., Weltjens,I., Voet,M., Bastiaens,I.,
            Aert,R., Defoor,E., Weitzenegger,T., Bothe,G., Ramsperger,U.,
            Hilbert,H., Braun,M., Holzer,E., Brandt,A., Peters,S., van
            Staveren,M., Dirske,W., Mooijman,P., Klein Lankhorst,R., Rose,M.,
            Hauf,J., Kotter,P., Berneiser,S., Hempel,S., Feldpausch,M.,
            Lamberth,S., Van den Daele,H., De Keyser,A., Buysshaert,C.,
            Gielen,J., Villarroel,R., De Clercq,R., Van Montagu,M., Rogers,J.,
            Cronin,A., Quail,M., Bray-Allen,S., Clark,L., Doggett,J., Hall,S.,
            Kay,M., Lennard,N., McLay,K., Mayes,R., Pettett,A.,
            Rajandream,M.A., Lyne,M., Benes,V., Rechmann,S., Borkova,D.,
            Blocker,H., Scharfe,M., Grimm,M., Lohnert,T.H., Dose,S., de
            Haan,M., Maarse,A., Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M.,
            Fartmann,B., Granderath,K., Dauner,D., Herzl,A., Neumann,S.,
            Argiriou,A., Vitale,D., Liguori,R., Piravandi,E., Massenet,O.,
            Quigley,F., Clabauld,G., Mundlein,A., Felber,R., Schnabl,S.,
            Hiller,R., Schmidt,W., Lecharny,A., Aubourg,S., Chefdor,F.,
            Cooke,R., Berger,C., Montfort,A., Casacuberta,E., Gibbons,T.,
            Weber,N., Vandenbol,M., Bargues,M., Terol,J., Torres,A.,
            Perez-Perez,A., Purnelle,B., Bent,E., Johnson,S., Tacon,D.,
            Jesse,T., Heijnen,L., Schwarz,S., Scholler,P., Heber,S., Francs,P.,
            Bielke,C., Frishman,D., Haase,D., Lemcke,K., Mewes,H.W.,
            Stocker,S., Zaccaria,P., Bevan,M., Wilson,R.K., de la Bastide,M.,
            Habermann,K., Parnell,L., Dedhia,N., Gnoj,L., Schutz,K., Huang,E.,
            Spiegel,L., Sehkon,M., Murray,J., Sheet,P., Cordes,M.,
            Abu-Threideh,J., Stoneking,T., Kalicki,J., Graves,T., Harmon,G.,
            Edwards,J., Latreille,P., Courtney,L., Cloud,J., Abbott,A.,
            Scott,K., Johnson,D., Minx,P., Bentley,D., Fulton,B., Miller,N.,
            Greco,T., Kemp,K., Kramer,J., Fulton,L., Mardis,E., Dante,M.,
            Pepin,K., Hillier,L., Nelson,J., Spieth,J., Ryan,E., Andrews,S.,
            Geisel,C., Layman,D., Du,H., Ali,J., Berghoff,A., Jones,K.,
            Drone,K., Cotton,M., Joshu,C., Antonoiu,B., Zidanic,M., Strong,C.,
            Sun,H., Lamar,B., Yordan,C., Ma,P., Zhong,J., Preston,R., Vil,D.,
            Shekher,M., Matero,A., Shah,R., Swaby,I.K., O'Shaughnessy,A.,
            Rodriguez,M., Hoffmann,J., Till,S., Granat,S., Shohdy,N.,
            Hasegawa,A., Hameed,A., Lodhi,M., Johnson,A., Chen,E., Marra,M.,
            Martienssen,R. and McCombie,W.R.
  TITLE     Sequence and analysis of chromosome 4 of the plant Arabidopsis
            thaliana
  JOURNAL   Nature 402 (6763), 769-777 (1999)
   PUBMED   10617198
REFERENCE   2  (bases 1 to 18585056)
  AUTHORS   Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
  CONSRTM   TAIR
  TITLE     Direct Submission
  JOURNAL   Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
            Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE   3  (bases 1 to 18585056)
  AUTHORS   Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
            Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
            Vaughn,M. and Town,C.D.
  TITLE     Direct Submission
  JOURNAL   Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
            9704 Medical Center Dr, Rockville, MD 20850, USA
  REMARK    Protein update by submitter
FEATURES             Qualifiers
     source          /organism="Arabidopsis thaliana"
                     /mol_type="genomic DNA"
                     /db_xref="taxon:3702"
                     /chromosome="4"
                     /ecotype="Columbia"
     protein         /gene="SWN"
                     /locus_tag="AT4G02020"
                     /gene_synonym="EZA1"
                     /gene_synonym="SDG10"
                     /gene_synonym="SET DOMAIN-CONTAINING PROTEIN 10"
                     /gene_synonym="SWINGER"
                     /gene_synonym="T10M13.3"
                     /gene_synonym="T10M13_3"
                     /inference="Similar to RNA sequence,
                     EST:INSD:EH867143.1,INSD:ES103472.1,INSD:DR362519.1,
                     INSD:ES098738.1,INSD:ES195890.1,INSD:ES062978.1,
                     INSD:EL999574.1,INSD:BP778316.1,INSD:BP561910.2,
                     INSD:AV530013.1,INSD:AV790728.1,INSD:BP795810.1,
                     INSD:AV526319.1,INSD:ES011398.1,INSD:BE529399.1"
                     /inference="Similar to RNA sequence,
                     mRNA:INSD:AY090293.1,INSD:AY057477.1,INSD:AF100163.1"
                     /note="SWINGER (SWN); CONTAINS InterPro DOMAIN/s: SANT,
                     DNA-binding (InterPro:IPR001005), SET domain
                     (InterPro:IPR001214); BEST Arabidopsis thaliana protein
                     match is: SET domain-containing protein
                     (TAIR:AT2G23380.1); Has 5041 Blast hits to 4734 proteins
                     in 465 species: Archae - 0; Bacteria - 399; Metazoa -
                     2132; Fungi - 472; Plants - 1030; Viruses - 0; Other
                     Eukaryotes - 1008 (source: NCBI BLink)."
                     /db_xref="TAIR:AT4G02020"
                     /db_xref="Araport:AT4G02020"
     intron_pos      13:0 (1/16)
     intron_pos      53:0 (2/16)
     intron_pos      141:2 (3/16)
     intron_pos      194:2 (4/16)
     intron_pos      223:0 (5/16)
     intron_pos      271:0 (6/16)
     intron_pos      287:0 (7/16)
     intron_pos      311:0 (8/16)
     intron_pos      512:2 (9/16)
     intron_pos      562:0 (10/16)
     intron_pos      635:2 (11/16)
     intron_pos      679:2 (12/16)
     intron_pos      709:0 (13/16)
     intron_pos      725:0 (14/16)
     intron_pos      768:0 (15/16)
     intron_pos      794:0 (16/16)
BEGIN
        1 MVTDDSNSSG RIKSHVDDDD DGEEEEDRLE GLENRLSELK RKIQGERVRS IKEKFEANRK
       61 KVDAHVSPFS SAASSRATAE DNGNSNMLSS RMRMPLCKLN GFSHGVGDRD YVPTKDVISA
      121 SVKLPIAERI PPYTTWIFLD RNQRMAEDQS VVGRRQIYYE QHGGETLICS DSEEEPEPEE
      181 EKREFSEGED SIIWLIGQEY GMGEEVQDAL CQLLSVDASD ILERYNELKL KDKQNTEEFS
      241 NSGFKLGISL EKGLGAALDS FDNLFCRRCL VFDCRLHGCS QPLISASEKQ PYWSDYEGDR
      301 KPCSKHCYLQ LKAVREVPET CSNFASKAEE KASEEECSKA VSSDVPHAAA SGVSLQVEKT
      361 DIGIKNVDSS SGVEQEHGIR GKREVPILKD SNDLPNLSNK KQKTAASDTK MSFVNSVPSL
      421 DQALDSTKGD QGGTTDNKVN RDSEADAKEV GEPIPDNSVH DGGSSICQPH HGSGNGAIII
      481 AEMSETSRPS TEWNPIEKDL YLKGVEIFGR NSCLIARNLL SGLKTCLDVS NYMRENEVSV
      541 FRRSSTPNLL LDDGRTDPGN DNDEVPPRTR LFRRKGKTRK LKYSTKSAGH PSVWKRIAGG
      601 KNQSCKQYTP CGCLSMCGKD CPCLTNETCC EKYCGCSKSC KNRFRGCHCA KSQCRSRQCP
      661 CFAAGRECDP DVCRNCWVSC GDGSLGEAPR RGEGQCGNMR LLLRQQQRIL LGKSDVAGWG
      721 AFLKNSVSKN EYLGEYTGEL ISHHEADKRG KIYDRANSSF LFDLNDQYVL DAQRKGDKLK
      781 FANHSAKPNC YAKVMFVAGD HRVGIFANER IEASEELFYD YRYGPDQAPV WARKPEGSKK
      841 DDSAITHRRA RKHQSH
//