LOCUS       AEE78472.1               600 aa    PRT              PLN 23-MAR-2023
DEFINITION  Arabidopsis thaliana single-stranded DNA endonuclease
            family protein protein.
ACCESSION   CP002686-6726
PROTEIN_ID  AEE78472.1
SOURCE      Arabidopsis thaliana (thale cress)
  ORGANISM  Arabidopsis thaliana
            Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
            Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
            Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
            Camelineae; Arabidopsis.
REFERENCE   1  (bases 1 to 23459830)
  AUTHORS   Salanoubat,M., Lemcke,K., Rieger,M., Ansorge,W., Unseld,M.,
            Fartmann,B., Valle,G., Blocker,H., Perez-Alonso,M., Obermaier,B.,
            Delseny,M., Boutry,M., Grivell,L.A., Mache,R., Puigdomenech,P., De
            Simone,V., Choisne,N., Artiguenave,F., Robert,C., Brottier,P.,
            Wincker,P., Cattolico,L., Weissenbach,J., Saurin,W., Quetier,F.,
            Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M., Benes,V.,
            Wurmbach,E., Drzonek,H., Erfle,H., Jordan,N., Bangert,S.,
            Wiedelmann,R., Kranz,H., Voss,H., Holland,R., Brandt,P.,
            Nyakatura,G., Vezzi,A., D'Angelo,M., Pallavicini,A., Toppo,S.,
            Simionati,B., Conrad,A., Hornischer,K., Kauer,G., Lohnert,T.H.,
            Nordsiek,G., Reichelt,J., Scharfe,M., Schon,O., Bargues,M.,
            Terol,J., Climent,J., Navarro,P., Collado,C., Perez-Perez,A.,
            Ottenwalder,B., Duchemin,D., Cooke,R., Laudie,M., Berger-Llauro,C.,
            Purnelle,B., Masuy,D., de Haan,M., Maarse,A.C., Alcaraz,J.P.,
            Cottet,A., Casacuberta,E., Monfort,A., Argiriou,A., flores,M.,
            Liguori,R., Vitale,D., Mannhaupt,G., Haase,D., Schoof,H., Rudd,S.,
            Zaccaria,P., Mewes,H.W., Mayer,K.F., Kaul,S., Town,C.D., Koo,H.L.,
            Tallon,L.J., Jenkins,J., Rooney,T., Rizzo,M., Walts,A.,
            Utterback,T., Fujii,C.Y., Shea,T.P., Creasy,T.H., Haas,B.,
            Maiti,R., Wu,D., Peterson,J., Van Aken,S., Pai,G., Militscher,J.,
            Sellers,P., Gill,J.E., Feldblyum,T.V., Preuss,D., Lin,X.,
            Nierman,W.C., Salzberg,S.L., White,O., Venter,J.C., Fraser,C.M.,
            Kaneko,T., Nakamura,Y., Sato,S., Kato,T., Asamizu,E., Sasamoto,S.,
            Kimura,T., Idesawa,K., Kawashima,K., Kishida,Y., Kiyokawa,C.,
            Kohara,M., Matsumoto,M., Matsuno,A., Muraki,A., Nakayama,S.,
            Nakazaki,N., Shinpo,S., Takeuchi,C., Wada,T., Watanabe,A.,
            Yamada,M., Yasuda,M. and Tabata,S.
  CONSRTM   European Union Chromosome 3 Arabidopsis Sequencing Consortium;
            Institute for Genomic Research; Kazusa DNA Research Institute
  TITLE     Sequence and analysis of chromosome 3 of the plant Arabidopsis
            thaliana
  JOURNAL   Nature 408 (6814), 820-822 (2000)
   PUBMED   11130713
REFERENCE   2  (bases 1 to 23459830)
  AUTHORS   Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
  CONSRTM   TAIR
  TITLE     Direct Submission
  JOURNAL   Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
            Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE   3  (bases 1 to 23459830)
  AUTHORS   Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
            Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
            Vaughn,M. and Town,C.D.
  TITLE     Direct Submission
  JOURNAL   Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
            9704 Medical Center Dr, Rockville, MD 20850, USA
  REMARK    Protein update by submitter
FEATURES             Qualifiers
     source          /organism="Arabidopsis thaliana"
                     /mol_type="genomic DNA"
                     /db_xref="taxon:3702"
                     /chromosome="3"
                     /ecotype="Columbia"
     protein         /locus_tag="AT3G48900"
                     /gene_synonym="AtSEND1"
                     /gene_synonym="SEND1"
                     /inference="Similar to RNA sequence, EST:INSD:AV815134.1"
                     /note="single-stranded DNA endonuclease family protein;
                     FUNCTIONS IN: chromatin binding, DNA binding, catalytic
                     activity, nuclease activity; INVOLVED IN: DNA repair,
                     chromatin assembly or disassembly; LOCATED IN: chromatin,
                     nucleus; EXPRESSED IN: 6 plant structures; EXPRESSED
                     DURING: F mature embryo stage, petal differentiation and
                     expansion stage, E expanded cotyledon stage, D bilateral
                     stage; CONTAINS InterPro DOMAIN/s: XPG N-terminal
                     (InterPro:IPR006085), DNA repair protein (XPGC)/yeast Rad
                     (InterPro:IPR006084), Chromo domain-like
                     (InterPro:IPR016197), 5'-3' exonuclease, C-terminal
                     subdomain (InterPro:IPR020045), Chromo domain
                     (InterPro:IPR000953), XPG/RAD2 endonuclease
                     (InterPro:IPR006086); BEST Arabidopsis thaliana protein
                     match is: 5'-3' exonuclease family protein
                     (TAIR:AT1G01880.1); Has 30201 Blast hits to 17322 proteins
                     in 780 species: Archae - 12; Bacteria - 1396; Metazoa -
                     17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other
                     Eukaryotes - 2996 (source: NCBI BLink)."
                     /db_xref="TAIR:AT3G48900"
                     /db_xref="Araport:AT3G48900"
     intron_pos      23:2 (1/13)
     intron_pos      76:1 (2/13)
     intron_pos      97:0 (3/13)
     intron_pos      139:2 (4/13)
     intron_pos      157:0 (5/13)
     intron_pos      181:1 (6/13)
     intron_pos      206:0 (7/13)
     intron_pos      226:0 (8/13)
     intron_pos      280:1 (9/13)
     intron_pos      313:2 (10/13)
     intron_pos      341:1 (11/13)
     intron_pos      375:0 (12/13)
     intron_pos      420:2 (13/13)
BEGIN
        1 MGVKYLWDVL EPCKKTFPLD HLQNKRVCVD LSCWMVELHK VNKSYCATKE KVYLRGFFHR
       61 LRALIALNCS IILVSDGAIP GIKVPTYKRR LKARFEIADD GVEPSKETSL KRNMGSEFSC
      121 IIKEAKVIAS TLGILCLDGI EEAEAQCALL NSESLCDACF SFDSDIFLFG AKTVYREICL
      181 GEGGYVVCYE MDDIKKKLGL GRNSLIALAL LLGSDYSQGV RGLRQEKACE LVRSIGDNVI
      241 LEKVASEGLS FAEKPRKSKK QVRPSVCSKK GTLPLVVING NNRDPERLEE IKQVIDAFMN
      301 PKCHQADSNT VSRALAEFSF QRTKLQEICH QFFEWPPEKT DEYILPKVAE RNLRRFANLQ
      361 SRSTEVEVNL PLHKPQMPEK CPVSEIIKTR KVQGRECFEV SWNDLEGLES SIVPADLVER
      421 ACPEKIIEFK EKMAAKKKKP KPKQKQKETS SPTKSSSLVE LSLELQHLDL NSTSLVSRST
      481 LEEAEQENEQ QNSKKHDYLR LIDSPDRENC NNAWSNRDRL GVGMSSFPLY PETEVIDLIS
      541 PCPEARSRSV SRSYQEQKSH DHQLETVIEL SDSETDDEEH CKKARELRIF LQNIRKDIIL
//