LOCUS       AEE76957.1              1957 aa    PRT              PLN 23-MAR-2023
DEFINITION  Arabidopsis thaliana Helicase/SANT-associated, DNA binding
            protein protein.
ACCESSION   CP002686-4642
PROTEIN_ID  AEE76957.1
SOURCE      Arabidopsis thaliana (thale cress)
  ORGANISM  Arabidopsis thaliana
            Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
            Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
            Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
            Camelineae; Arabidopsis.
REFERENCE   1  (bases 1 to 23459830)
  AUTHORS   Salanoubat,M., Lemcke,K., Rieger,M., Ansorge,W., Unseld,M.,
            Fartmann,B., Valle,G., Blocker,H., Perez-Alonso,M., Obermaier,B.,
            Delseny,M., Boutry,M., Grivell,L.A., Mache,R., Puigdomenech,P., De
            Simone,V., Choisne,N., Artiguenave,F., Robert,C., Brottier,P.,
            Wincker,P., Cattolico,L., Weissenbach,J., Saurin,W., Quetier,F.,
            Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M., Benes,V.,
            Wurmbach,E., Drzonek,H., Erfle,H., Jordan,N., Bangert,S.,
            Wiedelmann,R., Kranz,H., Voss,H., Holland,R., Brandt,P.,
            Nyakatura,G., Vezzi,A., D'Angelo,M., Pallavicini,A., Toppo,S.,
            Simionati,B., Conrad,A., Hornischer,K., Kauer,G., Lohnert,T.H.,
            Nordsiek,G., Reichelt,J., Scharfe,M., Schon,O., Bargues,M.,
            Terol,J., Climent,J., Navarro,P., Collado,C., Perez-Perez,A.,
            Ottenwalder,B., Duchemin,D., Cooke,R., Laudie,M., Berger-Llauro,C.,
            Purnelle,B., Masuy,D., de Haan,M., Maarse,A.C., Alcaraz,J.P.,
            Cottet,A., Casacuberta,E., Monfort,A., Argiriou,A., flores,M.,
            Liguori,R., Vitale,D., Mannhaupt,G., Haase,D., Schoof,H., Rudd,S.,
            Zaccaria,P., Mewes,H.W., Mayer,K.F., Kaul,S., Town,C.D., Koo,H.L.,
            Tallon,L.J., Jenkins,J., Rooney,T., Rizzo,M., Walts,A.,
            Utterback,T., Fujii,C.Y., Shea,T.P., Creasy,T.H., Haas,B.,
            Maiti,R., Wu,D., Peterson,J., Van Aken,S., Pai,G., Militscher,J.,
            Sellers,P., Gill,J.E., Feldblyum,T.V., Preuss,D., Lin,X.,
            Nierman,W.C., Salzberg,S.L., White,O., Venter,J.C., Fraser,C.M.,
            Kaneko,T., Nakamura,Y., Sato,S., Kato,T., Asamizu,E., Sasamoto,S.,
            Kimura,T., Idesawa,K., Kawashima,K., Kishida,Y., Kiyokawa,C.,
            Kohara,M., Matsumoto,M., Matsuno,A., Muraki,A., Nakayama,S.,
            Nakazaki,N., Shinpo,S., Takeuchi,C., Wada,T., Watanabe,A.,
            Yamada,M., Yasuda,M. and Tabata,S.
  CONSRTM   European Union Chromosome 3 Arabidopsis Sequencing Consortium;
            Institute for Genomic Research; Kazusa DNA Research Institute
  TITLE     Sequence and analysis of chromosome 3 of the plant Arabidopsis
            thaliana
  JOURNAL   Nature 408 (6814), 820-822 (2000)
   PUBMED   11130713
REFERENCE   2  (bases 1 to 23459830)
  AUTHORS   Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
  CONSRTM   TAIR
  TITLE     Direct Submission
  JOURNAL   Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
            Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE   3  (bases 1 to 23459830)
  AUTHORS   Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
            Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
            Vaughn,M. and Town,C.D.
  TITLE     Direct Submission
  JOURNAL   Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
            9704 Medical Center Dr, Rockville, MD 20850, USA
  REMARK    Protein update by submitter
FEATURES             Qualifiers
     source          /organism="Arabidopsis thaliana"
                     /mol_type="genomic DNA"
                     /db_xref="taxon:3702"
                     /chromosome="3"
                     /ecotype="Columbia"
     protein         /locus_tag="AT3G24880"
                     /inference="Similar to RNA sequence,
                     EST:INSD:ES123301.1,INSD:AV442458.1,INSD:ES048537.1,
                     INSD:ES169606.1,INSD:ES100711.1,INSD:ES174226.1,
                     INSD:ES104010.1,INSD:ES094195.1,INSD:ES138947.1,
                     INSD:CD533799.1,INSD:ES149727.1,INSD:ES078790.1"
                     /note="Helicase/SANT-associated, DNA binding protein;
                     FUNCTIONS IN: DNA binding; EXPRESSED IN: cultured cell;
                     CONTAINS InterPro DOMAIN/s: SANT, DNA-binding
                     (InterPro:IPR001005), HSA (InterPro:IPR006562),
                     Homeodomain-like (InterPro:IPR009057), HAS subgroup
                     (InterPro:IPR013999), Helicase/SANT-associated, DNA
                     binding (InterPro:IPR014012), MYB-like
                     (InterPro:IPR017877); BEST Arabidopsis thaliana protein
                     match is: Helicase/SANT-associated, DNA binding protein
                     (TAIR:AT3G24870.1); Has 17312 Blast hits to 12172 proteins
                     in 594 species: Archae - 4; Bacteria - 677; Metazoa -
                     8001; Fungi - 2909; Plants - 1838; Viruses - 51; Other
                     Eukaryotes - 3832 (source: NCBI BLink)."
                     /db_xref="TAIR:AT3G24880"
                     /db_xref="Araport:AT3G24880"
     intron_pos      47:2 (1/22)
     intron_pos      66:0 (2/22)
     intron_pos      98:2 (3/22)
     intron_pos      553:0 (4/22)
     intron_pos      593:0 (5/22)
     intron_pos      661:0 (6/22)
     intron_pos      724:0 (7/22)
     intron_pos      751:0 (8/22)
     intron_pos      770:1 (9/22)
     intron_pos      943:0 (10/22)
     intron_pos      961:0 (11/22)
     intron_pos      976:1 (12/22)
     intron_pos      1042:0 (13/22)
     intron_pos      1060:0 (14/22)
     intron_pos      1087:0 (15/22)
     intron_pos      1133:0 (16/22)
     intron_pos      1172:0 (17/22)
     intron_pos      1194:1 (18/22)
     intron_pos      1220:0 (19/22)
     intron_pos      1253:2 (20/22)
     intron_pos      1318:2 (21/22)
     intron_pos      1445:0 (22/22)
BEGIN
        1 MHGSVSGYLL VNAEVDSMGG VIDSGGGIGV KTSPRRTAIE KAQAELRQEY DVREERRREL
       61 EFLEKGGNPL DFKFGIATSH SVQSTSLTDQ QAEHFVNSEV KDSFALTASP HGDSVESSGR
      121 PGVPTISEPN TADNLLLFDS ENKSVEGERN LRHPNRQNRT SESERSSKAH TNQNTKETED
      181 SAIFRPYARR NRSKISRDPA RSSSTDLVQN RGGLATSISI RRGSVEGKGC IPEAANQKDM
      241 HTTSVSCPVF ANSNGNIVPK NRVSSNSLNT KVDGEPVVRE STAGSKTSLL KDEADISYSK
      301 SSAYLPVGES GLAGEKAQLV STGGSPKAAT IAGQKNSSTQ LNGLRDSTVE EESLTNRGAT
      361 GTNGLESESS HANNVEVNVD NERDLYKVDK LDSDEISMQK TLRVEGLLDQ TVGEMTKTKI
      421 EDETGQSTTI ISECIPECEM QMKSVKIENQ SHRSTAEMQT KEKSSETEKR LQDGLVVLEN
      481 DSKVGSILSE NPSSTLCSGI PQASVDTSSC TVGNSLLSGT DIEALKHQPS SDAVMLDTVK
      541 EDAILEEARI IQAKKKRIAE LSCGTAPVEV REKSQWDFVL EEMAWLANDF AQERLWKMTA
      601 AAQICHRVAL TCQLRFEERN QHRKLKKIAS VLSNAILQFW SSVEAEVPGE LEETSLGIVK
      661 ETCQESNCLN GRRCLAAGVK EYASRFLKYN NSSISYHSAA PSTPDNMCDP EILDISMVDQ
      721 LTEASLFYSV PSGAMEVYLK SIESHLTRCE KSGSSMQEEV DTSAYDTAGD IGYNVTAFDE
      781 DEGETSTYYL PGAFESSRSF NISHKKRKNL MKSHSARSYD LGDDLPYVNN TGGSNSSSLM
      841 AKRPDSNINA GSVPTRRVRT ASRQRVVSPF GCATTGNLPV PSKTDASSGD TSSFQDEYSS
      901 LHGGSAVQKG TEVESSVNFE KLLPYDMAET SGRPKKKKKT HQGSAYDQTW HLDPSVHVEQ
      961 KDHWKKRPEN NFDMNGLYGP HSAKKQKTTK QLVENNFDMA IPHTGSIPSP AASQMSNMSN
     1021 PNKSIKFIGG RDRGRKIKGL KISPGQHGSG NPWSLFEDQA LVVLVHDMGP NWELISDAMN
     1081 STLKIKCIYR NPTECKDRHK ILMDKTAGDG ADSAEDSGNS QSYPSTLPGI PKGSARQLFQ
     1141 RLQGPMEEDT LKSHFEKICL IGKKLHYRKT QSVIGVSVVS FVHGIQFSSC TGAGISQSLD
     1201 IPGLHVSKYS CKSWLGFPEN DGRDSKQIVP VHNSQVMALS QVFPNNLNGG VLTPLDVCDA
     1261 STSGQDVFSL ENPGLPMLNQ GTPVLPTSGA HPSTPGSSGV VLSNNLPTTS GLQSASVRDG
     1321 RFNVPRGSLP LDEQHRLQQF NQTLSGRNLQ QPSLSTPAAV SGSDRGHRMV PGGNAMGVSG
     1381 MNRNTPMSRP GFQGMASSAM PNTGSMLSSG MVEIPNTGNI HSGGGASQGN SMIRPREAVQ
     1441 HMMRMQAAQG NSPGIPAFSN LSSGFTNQTT PVQAYPGHLS QQHQMSPQSH VLGNSHHPHL
     1501 QSPSQATGAQ QEAFAIRQRQ IHQRYLQQQQ QQQQFPASGS MMPHVQQPQG SSVSSSPQNS
     1561 PQTQPPVSPQ PLSMPPVSPS PNINAMAQQK PQKSQLALHG LGRSPQSGTS GVNNQAGKQR
     1621 QRQLQQSARQ HPHQRQPTQG QQLNKQLKGM GRGNMIHQNI TVDQSHLNGL TMPQGNQATE
     1681 KGEIAVPVRP DQQSSVGTTT STNLQSKPFV SPLSSNHSQQ LPKSFPGALP PSPQQQMQLH
     1741 SDNSIQGQSS PATPCNILST SSPSIAPAVA PSNHQHLLIH QKQRNQVQST AQRVVQHNHL
     1801 GNSELSKKSQ AERMPRVPQS VTNTTQTVSM GTTKGMPQAS NDLKNIKAVG STAVPALEPP
     1861 SCVASVQITA SKVVNSSNTD SAGNDPVSTP NQGLAQKHGI KGVTQRQQQS LPSEEKRPKL
     1921 PEKPTVQNQK HLASEEQPHL EEAQELSSSK PPDTKVE
//