LOCUS       AEE76826.1              1473 aa    PRT              PLN 23-MAR-2023
DEFINITION  Arabidopsis thaliana topoisomerase II protein.
ACCESSION   CP002686-4459
PROTEIN_ID  AEE76826.1
SOURCE      Arabidopsis thaliana (thale cress)
  ORGANISM  Arabidopsis thaliana
            Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
            Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
            Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
            Camelineae; Arabidopsis.
REFERENCE   1  (bases 1 to 23459830)
  AUTHORS   Salanoubat,M., Lemcke,K., Rieger,M., Ansorge,W., Unseld,M.,
            Fartmann,B., Valle,G., Blocker,H., Perez-Alonso,M., Obermaier,B.,
            Delseny,M., Boutry,M., Grivell,L.A., Mache,R., Puigdomenech,P., De
            Simone,V., Choisne,N., Artiguenave,F., Robert,C., Brottier,P.,
            Wincker,P., Cattolico,L., Weissenbach,J., Saurin,W., Quetier,F.,
            Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M., Benes,V.,
            Wurmbach,E., Drzonek,H., Erfle,H., Jordan,N., Bangert,S.,
            Wiedelmann,R., Kranz,H., Voss,H., Holland,R., Brandt,P.,
            Nyakatura,G., Vezzi,A., D'Angelo,M., Pallavicini,A., Toppo,S.,
            Simionati,B., Conrad,A., Hornischer,K., Kauer,G., Lohnert,T.H.,
            Nordsiek,G., Reichelt,J., Scharfe,M., Schon,O., Bargues,M.,
            Terol,J., Climent,J., Navarro,P., Collado,C., Perez-Perez,A.,
            Ottenwalder,B., Duchemin,D., Cooke,R., Laudie,M., Berger-Llauro,C.,
            Purnelle,B., Masuy,D., de Haan,M., Maarse,A.C., Alcaraz,J.P.,
            Cottet,A., Casacuberta,E., Monfort,A., Argiriou,A., flores,M.,
            Liguori,R., Vitale,D., Mannhaupt,G., Haase,D., Schoof,H., Rudd,S.,
            Zaccaria,P., Mewes,H.W., Mayer,K.F., Kaul,S., Town,C.D., Koo,H.L.,
            Tallon,L.J., Jenkins,J., Rooney,T., Rizzo,M., Walts,A.,
            Utterback,T., Fujii,C.Y., Shea,T.P., Creasy,T.H., Haas,B.,
            Maiti,R., Wu,D., Peterson,J., Van Aken,S., Pai,G., Militscher,J.,
            Sellers,P., Gill,J.E., Feldblyum,T.V., Preuss,D., Lin,X.,
            Nierman,W.C., Salzberg,S.L., White,O., Venter,J.C., Fraser,C.M.,
            Kaneko,T., Nakamura,Y., Sato,S., Kato,T., Asamizu,E., Sasamoto,S.,
            Kimura,T., Idesawa,K., Kawashima,K., Kishida,Y., Kiyokawa,C.,
            Kohara,M., Matsumoto,M., Matsuno,A., Muraki,A., Nakayama,S.,
            Nakazaki,N., Shinpo,S., Takeuchi,C., Wada,T., Watanabe,A.,
            Yamada,M., Yasuda,M. and Tabata,S.
  CONSRTM   European Union Chromosome 3 Arabidopsis Sequencing Consortium;
            Institute for Genomic Research; Kazusa DNA Research Institute
  TITLE     Sequence and analysis of chromosome 3 of the plant Arabidopsis
            thaliana
  JOURNAL   Nature 408 (6814), 820-822 (2000)
   PUBMED   11130713
REFERENCE   2  (bases 1 to 23459830)
  AUTHORS   Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
  CONSRTM   TAIR
  TITLE     Direct Submission
  JOURNAL   Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
            Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE   3  (bases 1 to 23459830)
  AUTHORS   Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
            Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
            Vaughn,M. and Town,C.D.
  TITLE     Direct Submission
  JOURNAL   Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
            9704 Medical Center Dr, Rockville, MD 20850, USA
  REMARK    Protein update by submitter
FEATURES             Qualifiers
     source          /organism="Arabidopsis thaliana"
                     /mol_type="genomic DNA"
                     /db_xref="taxon:3702"
                     /chromosome="3"
                     /ecotype="Columbia"
     protein         /gene="TOPII"
                     /locus_tag="AT3G23890"
                     /gene_synonym="ATTOPII"
                     /gene_synonym="topoisomerase II"
                     /gene_synonym="TOPOISOMERASE II"
                     /inference="Similar to RNA sequence,
                     EST:INSD:ES157796.1,INSD:EG517053.1,INSD:EL983326.1,
                     INSD:EH974714.1,INSD:EL056760.1,INSD:BP659888.1,
                     INSD:ES053244.1,INSD:EL983488.1,INSD:ES177213.1,
                     INSD:EL099567.1,INSD:CF773290.1,INSD:AV820203.1,
                     INSD:EG517086.1,INSD:ES006679.1,INSD:EH896711.1,
                     INSD:R83960.1,INSD:EL997926.1"
                     /inference="Similar to RNA sequence, mRNA:INSD:L21015.1"
                     /note="topoisomerase II (TOPII); CONTAINS InterPro
                     DOMAIN/s: DNA topoisomerase, type IIA, subunit
                     A/C-terminal (InterPro:IPR002205), DNA topoisomerase, type
                     IIA, conserved site (InterPro:IPR018522), DNA
                     topoisomerase, type IIA, subunit A/ C-terminal, alpha-beta
                     (InterPro:IPR013758), DNA topoisomerase, type IIA, subunit
                     A, alpha-helical (InterPro:IPR013757), DNA topoisomerase,
                     type IIA, subunit B, domain 2 (InterPro:IPR013506),
                     ATPase-like, ATP-binding domain (InterPro:IPR003594), DNA
                     topoisomerase, type IIA, subunit B/N-terminal, alpha-beta
                     (InterPro:IPR013759), DNA topoisomerase, type IIA, subunit
                     B/N-terminal (InterPro:IPR001241), Ribosomal protein S5
                     domain 2-type fold (InterPro:IPR020568), DNA topoisomerase
                     II, eukaryotic-type (InterPro:IPR001154), DNA
                     topoisomerase, type IIA, central (InterPro:IPR013760);
                     BEST Arabidopsis thaliana protein match is: DNA GYRASE B2
                     (TAIR:AT5G04130.1); Has 46226 Blast hits to 43639 proteins
                     in 5982 species: Archae - 169; Bacteria - 28615; Metazoa -
                     2871; Fungi - 1392; Plants - 616; Viruses - 201; Other
                     Eukaryotes - 12362 (source: NCBI BLink)."
                     /db_xref="TAIR:AT3G23890"
                     /db_xref="Araport:AT3G23890"
     intron_pos      193:0 (1/19)
     intron_pos      291:2 (2/19)
     intron_pos      314:0 (3/19)
     intron_pos      404:1 (4/19)
     intron_pos      470:0 (5/19)
     intron_pos      540:0 (6/19)
     intron_pos      578:0 (7/19)
     intron_pos      616:0 (8/19)
     intron_pos      671:0 (9/19)
     intron_pos      785:0 (10/19)
     intron_pos      829:2 (11/19)
     intron_pos      946:0 (12/19)
     intron_pos      1008:1 (13/19)
     intron_pos      1025:0 (14/19)
     intron_pos      1172:0 (15/19)
     intron_pos      1234:1 (16/19)
     intron_pos      1269:1 (17/19)
     intron_pos      1294:0 (18/19)
     intron_pos      1321:1 (19/19)
BEGIN
        1 MATKLPLQNS NAANVAKAPA KSRAAAGGKT IEEMYQKKSQ LEHILLRPDT YIGSIEKHTQ
       61 TLWVYEKDEM VQRPVTYVPG LYKIFDEILV NAADNKQRDA KMDSVQVVID VEQNLISVCN
      121 SGAGVPVEIH QEEGIYVPEM IFGHLLTSSN YDDNVKKTTG GRNGYGAKLT NIFSTEFIIE
      181 TADGKRLKKY KQVFENNMGK KSEPVITKCN KSENWTKVTF KPDLKKFNMT ELEDDVVALM
      241 SKRVFDIAGC LGKSVKVELN GKQIPVKSFT DYVDLYLSAA NKSRTEDPLP RLTEKVNDRW
      301 EVCVSLSEGQ FQQVSFVNSI ATIKGGTHVD YVTSQITNHI VAAVNKKNKN ANVKAHNVKN
      361 HLWVFVNALI DNPAFDSQTK ETLTLRQSSF GSKCELSEDF LKKVGKSGVV ENLLSWADFK
      421 QNKDLKKSDG AKTGRVLVEK LDDAAEAGGK NSRLCTLILT EGDSAKSLAL AGRSVLGNNY
      481 CGVFPLRGKL LNVREASTTQ ITNNKEIENL KKILGLKQNM KYENVNSLRY GQMMIMTDQD
      541 HDGSHIKGLL INFIHSFWPS LLQVPSFLVE FITPIVKATR KGTKKVLSFY SMPEYEEWKE
      601 SLKGNATGWD IKYYKGLGTS TAEEGKEYFS NLGLHKKDFV WEDEQDGEAI ELAFSKKKIE
      661 ARKNWLSSYV PGNHLDQRQP KVTYSDFVNK ELILFSMADL QRSIPSMVDG LKPGQRKILF
      721 VAFKKIARKE MKVAQLVGYV SLLSAYHHGE QSLASAIIGM AQDYVGSNNI NLLLPNGQFG
      781 TRTSGGKDSA SARYIFTKLS PVTRILFPKD DDLLLDYLNE DGQRIEPTWY MPIIPTVLVN
      841 GAEGIGTGWS TFIPNYNPRE IVANVRRLLN GESMVPMDPW YRGFKGTIEK TASKEGGCTY
      901 TITGLYEEVD ETTIRITELP IRRWNDDYKN FLQSLKTDNG APFFQDVKAY NDEKSVDFDL
      961 ILSEENMLAA RQEGFLKKFK LTTTIATSNM HLFDKKGVIK KYVTPEQILE EFFDLRFEYY
     1021 EKRKETVVKN MEIELLKLEN KARFILAVLS GEIIVNKRKK ADIVEDLRQK GFTPFPRKAE
     1081 SVEAAIAGAV DDDAAEEPEE ILVDPESSSS YIPGSEYDYL LAMAIASLTI EKVEELLADR
     1141 DKMIIAVADM KKTTPKSLWL SDLESLDKEL EKLDLKDAQV QQAIEAAQKK IRAKSGAAVK
     1201 VKRQAPKKPA PKKTTKKASE SETTEASYSA MDTDNNVAEV VKPKARQGAK KKASESETTE
     1261 ASHSAMDTDN NVAEVVKPKG RQGAKKKAPA AAKEVEEDEM LDLAQRLAQY NFGSAPADSS
     1321 KTAETSKAIA VDDDDDDVVV EVAPVKKGGR KPAATKAAKP PAAPRKRGKQ TVASTEVLAI
     1381 GVSPEKKVRK MRSSPFNKKS SSVMSRLADN KEEESSENVA GNSSSEKSGG DVSAISRPQR
     1441 ANRRKMTYVL SDSESESAND SEFDDIEDDE DDE
//