LOCUS AEE85878.1 1284 aa PRT PLN 23-MAR-2023 DEFINITION Arabidopsis thaliana DNA topoisomerase, type IA, core protein. ACCESSION CP002687-5685 PROTEIN_ID AEE85878.1 SOURCE Arabidopsis thaliana (thale cress) ORGANISM Arabidopsis thaliana Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis. REFERENCE 1 (bases 1 to 18585056) AUTHORS Mayer,K., Schuller,C., Wambutt,R., Murphy,G., Volckaert,G., Pohl,T., Dusterhoft,A., Stiekema,W., Entian,K.D., Terryn,N., Harris,B., Ansorge,W., Brandt,P., Grivell,L., Rieger,M., Weichselgartner,M., de Simone,V., Obermaier,B., Mache,R., Muller,M., Kreis,M., Delseny,M., Puigdomenech,P., Watson,M., Schmidtheini,T., Reichert,B., Portatelle,D., Perez-Alonso,M., Boutry,M., Bancroft,I., Vos,P., Hoheisel,J., Zimmermann,W., Wedler,H., Ridley,P., Langham,S.A., McCullagh,B., Bilham,L., Robben,J., Van der Schueren,J., Grymonprez,B., Chuang,Y.J., Vandenbussche,F., Braeken,M., Weltjens,I., Voet,M., Bastiaens,I., Aert,R., Defoor,E., Weitzenegger,T., Bothe,G., Ramsperger,U., Hilbert,H., Braun,M., Holzer,E., Brandt,A., Peters,S., van Staveren,M., Dirske,W., Mooijman,P., Klein Lankhorst,R., Rose,M., Hauf,J., Kotter,P., Berneiser,S., Hempel,S., Feldpausch,M., Lamberth,S., Van den Daele,H., De Keyser,A., Buysshaert,C., Gielen,J., Villarroel,R., De Clercq,R., Van Montagu,M., Rogers,J., Cronin,A., Quail,M., Bray-Allen,S., Clark,L., Doggett,J., Hall,S., Kay,M., Lennard,N., McLay,K., Mayes,R., Pettett,A., Rajandream,M.A., Lyne,M., Benes,V., Rechmann,S., Borkova,D., Blocker,H., Scharfe,M., Grimm,M., Lohnert,T.H., Dose,S., de Haan,M., Maarse,A., Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M., Fartmann,B., Granderath,K., Dauner,D., Herzl,A., Neumann,S., Argiriou,A., Vitale,D., Liguori,R., Piravandi,E., Massenet,O., Quigley,F., Clabauld,G., Mundlein,A., Felber,R., Schnabl,S., Hiller,R., Schmidt,W., Lecharny,A., Aubourg,S., Chefdor,F., Cooke,R., Berger,C., Montfort,A., Casacuberta,E., Gibbons,T., Weber,N., Vandenbol,M., Bargues,M., Terol,J., Torres,A., Perez-Perez,A., Purnelle,B., Bent,E., Johnson,S., Tacon,D., Jesse,T., Heijnen,L., Schwarz,S., Scholler,P., Heber,S., Francs,P., Bielke,C., Frishman,D., Haase,D., Lemcke,K., Mewes,H.W., Stocker,S., Zaccaria,P., Bevan,M., Wilson,R.K., de la Bastide,M., Habermann,K., Parnell,L., Dedhia,N., Gnoj,L., Schutz,K., Huang,E., Spiegel,L., Sehkon,M., Murray,J., Sheet,P., Cordes,M., Abu-Threideh,J., Stoneking,T., Kalicki,J., Graves,T., Harmon,G., Edwards,J., Latreille,P., Courtney,L., Cloud,J., Abbott,A., Scott,K., Johnson,D., Minx,P., Bentley,D., Fulton,B., Miller,N., Greco,T., Kemp,K., Kramer,J., Fulton,L., Mardis,E., Dante,M., Pepin,K., Hillier,L., Nelson,J., Spieth,J., Ryan,E., Andrews,S., Geisel,C., Layman,D., Du,H., Ali,J., Berghoff,A., Jones,K., Drone,K., Cotton,M., Joshu,C., Antonoiu,B., Zidanic,M., Strong,C., Sun,H., Lamar,B., Yordan,C., Ma,P., Zhong,J., Preston,R., Vil,D., Shekher,M., Matero,A., Shah,R., Swaby,I.K., O'Shaughnessy,A., Rodriguez,M., Hoffmann,J., Till,S., Granat,S., Shohdy,N., Hasegawa,A., Hameed,A., Lodhi,M., Johnson,A., Chen,E., Marra,M., Martienssen,R. and McCombie,W.R. TITLE Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana JOURNAL Nature 402 (6763), 769-777 (1999) PUBMED 10617198 REFERENCE 2 (bases 1 to 18585056) AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E. CONSRTM TAIR TITLE Direct Submission JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie Institution, 260 Panama Street, Stanford, CA, USA REFERENCE 3 (bases 1 to 18585056) AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M., Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R., Vaughn,M. and Town,C.D. TITLE Direct Submission JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute, 9704 Medical Center Dr, Rockville, MD 20850, USA REMARK Protein update by submitter FEATURES Qualifiers source /organism="Arabidopsis thaliana" /mol_type="genomic DNA" /db_xref="taxon:3702" /chromosome="4" /ecotype="Columbia" protein /locus_tag="AT4G31210" /gene_synonym="F8F16.30" /gene_synonym="F8F16_30" /inference="Similar to RNA sequence, EST:INSD:EL021107.1,INSD:ES041104.1,INSD:AV441179.1, INSD:ES104562.1" /inference="Similar to RNA sequence, mRNA:INSD:BX826794.1,INSD:BX841517.1" /note="DNA topoisomerase, type IA, core; FUNCTIONS IN: DNA topoisomerase activity, DNA topoisomerase type I activity, DNA binding, nucleic acid binding; INVOLVED IN: DNA topological change, DNA unwinding involved in replication, DNA metabolic process; LOCATED IN: chromosome; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s: DNA topoisomerase, type IA, zn finger (InterPro:IPR013498), DNA topoisomerase, type IA, core (InterPro:IPR000380), DNA topoisomerase, type IA, DNA-binding (InterPro:IPR003602), DNA topoisomerase, type IA, domain 2 (InterPro:IPR003601), DNA topoisomerase, type IA, central (InterPro:IPR013497), DNA topoisomerase, type IA, central region, subdomain 3 (InterPro:IPR013826), DNA topoisomerase I, bacterial-type (InterPro:IPR005733), Toprim domain, subgroup (InterPro:IPR006154), DNA topoisomerase, type IA, central region, subdomain 1 (InterPro:IPR013824), Toprim domain (InterPro:IPR006171); BEST Arabidopsis thaliana protein match is: topoisomerase 3alpha (TAIR:AT5G63920.1); Has 21441 Blast hits to 18441 proteins in 2923 species: Archae - 440; Bacteria - 10015; Metazoa - 1777; Fungi - 750; Plants - 256; Viruses - 35; Other Eukaryotes - 8168 (source: NCBI BLink)." /db_xref="TAIR:AT4G31210" /db_xref="Araport:AT4G31210" intron_pos 28:0 (1/22) intron_pos 242:0 (2/22) intron_pos 409:0 (3/22) intron_pos 544:2 (4/22) intron_pos 758:0 (5/22) intron_pos 787:0 (6/22) intron_pos 802:2 (7/22) intron_pos 839:1 (8/22) intron_pos 873:0 (9/22) intron_pos 904:0 (10/22) intron_pos 933:0 (11/22) intron_pos 961:0 (12/22) intron_pos 984:0 (13/22) intron_pos 1004:0 (14/22) intron_pos 1028:0 (15/22) intron_pos 1064:0 (16/22) intron_pos 1089:2 (17/22) intron_pos 1117:2 (18/22) intron_pos 1155:0 (19/22) intron_pos 1185:0 (20/22) intron_pos 1207:0 (21/22) intron_pos 1239:0 (22/22) BEGIN 1 MQRTISLAAA KSSSSTSVLS LHPLMAKLQC RAIQNFPASS SSSVVRVDRV YRNVSQLQFK 61 RENSSCLKLA CALPSHLSLL GSLSYATHWS SSTSRAFGYS FRPFARRYFS QVASTEIKDS 121 IVGGVEKFGG NKIGFKKFNK KWKKHRVLAS TKAEVVASTE PVIGDVNSGI KAKLSTAASP 181 ASNAKQASTV KTKRQPKSKK FEDKSSPTVS VLETVSVDES LQSFPKPRHS GSGNRKSSSA 241 EYSSQKEVVK KTNVEGPKSS TPSNSMSEQQ HWTSTKASNA PKQEQDNIVG GDEKAGGNKV 301 GFKKFNKNRK KHNVLASSEA EVVTSTEPVI GDGSSGIKAE LSTAASPASN GNQATTVKSK 361 RRPKNKKVED KSSSVVPVLE AVSLDESPIS VPKPKHSGSG NRKSSSAKKE VAKNHPVEEP 421 KSPAPSNSKS EQQHLKSTKA SKAPKQKLVP QHMKNSIEHR GQNASKPLYP PSGKSVIVVE 481 SMTKAKIIQG YLGDMYEVLP SYGHIRDLAT RSGSVRPDDD FSMVWEVPSS AWTHIKSIKV 541 ALNGAENLIL ASDPDREGEA IAWHIIEMLQ QQGALHESMT VARVVFHEIT ESAIKSALQS 601 PREIDGDLVH AYLARRALDY LIGFNISPLL WRKLPGCPSA GRVQSAALAL VCDRESEIDG 661 FKPQEYWTVG IKVKGKDNSA TFSAHLTSLN SKRLNQLSIS SEANAQDIEQ RIKSEGFLVK 721 GTKTSTTRKN PPTPYITSTL QQDAANKLHF STAHTMKLAQ KLYEGVQLSD GKSAGLITYM 781 RTDGLHIADE AIKDIQSLVA ERYGKNFTSD SPRKYFKKVK NAQEAHEAIR PTDIRRLPST 841 IASLLDADSL KLYTLIWSRA VACQMEPASI AQIQLDIGNA SESIIFRSSC SKVEFLGYQA 901 VYEDPEAKAI KNNDNDQSSE REETFKTLSS LKDGDLLQIG EVELKQHHTQ HPPRYSEGSL 961 VKKLEELGIG RPSTYASIFR VLQHRKYVTI KNRVLYPEFR ARMVSAFLTH YFTEITDYSF 1021 TADMETELDN VSGGVTEWKG LLRDYWTRFS AYCKRVENVQ RSQVEKMLEK KYEDVLFSLL 1081 PYPSRTCPSC MEGTLSFKAS KFGAGYFIGC DQHPSCKFIA KTLYGEDEDE DDPPKNNCVE 1141 EPKLLGLHPN TSEKVILKCG PYGHYVQLGE DKKGHTPKRA NAAHIKDVNS ITLESALELL 1201 RYPLTLGTHP EDGQPVVLKL SKSGFTIKHR RNMATVPKNT EPGEVTLERA MKLLSGKNVR 1261 KSGRPPKSIL PEEEKGDEEV VVAI //