LOCUS       AEE84493.1              1188 aa    PRT              PLN 23-MAR-2023
DEFINITION  Arabidopsis thaliana DNA-directed RNA polymerase family
            protein protein.
ACCESSION   CP002687-3802
PROTEIN_ID  AEE84493.1
SOURCE      Arabidopsis thaliana (thale cress)
  ORGANISM  Arabidopsis thaliana
            Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
            Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
            Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
            Camelineae; Arabidopsis.
REFERENCE   1  (bases 1 to 18585056)
  AUTHORS   Mayer,K., Schuller,C., Wambutt,R., Murphy,G., Volckaert,G.,
            Pohl,T., Dusterhoft,A., Stiekema,W., Entian,K.D., Terryn,N.,
            Harris,B., Ansorge,W., Brandt,P., Grivell,L., Rieger,M.,
            Weichselgartner,M., de Simone,V., Obermaier,B., Mache,R.,
            Muller,M., Kreis,M., Delseny,M., Puigdomenech,P., Watson,M.,
            Schmidtheini,T., Reichert,B., Portatelle,D., Perez-Alonso,M.,
            Boutry,M., Bancroft,I., Vos,P., Hoheisel,J., Zimmermann,W.,
            Wedler,H., Ridley,P., Langham,S.A., McCullagh,B., Bilham,L.,
            Robben,J., Van der Schueren,J., Grymonprez,B., Chuang,Y.J.,
            Vandenbussche,F., Braeken,M., Weltjens,I., Voet,M., Bastiaens,I.,
            Aert,R., Defoor,E., Weitzenegger,T., Bothe,G., Ramsperger,U.,
            Hilbert,H., Braun,M., Holzer,E., Brandt,A., Peters,S., van
            Staveren,M., Dirske,W., Mooijman,P., Klein Lankhorst,R., Rose,M.,
            Hauf,J., Kotter,P., Berneiser,S., Hempel,S., Feldpausch,M.,
            Lamberth,S., Van den Daele,H., De Keyser,A., Buysshaert,C.,
            Gielen,J., Villarroel,R., De Clercq,R., Van Montagu,M., Rogers,J.,
            Cronin,A., Quail,M., Bray-Allen,S., Clark,L., Doggett,J., Hall,S.,
            Kay,M., Lennard,N., McLay,K., Mayes,R., Pettett,A.,
            Rajandream,M.A., Lyne,M., Benes,V., Rechmann,S., Borkova,D.,
            Blocker,H., Scharfe,M., Grimm,M., Lohnert,T.H., Dose,S., de
            Haan,M., Maarse,A., Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M.,
            Fartmann,B., Granderath,K., Dauner,D., Herzl,A., Neumann,S.,
            Argiriou,A., Vitale,D., Liguori,R., Piravandi,E., Massenet,O.,
            Quigley,F., Clabauld,G., Mundlein,A., Felber,R., Schnabl,S.,
            Hiller,R., Schmidt,W., Lecharny,A., Aubourg,S., Chefdor,F.,
            Cooke,R., Berger,C., Montfort,A., Casacuberta,E., Gibbons,T.,
            Weber,N., Vandenbol,M., Bargues,M., Terol,J., Torres,A.,
            Perez-Perez,A., Purnelle,B., Bent,E., Johnson,S., Tacon,D.,
            Jesse,T., Heijnen,L., Schwarz,S., Scholler,P., Heber,S., Francs,P.,
            Bielke,C., Frishman,D., Haase,D., Lemcke,K., Mewes,H.W.,
            Stocker,S., Zaccaria,P., Bevan,M., Wilson,R.K., de la Bastide,M.,
            Habermann,K., Parnell,L., Dedhia,N., Gnoj,L., Schutz,K., Huang,E.,
            Spiegel,L., Sehkon,M., Murray,J., Sheet,P., Cordes,M.,
            Abu-Threideh,J., Stoneking,T., Kalicki,J., Graves,T., Harmon,G.,
            Edwards,J., Latreille,P., Courtney,L., Cloud,J., Abbott,A.,
            Scott,K., Johnson,D., Minx,P., Bentley,D., Fulton,B., Miller,N.,
            Greco,T., Kemp,K., Kramer,J., Fulton,L., Mardis,E., Dante,M.,
            Pepin,K., Hillier,L., Nelson,J., Spieth,J., Ryan,E., Andrews,S.,
            Geisel,C., Layman,D., Du,H., Ali,J., Berghoff,A., Jones,K.,
            Drone,K., Cotton,M., Joshu,C., Antonoiu,B., Zidanic,M., Strong,C.,
            Sun,H., Lamar,B., Yordan,C., Ma,P., Zhong,J., Preston,R., Vil,D.,
            Shekher,M., Matero,A., Shah,R., Swaby,I.K., O'Shaughnessy,A.,
            Rodriguez,M., Hoffmann,J., Till,S., Granat,S., Shohdy,N.,
            Hasegawa,A., Hameed,A., Lodhi,M., Johnson,A., Chen,E., Marra,M.,
            Martienssen,R. and McCombie,W.R.
  TITLE     Sequence and analysis of chromosome 4 of the plant Arabidopsis
            thaliana
  JOURNAL   Nature 402 (6763), 769-777 (1999)
   PUBMED   10617198
REFERENCE   2  (bases 1 to 18585056)
  AUTHORS   Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
  CONSRTM   TAIR
  TITLE     Direct Submission
  JOURNAL   Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
            Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE   3  (bases 1 to 18585056)
  AUTHORS   Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
            Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
            Vaughn,M. and Town,C.D.
  TITLE     Direct Submission
  JOURNAL   Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
            9704 Medical Center Dr, Rockville, MD 20850, USA
  REMARK    Protein update by submitter
FEATURES             Qualifiers
     source          /organism="Arabidopsis thaliana"
                     /mol_type="genomic DNA"
                     /db_xref="taxon:3702"
                     /chromosome="4"
                     /ecotype="Columbia"
     protein         /gene="NRPB2"
                     /locus_tag="AT4G21710"
                     /gene_synonym="EMB1989"
                     /gene_synonym="EMBRYO DEFECTIVE 1989"
                     /gene_synonym="F17L22.170"
                     /gene_synonym="F17L22_170"
                     /gene_synonym="RPB2"
                     /inference="Similar to RNA sequence,
                     EST:INSD:BE520966.1,INSD:BE520964.1,INSD:EL328439.1,
                     INSD:ES097031.1,INSD:EH890089.1,INSD:AW004411.1,
                     INSD:BE520965.1,INSD:EG516180.1,INSD:EH885597.1,
                     INSD:ES166698.1,INSD:EL026289.1,INSD:Z26186.1,
                     INSD:AV551447.1,INSD:EL112916.1,INSD:EH951122.1,
                     INSD:ES179015.1,INSD:EG494783.1,INSD:ES098675.1,
                     INSD:EH840269.1,INSD:EL978530.1,INSD:AV539095.1,
                     INSD:CB252523.1,INSD:EH835940.1,INSD:EG494785.1,
                     INSD:EL272382.1,INSD:ES204207.1,INSD:EH936781.1,
                     INSD:ES165860.1"
                     /inference="Similar to RNA sequence, mRNA:INSD:Z19120.1"
                     /note="NRPB2; CONTAINS InterPro DOMAIN/s: DNA-directed RNA
                     polymerase, subunit 2, domain 6 (InterPro:IPR007120), RNA
                     polymerase Rpb2, domain 7 (InterPro:IPR007641), RNA
                     polymerase, beta subunit, protrusion (InterPro:IPR007644),
                     RNA polymerase Rpb2, domain 3 (InterPro:IPR007645),
                     DNA-directed RNA polymerase, subunit 2
                     (InterPro:IPR015712), RNA polymerase Rpb2, domain 2
                     (InterPro:IPR007642), RNA polymerase Rpb2, domain 4
                     (InterPro:IPR007646), RNA polymerase, beta subunit,
                     conserved site (InterPro:IPR007121), RNA polymerase Rpb2,
                     domain 5 (InterPro:IPR007647); BEST Arabidopsis thaliana
                     protein match is: nuclear RNA polymerase C2
                     (TAIR:AT5G45140.1); Has 37546 Blast hits to 27868 proteins
                     in 9192 species: Archae - 496; Bacteria - 17572; Metazoa -
                     623; Fungi - 7193; Plants - 3397; Viruses - 232; Other
                     Eukaryotes - 8033 (source: NCBI BLink)."
                     /db_xref="TAIR:AT4G21710"
                     /db_xref="Araport:AT4G21710"
     intron_pos      84:0 (1/24)
     intron_pos      158:0 (2/24)
     intron_pos      255:0 (3/24)
     intron_pos      321:0 (4/24)
     intron_pos      342:2 (5/24)
     intron_pos      371:2 (6/24)
     intron_pos      412:0 (7/24)
     intron_pos      427:0 (8/24)
     intron_pos      473:0 (9/24)
     intron_pos      497:1 (10/24)
     intron_pos      523:0 (11/24)
     intron_pos      559:0 (12/24)
     intron_pos      598:0 (13/24)
     intron_pos      650:0 (14/24)
     intron_pos      685:0 (15/24)
     intron_pos      727:0 (16/24)
     intron_pos      753:0 (17/24)
     intron_pos      785:0 (18/24)
     intron_pos      823:2 (19/24)
     intron_pos      847:0 (20/24)
     intron_pos      865:0 (21/24)
     intron_pos      913:0 (22/24)
     intron_pos      1013:0 (23/24)
     intron_pos      1154:0 (24/24)
BEGIN
        1 MEYNEYEPEP QYVEDDDDEE ITQEDAWAVI SAYFEEKGLV RQQLDSFDEF IQNTMQEIVD
       61 ESADIEIRPE SQHNPGHQSD FAETIYKISF GQIYLSKPMM TESDGETATL FPKAARLRNL
      121 TYSAPLYVDV TKRVIKKGHD GEEVTETQDF TKVFIGKVPI MLRSSYCTLF QNSEKDLTEL
      181 GECPYDQGGY FIINGSEKVL IAQEKMSTNH VYVFKKRQPN KYAYVGEVRS MAENQNRPPS
      241 TMFVRMLARA SAKGGSSGQY IRCTLPYIRT EIPIIIVFRA LGFVADKDIL EHICYDFADT
      301 QMMELLRPSL EEAFVIQNQL VALDYIGKRG ATVGVTKEKR IKYARDILQK EMLPHVGIGE
      361 HCETKKAYYF GYIIHRLLLC ALGRRPEDDR DHYGNKRLDL AGPLLGGLFR MLFRKLTRDV
      421 RSYVQKCVDN GKEVNLQFAI KAKTITSGLK YSLATGNWGQ ANAAGTRAGV SQVLNRLTYA
      481 STLSHLRRLN SPIGREGKLA KPRQLHNSQW GMMCPAETPE GQACGLVKNL ALMVYITVGS
      541 AAYPILEFLE EWGTENFEEI SPSVIPQATK IFVNGMWVGV HRDPDMLVKT LRRLRRRVDV
      601 NTEVGVVRDI RLKELRIYTD YGRCSRPLFI VDNQKLLIKK RDIYALQQRE SAEEDGWHHL
      661 VAKGFIEYID TEEEETTMIS MTISDLVQAR LRPEEAYTEN YTHCEIHPSL ILGVCASIIP
      721 FPDHNQSPRN TYQSAMGKQA MGIYVTNYQF RMDTLAYVLY YPQKPLVTTR AMEHLHFRQL
      781 PAGINAIVAI SCYSGYNQED SVIMNQSSID RGFFRSLFFR SYRDEEKKMG TLVKEDFGRP
      841 DRGSTMGMRH GSYDKLDDDG LAPPGTRVSG EDVIIGKTTP ISQDEAQGQS SRYTRRDHSI
      901 SLRHSETGMV DQVLLTTNAD GLRFVKVRVR SVRIPQIGDK FSSRHGQKGT VGMTYTQEDM
      961 PWTIEGVTPD IIVNPHAIPS RMTIGQLIEC IMGKVAAHMG KEGDATPFTD VTVDNISKAL
     1021 HKCGYQMRGF ERMYNGHTGR PLTAMIFLGP TYYQRLKHMV DDKIHSRGRG PVQILTRQPA
     1081 EGRSRDGGLR FGEMERDCMI AHGAAHFLKE RLFDQSDAYR VHVCEVCGLI AIANLKKNSF
     1141 ECRGCKNKTD IVQVYIPYAC KLLFQELMSM AIAPRMLTKH LKSAKGRQ
//