LOCUS       ANM67520.1              1839 aa    PRT              PLN 23-MAR-2023
DEFINITION  Arabidopsis thaliana RNA polymerase II large subunit protein.
ACCESSION   CP002687-6653
PROTEIN_ID  ANM67520.1
SOURCE      Arabidopsis thaliana (thale cress)
  ORGANISM  Arabidopsis thaliana
            Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
            Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
            Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
            Camelineae; Arabidopsis.
REFERENCE   1  (bases 1 to 18585056)
  AUTHORS   Mayer,K., Schuller,C., Wambutt,R., Murphy,G., Volckaert,G.,
            Pohl,T., Dusterhoft,A., Stiekema,W., Entian,K.D., Terryn,N.,
            Harris,B., Ansorge,W., Brandt,P., Grivell,L., Rieger,M.,
            Weichselgartner,M., de Simone,V., Obermaier,B., Mache,R.,
            Muller,M., Kreis,M., Delseny,M., Puigdomenech,P., Watson,M.,
            Schmidtheini,T., Reichert,B., Portatelle,D., Perez-Alonso,M.,
            Boutry,M., Bancroft,I., Vos,P., Hoheisel,J., Zimmermann,W.,
            Wedler,H., Ridley,P., Langham,S.A., McCullagh,B., Bilham,L.,
            Robben,J., Van der Schueren,J., Grymonprez,B., Chuang,Y.J.,
            Vandenbussche,F., Braeken,M., Weltjens,I., Voet,M., Bastiaens,I.,
            Aert,R., Defoor,E., Weitzenegger,T., Bothe,G., Ramsperger,U.,
            Hilbert,H., Braun,M., Holzer,E., Brandt,A., Peters,S., van
            Staveren,M., Dirske,W., Mooijman,P., Klein Lankhorst,R., Rose,M.,
            Hauf,J., Kotter,P., Berneiser,S., Hempel,S., Feldpausch,M.,
            Lamberth,S., Van den Daele,H., De Keyser,A., Buysshaert,C.,
            Gielen,J., Villarroel,R., De Clercq,R., Van Montagu,M., Rogers,J.,
            Cronin,A., Quail,M., Bray-Allen,S., Clark,L., Doggett,J., Hall,S.,
            Kay,M., Lennard,N., McLay,K., Mayes,R., Pettett,A.,
            Rajandream,M.A., Lyne,M., Benes,V., Rechmann,S., Borkova,D.,
            Blocker,H., Scharfe,M., Grimm,M., Lohnert,T.H., Dose,S., de
            Haan,M., Maarse,A., Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M.,
            Fartmann,B., Granderath,K., Dauner,D., Herzl,A., Neumann,S.,
            Argiriou,A., Vitale,D., Liguori,R., Piravandi,E., Massenet,O.,
            Quigley,F., Clabauld,G., Mundlein,A., Felber,R., Schnabl,S.,
            Hiller,R., Schmidt,W., Lecharny,A., Aubourg,S., Chefdor,F.,
            Cooke,R., Berger,C., Montfort,A., Casacuberta,E., Gibbons,T.,
            Weber,N., Vandenbol,M., Bargues,M., Terol,J., Torres,A.,
            Perez-Perez,A., Purnelle,B., Bent,E., Johnson,S., Tacon,D.,
            Jesse,T., Heijnen,L., Schwarz,S., Scholler,P., Heber,S., Francs,P.,
            Bielke,C., Frishman,D., Haase,D., Lemcke,K., Mewes,H.W.,
            Stocker,S., Zaccaria,P., Bevan,M., Wilson,R.K., de la Bastide,M.,
            Habermann,K., Parnell,L., Dedhia,N., Gnoj,L., Schutz,K., Huang,E.,
            Spiegel,L., Sehkon,M., Murray,J., Sheet,P., Cordes,M.,
            Abu-Threideh,J., Stoneking,T., Kalicki,J., Graves,T., Harmon,G.,
            Edwards,J., Latreille,P., Courtney,L., Cloud,J., Abbott,A.,
            Scott,K., Johnson,D., Minx,P., Bentley,D., Fulton,B., Miller,N.,
            Greco,T., Kemp,K., Kramer,J., Fulton,L., Mardis,E., Dante,M.,
            Pepin,K., Hillier,L., Nelson,J., Spieth,J., Ryan,E., Andrews,S.,
            Geisel,C., Layman,D., Du,H., Ali,J., Berghoff,A., Jones,K.,
            Drone,K., Cotton,M., Joshu,C., Antonoiu,B., Zidanic,M., Strong,C.,
            Sun,H., Lamar,B., Yordan,C., Ma,P., Zhong,J., Preston,R., Vil,D.,
            Shekher,M., Matero,A., Shah,R., Swaby,I.K., O'Shaughnessy,A.,
            Rodriguez,M., Hoffmann,J., Till,S., Granat,S., Shohdy,N.,
            Hasegawa,A., Hameed,A., Lodhi,M., Johnson,A., Chen,E., Marra,M.,
            Martienssen,R. and McCombie,W.R.
  TITLE     Sequence and analysis of chromosome 4 of the plant Arabidopsis
            thaliana
  JOURNAL   Nature 402 (6763), 769-777 (1999)
   PUBMED   10617198
REFERENCE   2  (bases 1 to 18585056)
  AUTHORS   Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
  CONSRTM   TAIR
  TITLE     Direct Submission
  JOURNAL   Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
            Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE   3  (bases 1 to 18585056)
  AUTHORS   Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
            Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
            Vaughn,M. and Town,C.D.
  TITLE     Direct Submission
  JOURNAL   Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
            9704 Medical Center Dr, Rockville, MD 20850, USA
  REMARK    Protein update by submitter
FEATURES             Qualifiers
     source          /organism="Arabidopsis thaliana"
                     /mol_type="genomic DNA"
                     /db_xref="taxon:3702"
                     /chromosome="4"
                     /ecotype="Columbia"
     protein         /gene="NRPB1"
                     /locus_tag="AT4G35800"
                     /gene_synonym="F4B14.70"
                     /gene_synonym="F4B14_70"
                     /gene_synonym="RNA polymerase II large subunit"
                     /gene_synonym="RNA POLYMERASE II LARGE SUBUNIT"
                     /gene_synonym="RNA_POL_II_LS"
                     /gene_synonym="RNA_POL_II_LSRNA_POL_II_LS"
                     /gene_synonym="RPB1"
                     /db_xref="Araport:AT4G35800"
                     /db_xref="TAIR:AT4G35800"
     intron_pos      29:0 (1/12)
     intron_pos      117:0 (2/12)
     intron_pos      218:0 (3/12)
     intron_pos      271:0 (4/12)
     intron_pos      325:0 (5/12)
     intron_pos      398:2 (6/12)
     intron_pos      446:0 (7/12)
     intron_pos      558:0 (8/12)
     intron_pos      653:2 (9/12)
     intron_pos      740:0 (10/12)
     intron_pos      1760:2 (11/12)
     intron_pos      1784:2 (12/12)
BEGIN
        1 MDTRFPFSPA EVSKVRVVQF GILSPDEIRQ MSVIHVEHSE TTEKGKPKVG GLSDTRLGTI
       61 DRKVKCETCM ANMAECPGHF GYLELAKPMY HVGFMKTVLS IMRCVCFNCS KILADEEEHK
      121 FKQAMKIKNP KNRLKKILDA CKNKTKCDGG DDIDDVQSHS TDEPVKKSRG GCGAQQPKLT
      181 IEGMKMIAEY KIQRKKNDEP DQLPEPAERK QTLGADRVLS VLKRISDADC QLLGFNPKFA
      241 RPDWMILEVL PIPPPPVRPS VMMDATSRSE DDLTHQLAMI IRHNENLKRQ EKNGAPAHII
      301 SEFTQLLQFH IATYFDNELP GQPRATQKSG RPIKSICSRL KAKEGRIRGN LMGKRVDFSA
      361 RTVITPDPTI NIDELGVPWS IALNLTYPET VTPYNIERLK ELVDYGPHPP PGKTGAKYII
      421 RDDGQRLDLR YLKKSSDQHL ELGYKVERHL QDGDFVLFNR QPSLHKMSIM GHRIRIMPYS
      481 TFRLNLSVTS PYNADFDGDE MNMHVPQSFE TRAEVLELMM VPKCIVSPQA NRPVMGIVQD
      541 TLLGCRKITK RDTFIEKDVF MNTLMWWEDF DGKVPAPAIL KPRPLWTGKQ VFNLIIPKQI
      601 NLLRYSAWHA DTETGFITPG DTQVRIERGE LLAGTLCKKT LGTSNGSLVH VIWEEVGPDA
      661 ARKFLGHTQW LVNYWLLQNG FTIGIGDTIA DSSTMEKINE TISNAKTAVK DLIRQFQGKE
      721 LDPEPGRTMR DTFENRVNQV LNKARDDAGS SAQKSLAETN NLKAMVTAGS KGSFINISQM
      781 TACVGQQNVE GKRIPFGFDG RTLPHFTKDD YGPESRGFVE NSYLRGLTPQ EFFFHAMGGR
      841 EGLIDTAVKT SETGYIQRRL VKAMEDIMVK YDGTVRNSLG DVIQFLYGED GMDAVWIESQ
      901 KLDSLKMKKS EFDRTFKYEI DDENWNPTYL SDEHLEDLKG IRELRDVFDA EYSKLETDRF
      961 QLGTEIATNG DSTWPLPVNI KRHIWNAQKT FKIDLRKISD MHPVEIVDAV DKLQERLLVV
     1021 PGDDALSVEA QKNATLFFNI LLRSTLASKR VLEEYKLSRE AFEWVIGEIE SRFLQSLVAP
     1081 GEMIGCVAAQ SIGEPATQMT LNTFHYAGVS AKNVTLGVPR LREIINVAKR IKTPSLSVYL
     1141 TPEASKSKEG AKTVQCALEY TTLRSVTQAT EVWYDPDPMS TIIEEDFEFV RSYYEMPDED
     1201 VSPDKISPWL LRIELNREMM VDKKLSMADI AEKINLEFDD DLTCIFNDDN AQKLILRIRI
     1261 MNDEGPKGEL QDESAEDDVF LKKIESNMLT EMALRGIPDI NKVFIKQVRK SRFDEEGGFK
     1321 TSEEWMLDTE GVNLLAVMCH EDVDPKRTTS NHLIEIIEVL GIEAVRRALL DELRVVISFD
     1381 GSYVNYRHLA ILCDTMTYRG HLMAITRHGI NRNDTGPLMR CSFEETVDIL LDAAAYAETD
     1441 CLRGVTENIM LGQLAPIGTG DCELYLNDEM LKNAIELQLP SYMDGLEFGM TPARSPVSGT
     1501 PYHEGMMSPN YLLSPNMRLS PMSDAQFSPY VGGMAFSPSS SPGYSPSSPG YSPTSPGYSP
     1561 TSPGYSPTSP GYSPTSPTYS PSSPGYSPTS PAYSPTSPSY SPTSPSYSPT SPSYSPTSPS
     1621 YSPTSPSYSP TSPSYSPTSP AYSPTSPAYS PTSPAYSPTS PSYSPTSPSY SPTSPSYSPT
     1681 SPSYSPTSPS YSPTSPAYSP TSPGYSPTSP SYSPTSPSYG PTSPSYNPQS AKYSPSIAYS
     1741 PSNARLSPAS PYSPTSPNYS PTSPSYSPTS PSYSPSSPTY SPSSPYSSGA SPDYSPSAGY
     1801 SPTLPGYSPS STGQYTPHEG DKKDKTGKKD ASKDDKGNP
//