LOCUS ANM67520.1 1839 aa PRT PLN 23-MAR-2023 DEFINITION Arabidopsis thaliana RNA polymerase II large subunit protein. ACCESSION CP002687-6653 PROTEIN_ID ANM67520.1 SOURCE Arabidopsis thaliana (thale cress) ORGANISM Arabidopsis thaliana Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis. REFERENCE 1 (bases 1 to 18585056) AUTHORS Mayer,K., Schuller,C., Wambutt,R., Murphy,G., Volckaert,G., Pohl,T., Dusterhoft,A., Stiekema,W., Entian,K.D., Terryn,N., Harris,B., Ansorge,W., Brandt,P., Grivell,L., Rieger,M., Weichselgartner,M., de Simone,V., Obermaier,B., Mache,R., Muller,M., Kreis,M., Delseny,M., Puigdomenech,P., Watson,M., Schmidtheini,T., Reichert,B., Portatelle,D., Perez-Alonso,M., Boutry,M., Bancroft,I., Vos,P., Hoheisel,J., Zimmermann,W., Wedler,H., Ridley,P., Langham,S.A., McCullagh,B., Bilham,L., Robben,J., Van der Schueren,J., Grymonprez,B., Chuang,Y.J., Vandenbussche,F., Braeken,M., Weltjens,I., Voet,M., Bastiaens,I., Aert,R., Defoor,E., Weitzenegger,T., Bothe,G., Ramsperger,U., Hilbert,H., Braun,M., Holzer,E., Brandt,A., Peters,S., van Staveren,M., Dirske,W., Mooijman,P., Klein Lankhorst,R., Rose,M., Hauf,J., Kotter,P., Berneiser,S., Hempel,S., Feldpausch,M., Lamberth,S., Van den Daele,H., De Keyser,A., Buysshaert,C., Gielen,J., Villarroel,R., De Clercq,R., Van Montagu,M., Rogers,J., Cronin,A., Quail,M., Bray-Allen,S., Clark,L., Doggett,J., Hall,S., Kay,M., Lennard,N., McLay,K., Mayes,R., Pettett,A., Rajandream,M.A., Lyne,M., Benes,V., Rechmann,S., Borkova,D., Blocker,H., Scharfe,M., Grimm,M., Lohnert,T.H., Dose,S., de Haan,M., Maarse,A., Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M., Fartmann,B., Granderath,K., Dauner,D., Herzl,A., Neumann,S., Argiriou,A., Vitale,D., Liguori,R., Piravandi,E., Massenet,O., Quigley,F., Clabauld,G., Mundlein,A., Felber,R., Schnabl,S., Hiller,R., Schmidt,W., Lecharny,A., Aubourg,S., Chefdor,F., Cooke,R., Berger,C., Montfort,A., Casacuberta,E., Gibbons,T., Weber,N., Vandenbol,M., Bargues,M., Terol,J., Torres,A., Perez-Perez,A., Purnelle,B., Bent,E., Johnson,S., Tacon,D., Jesse,T., Heijnen,L., Schwarz,S., Scholler,P., Heber,S., Francs,P., Bielke,C., Frishman,D., Haase,D., Lemcke,K., Mewes,H.W., Stocker,S., Zaccaria,P., Bevan,M., Wilson,R.K., de la Bastide,M., Habermann,K., Parnell,L., Dedhia,N., Gnoj,L., Schutz,K., Huang,E., Spiegel,L., Sehkon,M., Murray,J., Sheet,P., Cordes,M., Abu-Threideh,J., Stoneking,T., Kalicki,J., Graves,T., Harmon,G., Edwards,J., Latreille,P., Courtney,L., Cloud,J., Abbott,A., Scott,K., Johnson,D., Minx,P., Bentley,D., Fulton,B., Miller,N., Greco,T., Kemp,K., Kramer,J., Fulton,L., Mardis,E., Dante,M., Pepin,K., Hillier,L., Nelson,J., Spieth,J., Ryan,E., Andrews,S., Geisel,C., Layman,D., Du,H., Ali,J., Berghoff,A., Jones,K., Drone,K., Cotton,M., Joshu,C., Antonoiu,B., Zidanic,M., Strong,C., Sun,H., Lamar,B., Yordan,C., Ma,P., Zhong,J., Preston,R., Vil,D., Shekher,M., Matero,A., Shah,R., Swaby,I.K., O'Shaughnessy,A., Rodriguez,M., Hoffmann,J., Till,S., Granat,S., Shohdy,N., Hasegawa,A., Hameed,A., Lodhi,M., Johnson,A., Chen,E., Marra,M., Martienssen,R. and McCombie,W.R. TITLE Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana JOURNAL Nature 402 (6763), 769-777 (1999) PUBMED 10617198 REFERENCE 2 (bases 1 to 18585056) AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E. CONSRTM TAIR TITLE Direct Submission JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie Institution, 260 Panama Street, Stanford, CA, USA REFERENCE 3 (bases 1 to 18585056) AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M., Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R., Vaughn,M. and Town,C.D. TITLE Direct Submission JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute, 9704 Medical Center Dr, Rockville, MD 20850, USA REMARK Protein update by submitter FEATURES Qualifiers source /organism="Arabidopsis thaliana" /mol_type="genomic DNA" /db_xref="taxon:3702" /chromosome="4" /ecotype="Columbia" protein /gene="NRPB1" /locus_tag="AT4G35800" /gene_synonym="F4B14.70" /gene_synonym="F4B14_70" /gene_synonym="RNA polymerase II large subunit" /gene_synonym="RNA POLYMERASE II LARGE SUBUNIT" /gene_synonym="RNA_POL_II_LS" /gene_synonym="RNA_POL_II_LSRNA_POL_II_LS" /gene_synonym="RPB1" /db_xref="Araport:AT4G35800" /db_xref="TAIR:AT4G35800" intron_pos 29:0 (1/12) intron_pos 117:0 (2/12) intron_pos 218:0 (3/12) intron_pos 271:0 (4/12) intron_pos 325:0 (5/12) intron_pos 398:2 (6/12) intron_pos 446:0 (7/12) intron_pos 558:0 (8/12) intron_pos 653:2 (9/12) intron_pos 740:0 (10/12) intron_pos 1760:2 (11/12) intron_pos 1784:2 (12/12) BEGIN 1 MDTRFPFSPA EVSKVRVVQF GILSPDEIRQ MSVIHVEHSE TTEKGKPKVG GLSDTRLGTI 61 DRKVKCETCM ANMAECPGHF GYLELAKPMY HVGFMKTVLS IMRCVCFNCS KILADEEEHK 121 FKQAMKIKNP KNRLKKILDA CKNKTKCDGG DDIDDVQSHS TDEPVKKSRG GCGAQQPKLT 181 IEGMKMIAEY KIQRKKNDEP DQLPEPAERK QTLGADRVLS VLKRISDADC QLLGFNPKFA 241 RPDWMILEVL PIPPPPVRPS VMMDATSRSE DDLTHQLAMI IRHNENLKRQ EKNGAPAHII 301 SEFTQLLQFH IATYFDNELP GQPRATQKSG RPIKSICSRL KAKEGRIRGN LMGKRVDFSA 361 RTVITPDPTI NIDELGVPWS IALNLTYPET VTPYNIERLK ELVDYGPHPP PGKTGAKYII 421 RDDGQRLDLR YLKKSSDQHL ELGYKVERHL QDGDFVLFNR QPSLHKMSIM GHRIRIMPYS 481 TFRLNLSVTS PYNADFDGDE MNMHVPQSFE TRAEVLELMM VPKCIVSPQA NRPVMGIVQD 541 TLLGCRKITK RDTFIEKDVF MNTLMWWEDF DGKVPAPAIL KPRPLWTGKQ VFNLIIPKQI 601 NLLRYSAWHA DTETGFITPG DTQVRIERGE LLAGTLCKKT LGTSNGSLVH VIWEEVGPDA 661 ARKFLGHTQW LVNYWLLQNG FTIGIGDTIA DSSTMEKINE TISNAKTAVK DLIRQFQGKE 721 LDPEPGRTMR DTFENRVNQV LNKARDDAGS SAQKSLAETN NLKAMVTAGS KGSFINISQM 781 TACVGQQNVE GKRIPFGFDG RTLPHFTKDD YGPESRGFVE NSYLRGLTPQ EFFFHAMGGR 841 EGLIDTAVKT SETGYIQRRL VKAMEDIMVK YDGTVRNSLG DVIQFLYGED GMDAVWIESQ 901 KLDSLKMKKS EFDRTFKYEI DDENWNPTYL SDEHLEDLKG IRELRDVFDA EYSKLETDRF 961 QLGTEIATNG DSTWPLPVNI KRHIWNAQKT FKIDLRKISD MHPVEIVDAV DKLQERLLVV 1021 PGDDALSVEA QKNATLFFNI LLRSTLASKR VLEEYKLSRE AFEWVIGEIE SRFLQSLVAP 1081 GEMIGCVAAQ SIGEPATQMT LNTFHYAGVS AKNVTLGVPR LREIINVAKR IKTPSLSVYL 1141 TPEASKSKEG AKTVQCALEY TTLRSVTQAT EVWYDPDPMS TIIEEDFEFV RSYYEMPDED 1201 VSPDKISPWL LRIELNREMM VDKKLSMADI AEKINLEFDD DLTCIFNDDN AQKLILRIRI 1261 MNDEGPKGEL QDESAEDDVF LKKIESNMLT EMALRGIPDI NKVFIKQVRK SRFDEEGGFK 1321 TSEEWMLDTE GVNLLAVMCH EDVDPKRTTS NHLIEIIEVL GIEAVRRALL DELRVVISFD 1381 GSYVNYRHLA ILCDTMTYRG HLMAITRHGI NRNDTGPLMR CSFEETVDIL LDAAAYAETD 1441 CLRGVTENIM LGQLAPIGTG DCELYLNDEM LKNAIELQLP SYMDGLEFGM TPARSPVSGT 1501 PYHEGMMSPN YLLSPNMRLS PMSDAQFSPY VGGMAFSPSS SPGYSPSSPG YSPTSPGYSP 1561 TSPGYSPTSP GYSPTSPTYS PSSPGYSPTS PAYSPTSPSY SPTSPSYSPT SPSYSPTSPS 1621 YSPTSPSYSP TSPSYSPTSP AYSPTSPAYS PTSPAYSPTS PSYSPTSPSY SPTSPSYSPT 1681 SPSYSPTSPS YSPTSPAYSP TSPGYSPTSP SYSPTSPSYG PTSPSYNPQS AKYSPSIAYS 1741 PSNARLSPAS PYSPTSPNYS PTSPSYSPTS PSYSPSSPTY SPSSPYSSGA SPDYSPSAGY 1801 SPTLPGYSPS STGQYTPHEG DKKDKTGKKD ASKDDKGNP //