LOCUS AEC09767.1 1976 aa PRT PLN 23-MAR-2023 DEFINITION Arabidopsis thaliana nuclear RNA polymerase D1B protein. ACCESSION CP002685-5979 PROTEIN_ID AEC09767.1 SOURCE Arabidopsis thaliana (thale cress) ORGANISM Arabidopsis thaliana Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis. REFERENCE 1 (bases 1 to 19698289) AUTHORS Lin,X., Kaul,S., Rounsley,S., Shea,T.P., Benito,M.I., Town,C.D., Fujii,C.Y., Mason,T., Bowman,C.L., Barnstead,M., Feldblyum,T.V., Buell,C.R., Ketchum,K.A., Lee,J., Ronning,C.M., Koo,H.L., Moffat,K.S., Cronin,L.A., Shen,M., Pai,G., Van Aken,S., Umayam,L., Tallon,L.J., Gill,J.E., Adams,M.D., Carrera,A.J., Creasy,T.H., Goodman,H.M., Somerville,C.R., Copenhaver,G.P., Preuss,D., Nierman,W.C., White,O., Eisen,J.A., Salzberg,S.L., Fraser,C.M. and Venter,J.C. TITLE Sequence and analysis of chromosome 2 of the plant Arabidopsis thaliana JOURNAL Nature 402 (6763), 761-768 (1999) PUBMED 10617197 REFERENCE 2 (bases 1 to 19698289) AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E. CONSRTM TAIR TITLE Direct Submission JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie Institution, 260 Panama Street, Stanford, CA, USA REFERENCE 3 (bases 1 to 19698289) AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M., Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R., Vaughn,M. and Town,C.D. TITLE Direct Submission JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute, 9704 Medical Center Dr, Rockville, MD 20850, USA REMARK Protein update by submitter FEATURES Qualifiers source /organism="Arabidopsis thaliana" /mol_type="genomic DNA" /db_xref="taxon:3702" /chromosome="2" /ecotype="Columbia" protein /gene="NRPD1B" /locus_tag="AT2G40030" /gene_synonym="ATNRPD1B" /gene_synonym="DEFECTIVE IN MERISTEM SILENCING 5" /gene_synonym="DMS5" /gene_synonym="DRD3" /gene_synonym="NRPE1" /gene_synonym="nuclear RNA polymerase D1B" /gene_synonym="T28M21.19" /gene_synonym="T28M21_19" /inference="Similar to RNA sequence, EST:INSD:Z34832.1,INSD:ES080084.1,INSD:EL967379.1, INSD:ES042172.1,INSD:ES041018.1,INSD:EH821470.1, INSD:EH966998.1,INSD:ES176080.1,INSD:EL144975.1, INSD:EH986213.1,INSD:ES055069.1,INSD:Z34069.1, INSD:EH813698.1" /inference="similar to RNA sequence, mRNA:INSD:AY927744.1,INSD:DQ020656.1,INSD:AY826516.1" /note="nuclear RNA polymerase D1B (NRPD1B); CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF3223 (InterPro:IPR021602), RNA polymerase, N-terminal (InterPro:IPR006592), RNA polymerase, alpha subunit (InterPro:IPR000722), RNA polymerase Rpb1, domain 3 (InterPro:IPR007066), RNA polymerase Rpb1, domain 1 (InterPro:IPR007080), RNA polymerase Rpb1, domain 5 (InterPro:IPR007081); BEST Arabidopsis thaliana protein match is: nuclear RNA polymerase D1A (TAIR:AT1G63020.2); Has 52919 Blast hits to 31940 proteins in 6835 species: Archae - 366; Bacteria - 10380; Metazoa - 13235; Fungi - 6920; Plants - 7147; Viruses - 757; Other Eukaryotes - 14114 (source: NCBI BLink)." /db_xref="Araport:AT2G40030" /db_xref="TAIR:AT2G40030" intron_pos 28:0 (1/16) intron_pos 70:1 (2/16) intron_pos 109:0 (3/16) intron_pos 125:0 (4/16) intron_pos 180:0 (5/16) intron_pos 233:0 (6/16) intron_pos 860:0 (7/16) intron_pos 918:2 (8/16) intron_pos 944:0 (9/16) intron_pos 987:2 (10/16) intron_pos 1039:1 (11/16) intron_pos 1131:0 (12/16) intron_pos 1189:0 (13/16) intron_pos 1236:0 (14/16) intron_pos 1758:2 (15/16) intron_pos 1797:0 (16/16) BEGIN 1 MEEESTSEIL DGEIVGITFA LASHHEICIQ SISESAINHP SQLTNAFLGL PLEFGKCESC 61 GATEPDKCEG HFGYIQLPVP IYHPAHVNEL KQMLSLLCLK CLKIKKAKGT SGGLADRLLG 121 VCCEEASQIS IKDRASDGAS YLELKLPSRS RLQPGCWNFL ERYGYRYGSD YTRPLLAREV 181 KEILRRIPEE SRKKLTAKGH IPQEGYILEY LPVPPNCLSV PEASDGFSTM SVDPSRIELK 241 DVLKKVIAIK SSRSGETNFE SHKAEASEMF RVVDTYLQVR GTAKAARNID MRYGVSKISD 301 SSSSKAWTEK MRTLFIRKGS GFSSRSVITG DAYRHVNEVG IPIEIAQRIT FEERVSVHNR 361 GYLQKLVDDK LCLSYTQGST TYSLRDGSKG HTELKPGQVV HRRVMDGDVV FINRPPTTHK 421 HSLQALRVYV HEDNTVKINP LMCSPLSADF DGDCVHLFYP QSLSAKAEVM ELFSVEKQLL 481 SSHTGQLILQ MGSDSLLSLR VMLERVFLDK ATAQQLAMYG SLSLPPPALR KSSKSGPAWT 541 VFQILQLAFP ERLSCKGDRF LVDGSDLLKF DFGVDAMGSI INEIVTSIFL EKGPKETLGF 601 FDSLQPLLME SLFAEGFSLS LEDLSMSRAD MDVIHNLIIR EISPMVSRLR LSYRDELQLE 661 NSIHKVKEVA ANFMLKSYSI RNLIDIKSNS AITKLVQQTG FLGLQLSDKK KFYTKTLVED 721 MAIFCKRKYG RISSSGDFGI VKGCFFHGLD PYEEMAHSIA AREVIVRSSR GLAEPGTLFK 781 NLMAVLRDIV ITNDGTVRNT CSNSVIQFKY GVDSERGHQG LFEAGEPVGV LAATAMSNPA 841 YKAVLDSSPN SNSSWELMKE VLLCKVNFQN TTNDRRVILY LNECHCGKRF CQENAACTVR 901 NKLNKVSLKD TAVEFLVEYR KQPTISEIFG IDSCLHGHIH LNKTLLQDWN ISMQDIHQKC 961 EDVINSLGQK KKKKATDDFK RTSLSVSECC SFRDPCGSKG SDMPCLTFSY NATDPDLERT 1021 LDVLCNTVYP VLLEIVIKGD SRICSANIIW NSSDMTTWIR NRHASRRGEW VLDVTVEKSA 1081 VKQSGDAWRV VIDSCLSVLH LIDTKRSIPY SVKQVQELLG LSCAFEQAVQ RLSASVRMVS 1141 KGVLKEHIIL LANNMTCSGT MLGFNSGGYK ALTRSLNIKA PFTEATLIAP RKCFEKAAEK 1201 CHTDSLSTVV GSCSWGKRVD VGTGSQFELL WNQKETGLDD KEETDVYSFL QMVISTTNAD 1261 AFVSSPGFDV TEEEMAEWAE SPERDSALGE PKFEDSADFQ NLHDEGKPSG ANWEKSSSWD 1321 NGCSGGSEWG VSKSTGGEAN PESNWEKTTN VEKEDAWSSW NTRKDAQESS KSDSGGAWGI 1381 KTKDADADTT PNWETSPAPK DSIVPENNEP TSDVWGHKSV SDKSWDKKNW GTESAPAAWG 1441 STDAAVWGSS DKKNSETESD AAAWGSRDKN NSDVGSGAGV LGPWNKKSSE TESNGATWGS 1501 SDKTKSGAAA WNSWDKKNIE TDSEPAAWGS QGKKNSETES GPAAWGAWDK KKSETEPGPA 1561 GWGMGDKKNS ETELGPAAMG NWDKKKSDTK SGPAAWGSTD AAAWGSSDKN NSETESDAAA 1621 WGSRNKKTSE IESGAGAWGS WGQPSPTAED KDTNEDDRNP WVSLKETKSR EKDDKERSQW 1681 GNPAKKFPSS GGWSNGGGAD WKGNRNHTPR PPRSEDNLAP MFTATRQRLD SFTSEEQELL 1741 SDVEPVMRTL RKIMHPSAYP DGDPISDDDK TFVLEKILNF HPQKETKLGS GVDFITVDKH 1801 TIFSDSRCFF VVSTDGAKQD FSYRKSLNNY LMKKYPDRAE EFIDKYFTKP RPSGNRDRNN 1861 QDATPPGEEQ SQPPNQSIGN GGDDFQTQTQ SQSPSQTRAQ SPSQAQAQSP SQTQSQSQSQ 1921 SQSQSQSQSQ SQSQSQSQSQ SQSQSQSPSQ TQTQSPSQTQ AQAQSPSSQS PSQTQT //