LOCUS AEC09767.1 1976 aa PRT PLN 23-MAR-2023
DEFINITION Arabidopsis thaliana nuclear RNA polymerase D1B protein.
ACCESSION CP002685-5979
PROTEIN_ID AEC09767.1
SOURCE Arabidopsis thaliana (thale cress)
ORGANISM Arabidopsis thaliana
Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
Camelineae; Arabidopsis.
REFERENCE 1 (bases 1 to 19698289)
AUTHORS Lin,X., Kaul,S., Rounsley,S., Shea,T.P., Benito,M.I., Town,C.D.,
Fujii,C.Y., Mason,T., Bowman,C.L., Barnstead,M., Feldblyum,T.V.,
Buell,C.R., Ketchum,K.A., Lee,J., Ronning,C.M., Koo,H.L.,
Moffat,K.S., Cronin,L.A., Shen,M., Pai,G., Van Aken,S., Umayam,L.,
Tallon,L.J., Gill,J.E., Adams,M.D., Carrera,A.J., Creasy,T.H.,
Goodman,H.M., Somerville,C.R., Copenhaver,G.P., Preuss,D.,
Nierman,W.C., White,O., Eisen,J.A., Salzberg,S.L., Fraser,C.M. and
Venter,J.C.
TITLE Sequence and analysis of chromosome 2 of the plant Arabidopsis
thaliana
JOURNAL Nature 402 (6763), 761-768 (1999)
PUBMED 10617197
REFERENCE 2 (bases 1 to 19698289)
AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
CONSRTM TAIR
TITLE Direct Submission
JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE 3 (bases 1 to 19698289)
AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
Vaughn,M. and Town,C.D.
TITLE Direct Submission
JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
9704 Medical Center Dr, Rockville, MD 20850, USA
REMARK Protein update by submitter
FEATURES Qualifiers
source /organism="Arabidopsis thaliana"
/mol_type="genomic DNA"
/db_xref="taxon:3702"
/chromosome="2"
/ecotype="Columbia"
protein /gene="NRPD1B"
/locus_tag="AT2G40030"
/gene_synonym="ATNRPD1B"
/gene_synonym="DEFECTIVE IN MERISTEM SILENCING 5"
/gene_synonym="DMS5"
/gene_synonym="DRD3"
/gene_synonym="NRPE1"
/gene_synonym="nuclear RNA polymerase D1B"
/gene_synonym="T28M21.19"
/gene_synonym="T28M21_19"
/inference="Similar to RNA sequence,
EST:INSD:Z34832.1,INSD:ES080084.1,INSD:EL967379.1,
INSD:ES042172.1,INSD:ES041018.1,INSD:EH821470.1,
INSD:EH966998.1,INSD:ES176080.1,INSD:EL144975.1,
INSD:EH986213.1,INSD:ES055069.1,INSD:Z34069.1,
INSD:EH813698.1"
/inference="similar to RNA sequence,
mRNA:INSD:AY927744.1,INSD:DQ020656.1,INSD:AY826516.1"
/note="nuclear RNA polymerase D1B (NRPD1B); CONTAINS
InterPro DOMAIN/s: Protein of unknown function DUF3223
(InterPro:IPR021602), RNA polymerase, N-terminal
(InterPro:IPR006592), RNA polymerase, alpha subunit
(InterPro:IPR000722), RNA polymerase Rpb1, domain 3
(InterPro:IPR007066), RNA polymerase Rpb1, domain 1
(InterPro:IPR007080), RNA polymerase Rpb1, domain 5
(InterPro:IPR007081); BEST Arabidopsis thaliana protein
match is: nuclear RNA polymerase D1A (TAIR:AT1G63020.2);
Has 52919 Blast hits to 31940 proteins in 6835 species:
Archae - 366; Bacteria - 10380; Metazoa - 13235; Fungi -
6920; Plants - 7147; Viruses - 757; Other Eukaryotes -
14114 (source: NCBI BLink)."
/db_xref="Araport:AT2G40030"
/db_xref="TAIR:AT2G40030"
intron_pos 28:0 (1/16)
intron_pos 70:1 (2/16)
intron_pos 109:0 (3/16)
intron_pos 125:0 (4/16)
intron_pos 180:0 (5/16)
intron_pos 233:0 (6/16)
intron_pos 860:0 (7/16)
intron_pos 918:2 (8/16)
intron_pos 944:0 (9/16)
intron_pos 987:2 (10/16)
intron_pos 1039:1 (11/16)
intron_pos 1131:0 (12/16)
intron_pos 1189:0 (13/16)
intron_pos 1236:0 (14/16)
intron_pos 1758:2 (15/16)
intron_pos 1797:0 (16/16)
BEGIN
1 MEEESTSEIL DGEIVGITFA LASHHEICIQ SISESAINHP SQLTNAFLGL PLEFGKCESC
61 GATEPDKCEG HFGYIQLPVP IYHPAHVNEL KQMLSLLCLK CLKIKKAKGT SGGLADRLLG
121 VCCEEASQIS IKDRASDGAS YLELKLPSRS RLQPGCWNFL ERYGYRYGSD YTRPLLAREV
181 KEILRRIPEE SRKKLTAKGH IPQEGYILEY LPVPPNCLSV PEASDGFSTM SVDPSRIELK
241 DVLKKVIAIK SSRSGETNFE SHKAEASEMF RVVDTYLQVR GTAKAARNID MRYGVSKISD
301 SSSSKAWTEK MRTLFIRKGS GFSSRSVITG DAYRHVNEVG IPIEIAQRIT FEERVSVHNR
361 GYLQKLVDDK LCLSYTQGST TYSLRDGSKG HTELKPGQVV HRRVMDGDVV FINRPPTTHK
421 HSLQALRVYV HEDNTVKINP LMCSPLSADF DGDCVHLFYP QSLSAKAEVM ELFSVEKQLL
481 SSHTGQLILQ MGSDSLLSLR VMLERVFLDK ATAQQLAMYG SLSLPPPALR KSSKSGPAWT
541 VFQILQLAFP ERLSCKGDRF LVDGSDLLKF DFGVDAMGSI INEIVTSIFL EKGPKETLGF
601 FDSLQPLLME SLFAEGFSLS LEDLSMSRAD MDVIHNLIIR EISPMVSRLR LSYRDELQLE
661 NSIHKVKEVA ANFMLKSYSI RNLIDIKSNS AITKLVQQTG FLGLQLSDKK KFYTKTLVED
721 MAIFCKRKYG RISSSGDFGI VKGCFFHGLD PYEEMAHSIA AREVIVRSSR GLAEPGTLFK
781 NLMAVLRDIV ITNDGTVRNT CSNSVIQFKY GVDSERGHQG LFEAGEPVGV LAATAMSNPA
841 YKAVLDSSPN SNSSWELMKE VLLCKVNFQN TTNDRRVILY LNECHCGKRF CQENAACTVR
901 NKLNKVSLKD TAVEFLVEYR KQPTISEIFG IDSCLHGHIH LNKTLLQDWN ISMQDIHQKC
961 EDVINSLGQK KKKKATDDFK RTSLSVSECC SFRDPCGSKG SDMPCLTFSY NATDPDLERT
1021 LDVLCNTVYP VLLEIVIKGD SRICSANIIW NSSDMTTWIR NRHASRRGEW VLDVTVEKSA
1081 VKQSGDAWRV VIDSCLSVLH LIDTKRSIPY SVKQVQELLG LSCAFEQAVQ RLSASVRMVS
1141 KGVLKEHIIL LANNMTCSGT MLGFNSGGYK ALTRSLNIKA PFTEATLIAP RKCFEKAAEK
1201 CHTDSLSTVV GSCSWGKRVD VGTGSQFELL WNQKETGLDD KEETDVYSFL QMVISTTNAD
1261 AFVSSPGFDV TEEEMAEWAE SPERDSALGE PKFEDSADFQ NLHDEGKPSG ANWEKSSSWD
1321 NGCSGGSEWG VSKSTGGEAN PESNWEKTTN VEKEDAWSSW NTRKDAQESS KSDSGGAWGI
1381 KTKDADADTT PNWETSPAPK DSIVPENNEP TSDVWGHKSV SDKSWDKKNW GTESAPAAWG
1441 STDAAVWGSS DKKNSETESD AAAWGSRDKN NSDVGSGAGV LGPWNKKSSE TESNGATWGS
1501 SDKTKSGAAA WNSWDKKNIE TDSEPAAWGS QGKKNSETES GPAAWGAWDK KKSETEPGPA
1561 GWGMGDKKNS ETELGPAAMG NWDKKKSDTK SGPAAWGSTD AAAWGSSDKN NSETESDAAA
1621 WGSRNKKTSE IESGAGAWGS WGQPSPTAED KDTNEDDRNP WVSLKETKSR EKDDKERSQW
1681 GNPAKKFPSS GGWSNGGGAD WKGNRNHTPR PPRSEDNLAP MFTATRQRLD SFTSEEQELL
1741 SDVEPVMRTL RKIMHPSAYP DGDPISDDDK TFVLEKILNF HPQKETKLGS GVDFITVDKH
1801 TIFSDSRCFF VVSTDGAKQD FSYRKSLNNY LMKKYPDRAE EFIDKYFTKP RPSGNRDRNN
1861 QDATPPGEEQ SQPPNQSIGN GGDDFQTQTQ SQSPSQTRAQ SPSQAQAQSP SQTQSQSQSQ
1921 SQSQSQSQSQ SQSQSQSQSQ SQSQSQSPSQ TQTQSPSQTQ AQAQSPSSQS PSQTQT
//