LOCUS AEE83669.2 844 aa PRT PLN 23-MAR-2023
DEFINITION Arabidopsis thaliana RNA polymerase II, Rpb4, core protein protein.
ACCESSION CP002687-2611
PROTEIN_ID AEE83669.2
SOURCE Arabidopsis thaliana (thale cress)
ORGANISM Arabidopsis thaliana
Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
Camelineae; Arabidopsis.
REFERENCE 1 (bases 1 to 18585056)
AUTHORS Mayer,K., Schuller,C., Wambutt,R., Murphy,G., Volckaert,G.,
Pohl,T., Dusterhoft,A., Stiekema,W., Entian,K.D., Terryn,N.,
Harris,B., Ansorge,W., Brandt,P., Grivell,L., Rieger,M.,
Weichselgartner,M., de Simone,V., Obermaier,B., Mache,R.,
Muller,M., Kreis,M., Delseny,M., Puigdomenech,P., Watson,M.,
Schmidtheini,T., Reichert,B., Portatelle,D., Perez-Alonso,M.,
Boutry,M., Bancroft,I., Vos,P., Hoheisel,J., Zimmermann,W.,
Wedler,H., Ridley,P., Langham,S.A., McCullagh,B., Bilham,L.,
Robben,J., Van der Schueren,J., Grymonprez,B., Chuang,Y.J.,
Vandenbussche,F., Braeken,M., Weltjens,I., Voet,M., Bastiaens,I.,
Aert,R., Defoor,E., Weitzenegger,T., Bothe,G., Ramsperger,U.,
Hilbert,H., Braun,M., Holzer,E., Brandt,A., Peters,S., van
Staveren,M., Dirske,W., Mooijman,P., Klein Lankhorst,R., Rose,M.,
Hauf,J., Kotter,P., Berneiser,S., Hempel,S., Feldpausch,M.,
Lamberth,S., Van den Daele,H., De Keyser,A., Buysshaert,C.,
Gielen,J., Villarroel,R., De Clercq,R., Van Montagu,M., Rogers,J.,
Cronin,A., Quail,M., Bray-Allen,S., Clark,L., Doggett,J., Hall,S.,
Kay,M., Lennard,N., McLay,K., Mayes,R., Pettett,A.,
Rajandream,M.A., Lyne,M., Benes,V., Rechmann,S., Borkova,D.,
Blocker,H., Scharfe,M., Grimm,M., Lohnert,T.H., Dose,S., de
Haan,M., Maarse,A., Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M.,
Fartmann,B., Granderath,K., Dauner,D., Herzl,A., Neumann,S.,
Argiriou,A., Vitale,D., Liguori,R., Piravandi,E., Massenet,O.,
Quigley,F., Clabauld,G., Mundlein,A., Felber,R., Schnabl,S.,
Hiller,R., Schmidt,W., Lecharny,A., Aubourg,S., Chefdor,F.,
Cooke,R., Berger,C., Montfort,A., Casacuberta,E., Gibbons,T.,
Weber,N., Vandenbol,M., Bargues,M., Terol,J., Torres,A.,
Perez-Perez,A., Purnelle,B., Bent,E., Johnson,S., Tacon,D.,
Jesse,T., Heijnen,L., Schwarz,S., Scholler,P., Heber,S., Francs,P.,
Bielke,C., Frishman,D., Haase,D., Lemcke,K., Mewes,H.W.,
Stocker,S., Zaccaria,P., Bevan,M., Wilson,R.K., de la Bastide,M.,
Habermann,K., Parnell,L., Dedhia,N., Gnoj,L., Schutz,K., Huang,E.,
Spiegel,L., Sehkon,M., Murray,J., Sheet,P., Cordes,M.,
Abu-Threideh,J., Stoneking,T., Kalicki,J., Graves,T., Harmon,G.,
Edwards,J., Latreille,P., Courtney,L., Cloud,J., Abbott,A.,
Scott,K., Johnson,D., Minx,P., Bentley,D., Fulton,B., Miller,N.,
Greco,T., Kemp,K., Kramer,J., Fulton,L., Mardis,E., Dante,M.,
Pepin,K., Hillier,L., Nelson,J., Spieth,J., Ryan,E., Andrews,S.,
Geisel,C., Layman,D., Du,H., Ali,J., Berghoff,A., Jones,K.,
Drone,K., Cotton,M., Joshu,C., Antonoiu,B., Zidanic,M., Strong,C.,
Sun,H., Lamar,B., Yordan,C., Ma,P., Zhong,J., Preston,R., Vil,D.,
Shekher,M., Matero,A., Shah,R., Swaby,I.K., O'Shaughnessy,A.,
Rodriguez,M., Hoffmann,J., Till,S., Granat,S., Shohdy,N.,
Hasegawa,A., Hameed,A., Lodhi,M., Johnson,A., Chen,E., Marra,M.,
Martienssen,R. and McCombie,W.R.
TITLE Sequence and analysis of chromosome 4 of the plant Arabidopsis
thaliana
JOURNAL Nature 402 (6763), 769-777 (1999)
PUBMED 10617198
REFERENCE 2 (bases 1 to 18585056)
AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
CONSRTM TAIR
TITLE Direct Submission
JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE 3 (bases 1 to 18585056)
AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
Vaughn,M. and Town,C.D.
TITLE Direct Submission
JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
9704 Medical Center Dr, Rockville, MD 20850, USA
REMARK Protein update by submitter
FEATURES Qualifiers
source /organism="Arabidopsis thaliana"
/mol_type="genomic DNA"
/db_xref="taxon:3702"
/chromosome="4"
/ecotype="Columbia"
protein /gene="NRPD4"
/locus_tag="AT4G15950"
/gene_synonym="DL4012W"
/gene_synonym="FCAALL.240"
/gene_synonym="NRPE4"
/gene_synonym="RDM2"
/gene_synonym="RNA-DIRECTED DNA METHYLATION 2"
/inference="similar to RNA sequence,
EST:INSD:ES197708.1,INSD:EL316587.1,INSD:ES205252.1,
INSD:ES196420.1"
/note="NRPD4; FUNCTIONS IN: DNA-directed RNA polymerase
activity, nucleotide binding, catalytic activity; INVOLVED
IN: RNA interference, interphase, production of siRNA
involved in RNA interference, DNA methylation on cytosine
within a CHH sequence; LOCATED IN: DNA-directed RNA
polymerase V complex, nucleus, DNA-directed RNA polymerase
IV complex; EXPRESSED IN: 23 plant structures; EXPRESSED
DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s:
HRDC-like (InterPro:IPR010997), RNA polymerase II, Rpb4
(InterPro:IPR005574), RNA polymerase II, Rpb4, core
(InterPro:IPR006590); BEST Arabidopsis thaliana protein
match is: RNA polymerase II, Rpb4, core protein
(TAIR:AT5G09920.1); Has 23 Blast hits to 23 proteins in 3
species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0;
Plants - 23; Viruses - 0; Other Eukaryotes - 0 (source:
NCBI BLink)."
/db_xref="TAIR:AT4G15950"
/db_xref="Araport:AT4G15950"
intron_pos 18:1 (1/7)
intron_pos 38:1 (2/7)
intron_pos 84:1 (3/7)
intron_pos 122:0 (4/7)
intron_pos 148:0 (5/7)
intron_pos 159:1 (6/7)
intron_pos 182:1 (7/7)
BEGIN
1 MSVIAHVDHG KSTLTDSLVA AAGIIAQETA GDVRMTDTRA DEAERGITIK STGISLYYEM
61 TDASLKSFTG ARDGNEYLIN LIDSPGHVDF SSEVTAALRI TDGALVVVDC IEGVCVQTET
121 VLRQSLGERI RPVLTVNKMD RCFLELKVDG EEAYQNFQRV IENANVIMAT HEDPLLGDVQ
181 VYPEKGTVAF SAGLHGWAFT LTNFAKMYAS KFGVSESKMM ERLWGENFFD SATRKWTTKT
241 GSPTCKRGFV QFCYEPIKIM INTCMNDQKD KLWPMLEKLG IQMKPDEKEL MGKPLMKRVM
301 QAWLPASTAL LEMMIFHLPS PYTAQRYRVE NLYEGPLDDK YAAAIRNCDP DGPLMLYVSK
361 MIPASDKGRF FAFGRVFSGT VSTGMKVRIM GPNYVPGEKK DLYVKSVQRT VIWMGKKQET
421 VEDVPCGNTV AMVGLDQFIT KNGTLTNEKE VDAHPLRAMK FSVSPVVRVA VKCKLASDLP
481 KLVEGLKRLA KSDPMVLCTM EESGEHIVAG AGELHIEICV KDLQDFMGGA DIIVSDPVVS
541 LRETVFERSC RTVMSKSPNK HNRLYMEARP MEDGLAEAID EGRIGPSDDP KIRSKILAEE
601 FGWDKDLAKK IWAFGPDTTG PNMVVDMCKG VQYLNEIKDS VVAGFQWASK EGPLAEENMR
661 GVCYEVCDVV LHADAIHRGC GQMISTARRA IYASQLTAKP RLLEPVYMVE IQAPEGALGG
721 IYSVLNQKRG HVFEEMQRPG TPLYNIKAYL PVVESFGFSG QLRAATSGQA FPQCVFDHWD
781 MMSSDPLETG SQAATLVADI RKRKGLKLQM TPLSDYEDKL GNLJCVIMRN AATGGNLJCV
841 IATG
//