LOCUS AEC10458.1 809 aa PRT PLN 23-MAR-2023
DEFINITION Arabidopsis thaliana RNA-binding (RRM/RBD/RNP motifs)
family protein protein.
ACCESSION CP002685-6917
PROTEIN_ID AEC10458.1
SOURCE Arabidopsis thaliana (thale cress)
ORGANISM Arabidopsis thaliana
Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
Camelineae; Arabidopsis.
REFERENCE 1 (bases 1 to 19698289)
AUTHORS Lin,X., Kaul,S., Rounsley,S., Shea,T.P., Benito,M.I., Town,C.D.,
Fujii,C.Y., Mason,T., Bowman,C.L., Barnstead,M., Feldblyum,T.V.,
Buell,C.R., Ketchum,K.A., Lee,J., Ronning,C.M., Koo,H.L.,
Moffat,K.S., Cronin,L.A., Shen,M., Pai,G., Van Aken,S., Umayam,L.,
Tallon,L.J., Gill,J.E., Adams,M.D., Carrera,A.J., Creasy,T.H.,
Goodman,H.M., Somerville,C.R., Copenhaver,G.P., Preuss,D.,
Nierman,W.C., White,O., Eisen,J.A., Salzberg,S.L., Fraser,C.M. and
Venter,J.C.
TITLE Sequence and analysis of chromosome 2 of the plant Arabidopsis
thaliana
JOURNAL Nature 402 (6763), 761-768 (1999)
PUBMED 10617197
REFERENCE 2 (bases 1 to 19698289)
AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
CONSRTM TAIR
TITLE Direct Submission
JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE 3 (bases 1 to 19698289)
AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
Vaughn,M. and Town,C.D.
TITLE Direct Submission
JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
9704 Medical Center Dr, Rockville, MD 20850, USA
REMARK Protein update by submitter
FEATURES Qualifiers
source /organism="Arabidopsis thaliana"
/mol_type="genomic DNA"
/db_xref="taxon:3702"
/chromosome="2"
/ecotype="Columbia"
protein /locus_tag="AT2G44710"
/gene_synonym="F16B22.20"
/inference="Similar to RNA sequence,
EST:INSD:AV828892.1,INSD:AV525087.1,INSD:AV828909.1,
INSD:BP598727.1,INSD:AV803242.1,INSD:EL100692.1,
INSD:BP609980.1,INSD:BP609255.1,INSD:EH878588.1,
INSD:DR355744.1,INSD:BP784452.1,INSD:EG463035.1,
INSD:AV803308.1,INSD:BP805940.1,INSD:AV543614.1,
INSD:EG463034.1,INSD:BP598496.1,INSD:DR354209.1,
INSD:EL277376.1,INSD:BP611790.1,INSD:AV518322.1,
INSD:BP621098.1,INSD:CB263486.1,INSD:EH816008.1,
INSD:ES087346.1,INSD:EL244631.1,INSD:ES129935.1,
INSD:BP785331.1"
/inference="similar to RNA sequence,
mRNA:INSD:AK230292.1,INSD:BT000637.1,INSD:AY091781.1,
INSD:BX819714.1"
/note="RNA-binding (RRM/RBD/RNP motifs) family protein;
FUNCTIONS IN: RNA binding, nucleotide binding, nucleic
acid binding; EXPRESSED IN: 22 plant structures; EXPRESSED
DURING: 12 growth stages; CONTAINS InterPro DOMAIN/s: RNA
recognition motif, RNP-1 (InterPro:IPR000504),
Nucleotide-binding, alpha-beta plait (InterPro:IPR012677);
BEST Arabidopsis thaliana protein match is: RNA-binding
(RRM/RBD/RNP motifs) family protein (TAIR:AT4G00830.2);
Has 31429 Blast hits to 23398 proteins in 2969 species:
Archae - 812; Bacteria - 21040; Metazoa - 874; Fungi -
1027; Plants - 329; Viruses - 0; Other Eukaryotes - 7347
(source: NCBI BLink)."
/db_xref="Araport:AT2G44710"
/db_xref="TAIR:AT2G44710"
intron_pos 94:1 (1/10)
intron_pos 280:0 (2/10)
intron_pos 310:0 (3/10)
intron_pos 390:0 (4/10)
intron_pos 464:0 (5/10)
intron_pos 583:1 (6/10)
intron_pos 688:0 (7/10)
intron_pos 722:2 (8/10)
intron_pos 757:1 (9/10)
intron_pos 781:0 (10/10)
BEGIN
1 MPPKVVKRGG AARRGGRLTR SALKAQNPHV ESSHDESVNI GELSGSDALE AKEVTPEVDK
61 TVEEENPLDV PKSSDSIDDS EAAANPHVDV PSKKETEVEE SVDDFGKDER LDLDDNEPEY
121 EAEEYGGEEF EERELGQEDH ELVNEEGEEL EEEIEVEEEA GEFADEIGDG AEDLDSEDDD
181 DDHAIEEVKH GETVDVEEEE HHDVLHERRK RKEFEIFVGS LDKGASEEDL KKVFGHVGEV
241 TEVRILKNPQ TKKSKGSAFL RFATVEQAKR AVKELKSPMI NGKKCGVTAS QDNDTLFVGN
301 ICKIWTPEAL REKLKHYGVE NMDDITLVED SNNVNMNRGY AFLEFSSRSD AMDAHKRLVK
361 KDVMFGVEKP AKVSFTDSFL DLEDEIMAQV KTIFIDGLLP SWNEERVRDL LKPYGKLEKV
421 ELARNMPSAR RKDFGFVTFD THEAAVSCAK FINNSELGEG EDKAKVRARL SRPLQKAGKG
481 RQSSRSDQRS RHGAGRSGRS SFARLPPRSL ASSRSARGAG SRAPSSSAKR ASGSRGRRPR
541 PPLPPPARAR PLPPPARARP MPPPARARPL PPPARSYDRR PPVPLYPKAS LKRDYDRRDE
601 LPPPRSRPAV SYSSRLSPER HLSYRDDYPP RGSGYSDLPR SSSRSEIRRP FVDDLYSPRF
661 ERPPSYSEGR PRAYEPLPGS KRPYAALDDL PPRYADVDVR HSRPRLDYDV GPSQYGESYG
721 DRIPRSSLGY GSSRNSMSNH DSRGPYSSRQ GMDYGGGSYS GSDVGGMYSS SYGGDLPRRD
781 GGGSSYSSIY SSRGLGGSSY SGGGPGSYY
//