LOCUS AEE86973.1 2332 aa PRT PLN 23-MAR-2023
DEFINITION Arabidopsis thaliana pre-mRNA-processing-splicing factor-
like protein protein.
ACCESSION CP002687-7178
PROTEIN_ID AEE86973.1
SOURCE Arabidopsis thaliana (thale cress)
ORGANISM Arabidopsis thaliana
Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
Camelineae; Arabidopsis.
REFERENCE 1 (bases 1 to 18585056)
AUTHORS Mayer,K., Schuller,C., Wambutt,R., Murphy,G., Volckaert,G.,
Pohl,T., Dusterhoft,A., Stiekema,W., Entian,K.D., Terryn,N.,
Harris,B., Ansorge,W., Brandt,P., Grivell,L., Rieger,M.,
Weichselgartner,M., de Simone,V., Obermaier,B., Mache,R.,
Muller,M., Kreis,M., Delseny,M., Puigdomenech,P., Watson,M.,
Schmidtheini,T., Reichert,B., Portatelle,D., Perez-Alonso,M.,
Boutry,M., Bancroft,I., Vos,P., Hoheisel,J., Zimmermann,W.,
Wedler,H., Ridley,P., Langham,S.A., McCullagh,B., Bilham,L.,
Robben,J., Van der Schueren,J., Grymonprez,B., Chuang,Y.J.,
Vandenbussche,F., Braeken,M., Weltjens,I., Voet,M., Bastiaens,I.,
Aert,R., Defoor,E., Weitzenegger,T., Bothe,G., Ramsperger,U.,
Hilbert,H., Braun,M., Holzer,E., Brandt,A., Peters,S., van
Staveren,M., Dirske,W., Mooijman,P., Klein Lankhorst,R., Rose,M.,
Hauf,J., Kotter,P., Berneiser,S., Hempel,S., Feldpausch,M.,
Lamberth,S., Van den Daele,H., De Keyser,A., Buysshaert,C.,
Gielen,J., Villarroel,R., De Clercq,R., Van Montagu,M., Rogers,J.,
Cronin,A., Quail,M., Bray-Allen,S., Clark,L., Doggett,J., Hall,S.,
Kay,M., Lennard,N., McLay,K., Mayes,R., Pettett,A.,
Rajandream,M.A., Lyne,M., Benes,V., Rechmann,S., Borkova,D.,
Blocker,H., Scharfe,M., Grimm,M., Lohnert,T.H., Dose,S., de
Haan,M., Maarse,A., Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M.,
Fartmann,B., Granderath,K., Dauner,D., Herzl,A., Neumann,S.,
Argiriou,A., Vitale,D., Liguori,R., Piravandi,E., Massenet,O.,
Quigley,F., Clabauld,G., Mundlein,A., Felber,R., Schnabl,S.,
Hiller,R., Schmidt,W., Lecharny,A., Aubourg,S., Chefdor,F.,
Cooke,R., Berger,C., Montfort,A., Casacuberta,E., Gibbons,T.,
Weber,N., Vandenbol,M., Bargues,M., Terol,J., Torres,A.,
Perez-Perez,A., Purnelle,B., Bent,E., Johnson,S., Tacon,D.,
Jesse,T., Heijnen,L., Schwarz,S., Scholler,P., Heber,S., Francs,P.,
Bielke,C., Frishman,D., Haase,D., Lemcke,K., Mewes,H.W.,
Stocker,S., Zaccaria,P., Bevan,M., Wilson,R.K., de la Bastide,M.,
Habermann,K., Parnell,L., Dedhia,N., Gnoj,L., Schutz,K., Huang,E.,
Spiegel,L., Sehkon,M., Murray,J., Sheet,P., Cordes,M.,
Abu-Threideh,J., Stoneking,T., Kalicki,J., Graves,T., Harmon,G.,
Edwards,J., Latreille,P., Courtney,L., Cloud,J., Abbott,A.,
Scott,K., Johnson,D., Minx,P., Bentley,D., Fulton,B., Miller,N.,
Greco,T., Kemp,K., Kramer,J., Fulton,L., Mardis,E., Dante,M.,
Pepin,K., Hillier,L., Nelson,J., Spieth,J., Ryan,E., Andrews,S.,
Geisel,C., Layman,D., Du,H., Ali,J., Berghoff,A., Jones,K.,
Drone,K., Cotton,M., Joshu,C., Antonoiu,B., Zidanic,M., Strong,C.,
Sun,H., Lamar,B., Yordan,C., Ma,P., Zhong,J., Preston,R., Vil,D.,
Shekher,M., Matero,A., Shah,R., Swaby,I.K., O'Shaughnessy,A.,
Rodriguez,M., Hoffmann,J., Till,S., Granat,S., Shohdy,N.,
Hasegawa,A., Hameed,A., Lodhi,M., Johnson,A., Chen,E., Marra,M.,
Martienssen,R. and McCombie,W.R.
TITLE Sequence and analysis of chromosome 4 of the plant Arabidopsis
thaliana
JOURNAL Nature 402 (6763), 769-777 (1999)
PUBMED 10617198
REFERENCE 2 (bases 1 to 18585056)
AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
CONSRTM TAIR
TITLE Direct Submission
JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE 3 (bases 1 to 18585056)
AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
Vaughn,M. and Town,C.D.
TITLE Direct Submission
JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
9704 Medical Center Dr, Rockville, MD 20850, USA
REMARK Protein update by submitter
FEATURES Qualifiers
source /organism="Arabidopsis thaliana"
/mol_type="genomic DNA"
/db_xref="taxon:3702"
/chromosome="4"
/ecotype="Columbia"
protein /locus_tag="AT4G38780"
/gene_synonym="T9A14.60"
/gene_synonym="T9A14_60"
/inference="Similar to RNA sequence,
EST:INSD:T14115.1,INSD:ES066444.1"
/inference="Similar to RNA sequence, mRNA:INSD:BX828607.1"
/note="Pre-mRNA-processing-splicing factor; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN: nuclear mRNA
splicing, via spliceosome; LOCATED IN: spliceosomal
complex; EXPRESSED IN: 19 plant structures; EXPRESSED
DURING: 8 growth stages; CONTAINS InterPro DOMAIN/s:
Mov34/MPN/PAD-1 (InterPro:IPR000555),
Pre-mRNA-processing-splicing factor 8, U5-snRNA-binding
(InterPro:IPR019581), Pre-mRNA-processing-splicing factor
8 (InterPro:IPR012591), Pre-mRNA-processing-splicing
factor 8, U6-snRNA-binding (InterPro:IPR019580), PROCN
(InterPro:IPR012592), PRP8 domain IV core
(InterPro:IPR021983), PRO, C-terminal
(InterPro:IPR012984), RNA recognition motif, spliceosomal
PrP8 (InterPro:IPR019582); BEST Arabidopsis thaliana
protein match is: Pre-mRNA-processing-splicing factor
(TAIR:AT1G80070.1); Has 803 Blast hits to 702 proteins in
309 species: Archae - 0; Bacteria - 0; Metazoa - 318;
Fungi - 227; Plants - 60; Viruses - 0; Other Eukaryotes -
198 (source: NCBI BLink)."
/db_xref="TAIR:AT4G38780"
/db_xref="Araport:AT4G38780"
intron_pos 66:2 (1/23)
intron_pos 108:0 (2/23)
intron_pos 139:0 (3/23)
intron_pos 269:1 (4/23)
intron_pos 433:2 (5/23)
intron_pos 697:1 (6/23)
intron_pos 787:0 (7/23)
intron_pos 892:0 (8/23)
intron_pos 993:2 (9/23)
intron_pos 1145:1 (10/23)
intron_pos 1187:2 (11/23)
intron_pos 1268:1 (12/23)
intron_pos 1309:2 (13/23)
intron_pos 1409:0 (14/23)
intron_pos 1493:2 (15/23)
intron_pos 1699:1 (16/23)
intron_pos 1735:0 (17/23)
intron_pos 1820:0 (18/23)
intron_pos 1978:0 (19/23)
intron_pos 2032:0 (20/23)
intron_pos 2098:0 (21/23)
intron_pos 2191:0 (22/23)
intron_pos 2272:0 (23/23)
BEGIN
1 MWNIDGTSLA PPGTDGSRMQ TPSHPADHPS YTAPSNRNTP TVPTPEDAEA KLEKKARTWM
61 QLNSKRDHGD MSSKKHRLDK RVYLGALKFV PHAVFKLLEN MPMPWEQVRD VKVLYHITGA
121 ITFVNEVRWV VEPIYMAQWG SMWIMMRREK RDRRHFKRMR FPPFDDEEPP LDYADNLLDV
181 DPLEAIQLEL DEEEDSAVYS WFYDHKPLVK TKMINGPSYQ TWNLSLPIMS TLHRLAAQLL
241 SDLVDRNYFY LFDMPSFFTA KALNMCIPGG PKFEPLHRDM EKGDEDWNEF NDINKLIIRS
301 PLRTEYKVAF PHLYNNRPRK VKLCVYHTPM VMYIKTEDPD LPAFYYDPLI HPISNSNNTN
361 KEQRKSNGYD DDGDDFVLPE GLEPLLNNSP LYTDTTAPGI SLLFAPRPFN MRSGRTRRAE
421 DIPLVAEWFK EHCPPAYPVK VRVSYQKLLK CYLLNELHHR PPKAQKKKHL FRSLAATKFF
481 QSTELDWVEV GLQVCRQGYN MLNLLIHRKN LNYLHLDYNF NLKPVKTLTT KERKKSRFGN
541 AFHLCREILR LTKLVVDANV QFRLGNVDAF QLADGLQYIF SHVGQLTGMY RYKYRLMRQI
601 RMCKDLKHLI YYRFNTGPVG KGPGCGFWAP MWRVWLFFLR GIVPLLERWL GNLLARQFEG
661 RHSKGVAKTV TKQRVESHFD LELRAAVMHD VVDAMPEGIK QNKARTILQH LSEAWRCWKA
721 NIPWKVPGLP VAIENMILRY VKSKADWWTN VAHYNRERIR RGATVDKTVC RKNLGRLTRL
781 WLKAEQERQH NFQKDGPYVT ADEGIAIYST TVNWLESRKF SAIPFPPLSY KHDTKLLILA
841 LERLKESYSA AVKLNQQQRE ELGLIEQAYD NPHEALMRIK RHLLTQHSFK EVGIEFMDLY
901 SHLIPVYQID PLEKITDAYL DQYLWYEGDK RHLFPNWIKP ADSEPPPLLV YKWCQGINNL
961 QGIWDTSDGQ CVVMLQTKFE KLFEKIDLTV LNSLLRLVLD PKLANYVTGK NNVVLSYKDM
1021 SYTNTYGLIR GLQFASFVVQ FYGLVLDLLL LGLTRASEIA GPPQRPNEFM TYWDTKVETR
1081 HPIRLYSRYI DKVHIMFKFT HEEARDLIQR HLTERPDPNN ENMVGYNNKK CWPRDARMRL
1141 MKHDVNLGRS VFWDMKNRLP RSITTLEWEN GFVSVYSKDN PNLLFSMCGF EVRVLPKIRM
1201 GQEAFSSTRD GVWNLQNEQT KERTAVAFLR ADDEHMKVFE NRVRQILMSS GSTTFTKIVN
1261 KWNTALIGLM TYFREATVHT QELLDLLVKC ENKIQTRVKI GLNSKMPSRF PPVIFYTPKE
1321 IGGLGMLSMG HILIPQSDLR YSNQTDVGVS HFRSGMSHEE DQLIPNLYRY IQPWESEFID
1381 SQRVWAEYAL KRQEAQAQNR RLTLEDLEDS WDRGIPRINT LFQKDRHTLA YDKGWRVRTD
1441 FKQYQALKQN PFWWTHQRHD GKLWNLNNYR TDVIQALGGV EGILEHTLFK GTYFPTWEGL
1501 FWEKASGFEE SMKYKKLTNA QRSGLNQIPN RRFTLWWSPT INRANVYVGF QVQLDLTGIY
1561 MHGKIPTLKI SLIQIFRAHL WQKIHESVVM DLCQVLDQEL EPLEIETVQK ETIHPRKSYK
1621 MNSSCADVLL FAAHKWPMSK PSLIAESKDV FDQKASNKYW IDVQLRWGDY DSHDIERYTK
1681 AKFMDYTTDN MSIYPSPTGV IIGLDLAYNL HSAFGNWFPG SKPLLAQAMN KIMKSNPALY
1741 VLRERIRKGL QLYSSEPTEP YLSSQNYGEI FSNQIIWFVD DTNVYRVTIH KTFEGNLTTK
1801 PINGVIFIFN PRTGQLFLKI IHTSVWAGQK RLGQLAKWKT AEEVAALVRS LPVEEQPKQV
1861 IVTRKGMLDP LEVHLLDFPN IVIKGSELQL PFQACLKIEK FGDLILKATE PQMALFNIYD
1921 DWLMTVSSYT AFQRLILILR ALHVNNEKAK MLLKPDMSVV TEPNHIWPSL TDDQWMKVEV
1981 ALRDLILSDY AKKNKVNTSA LTQSEIRDII LGAEITPPSQ QRQQIAEIEK QAKEASQLTA
2041 VTTRTTNVHG DELISTTISP YEQSAFGSKT DWRVRAISAT NLYLRVNHIY VNSDDIKETG
2101 YTYIMPKNIL KKFICIADLR TQIAGYLYGI SPPDNPQVKE IRCVVMVPQC GNHQQVQLPS
2161 SLPEHQFLDD LEPLGWIHTQ PNELPQLSPQ DVTFHTRVLE NNKQWDAEKC IILTCSFTPG
2221 SCSLTSYKLT QAGYEWGRLN KDTGSNPHGY LPTHYEKVQM LLSDRFFGFY MVPENGPWNY
2281 NFMGANHTVS INYSLTLGTP KEYYHQVHRP THFLQFSKME EDGDLDRDDS FA
//