LOCUS AEE86973.1 2332 aa PRT PLN 23-MAR-2023 DEFINITION Arabidopsis thaliana pre-mRNA-processing-splicing factor- like protein protein. ACCESSION CP002687-7178 PROTEIN_ID AEE86973.1 SOURCE Arabidopsis thaliana (thale cress) ORGANISM Arabidopsis thaliana Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis. REFERENCE 1 (bases 1 to 18585056) AUTHORS Mayer,K., Schuller,C., Wambutt,R., Murphy,G., Volckaert,G., Pohl,T., Dusterhoft,A., Stiekema,W., Entian,K.D., Terryn,N., Harris,B., Ansorge,W., Brandt,P., Grivell,L., Rieger,M., Weichselgartner,M., de Simone,V., Obermaier,B., Mache,R., Muller,M., Kreis,M., Delseny,M., Puigdomenech,P., Watson,M., Schmidtheini,T., Reichert,B., Portatelle,D., Perez-Alonso,M., Boutry,M., Bancroft,I., Vos,P., Hoheisel,J., Zimmermann,W., Wedler,H., Ridley,P., Langham,S.A., McCullagh,B., Bilham,L., Robben,J., Van der Schueren,J., Grymonprez,B., Chuang,Y.J., Vandenbussche,F., Braeken,M., Weltjens,I., Voet,M., Bastiaens,I., Aert,R., Defoor,E., Weitzenegger,T., Bothe,G., Ramsperger,U., Hilbert,H., Braun,M., Holzer,E., Brandt,A., Peters,S., van Staveren,M., Dirske,W., Mooijman,P., Klein Lankhorst,R., Rose,M., Hauf,J., Kotter,P., Berneiser,S., Hempel,S., Feldpausch,M., Lamberth,S., Van den Daele,H., De Keyser,A., Buysshaert,C., Gielen,J., Villarroel,R., De Clercq,R., Van Montagu,M., Rogers,J., Cronin,A., Quail,M., Bray-Allen,S., Clark,L., Doggett,J., Hall,S., Kay,M., Lennard,N., McLay,K., Mayes,R., Pettett,A., Rajandream,M.A., Lyne,M., Benes,V., Rechmann,S., Borkova,D., Blocker,H., Scharfe,M., Grimm,M., Lohnert,T.H., Dose,S., de Haan,M., Maarse,A., Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M., Fartmann,B., Granderath,K., Dauner,D., Herzl,A., Neumann,S., Argiriou,A., Vitale,D., Liguori,R., Piravandi,E., Massenet,O., Quigley,F., Clabauld,G., Mundlein,A., Felber,R., Schnabl,S., Hiller,R., Schmidt,W., Lecharny,A., Aubourg,S., Chefdor,F., Cooke,R., Berger,C., Montfort,A., Casacuberta,E., Gibbons,T., Weber,N., Vandenbol,M., Bargues,M., Terol,J., Torres,A., Perez-Perez,A., Purnelle,B., Bent,E., Johnson,S., Tacon,D., Jesse,T., Heijnen,L., Schwarz,S., Scholler,P., Heber,S., Francs,P., Bielke,C., Frishman,D., Haase,D., Lemcke,K., Mewes,H.W., Stocker,S., Zaccaria,P., Bevan,M., Wilson,R.K., de la Bastide,M., Habermann,K., Parnell,L., Dedhia,N., Gnoj,L., Schutz,K., Huang,E., Spiegel,L., Sehkon,M., Murray,J., Sheet,P., Cordes,M., Abu-Threideh,J., Stoneking,T., Kalicki,J., Graves,T., Harmon,G., Edwards,J., Latreille,P., Courtney,L., Cloud,J., Abbott,A., Scott,K., Johnson,D., Minx,P., Bentley,D., Fulton,B., Miller,N., Greco,T., Kemp,K., Kramer,J., Fulton,L., Mardis,E., Dante,M., Pepin,K., Hillier,L., Nelson,J., Spieth,J., Ryan,E., Andrews,S., Geisel,C., Layman,D., Du,H., Ali,J., Berghoff,A., Jones,K., Drone,K., Cotton,M., Joshu,C., Antonoiu,B., Zidanic,M., Strong,C., Sun,H., Lamar,B., Yordan,C., Ma,P., Zhong,J., Preston,R., Vil,D., Shekher,M., Matero,A., Shah,R., Swaby,I.K., O'Shaughnessy,A., Rodriguez,M., Hoffmann,J., Till,S., Granat,S., Shohdy,N., Hasegawa,A., Hameed,A., Lodhi,M., Johnson,A., Chen,E., Marra,M., Martienssen,R. and McCombie,W.R. TITLE Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana JOURNAL Nature 402 (6763), 769-777 (1999) PUBMED 10617198 REFERENCE 2 (bases 1 to 18585056) AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E. CONSRTM TAIR TITLE Direct Submission JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie Institution, 260 Panama Street, Stanford, CA, USA REFERENCE 3 (bases 1 to 18585056) AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M., Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R., Vaughn,M. and Town,C.D. TITLE Direct Submission JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute, 9704 Medical Center Dr, Rockville, MD 20850, USA REMARK Protein update by submitter FEATURES Qualifiers source /organism="Arabidopsis thaliana" /mol_type="genomic DNA" /db_xref="taxon:3702" /chromosome="4" /ecotype="Columbia" protein /locus_tag="AT4G38780" /gene_synonym="T9A14.60" /gene_synonym="T9A14_60" /inference="Similar to RNA sequence, EST:INSD:T14115.1,INSD:ES066444.1" /inference="Similar to RNA sequence, mRNA:INSD:BX828607.1" /note="Pre-mRNA-processing-splicing factor; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: nuclear mRNA splicing, via spliceosome; LOCATED IN: spliceosomal complex; EXPRESSED IN: 19 plant structures; EXPRESSED DURING: 8 growth stages; CONTAINS InterPro DOMAIN/s: Mov34/MPN/PAD-1 (InterPro:IPR000555), Pre-mRNA-processing-splicing factor 8, U5-snRNA-binding (InterPro:IPR019581), Pre-mRNA-processing-splicing factor 8 (InterPro:IPR012591), Pre-mRNA-processing-splicing factor 8, U6-snRNA-binding (InterPro:IPR019580), PROCN (InterPro:IPR012592), PRP8 domain IV core (InterPro:IPR021983), PRO, C-terminal (InterPro:IPR012984), RNA recognition motif, spliceosomal PrP8 (InterPro:IPR019582); BEST Arabidopsis thaliana protein match is: Pre-mRNA-processing-splicing factor (TAIR:AT1G80070.1); Has 803 Blast hits to 702 proteins in 309 species: Archae - 0; Bacteria - 0; Metazoa - 318; Fungi - 227; Plants - 60; Viruses - 0; Other Eukaryotes - 198 (source: NCBI BLink)." /db_xref="TAIR:AT4G38780" /db_xref="Araport:AT4G38780" intron_pos 66:2 (1/23) intron_pos 108:0 (2/23) intron_pos 139:0 (3/23) intron_pos 269:1 (4/23) intron_pos 433:2 (5/23) intron_pos 697:1 (6/23) intron_pos 787:0 (7/23) intron_pos 892:0 (8/23) intron_pos 993:2 (9/23) intron_pos 1145:1 (10/23) intron_pos 1187:2 (11/23) intron_pos 1268:1 (12/23) intron_pos 1309:2 (13/23) intron_pos 1409:0 (14/23) intron_pos 1493:2 (15/23) intron_pos 1699:1 (16/23) intron_pos 1735:0 (17/23) intron_pos 1820:0 (18/23) intron_pos 1978:0 (19/23) intron_pos 2032:0 (20/23) intron_pos 2098:0 (21/23) intron_pos 2191:0 (22/23) intron_pos 2272:0 (23/23) BEGIN 1 MWNIDGTSLA PPGTDGSRMQ TPSHPADHPS YTAPSNRNTP TVPTPEDAEA KLEKKARTWM 61 QLNSKRDHGD MSSKKHRLDK RVYLGALKFV PHAVFKLLEN MPMPWEQVRD VKVLYHITGA 121 ITFVNEVRWV VEPIYMAQWG SMWIMMRREK RDRRHFKRMR FPPFDDEEPP LDYADNLLDV 181 DPLEAIQLEL DEEEDSAVYS WFYDHKPLVK TKMINGPSYQ TWNLSLPIMS TLHRLAAQLL 241 SDLVDRNYFY LFDMPSFFTA KALNMCIPGG PKFEPLHRDM EKGDEDWNEF NDINKLIIRS 301 PLRTEYKVAF PHLYNNRPRK VKLCVYHTPM VMYIKTEDPD LPAFYYDPLI HPISNSNNTN 361 KEQRKSNGYD DDGDDFVLPE GLEPLLNNSP LYTDTTAPGI SLLFAPRPFN MRSGRTRRAE 421 DIPLVAEWFK EHCPPAYPVK VRVSYQKLLK CYLLNELHHR PPKAQKKKHL FRSLAATKFF 481 QSTELDWVEV GLQVCRQGYN MLNLLIHRKN LNYLHLDYNF NLKPVKTLTT KERKKSRFGN 541 AFHLCREILR LTKLVVDANV QFRLGNVDAF QLADGLQYIF SHVGQLTGMY RYKYRLMRQI 601 RMCKDLKHLI YYRFNTGPVG KGPGCGFWAP MWRVWLFFLR GIVPLLERWL GNLLARQFEG 661 RHSKGVAKTV TKQRVESHFD LELRAAVMHD VVDAMPEGIK QNKARTILQH LSEAWRCWKA 721 NIPWKVPGLP VAIENMILRY VKSKADWWTN VAHYNRERIR RGATVDKTVC RKNLGRLTRL 781 WLKAEQERQH NFQKDGPYVT ADEGIAIYST TVNWLESRKF SAIPFPPLSY KHDTKLLILA 841 LERLKESYSA AVKLNQQQRE ELGLIEQAYD NPHEALMRIK RHLLTQHSFK EVGIEFMDLY 901 SHLIPVYQID PLEKITDAYL DQYLWYEGDK RHLFPNWIKP ADSEPPPLLV YKWCQGINNL 961 QGIWDTSDGQ CVVMLQTKFE KLFEKIDLTV LNSLLRLVLD PKLANYVTGK NNVVLSYKDM 1021 SYTNTYGLIR GLQFASFVVQ FYGLVLDLLL LGLTRASEIA GPPQRPNEFM TYWDTKVETR 1081 HPIRLYSRYI DKVHIMFKFT HEEARDLIQR HLTERPDPNN ENMVGYNNKK CWPRDARMRL 1141 MKHDVNLGRS VFWDMKNRLP RSITTLEWEN GFVSVYSKDN PNLLFSMCGF EVRVLPKIRM 1201 GQEAFSSTRD GVWNLQNEQT KERTAVAFLR ADDEHMKVFE NRVRQILMSS GSTTFTKIVN 1261 KWNTALIGLM TYFREATVHT QELLDLLVKC ENKIQTRVKI GLNSKMPSRF PPVIFYTPKE 1321 IGGLGMLSMG HILIPQSDLR YSNQTDVGVS HFRSGMSHEE DQLIPNLYRY IQPWESEFID 1381 SQRVWAEYAL KRQEAQAQNR RLTLEDLEDS WDRGIPRINT LFQKDRHTLA YDKGWRVRTD 1441 FKQYQALKQN PFWWTHQRHD GKLWNLNNYR TDVIQALGGV EGILEHTLFK GTYFPTWEGL 1501 FWEKASGFEE SMKYKKLTNA QRSGLNQIPN RRFTLWWSPT INRANVYVGF QVQLDLTGIY 1561 MHGKIPTLKI SLIQIFRAHL WQKIHESVVM DLCQVLDQEL EPLEIETVQK ETIHPRKSYK 1621 MNSSCADVLL FAAHKWPMSK PSLIAESKDV FDQKASNKYW IDVQLRWGDY DSHDIERYTK 1681 AKFMDYTTDN MSIYPSPTGV IIGLDLAYNL HSAFGNWFPG SKPLLAQAMN KIMKSNPALY 1741 VLRERIRKGL QLYSSEPTEP YLSSQNYGEI FSNQIIWFVD DTNVYRVTIH KTFEGNLTTK 1801 PINGVIFIFN PRTGQLFLKI IHTSVWAGQK RLGQLAKWKT AEEVAALVRS LPVEEQPKQV 1861 IVTRKGMLDP LEVHLLDFPN IVIKGSELQL PFQACLKIEK FGDLILKATE PQMALFNIYD 1921 DWLMTVSSYT AFQRLILILR ALHVNNEKAK MLLKPDMSVV TEPNHIWPSL TDDQWMKVEV 1981 ALRDLILSDY AKKNKVNTSA LTQSEIRDII LGAEITPPSQ QRQQIAEIEK QAKEASQLTA 2041 VTTRTTNVHG DELISTTISP YEQSAFGSKT DWRVRAISAT NLYLRVNHIY VNSDDIKETG 2101 YTYIMPKNIL KKFICIADLR TQIAGYLYGI SPPDNPQVKE IRCVVMVPQC GNHQQVQLPS 2161 SLPEHQFLDD LEPLGWIHTQ PNELPQLSPQ DVTFHTRVLE NNKQWDAEKC IILTCSFTPG 2221 SCSLTSYKLT QAGYEWGRLN KDTGSNPHGY LPTHYEKVQM LLSDRFFGFY MVPENGPWNY 2281 NFMGANHTVS INYSLTLGTP KEYYHQVHRP THFLQFSKME EDGDLDRDDS FA //