LOCUS AEE87072.1 2513 aa PRT PLN 23-MAR-2023 DEFINITION Arabidopsis thaliana spatacsin carboxy-terminus protein protein. ACCESSION CP002687-7308 PROTEIN_ID AEE87072.1 SOURCE Arabidopsis thaliana (thale cress) ORGANISM Arabidopsis thaliana Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis. REFERENCE 1 (bases 1 to 18585056) AUTHORS Mayer,K., Schuller,C., Wambutt,R., Murphy,G., Volckaert,G., Pohl,T., Dusterhoft,A., Stiekema,W., Entian,K.D., Terryn,N., Harris,B., Ansorge,W., Brandt,P., Grivell,L., Rieger,M., Weichselgartner,M., de Simone,V., Obermaier,B., Mache,R., Muller,M., Kreis,M., Delseny,M., Puigdomenech,P., Watson,M., Schmidtheini,T., Reichert,B., Portatelle,D., Perez-Alonso,M., Boutry,M., Bancroft,I., Vos,P., Hoheisel,J., Zimmermann,W., Wedler,H., Ridley,P., Langham,S.A., McCullagh,B., Bilham,L., Robben,J., Van der Schueren,J., Grymonprez,B., Chuang,Y.J., Vandenbussche,F., Braeken,M., Weltjens,I., Voet,M., Bastiaens,I., Aert,R., Defoor,E., Weitzenegger,T., Bothe,G., Ramsperger,U., Hilbert,H., Braun,M., Holzer,E., Brandt,A., Peters,S., van Staveren,M., Dirske,W., Mooijman,P., Klein Lankhorst,R., Rose,M., Hauf,J., Kotter,P., Berneiser,S., Hempel,S., Feldpausch,M., Lamberth,S., Van den Daele,H., De Keyser,A., Buysshaert,C., Gielen,J., Villarroel,R., De Clercq,R., Van Montagu,M., Rogers,J., Cronin,A., Quail,M., Bray-Allen,S., Clark,L., Doggett,J., Hall,S., Kay,M., Lennard,N., McLay,K., Mayes,R., Pettett,A., Rajandream,M.A., Lyne,M., Benes,V., Rechmann,S., Borkova,D., Blocker,H., Scharfe,M., Grimm,M., Lohnert,T.H., Dose,S., de Haan,M., Maarse,A., Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M., Fartmann,B., Granderath,K., Dauner,D., Herzl,A., Neumann,S., Argiriou,A., Vitale,D., Liguori,R., Piravandi,E., Massenet,O., Quigley,F., Clabauld,G., Mundlein,A., Felber,R., Schnabl,S., Hiller,R., Schmidt,W., Lecharny,A., Aubourg,S., Chefdor,F., Cooke,R., Berger,C., Montfort,A., Casacuberta,E., Gibbons,T., Weber,N., Vandenbol,M., Bargues,M., Terol,J., Torres,A., Perez-Perez,A., Purnelle,B., Bent,E., Johnson,S., Tacon,D., Jesse,T., Heijnen,L., Schwarz,S., Scholler,P., Heber,S., Francs,P., Bielke,C., Frishman,D., Haase,D., Lemcke,K., Mewes,H.W., Stocker,S., Zaccaria,P., Bevan,M., Wilson,R.K., de la Bastide,M., Habermann,K., Parnell,L., Dedhia,N., Gnoj,L., Schutz,K., Huang,E., Spiegel,L., Sehkon,M., Murray,J., Sheet,P., Cordes,M., Abu-Threideh,J., Stoneking,T., Kalicki,J., Graves,T., Harmon,G., Edwards,J., Latreille,P., Courtney,L., Cloud,J., Abbott,A., Scott,K., Johnson,D., Minx,P., Bentley,D., Fulton,B., Miller,N., Greco,T., Kemp,K., Kramer,J., Fulton,L., Mardis,E., Dante,M., Pepin,K., Hillier,L., Nelson,J., Spieth,J., Ryan,E., Andrews,S., Geisel,C., Layman,D., Du,H., Ali,J., Berghoff,A., Jones,K., Drone,K., Cotton,M., Joshu,C., Antonoiu,B., Zidanic,M., Strong,C., Sun,H., Lamar,B., Yordan,C., Ma,P., Zhong,J., Preston,R., Vil,D., Shekher,M., Matero,A., Shah,R., Swaby,I.K., O'Shaughnessy,A., Rodriguez,M., Hoffmann,J., Till,S., Granat,S., Shohdy,N., Hasegawa,A., Hameed,A., Lodhi,M., Johnson,A., Chen,E., Marra,M., Martienssen,R. and McCombie,W.R. TITLE Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana JOURNAL Nature 402 (6763), 769-777 (1999) PUBMED 10617198 REFERENCE 2 (bases 1 to 18585056) AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E. CONSRTM TAIR TITLE Direct Submission JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie Institution, 260 Panama Street, Stanford, CA, USA REFERENCE 3 (bases 1 to 18585056) AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M., Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R., Vaughn,M. and Town,C.D. TITLE Direct Submission JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute, 9704 Medical Center Dr, Rockville, MD 20850, USA REMARK Protein update by submitter FEATURES Qualifiers source /organism="Arabidopsis thaliana" /mol_type="genomic DNA" /db_xref="taxon:3702" /chromosome="4" /ecotype="Columbia" protein /locus_tag="AT4G39420" /gene_synonym="F23K16.50" /gene_synonym="F23K16_50" /inference="Similar to RNA sequence, EST:INSD:EG468594.1,INSD:EG468593.1,INSD:EG468591.1, INSD:EG468630.1,INSD:EG468597.1,INSD:EH898310.1, INSD:EG468590.1,INSD:EH888347.1,INSD:BP844495.1, INSD:EG468599.1,INSD:EG468600.1,INSD:EG468620.1" /note="unknown protein; Has 46 Blast hits to 40 proteins in 10 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 44; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink)." /db_xref="TAIR:AT4G39420" /db_xref="Araport:AT4G39420" intron_pos 58:1 (1/24) intron_pos 99:1 (2/24) intron_pos 124:2 (3/24) intron_pos 165:2 (4/24) intron_pos 247:0 (5/24) intron_pos 402:0 (6/24) intron_pos 464:0 (7/24) intron_pos 562:0 (8/24) intron_pos 776:0 (9/24) intron_pos 909:1 (10/24) intron_pos 949:0 (11/24) intron_pos 1006:0 (12/24) intron_pos 1073:0 (13/24) intron_pos 1260:0 (14/24) intron_pos 1280:0 (15/24) intron_pos 1388:2 (16/24) intron_pos 1527:0 (17/24) intron_pos 1682:0 (18/24) intron_pos 1721:0 (19/24) intron_pos 1761:2 (20/24) intron_pos 2005:0 (21/24) intron_pos 2090:0 (22/24) intron_pos 2151:0 (23/24) intron_pos 2265:0 (24/24) BEGIN 1 MEKLVKEGPT LLQLHKWEPS QFQLKLSEFR EAFISPSRQL LLLLSYHSEA LLLPLVAGRS 61 IGSEVSLSGD NEELNSPSCS GGSDPEKIES PCGSGVGSGW DLKIVRLRRL QMALDYLKYD 121 DINESLKMLG NVKLAEEGML RVLFSAVYLL SRKDRNDNEI SAVSRLLGLA TMFATEMIRR 181 YGLLEYRKDV YMFDSKPRTQ ILSLPAVSLN IDVMENSRRL SEMGYLLEIT RNIQSRITRK 241 FKKLGKGNNE KSLNLVDPNS LQDDSQLEIV PDPASAESRQ LDTSLFDTNE ELALTPMGMM 301 TAGQIIDERS YASGLVPQGI VEEKKVLPLE NPKEMMARWK ANNLDLKTVV KDALLSGRLP 361 LAVLQLHLQH SKDVVEDGEH HDTFTEVRDI GRAIAYDLFL KGEPGVAIAT LQRLGEDVEA 421 CLNQLVFGTV RRSLRYQIAE EMRKLGFLRP YEDNVLERIS LIERLYPSSH FWETYLARRK 481 ELLKAALPFD SSEISLHLGG SSLFQHLKIE CGEVDGVVLG SWTKINESAS EHAPDETDAV 541 AGYWAAAAVW SNAWDQRTFD HIVLDQPLVM GVHVPWDSQL EYYMCHNDWD EVLKLLDLIP 601 EDVLYDGSLQ IALDGPKQSS GVNYSVSSRS EYICSIEEVD AVLMDVPYIK IFRLPGDIRC 661 SLWLTTLMEQ ELARKLIFLK EYWENALDVV YLLARAGVIL GNCEVSFKEE TCTPSLDLCL 721 SIKKGGANVD TLNAVHKLFI HYCTQYNLPN LLDLYLDHHE LVLDNDSLSS LQEAVGDSHW 781 AKWLLLSRIK GREYDASFSN ARSIMSRNGA PNSEPSVPEI DEMVCTVDDI ADGAGEMAAL 841 ATMMCAPVPI QKSLSTGSVN RHTNSSAQCT LENLRSFLQR FPTLWSKLVS ACLGEDISGN 901 LLRTKTKNEY LNWRDGVFFS TARDTSLLQM LPCWFPKAVR RLVQLYIQGP LGWLSFSGYP 961 TGEYLLHRGV EFFINVDDPT EISAISWEAI IQKHIEEELH HTKTEGTELG LEHFLHRGRP 1021 LAAFNAFLEH RVEKLKLEDQ SGSSIHGQRN MQSDVPMLLA PLTQSDESLL SSVIPLAITH 1081 FGDSVLVASC AFLLELCGLS ASMLRIDVAS LRRISSFYKS NGNADMAHQK SLKRSMFHSV 1141 SSEDDLMGSL ARALANEYAY PDISSVPKQK QNPSISGSQP GLPLMLVLHH LEQASLPEIG 1201 VGRKTSGYWL LTGDGDGSEL RSQQTSASLH WSLVTLFCQM HKIPLSTKYL AMLARDNDWV 1261 GFLSEAQLGG YPFDTVLNVA SKEFGDQRLK AHILTVLRYA NSKKKATTSF SDDPSRGLSC 1321 SPSEGGAYVS AELFRVLAYS EKLKNPGEYL LSKAKEFSWS ILALIASCFP DVSPLSCLTI 1381 WLEITAARET SSIKVNDITT KIAENIGAAV VSTNSLPTDA RGVQFHYNRR NPKRRRLTAH 1441 TSVDLLASAN SLNISAGKTF CSHRTEAAED EKAEDSSVID DSSDEHASLS KMVAVLCEQR 1501 LFLPLLKAFD LFLPSCSLLP FFRALQAFSQ MRLSEASAHL GSFWGRVKEE SMHFQSNTAK 1561 DVNFGASWIS RTAVKAADAV LSACPSPYEK RCLLQLLAAT DFGDGGSAAT YYRRLYWKVN 1621 LAEPSLREND LDLGNESLDD GSLLTALEKN RQWEQARNWA KQLETIGATW TSSVHHVTET 1681 QAESMVAEWK EFLWDVPEER IALWGHCQTL FIRYSFPALQ AGLFFLRHAE VVEKDLPARE 1741 IYELLLLSLQ WLSGLTTLSH PVYPLHLLRE IETRVWLLAV EAESHVKNVG AFSPSSIGKD 1801 MVNGYSSNLI DRTASIITKM DSHISSATKN RIGEKHDARA AGQGNQRNQD TSTSIFGAST 1861 KPKRRAKGNV PQIRHFVDSS DRNTDFEDSS SLINIKSEFQ LQEESTGLEI SLSKWEESIE 1921 PAELERAVLS LLEFGQVTAA KQLQLKLAPG NLPSELIILD AVMKLAMLST PCRQVLLSML 1981 DDEVRSVIQS HSLKIDQPMI EPLQILENLS TILNEGSGRG LARKIIAVIK AANILGLTFT 2041 EAYQKQPIEL LRLLSLKAQD SFEEACLLVQ THSMPAASIA QILAESFLKG LLAAHRGGYI 2101 DSQKEEGPAP LLWRFSDFLK WAELCPSEQE IGHALMRLVI TGQEIPHACE VELLILSHHF 2161 YKSSTCLDGV DVLVALAATR VEAYVAEGDF SCLARLITGV GNFHALNFIL NILIENGQLD 2221 LLLQKFSAAA DANTGTAQAV RSFRMAVLTS LNLYNPNDHD AFAMVYKHFD MKHETATLLE 2281 ARADQAAQQW FLRYDKDQNE DLLDSMRYYI EAAEVHTSID AGNKARKACG QASLVSLQIR 2341 MPDSKWLCLS ETNARRALVD QSRFQEALIV AEAYGLNQPS EWALVLWNLM LKPELAEDFV 2401 AEFVAVLPLQ ASMLLELARF YRAEMAARGD QSQFSVWLTG GGLPAEWAKY MWRSFRCLLK 2461 RTRDLRLRLQ LATTATGFAD MVDVCMNALD KVPENAGPLV LKKGHGGGYL PLM //