LOCUS AEE87073.1 3184 aa PRT PLN 23-MAR-2023 DEFINITION Arabidopsis thaliana spatacsin carboxy-terminus protein protein. ACCESSION CP002687-7306 PROTEIN_ID AEE87073.1 SOURCE Arabidopsis thaliana (thale cress) ORGANISM Arabidopsis thaliana Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis. REFERENCE 1 (bases 1 to 18585056) AUTHORS Mayer,K., Schuller,C., Wambutt,R., Murphy,G., Volckaert,G., Pohl,T., Dusterhoft,A., Stiekema,W., Entian,K.D., Terryn,N., Harris,B., Ansorge,W., Brandt,P., Grivell,L., Rieger,M., Weichselgartner,M., de Simone,V., Obermaier,B., Mache,R., Muller,M., Kreis,M., Delseny,M., Puigdomenech,P., Watson,M., Schmidtheini,T., Reichert,B., Portatelle,D., Perez-Alonso,M., Boutry,M., Bancroft,I., Vos,P., Hoheisel,J., Zimmermann,W., Wedler,H., Ridley,P., Langham,S.A., McCullagh,B., Bilham,L., Robben,J., Van der Schueren,J., Grymonprez,B., Chuang,Y.J., Vandenbussche,F., Braeken,M., Weltjens,I., Voet,M., Bastiaens,I., Aert,R., Defoor,E., Weitzenegger,T., Bothe,G., Ramsperger,U., Hilbert,H., Braun,M., Holzer,E., Brandt,A., Peters,S., van Staveren,M., Dirske,W., Mooijman,P., Klein Lankhorst,R., Rose,M., Hauf,J., Kotter,P., Berneiser,S., Hempel,S., Feldpausch,M., Lamberth,S., Van den Daele,H., De Keyser,A., Buysshaert,C., Gielen,J., Villarroel,R., De Clercq,R., Van Montagu,M., Rogers,J., Cronin,A., Quail,M., Bray-Allen,S., Clark,L., Doggett,J., Hall,S., Kay,M., Lennard,N., McLay,K., Mayes,R., Pettett,A., Rajandream,M.A., Lyne,M., Benes,V., Rechmann,S., Borkova,D., Blocker,H., Scharfe,M., Grimm,M., Lohnert,T.H., Dose,S., de Haan,M., Maarse,A., Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M., Fartmann,B., Granderath,K., Dauner,D., Herzl,A., Neumann,S., Argiriou,A., Vitale,D., Liguori,R., Piravandi,E., Massenet,O., Quigley,F., Clabauld,G., Mundlein,A., Felber,R., Schnabl,S., Hiller,R., Schmidt,W., Lecharny,A., Aubourg,S., Chefdor,F., Cooke,R., Berger,C., Montfort,A., Casacuberta,E., Gibbons,T., Weber,N., Vandenbol,M., Bargues,M., Terol,J., Torres,A., Perez-Perez,A., Purnelle,B., Bent,E., Johnson,S., Tacon,D., Jesse,T., Heijnen,L., Schwarz,S., Scholler,P., Heber,S., Francs,P., Bielke,C., Frishman,D., Haase,D., Lemcke,K., Mewes,H.W., Stocker,S., Zaccaria,P., Bevan,M., Wilson,R.K., de la Bastide,M., Habermann,K., Parnell,L., Dedhia,N., Gnoj,L., Schutz,K., Huang,E., Spiegel,L., Sehkon,M., Murray,J., Sheet,P., Cordes,M., Abu-Threideh,J., Stoneking,T., Kalicki,J., Graves,T., Harmon,G., Edwards,J., Latreille,P., Courtney,L., Cloud,J., Abbott,A., Scott,K., Johnson,D., Minx,P., Bentley,D., Fulton,B., Miller,N., Greco,T., Kemp,K., Kramer,J., Fulton,L., Mardis,E., Dante,M., Pepin,K., Hillier,L., Nelson,J., Spieth,J., Ryan,E., Andrews,S., Geisel,C., Layman,D., Du,H., Ali,J., Berghoff,A., Jones,K., Drone,K., Cotton,M., Joshu,C., Antonoiu,B., Zidanic,M., Strong,C., Sun,H., Lamar,B., Yordan,C., Ma,P., Zhong,J., Preston,R., Vil,D., Shekher,M., Matero,A., Shah,R., Swaby,I.K., O'Shaughnessy,A., Rodriguez,M., Hoffmann,J., Till,S., Granat,S., Shohdy,N., Hasegawa,A., Hameed,A., Lodhi,M., Johnson,A., Chen,E., Marra,M., Martienssen,R. and McCombie,W.R. TITLE Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana JOURNAL Nature 402 (6763), 769-777 (1999) PUBMED 10617198 REFERENCE 2 (bases 1 to 18585056) AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E. CONSRTM TAIR TITLE Direct Submission JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie Institution, 260 Panama Street, Stanford, CA, USA REFERENCE 3 (bases 1 to 18585056) AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M., Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R., Vaughn,M. and Town,C.D. TITLE Direct Submission JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute, 9704 Medical Center Dr, Rockville, MD 20850, USA REMARK Protein update by submitter FEATURES Qualifiers source /organism="Arabidopsis thaliana" /mol_type="genomic DNA" /db_xref="taxon:3702" /chromosome="4" /ecotype="Columbia" protein /locus_tag="AT4G39420" /gene_synonym="F23K16.50" /gene_synonym="F23K16_50" /note="unknown protein; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: leaf; EXPRESSED DURING: LP.04 four leaves visible, LP.02 two leaves visible; Has 20 Blast hits to 19 proteins in 8 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 20; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink)." /db_xref="TAIR:AT4G39420" /db_xref="Araport:AT4G39420" intron_pos 58:1 (1/24) intron_pos 767:1 (2/24) intron_pos 792:2 (3/24) intron_pos 833:2 (4/24) intron_pos 915:0 (5/24) intron_pos 1070:0 (6/24) intron_pos 1132:0 (7/24) intron_pos 1230:0 (8/24) intron_pos 1444:0 (9/24) intron_pos 1577:1 (10/24) intron_pos 1620:0 (11/24) intron_pos 1677:0 (12/24) intron_pos 1744:0 (13/24) intron_pos 1931:0 (14/24) intron_pos 1951:0 (15/24) intron_pos 2059:2 (16/24) intron_pos 2198:0 (17/24) intron_pos 2353:0 (18/24) intron_pos 2392:0 (19/24) intron_pos 2432:2 (20/24) intron_pos 2676:0 (21/24) intron_pos 2761:0 (22/24) intron_pos 2822:0 (23/24) intron_pos 2936:0 (24/24) BEGIN 1 MEKLVKEGPT LLQLHKWEPS QFQLKLSEFR EAFISPSRQL LLLLSYHSEA LLLPLVAGRS 61 IGSEVSLSGD NEELNSPSCS GGSDPEKIES PCGSGVGSGE PGFVDNCSSS CNSFPFIFDA 121 KSVAWGSCGD TYNRHKDPLF RELLFVSGNH GVTVHAFCCT KDLSDKAKGK PNGELRHGEW 181 VEWGPSRLSQ KSEPERVSSS DGSKQWMQSF LIDLETTVID GTRQSRFPEK SAFPGSAEVV 241 SFSILNTDLP FSNLLFQDNS ILPKDNMPED GNVNDNNFLV ASDPTALDEK SRADMPVNNV 301 SVNSLYRCIK VFSSDAHSLI GFVMELSDCA STPRRNENER SKGKRNIFVA KLFSWGIEWV 361 SLVKFGESSI GPTNEWADFR LSDNFVICLS VSGLIFLYDV NSGDFISHGD ILQTCGRGLH 421 SSSDRQEATA EADQLSDFQN RAPSMSKTCI VGSTDRRKFR KLIVASHTPL IAAVDENGLV 481 YVLCVNDFVS KEYHMAAEPI PDLLHLGLGS LVGWKIGGMD IGQKKVHHPS SSGSRGEDAF 541 SRRDLSFSAS EISMSDPCLE RQQNNFDRRA GYSGSWLSGF SAQPKTNGLK LEKFRRDSHV 601 TRKMFLSAEK LGLDDNICFS PYGFTHFSRK YTNKDDRSCK IFHYSLQTHM TARDDSYLNY 661 DVNKNSIQGA EENFIGESVG CSFQGFLFLV TCDGLSVFLP SISITSNYPT IEAIEYLQPF 721 QTTVMGYRGR DDLAAGESRF PWQVEVIDRV ILFEGPEVAD HLCLENGWDL KIVRLRRLQM 781 ALDYLKYDDI NESLKMLGNV KLAEEGMLRV LFSAVYLLSR KDRNDNEISA VSRLLGLATM 841 FATEMIRRYG LLEYRKDVYM FDSKPRTQIL SLPAVSLNID VMENSRRLSE MGYLLEITRN 901 IQSRITRKFK KLGKGNNEKS LNLVDPNSLQ DDSQLEIVPD PASAESRQLD TSLFDTNEEL 961 ALTPMGMMTA GQIIDERSYA SGLVPQGIVE EKKVLPLENP KEMMARWKAN NLDLKTVVKD 1021 ALLSGRLPLA VLQLHLQHSK DVVEDGEHHD TFTEVRDIGR AIAYDLFLKG EPGVAIATLQ 1081 RLGEDVEACL NQLVFGTVRR SLRYQIAEEM RKLGFLRPYE DNVLERISLI ERLYPSSHFW 1141 ETYLARRKEL LKAALPFDSS EISLHLGGSS LFQHLKIECG EVDGVVLGSW TKINESASEH 1201 APDETDAVAG YWAAAAVWSN AWDQRTFDHI VLDQPLVMGV HVPWDSQLEY YMCHNDWDEV 1261 LKLLDLIPED VLYDGSLQIA LDGPKQSSGV NYSVSSRSEY ICSIEEVDAV LMDVPYIKIF 1321 RLPGDIRCSL WLTTLMEQEL ARKLIFLKEY WENALDVVYL LARAGVILGN CEVSFKEETC 1381 TPSLDLCLSI KKGGANVDTL NAVHKLFIHY CTQYNLPNLL DLYLDHHELV LDNDSLSSLQ 1441 EAVGDSHWAK WLLLSRIKGR EYDASFSNAR SIMSRNGAPN SEPSVPEIDE MVCTVDDIAD 1501 GAGEMAALAT MMCAPVPIQK SLSTGSVNRH TNSSAQCTLE NLRSFLQRFP TLWSKLVSAC 1561 LGEDISGNLL RTKTKNVLSE YLNWRDGVFF STARDTSLLQ MLPCWFPKAV RRLVQLYIQG 1621 PLGWLSFSGY PTGEYLLHRG VEFFINVDDP TEISAISWEA IIQKHIEEEL HHTKTEGTEL 1681 GLEHFLHRGR PLAAFNAFLE HRVEKLKLED QSGSSIHGQR NMQSDVPMLL APLTQSDESL 1741 LSSVIPLAIT HFGDSVLVAS CAFLLELCGL SASMLRIDVA SLRRISSFYK SNGNADMAHQ 1801 KSLKRSMFHS VSSEDDLMGS LARALANEYA YPDISSVPKQ KQNPSISGSQ PGLPLMLVLH 1861 HLEQASLPEI GVGRKTSGYW LLTGDGDGSE LRSQQTSASL HWSLVTLFCQ MHKIPLSTKY 1921 LAMLARDNDW VGFLSEAQLG GYPFDTVLNV ASKEFGDQRL KAHILTVLRY ANSKKKATTS 1981 FSDDPSRGLS CSPSEGGAYV SAELFRVLAY SEKLKNPGEY LLSKAKEFSW SILALIASCF 2041 PDVSPLSCLT IWLEITAARE TSSIKVNDIT TKIAENIGAA VVSTNSLPTD ARGVQFHYNR 2101 RNPKRRRLTA HTSVDLLASA NSLNISAGKT FCSHRTEAAE DEKAEDSSVI DDSSDEHASL 2161 SKMVAVLCEQ RLFLPLLKAF DLFLPSCSLL PFFRALQAFS QMRLSEASAH LGSFWGRVKE 2221 ESMHFQSNTA KDVNFGASWI SRTAVKAADA VLSACPSPYE KRCLLQLLAA TDFGDGGSAA 2281 TYYRRLYWKV NLAEPSLREN DLDLGNESLD DGSLLTALEK NRQWEQARNW AKQLETIGAT 2341 WTSSVHHVTE TQAESMVAEW KEFLWDVPEE RIALWGHCQT LFIRYSFPAL QAGLFFLRHA 2401 EVVEKDLPAR EIYELLLLSL QWLSGLTTLS HPVYPLHLLR EIETRVWLLA VEAESHVKNV 2461 GAFSPSSIGK DMVNGYSSNL IDRTASIITK MDSHISSATK NRIGEKHDAR AAGQGNQRNQ 2521 DTSTSIFGAS TKPKRRAKGN VPQIRHFVDS SDRNTDFEDS SSLINIKSEF QLQEESTGLE 2581 ISLSKWEESI EPAELERAVL SLLEFGQVTA AKQLQLKLAP GNLPSELIIL DAVMKLAMLS 2641 TPCRQVLLSM LDDEVRSVIQ SHSLKIDQPM IEPLQILENL STILNEGSGR GLARKIIAVI 2701 KAANILGLTF TEAYQKQPIE LLRLLSLKAQ DSFEEACLLV QTHSMPAASI AQILAESFLK 2761 GLLAAHRGGY IDSQKEEGPA PLLWRFSDFL KWAELCPSEQ EIGHALMRLV ITGQEIPHAC 2821 EVELLILSHH FYKSSTCLDG VDVLVALAAT RVEAYVAEGD FSCLARLITG VGNFHALNFI 2881 LNILIENGQL DLLLQKFSAA ADANTGTAQA VRSFRMAVLT SLNLYNPNDH DAFAMVYKHF 2941 DMKHETATLL EARADQAAQQ WFLRYDKDQN EDLLDSMRYY IEAAEVHTSI DAGNKARKAC 3001 GQASLVSLQI RMPDSKWLCL SETNARRALV DQSRFQEALI VAEAYGLNQP SEWALVLWNL 3061 MLKPELAEDF VAEFVAVLPL QASMLLELAR FYRAEMAARG DQSQFSVWLT GGGLPAEWAK 3121 YMWRSFRCLL KRTRDLRLRL QLATTATGFA DMVDVCMNAL DKVPENAGPL VLKKGHGGGY 3181 LPLM //