LOCUS ANM66210.1 3181 aa PRT PLN 23-MAR-2023
DEFINITION Arabidopsis thaliana spatacsin carboxy-terminus protein protein.
ACCESSION CP002687-7307
PROTEIN_ID ANM66210.1
SOURCE Arabidopsis thaliana (thale cress)
ORGANISM Arabidopsis thaliana
Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
Camelineae; Arabidopsis.
REFERENCE 1 (bases 1 to 18585056)
AUTHORS Mayer,K., Schuller,C., Wambutt,R., Murphy,G., Volckaert,G.,
Pohl,T., Dusterhoft,A., Stiekema,W., Entian,K.D., Terryn,N.,
Harris,B., Ansorge,W., Brandt,P., Grivell,L., Rieger,M.,
Weichselgartner,M., de Simone,V., Obermaier,B., Mache,R.,
Muller,M., Kreis,M., Delseny,M., Puigdomenech,P., Watson,M.,
Schmidtheini,T., Reichert,B., Portatelle,D., Perez-Alonso,M.,
Boutry,M., Bancroft,I., Vos,P., Hoheisel,J., Zimmermann,W.,
Wedler,H., Ridley,P., Langham,S.A., McCullagh,B., Bilham,L.,
Robben,J., Van der Schueren,J., Grymonprez,B., Chuang,Y.J.,
Vandenbussche,F., Braeken,M., Weltjens,I., Voet,M., Bastiaens,I.,
Aert,R., Defoor,E., Weitzenegger,T., Bothe,G., Ramsperger,U.,
Hilbert,H., Braun,M., Holzer,E., Brandt,A., Peters,S., van
Staveren,M., Dirske,W., Mooijman,P., Klein Lankhorst,R., Rose,M.,
Hauf,J., Kotter,P., Berneiser,S., Hempel,S., Feldpausch,M.,
Lamberth,S., Van den Daele,H., De Keyser,A., Buysshaert,C.,
Gielen,J., Villarroel,R., De Clercq,R., Van Montagu,M., Rogers,J.,
Cronin,A., Quail,M., Bray-Allen,S., Clark,L., Doggett,J., Hall,S.,
Kay,M., Lennard,N., McLay,K., Mayes,R., Pettett,A.,
Rajandream,M.A., Lyne,M., Benes,V., Rechmann,S., Borkova,D.,
Blocker,H., Scharfe,M., Grimm,M., Lohnert,T.H., Dose,S., de
Haan,M., Maarse,A., Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M.,
Fartmann,B., Granderath,K., Dauner,D., Herzl,A., Neumann,S.,
Argiriou,A., Vitale,D., Liguori,R., Piravandi,E., Massenet,O.,
Quigley,F., Clabauld,G., Mundlein,A., Felber,R., Schnabl,S.,
Hiller,R., Schmidt,W., Lecharny,A., Aubourg,S., Chefdor,F.,
Cooke,R., Berger,C., Montfort,A., Casacuberta,E., Gibbons,T.,
Weber,N., Vandenbol,M., Bargues,M., Terol,J., Torres,A.,
Perez-Perez,A., Purnelle,B., Bent,E., Johnson,S., Tacon,D.,
Jesse,T., Heijnen,L., Schwarz,S., Scholler,P., Heber,S., Francs,P.,
Bielke,C., Frishman,D., Haase,D., Lemcke,K., Mewes,H.W.,
Stocker,S., Zaccaria,P., Bevan,M., Wilson,R.K., de la Bastide,M.,
Habermann,K., Parnell,L., Dedhia,N., Gnoj,L., Schutz,K., Huang,E.,
Spiegel,L., Sehkon,M., Murray,J., Sheet,P., Cordes,M.,
Abu-Threideh,J., Stoneking,T., Kalicki,J., Graves,T., Harmon,G.,
Edwards,J., Latreille,P., Courtney,L., Cloud,J., Abbott,A.,
Scott,K., Johnson,D., Minx,P., Bentley,D., Fulton,B., Miller,N.,
Greco,T., Kemp,K., Kramer,J., Fulton,L., Mardis,E., Dante,M.,
Pepin,K., Hillier,L., Nelson,J., Spieth,J., Ryan,E., Andrews,S.,
Geisel,C., Layman,D., Du,H., Ali,J., Berghoff,A., Jones,K.,
Drone,K., Cotton,M., Joshu,C., Antonoiu,B., Zidanic,M., Strong,C.,
Sun,H., Lamar,B., Yordan,C., Ma,P., Zhong,J., Preston,R., Vil,D.,
Shekher,M., Matero,A., Shah,R., Swaby,I.K., O'Shaughnessy,A.,
Rodriguez,M., Hoffmann,J., Till,S., Granat,S., Shohdy,N.,
Hasegawa,A., Hameed,A., Lodhi,M., Johnson,A., Chen,E., Marra,M.,
Martienssen,R. and McCombie,W.R.
TITLE Sequence and analysis of chromosome 4 of the plant Arabidopsis
thaliana
JOURNAL Nature 402 (6763), 769-777 (1999)
PUBMED 10617198
REFERENCE 2 (bases 1 to 18585056)
AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
CONSRTM TAIR
TITLE Direct Submission
JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE 3 (bases 1 to 18585056)
AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
Vaughn,M. and Town,C.D.
TITLE Direct Submission
JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
9704 Medical Center Dr, Rockville, MD 20850, USA
REMARK Protein update by submitter
FEATURES Qualifiers
source /organism="Arabidopsis thaliana"
/mol_type="genomic DNA"
/db_xref="taxon:3702"
/chromosome="4"
/ecotype="Columbia"
protein /locus_tag="AT4G39420"
/gene_synonym="F23K16.50"
/gene_synonym="F23K16_50"
/db_xref="Araport:AT4G39420"
/db_xref="TAIR:AT4G39420"
intron_pos 58:1 (1/24)
intron_pos 767:1 (2/24)
intron_pos 792:2 (3/24)
intron_pos 833:2 (4/24)
intron_pos 915:0 (5/24)
intron_pos 1070:0 (6/24)
intron_pos 1132:0 (7/24)
intron_pos 1230:0 (8/24)
intron_pos 1444:0 (9/24)
intron_pos 1577:1 (10/24)
intron_pos 1617:0 (11/24)
intron_pos 1674:0 (12/24)
intron_pos 1741:0 (13/24)
intron_pos 1928:0 (14/24)
intron_pos 1948:0 (15/24)
intron_pos 2056:2 (16/24)
intron_pos 2195:0 (17/24)
intron_pos 2350:0 (18/24)
intron_pos 2389:0 (19/24)
intron_pos 2429:2 (20/24)
intron_pos 2673:0 (21/24)
intron_pos 2758:0 (22/24)
intron_pos 2819:0 (23/24)
intron_pos 2933:0 (24/24)
BEGIN
1 MEKLVKEGPT LLQLHKWEPS QFQLKLSEFR EAFISPSRQL LLLLSYHSEA LLLPLVAGRS
61 IGSEVSLSGD NEELNSPSCS GGSDPEKIES PCGSGVGSGE PGFVDNCSSS CNSFPFIFDA
121 KSVAWGSCGD TYNRHKDPLF RELLFVSGNH GVTVHAFCCT KDLSDKAKGK PNGELRHGEW
181 VEWGPSRLSQ KSEPERVSSS DGSKQWMQSF LIDLETTVID GTRQSRFPEK SAFPGSAEVV
241 SFSILNTDLP FSNLLFQDNS ILPKDNMPED GNVNDNNFLV ASDPTALDEK SRADMPVNNV
301 SVNSLYRCIK VFSSDAHSLI GFVMELSDCA STPRRNENER SKGKRNIFVA KLFSWGIEWV
361 SLVKFGESSI GPTNEWADFR LSDNFVICLS VSGLIFLYDV NSGDFISHGD ILQTCGRGLH
421 SSSDRQEATA EADQLSDFQN RAPSMSKTCI VGSTDRRKFR KLIVASHTPL IAAVDENGLV
481 YVLCVNDFVS KEYHMAAEPI PDLLHLGLGS LVGWKIGGMD IGQKKVHHPS SSGSRGEDAF
541 SRRDLSFSAS EISMSDPCLE RQQNNFDRRA GYSGSWLSGF SAQPKTNGLK LEKFRRDSHV
601 TRKMFLSAEK LGLDDNICFS PYGFTHFSRK YTNKDDRSCK IFHYSLQTHM TARDDSYLNY
661 DVNKNSIQGA EENFIGESVG CSFQGFLFLV TCDGLSVFLP SISITSNYPT IEAIEYLQPF
721 QTTVMGYRGR DDLAAGESRF PWQVEVIDRV ILFEGPEVAD HLCLENGWDL KIVRLRRLQM
781 ALDYLKYDDI NESLKMLGNV KLAEEGMLRV LFSAVYLLSR KDRNDNEISA VSRLLGLATM
841 FATEMIRRYG LLEYRKDVYM FDSKPRTQIL SLPAVSLNID VMENSRRLSE MGYLLEITRN
901 IQSRITRKFK KLGKGNNEKS LNLVDPNSLQ DDSQLEIVPD PASAESRQLD TSLFDTNEEL
961 ALTPMGMMTA GQIIDERSYA SGLVPQGIVE EKKVLPLENP KEMMARWKAN NLDLKTVVKD
1021 ALLSGRLPLA VLQLHLQHSK DVVEDGEHHD TFTEVRDIGR AIAYDLFLKG EPGVAIATLQ
1081 RLGEDVEACL NQLVFGTVRR SLRYQIAEEM RKLGFLRPYE DNVLERISLI ERLYPSSHFW
1141 ETYLARRKEL LKAALPFDSS EISLHLGGSS LFQHLKIECG EVDGVVLGSW TKINESASEH
1201 APDETDAVAG YWAAAAVWSN AWDQRTFDHI VLDQPLVMGV HVPWDSQLEY YMCHNDWDEV
1261 LKLLDLIPED VLYDGSLQIA LDGPKQSSGV NYSVSSRSEY ICSIEEVDAV LMDVPYIKIF
1321 RLPGDIRCSL WLTTLMEQEL ARKLIFLKEY WENALDVVYL LARAGVILGN CEVSFKEETC
1381 TPSLDLCLSI KKGGANVDTL NAVHKLFIHY CTQYNLPNLL DLYLDHHELV LDNDSLSSLQ
1441 EAVGDSHWAK WLLLSRIKGR EYDASFSNAR SIMSRNGAPN SEPSVPEIDE MVCTVDDIAD
1501 GAGEMAALAT MMCAPVPIQK SLSTGSVNRH TNSSAQCTLE NLRSFLQRFP TLWSKLVSAC
1561 LGEDISGNLL RTKTKNEYLN WRDGVFFSTA RDTSLLQMLP CWFPKAVRRL VQLYIQGPLG
1621 WLSFSGYPTG EYLLHRGVEF FINVDDPTEI SAISWEAIIQ KHIEEELHHT KTEGTELGLE
1681 HFLHRGRPLA AFNAFLEHRV EKLKLEDQSG SSIHGQRNMQ SDVPMLLAPL TQSDESLLSS
1741 VIPLAITHFG DSVLVASCAF LLELCGLSAS MLRIDVASLR RISSFYKSNG NADMAHQKSL
1801 KRSMFHSVSS EDDLMGSLAR ALANEYAYPD ISSVPKQKQN PSISGSQPGL PLMLVLHHLE
1861 QASLPEIGVG RKTSGYWLLT GDGDGSELRS QQTSASLHWS LVTLFCQMHK IPLSTKYLAM
1921 LARDNDWVGF LSEAQLGGYP FDTVLNVASK EFGDQRLKAH ILTVLRYANS KKKATTSFSD
1981 DPSRGLSCSP SEGGAYVSAE LFRVLAYSEK LKNPGEYLLS KAKEFSWSIL ALIASCFPDV
2041 SPLSCLTIWL EITAARETSS IKVNDITTKI AENIGAAVVS TNSLPTDARG VQFHYNRRNP
2101 KRRRLTAHTS VDLLASANSL NISAGKTFCS HRTEAAEDEK AEDSSVIDDS SDEHASLSKM
2161 VAVLCEQRLF LPLLKAFDLF LPSCSLLPFF RALQAFSQMR LSEASAHLGS FWGRVKEESM
2221 HFQSNTAKDV NFGASWISRT AVKAADAVLS ACPSPYEKRC LLQLLAATDF GDGGSAATYY
2281 RRLYWKVNLA EPSLRENDLD LGNESLDDGS LLTALEKNRQ WEQARNWAKQ LETIGATWTS
2341 SVHHVTETQA ESMVAEWKEF LWDVPEERIA LWGHCQTLFI RYSFPALQAG LFFLRHAEVV
2401 EKDLPAREIY ELLLLSLQWL SGLTTLSHPV YPLHLLREIE TRVWLLAVEA ESHVKNVGAF
2461 SPSSIGKDMV NGYSSNLIDR TASIITKMDS HISSATKNRI GEKHDARAAG QGNQRNQDTS
2521 TSIFGASTKP KRRAKGNVPQ IRHFVDSSDR NTDFEDSSSL INIKSEFQLQ EESTGLEISL
2581 SKWEESIEPA ELERAVLSLL EFGQVTAAKQ LQLKLAPGNL PSELIILDAV MKLAMLSTPC
2641 RQVLLSMLDD EVRSVIQSHS LKIDQPMIEP LQILENLSTI LNEGSGRGLA RKIIAVIKAA
2701 NILGLTFTEA YQKQPIELLR LLSLKAQDSF EEACLLVQTH SMPAASIAQI LAESFLKGLL
2761 AAHRGGYIDS QKEEGPAPLL WRFSDFLKWA ELCPSEQEIG HALMRLVITG QEIPHACEVE
2821 LLILSHHFYK SSTCLDGVDV LVALAATRVE AYVAEGDFSC LARLITGVGN FHALNFILNI
2881 LIENGQLDLL LQKFSAAADA NTGTAQAVRS FRMAVLTSLN LYNPNDHDAF AMVYKHFDMK
2941 HETATLLEAR ADQAAQQWFL RYDKDQNEDL LDSMRYYIEA AEVHTSIDAG NKARKACGQA
3001 SLVSLQIRMP DSKWLCLSET NARRALVDQS RFQEALIVAE AYGLNQPSEW ALVLWNLMLK
3061 PELAEDFVAE FVAVLPLQAS MLLELARFYR AEMAARGDQS QFSVWLTGGG LPAEWAKYMW
3121 RSFRCLLKRT RDLRLRLQLA TTATGFADMV DVCMNALDKV PENAGPLVLK KGHGGGYLPL
3181 M
//