LOCUS AEE87073.1 3184 aa PRT PLN 23-MAR-2023
DEFINITION Arabidopsis thaliana spatacsin carboxy-terminus protein protein.
ACCESSION CP002687-7306
PROTEIN_ID AEE87073.1
SOURCE Arabidopsis thaliana (thale cress)
ORGANISM Arabidopsis thaliana
Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
Camelineae; Arabidopsis.
REFERENCE 1 (bases 1 to 18585056)
AUTHORS Mayer,K., Schuller,C., Wambutt,R., Murphy,G., Volckaert,G.,
Pohl,T., Dusterhoft,A., Stiekema,W., Entian,K.D., Terryn,N.,
Harris,B., Ansorge,W., Brandt,P., Grivell,L., Rieger,M.,
Weichselgartner,M., de Simone,V., Obermaier,B., Mache,R.,
Muller,M., Kreis,M., Delseny,M., Puigdomenech,P., Watson,M.,
Schmidtheini,T., Reichert,B., Portatelle,D., Perez-Alonso,M.,
Boutry,M., Bancroft,I., Vos,P., Hoheisel,J., Zimmermann,W.,
Wedler,H., Ridley,P., Langham,S.A., McCullagh,B., Bilham,L.,
Robben,J., Van der Schueren,J., Grymonprez,B., Chuang,Y.J.,
Vandenbussche,F., Braeken,M., Weltjens,I., Voet,M., Bastiaens,I.,
Aert,R., Defoor,E., Weitzenegger,T., Bothe,G., Ramsperger,U.,
Hilbert,H., Braun,M., Holzer,E., Brandt,A., Peters,S., van
Staveren,M., Dirske,W., Mooijman,P., Klein Lankhorst,R., Rose,M.,
Hauf,J., Kotter,P., Berneiser,S., Hempel,S., Feldpausch,M.,
Lamberth,S., Van den Daele,H., De Keyser,A., Buysshaert,C.,
Gielen,J., Villarroel,R., De Clercq,R., Van Montagu,M., Rogers,J.,
Cronin,A., Quail,M., Bray-Allen,S., Clark,L., Doggett,J., Hall,S.,
Kay,M., Lennard,N., McLay,K., Mayes,R., Pettett,A.,
Rajandream,M.A., Lyne,M., Benes,V., Rechmann,S., Borkova,D.,
Blocker,H., Scharfe,M., Grimm,M., Lohnert,T.H., Dose,S., de
Haan,M., Maarse,A., Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M.,
Fartmann,B., Granderath,K., Dauner,D., Herzl,A., Neumann,S.,
Argiriou,A., Vitale,D., Liguori,R., Piravandi,E., Massenet,O.,
Quigley,F., Clabauld,G., Mundlein,A., Felber,R., Schnabl,S.,
Hiller,R., Schmidt,W., Lecharny,A., Aubourg,S., Chefdor,F.,
Cooke,R., Berger,C., Montfort,A., Casacuberta,E., Gibbons,T.,
Weber,N., Vandenbol,M., Bargues,M., Terol,J., Torres,A.,
Perez-Perez,A., Purnelle,B., Bent,E., Johnson,S., Tacon,D.,
Jesse,T., Heijnen,L., Schwarz,S., Scholler,P., Heber,S., Francs,P.,
Bielke,C., Frishman,D., Haase,D., Lemcke,K., Mewes,H.W.,
Stocker,S., Zaccaria,P., Bevan,M., Wilson,R.K., de la Bastide,M.,
Habermann,K., Parnell,L., Dedhia,N., Gnoj,L., Schutz,K., Huang,E.,
Spiegel,L., Sehkon,M., Murray,J., Sheet,P., Cordes,M.,
Abu-Threideh,J., Stoneking,T., Kalicki,J., Graves,T., Harmon,G.,
Edwards,J., Latreille,P., Courtney,L., Cloud,J., Abbott,A.,
Scott,K., Johnson,D., Minx,P., Bentley,D., Fulton,B., Miller,N.,
Greco,T., Kemp,K., Kramer,J., Fulton,L., Mardis,E., Dante,M.,
Pepin,K., Hillier,L., Nelson,J., Spieth,J., Ryan,E., Andrews,S.,
Geisel,C., Layman,D., Du,H., Ali,J., Berghoff,A., Jones,K.,
Drone,K., Cotton,M., Joshu,C., Antonoiu,B., Zidanic,M., Strong,C.,
Sun,H., Lamar,B., Yordan,C., Ma,P., Zhong,J., Preston,R., Vil,D.,
Shekher,M., Matero,A., Shah,R., Swaby,I.K., O'Shaughnessy,A.,
Rodriguez,M., Hoffmann,J., Till,S., Granat,S., Shohdy,N.,
Hasegawa,A., Hameed,A., Lodhi,M., Johnson,A., Chen,E., Marra,M.,
Martienssen,R. and McCombie,W.R.
TITLE Sequence and analysis of chromosome 4 of the plant Arabidopsis
thaliana
JOURNAL Nature 402 (6763), 769-777 (1999)
PUBMED 10617198
REFERENCE 2 (bases 1 to 18585056)
AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
CONSRTM TAIR
TITLE Direct Submission
JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE 3 (bases 1 to 18585056)
AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
Vaughn,M. and Town,C.D.
TITLE Direct Submission
JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
9704 Medical Center Dr, Rockville, MD 20850, USA
REMARK Protein update by submitter
FEATURES Qualifiers
source /organism="Arabidopsis thaliana"
/mol_type="genomic DNA"
/db_xref="taxon:3702"
/chromosome="4"
/ecotype="Columbia"
protein /locus_tag="AT4G39420"
/gene_synonym="F23K16.50"
/gene_synonym="F23K16_50"
/note="unknown protein; INVOLVED IN: biological_process
unknown; LOCATED IN: cellular_component unknown; EXPRESSED
IN: leaf; EXPRESSED DURING: LP.04 four leaves visible,
LP.02 two leaves visible; Has 20 Blast hits to 19 proteins
in 8 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi
- 0; Plants - 20; Viruses - 0; Other Eukaryotes - 0
(source: NCBI BLink)."
/db_xref="TAIR:AT4G39420"
/db_xref="Araport:AT4G39420"
intron_pos 58:1 (1/24)
intron_pos 767:1 (2/24)
intron_pos 792:2 (3/24)
intron_pos 833:2 (4/24)
intron_pos 915:0 (5/24)
intron_pos 1070:0 (6/24)
intron_pos 1132:0 (7/24)
intron_pos 1230:0 (8/24)
intron_pos 1444:0 (9/24)
intron_pos 1577:1 (10/24)
intron_pos 1620:0 (11/24)
intron_pos 1677:0 (12/24)
intron_pos 1744:0 (13/24)
intron_pos 1931:0 (14/24)
intron_pos 1951:0 (15/24)
intron_pos 2059:2 (16/24)
intron_pos 2198:0 (17/24)
intron_pos 2353:0 (18/24)
intron_pos 2392:0 (19/24)
intron_pos 2432:2 (20/24)
intron_pos 2676:0 (21/24)
intron_pos 2761:0 (22/24)
intron_pos 2822:0 (23/24)
intron_pos 2936:0 (24/24)
BEGIN
1 MEKLVKEGPT LLQLHKWEPS QFQLKLSEFR EAFISPSRQL LLLLSYHSEA LLLPLVAGRS
61 IGSEVSLSGD NEELNSPSCS GGSDPEKIES PCGSGVGSGE PGFVDNCSSS CNSFPFIFDA
121 KSVAWGSCGD TYNRHKDPLF RELLFVSGNH GVTVHAFCCT KDLSDKAKGK PNGELRHGEW
181 VEWGPSRLSQ KSEPERVSSS DGSKQWMQSF LIDLETTVID GTRQSRFPEK SAFPGSAEVV
241 SFSILNTDLP FSNLLFQDNS ILPKDNMPED GNVNDNNFLV ASDPTALDEK SRADMPVNNV
301 SVNSLYRCIK VFSSDAHSLI GFVMELSDCA STPRRNENER SKGKRNIFVA KLFSWGIEWV
361 SLVKFGESSI GPTNEWADFR LSDNFVICLS VSGLIFLYDV NSGDFISHGD ILQTCGRGLH
421 SSSDRQEATA EADQLSDFQN RAPSMSKTCI VGSTDRRKFR KLIVASHTPL IAAVDENGLV
481 YVLCVNDFVS KEYHMAAEPI PDLLHLGLGS LVGWKIGGMD IGQKKVHHPS SSGSRGEDAF
541 SRRDLSFSAS EISMSDPCLE RQQNNFDRRA GYSGSWLSGF SAQPKTNGLK LEKFRRDSHV
601 TRKMFLSAEK LGLDDNICFS PYGFTHFSRK YTNKDDRSCK IFHYSLQTHM TARDDSYLNY
661 DVNKNSIQGA EENFIGESVG CSFQGFLFLV TCDGLSVFLP SISITSNYPT IEAIEYLQPF
721 QTTVMGYRGR DDLAAGESRF PWQVEVIDRV ILFEGPEVAD HLCLENGWDL KIVRLRRLQM
781 ALDYLKYDDI NESLKMLGNV KLAEEGMLRV LFSAVYLLSR KDRNDNEISA VSRLLGLATM
841 FATEMIRRYG LLEYRKDVYM FDSKPRTQIL SLPAVSLNID VMENSRRLSE MGYLLEITRN
901 IQSRITRKFK KLGKGNNEKS LNLVDPNSLQ DDSQLEIVPD PASAESRQLD TSLFDTNEEL
961 ALTPMGMMTA GQIIDERSYA SGLVPQGIVE EKKVLPLENP KEMMARWKAN NLDLKTVVKD
1021 ALLSGRLPLA VLQLHLQHSK DVVEDGEHHD TFTEVRDIGR AIAYDLFLKG EPGVAIATLQ
1081 RLGEDVEACL NQLVFGTVRR SLRYQIAEEM RKLGFLRPYE DNVLERISLI ERLYPSSHFW
1141 ETYLARRKEL LKAALPFDSS EISLHLGGSS LFQHLKIECG EVDGVVLGSW TKINESASEH
1201 APDETDAVAG YWAAAAVWSN AWDQRTFDHI VLDQPLVMGV HVPWDSQLEY YMCHNDWDEV
1261 LKLLDLIPED VLYDGSLQIA LDGPKQSSGV NYSVSSRSEY ICSIEEVDAV LMDVPYIKIF
1321 RLPGDIRCSL WLTTLMEQEL ARKLIFLKEY WENALDVVYL LARAGVILGN CEVSFKEETC
1381 TPSLDLCLSI KKGGANVDTL NAVHKLFIHY CTQYNLPNLL DLYLDHHELV LDNDSLSSLQ
1441 EAVGDSHWAK WLLLSRIKGR EYDASFSNAR SIMSRNGAPN SEPSVPEIDE MVCTVDDIAD
1501 GAGEMAALAT MMCAPVPIQK SLSTGSVNRH TNSSAQCTLE NLRSFLQRFP TLWSKLVSAC
1561 LGEDISGNLL RTKTKNVLSE YLNWRDGVFF STARDTSLLQ MLPCWFPKAV RRLVQLYIQG
1621 PLGWLSFSGY PTGEYLLHRG VEFFINVDDP TEISAISWEA IIQKHIEEEL HHTKTEGTEL
1681 GLEHFLHRGR PLAAFNAFLE HRVEKLKLED QSGSSIHGQR NMQSDVPMLL APLTQSDESL
1741 LSSVIPLAIT HFGDSVLVAS CAFLLELCGL SASMLRIDVA SLRRISSFYK SNGNADMAHQ
1801 KSLKRSMFHS VSSEDDLMGS LARALANEYA YPDISSVPKQ KQNPSISGSQ PGLPLMLVLH
1861 HLEQASLPEI GVGRKTSGYW LLTGDGDGSE LRSQQTSASL HWSLVTLFCQ MHKIPLSTKY
1921 LAMLARDNDW VGFLSEAQLG GYPFDTVLNV ASKEFGDQRL KAHILTVLRY ANSKKKATTS
1981 FSDDPSRGLS CSPSEGGAYV SAELFRVLAY SEKLKNPGEY LLSKAKEFSW SILALIASCF
2041 PDVSPLSCLT IWLEITAARE TSSIKVNDIT TKIAENIGAA VVSTNSLPTD ARGVQFHYNR
2101 RNPKRRRLTA HTSVDLLASA NSLNISAGKT FCSHRTEAAE DEKAEDSSVI DDSSDEHASL
2161 SKMVAVLCEQ RLFLPLLKAF DLFLPSCSLL PFFRALQAFS QMRLSEASAH LGSFWGRVKE
2221 ESMHFQSNTA KDVNFGASWI SRTAVKAADA VLSACPSPYE KRCLLQLLAA TDFGDGGSAA
2281 TYYRRLYWKV NLAEPSLREN DLDLGNESLD DGSLLTALEK NRQWEQARNW AKQLETIGAT
2341 WTSSVHHVTE TQAESMVAEW KEFLWDVPEE RIALWGHCQT LFIRYSFPAL QAGLFFLRHA
2401 EVVEKDLPAR EIYELLLLSL QWLSGLTTLS HPVYPLHLLR EIETRVWLLA VEAESHVKNV
2461 GAFSPSSIGK DMVNGYSSNL IDRTASIITK MDSHISSATK NRIGEKHDAR AAGQGNQRNQ
2521 DTSTSIFGAS TKPKRRAKGN VPQIRHFVDS SDRNTDFEDS SSLINIKSEF QLQEESTGLE
2581 ISLSKWEESI EPAELERAVL SLLEFGQVTA AKQLQLKLAP GNLPSELIIL DAVMKLAMLS
2641 TPCRQVLLSM LDDEVRSVIQ SHSLKIDQPM IEPLQILENL STILNEGSGR GLARKIIAVI
2701 KAANILGLTF TEAYQKQPIE LLRLLSLKAQ DSFEEACLLV QTHSMPAASI AQILAESFLK
2761 GLLAAHRGGY IDSQKEEGPA PLLWRFSDFL KWAELCPSEQ EIGHALMRLV ITGQEIPHAC
2821 EVELLILSHH FYKSSTCLDG VDVLVALAAT RVEAYVAEGD FSCLARLITG VGNFHALNFI
2881 LNILIENGQL DLLLQKFSAA ADANTGTAQA VRSFRMAVLT SLNLYNPNDH DAFAMVYKHF
2941 DMKHETATLL EARADQAAQQ WFLRYDKDQN EDLLDSMRYY IEAAEVHTSI DAGNKARKAC
3001 GQASLVSLQI RMPDSKWLCL SETNARRALV DQSRFQEALI VAEAYGLNQP SEWALVLWNL
3061 MLKPELAEDF VAEFVAVLPL QASMLLELAR FYRAEMAARG DQSQFSVWLT GGGLPAEWAK
3121 YMWRSFRCLL KRTRDLRLRL QLATTATGFA DMVDVCMNAL DKVPENAGPL VLKKGHGGGY
3181 LPLM
//