LOCUS AEE87072.1 2513 aa PRT PLN 23-MAR-2023
DEFINITION Arabidopsis thaliana spatacsin carboxy-terminus protein protein.
ACCESSION CP002687-7308
PROTEIN_ID AEE87072.1
SOURCE Arabidopsis thaliana (thale cress)
ORGANISM Arabidopsis thaliana
Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
Camelineae; Arabidopsis.
REFERENCE 1 (bases 1 to 18585056)
AUTHORS Mayer,K., Schuller,C., Wambutt,R., Murphy,G., Volckaert,G.,
Pohl,T., Dusterhoft,A., Stiekema,W., Entian,K.D., Terryn,N.,
Harris,B., Ansorge,W., Brandt,P., Grivell,L., Rieger,M.,
Weichselgartner,M., de Simone,V., Obermaier,B., Mache,R.,
Muller,M., Kreis,M., Delseny,M., Puigdomenech,P., Watson,M.,
Schmidtheini,T., Reichert,B., Portatelle,D., Perez-Alonso,M.,
Boutry,M., Bancroft,I., Vos,P., Hoheisel,J., Zimmermann,W.,
Wedler,H., Ridley,P., Langham,S.A., McCullagh,B., Bilham,L.,
Robben,J., Van der Schueren,J., Grymonprez,B., Chuang,Y.J.,
Vandenbussche,F., Braeken,M., Weltjens,I., Voet,M., Bastiaens,I.,
Aert,R., Defoor,E., Weitzenegger,T., Bothe,G., Ramsperger,U.,
Hilbert,H., Braun,M., Holzer,E., Brandt,A., Peters,S., van
Staveren,M., Dirske,W., Mooijman,P., Klein Lankhorst,R., Rose,M.,
Hauf,J., Kotter,P., Berneiser,S., Hempel,S., Feldpausch,M.,
Lamberth,S., Van den Daele,H., De Keyser,A., Buysshaert,C.,
Gielen,J., Villarroel,R., De Clercq,R., Van Montagu,M., Rogers,J.,
Cronin,A., Quail,M., Bray-Allen,S., Clark,L., Doggett,J., Hall,S.,
Kay,M., Lennard,N., McLay,K., Mayes,R., Pettett,A.,
Rajandream,M.A., Lyne,M., Benes,V., Rechmann,S., Borkova,D.,
Blocker,H., Scharfe,M., Grimm,M., Lohnert,T.H., Dose,S., de
Haan,M., Maarse,A., Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M.,
Fartmann,B., Granderath,K., Dauner,D., Herzl,A., Neumann,S.,
Argiriou,A., Vitale,D., Liguori,R., Piravandi,E., Massenet,O.,
Quigley,F., Clabauld,G., Mundlein,A., Felber,R., Schnabl,S.,
Hiller,R., Schmidt,W., Lecharny,A., Aubourg,S., Chefdor,F.,
Cooke,R., Berger,C., Montfort,A., Casacuberta,E., Gibbons,T.,
Weber,N., Vandenbol,M., Bargues,M., Terol,J., Torres,A.,
Perez-Perez,A., Purnelle,B., Bent,E., Johnson,S., Tacon,D.,
Jesse,T., Heijnen,L., Schwarz,S., Scholler,P., Heber,S., Francs,P.,
Bielke,C., Frishman,D., Haase,D., Lemcke,K., Mewes,H.W.,
Stocker,S., Zaccaria,P., Bevan,M., Wilson,R.K., de la Bastide,M.,
Habermann,K., Parnell,L., Dedhia,N., Gnoj,L., Schutz,K., Huang,E.,
Spiegel,L., Sehkon,M., Murray,J., Sheet,P., Cordes,M.,
Abu-Threideh,J., Stoneking,T., Kalicki,J., Graves,T., Harmon,G.,
Edwards,J., Latreille,P., Courtney,L., Cloud,J., Abbott,A.,
Scott,K., Johnson,D., Minx,P., Bentley,D., Fulton,B., Miller,N.,
Greco,T., Kemp,K., Kramer,J., Fulton,L., Mardis,E., Dante,M.,
Pepin,K., Hillier,L., Nelson,J., Spieth,J., Ryan,E., Andrews,S.,
Geisel,C., Layman,D., Du,H., Ali,J., Berghoff,A., Jones,K.,
Drone,K., Cotton,M., Joshu,C., Antonoiu,B., Zidanic,M., Strong,C.,
Sun,H., Lamar,B., Yordan,C., Ma,P., Zhong,J., Preston,R., Vil,D.,
Shekher,M., Matero,A., Shah,R., Swaby,I.K., O'Shaughnessy,A.,
Rodriguez,M., Hoffmann,J., Till,S., Granat,S., Shohdy,N.,
Hasegawa,A., Hameed,A., Lodhi,M., Johnson,A., Chen,E., Marra,M.,
Martienssen,R. and McCombie,W.R.
TITLE Sequence and analysis of chromosome 4 of the plant Arabidopsis
thaliana
JOURNAL Nature 402 (6763), 769-777 (1999)
PUBMED 10617198
REFERENCE 2 (bases 1 to 18585056)
AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
CONSRTM TAIR
TITLE Direct Submission
JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE 3 (bases 1 to 18585056)
AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
Vaughn,M. and Town,C.D.
TITLE Direct Submission
JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
9704 Medical Center Dr, Rockville, MD 20850, USA
REMARK Protein update by submitter
FEATURES Qualifiers
source /organism="Arabidopsis thaliana"
/mol_type="genomic DNA"
/db_xref="taxon:3702"
/chromosome="4"
/ecotype="Columbia"
protein /locus_tag="AT4G39420"
/gene_synonym="F23K16.50"
/gene_synonym="F23K16_50"
/inference="Similar to RNA sequence,
EST:INSD:EG468594.1,INSD:EG468593.1,INSD:EG468591.1,
INSD:EG468630.1,INSD:EG468597.1,INSD:EH898310.1,
INSD:EG468590.1,INSD:EH888347.1,INSD:BP844495.1,
INSD:EG468599.1,INSD:EG468600.1,INSD:EG468620.1"
/note="unknown protein; Has 46 Blast hits to 40 proteins
in 10 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 44; Viruses - 0; Other Eukaryotes - 2
(source: NCBI BLink)."
/db_xref="TAIR:AT4G39420"
/db_xref="Araport:AT4G39420"
intron_pos 58:1 (1/24)
intron_pos 99:1 (2/24)
intron_pos 124:2 (3/24)
intron_pos 165:2 (4/24)
intron_pos 247:0 (5/24)
intron_pos 402:0 (6/24)
intron_pos 464:0 (7/24)
intron_pos 562:0 (8/24)
intron_pos 776:0 (9/24)
intron_pos 909:1 (10/24)
intron_pos 949:0 (11/24)
intron_pos 1006:0 (12/24)
intron_pos 1073:0 (13/24)
intron_pos 1260:0 (14/24)
intron_pos 1280:0 (15/24)
intron_pos 1388:2 (16/24)
intron_pos 1527:0 (17/24)
intron_pos 1682:0 (18/24)
intron_pos 1721:0 (19/24)
intron_pos 1761:2 (20/24)
intron_pos 2005:0 (21/24)
intron_pos 2090:0 (22/24)
intron_pos 2151:0 (23/24)
intron_pos 2265:0 (24/24)
BEGIN
1 MEKLVKEGPT LLQLHKWEPS QFQLKLSEFR EAFISPSRQL LLLLSYHSEA LLLPLVAGRS
61 IGSEVSLSGD NEELNSPSCS GGSDPEKIES PCGSGVGSGW DLKIVRLRRL QMALDYLKYD
121 DINESLKMLG NVKLAEEGML RVLFSAVYLL SRKDRNDNEI SAVSRLLGLA TMFATEMIRR
181 YGLLEYRKDV YMFDSKPRTQ ILSLPAVSLN IDVMENSRRL SEMGYLLEIT RNIQSRITRK
241 FKKLGKGNNE KSLNLVDPNS LQDDSQLEIV PDPASAESRQ LDTSLFDTNE ELALTPMGMM
301 TAGQIIDERS YASGLVPQGI VEEKKVLPLE NPKEMMARWK ANNLDLKTVV KDALLSGRLP
361 LAVLQLHLQH SKDVVEDGEH HDTFTEVRDI GRAIAYDLFL KGEPGVAIAT LQRLGEDVEA
421 CLNQLVFGTV RRSLRYQIAE EMRKLGFLRP YEDNVLERIS LIERLYPSSH FWETYLARRK
481 ELLKAALPFD SSEISLHLGG SSLFQHLKIE CGEVDGVVLG SWTKINESAS EHAPDETDAV
541 AGYWAAAAVW SNAWDQRTFD HIVLDQPLVM GVHVPWDSQL EYYMCHNDWD EVLKLLDLIP
601 EDVLYDGSLQ IALDGPKQSS GVNYSVSSRS EYICSIEEVD AVLMDVPYIK IFRLPGDIRC
661 SLWLTTLMEQ ELARKLIFLK EYWENALDVV YLLARAGVIL GNCEVSFKEE TCTPSLDLCL
721 SIKKGGANVD TLNAVHKLFI HYCTQYNLPN LLDLYLDHHE LVLDNDSLSS LQEAVGDSHW
781 AKWLLLSRIK GREYDASFSN ARSIMSRNGA PNSEPSVPEI DEMVCTVDDI ADGAGEMAAL
841 ATMMCAPVPI QKSLSTGSVN RHTNSSAQCT LENLRSFLQR FPTLWSKLVS ACLGEDISGN
901 LLRTKTKNEY LNWRDGVFFS TARDTSLLQM LPCWFPKAVR RLVQLYIQGP LGWLSFSGYP
961 TGEYLLHRGV EFFINVDDPT EISAISWEAI IQKHIEEELH HTKTEGTELG LEHFLHRGRP
1021 LAAFNAFLEH RVEKLKLEDQ SGSSIHGQRN MQSDVPMLLA PLTQSDESLL SSVIPLAITH
1081 FGDSVLVASC AFLLELCGLS ASMLRIDVAS LRRISSFYKS NGNADMAHQK SLKRSMFHSV
1141 SSEDDLMGSL ARALANEYAY PDISSVPKQK QNPSISGSQP GLPLMLVLHH LEQASLPEIG
1201 VGRKTSGYWL LTGDGDGSEL RSQQTSASLH WSLVTLFCQM HKIPLSTKYL AMLARDNDWV
1261 GFLSEAQLGG YPFDTVLNVA SKEFGDQRLK AHILTVLRYA NSKKKATTSF SDDPSRGLSC
1321 SPSEGGAYVS AELFRVLAYS EKLKNPGEYL LSKAKEFSWS ILALIASCFP DVSPLSCLTI
1381 WLEITAARET SSIKVNDITT KIAENIGAAV VSTNSLPTDA RGVQFHYNRR NPKRRRLTAH
1441 TSVDLLASAN SLNISAGKTF CSHRTEAAED EKAEDSSVID DSSDEHASLS KMVAVLCEQR
1501 LFLPLLKAFD LFLPSCSLLP FFRALQAFSQ MRLSEASAHL GSFWGRVKEE SMHFQSNTAK
1561 DVNFGASWIS RTAVKAADAV LSACPSPYEK RCLLQLLAAT DFGDGGSAAT YYRRLYWKVN
1621 LAEPSLREND LDLGNESLDD GSLLTALEKN RQWEQARNWA KQLETIGATW TSSVHHVTET
1681 QAESMVAEWK EFLWDVPEER IALWGHCQTL FIRYSFPALQ AGLFFLRHAE VVEKDLPARE
1741 IYELLLLSLQ WLSGLTTLSH PVYPLHLLRE IETRVWLLAV EAESHVKNVG AFSPSSIGKD
1801 MVNGYSSNLI DRTASIITKM DSHISSATKN RIGEKHDARA AGQGNQRNQD TSTSIFGAST
1861 KPKRRAKGNV PQIRHFVDSS DRNTDFEDSS SLINIKSEFQ LQEESTGLEI SLSKWEESIE
1921 PAELERAVLS LLEFGQVTAA KQLQLKLAPG NLPSELIILD AVMKLAMLST PCRQVLLSML
1981 DDEVRSVIQS HSLKIDQPMI EPLQILENLS TILNEGSGRG LARKIIAVIK AANILGLTFT
2041 EAYQKQPIEL LRLLSLKAQD SFEEACLLVQ THSMPAASIA QILAESFLKG LLAAHRGGYI
2101 DSQKEEGPAP LLWRFSDFLK WAELCPSEQE IGHALMRLVI TGQEIPHACE VELLILSHHF
2161 YKSSTCLDGV DVLVALAATR VEAYVAEGDF SCLARLITGV GNFHALNFIL NILIENGQLD
2221 LLLQKFSAAA DANTGTAQAV RSFRMAVLTS LNLYNPNDHD AFAMVYKHFD MKHETATLLE
2281 ARADQAAQQW FLRYDKDQNE DLLDSMRYYI EAAEVHTSID AGNKARKACG QASLVSLQIR
2341 MPDSKWLCLS ETNARRALVD QSRFQEALIV AEAYGLNQPS EWALVLWNLM LKPELAEDFV
2401 AEFVAVLPLQ ASMLLELARF YRAEMAARGD QSQFSVWLTG GGLPAEWAKY MWRSFRCLLK
2461 RTRDLRLRLQ LATTATGFAD MVDVCMNALD KVPENAGPLV LKKGHGGGYL PLM
//