LOCUS AEE79989.1 477 aa PRT PLN 23-MAR-2023
DEFINITION Arabidopsis thaliana Peptidase family C54 protein protein.
ACCESSION CP002686-8782
PROTEIN_ID AEE79989.1
SOURCE Arabidopsis thaliana (thale cress)
ORGANISM Arabidopsis thaliana
Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
Camelineae; Arabidopsis.
REFERENCE 1 (bases 1 to 23459830)
AUTHORS Salanoubat,M., Lemcke,K., Rieger,M., Ansorge,W., Unseld,M.,
Fartmann,B., Valle,G., Blocker,H., Perez-Alonso,M., Obermaier,B.,
Delseny,M., Boutry,M., Grivell,L.A., Mache,R., Puigdomenech,P., De
Simone,V., Choisne,N., Artiguenave,F., Robert,C., Brottier,P.,
Wincker,P., Cattolico,L., Weissenbach,J., Saurin,W., Quetier,F.,
Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M., Benes,V.,
Wurmbach,E., Drzonek,H., Erfle,H., Jordan,N., Bangert,S.,
Wiedelmann,R., Kranz,H., Voss,H., Holland,R., Brandt,P.,
Nyakatura,G., Vezzi,A., D'Angelo,M., Pallavicini,A., Toppo,S.,
Simionati,B., Conrad,A., Hornischer,K., Kauer,G., Lohnert,T.H.,
Nordsiek,G., Reichelt,J., Scharfe,M., Schon,O., Bargues,M.,
Terol,J., Climent,J., Navarro,P., Collado,C., Perez-Perez,A.,
Ottenwalder,B., Duchemin,D., Cooke,R., Laudie,M., Berger-Llauro,C.,
Purnelle,B., Masuy,D., de Haan,M., Maarse,A.C., Alcaraz,J.P.,
Cottet,A., Casacuberta,E., Monfort,A., Argiriou,A., flores,M.,
Liguori,R., Vitale,D., Mannhaupt,G., Haase,D., Schoof,H., Rudd,S.,
Zaccaria,P., Mewes,H.W., Mayer,K.F., Kaul,S., Town,C.D., Koo,H.L.,
Tallon,L.J., Jenkins,J., Rooney,T., Rizzo,M., Walts,A.,
Utterback,T., Fujii,C.Y., Shea,T.P., Creasy,T.H., Haas,B.,
Maiti,R., Wu,D., Peterson,J., Van Aken,S., Pai,G., Militscher,J.,
Sellers,P., Gill,J.E., Feldblyum,T.V., Preuss,D., Lin,X.,
Nierman,W.C., Salzberg,S.L., White,O., Venter,J.C., Fraser,C.M.,
Kaneko,T., Nakamura,Y., Sato,S., Kato,T., Asamizu,E., Sasamoto,S.,
Kimura,T., Idesawa,K., Kawashima,K., Kishida,Y., Kiyokawa,C.,
Kohara,M., Matsumoto,M., Matsuno,A., Muraki,A., Nakayama,S.,
Nakazaki,N., Shinpo,S., Takeuchi,C., Wada,T., Watanabe,A.,
Yamada,M., Yasuda,M. and Tabata,S.
CONSRTM European Union Chromosome 3 Arabidopsis Sequencing Consortium;
Institute for Genomic Research; Kazusa DNA Research Institute
TITLE Sequence and analysis of chromosome 3 of the plant Arabidopsis
thaliana
JOURNAL Nature 408 (6814), 820-822 (2000)
PUBMED 11130713
REFERENCE 2 (bases 1 to 23459830)
AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
CONSRTM TAIR
TITLE Direct Submission
JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE 3 (bases 1 to 23459830)
AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
Vaughn,M. and Town,C.D.
TITLE Direct Submission
JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
9704 Medical Center Dr, Rockville, MD 20850, USA
REMARK Protein update by submitter
FEATURES Qualifiers
source /organism="Arabidopsis thaliana"
/mol_type="genomic DNA"
/db_xref="taxon:3702"
/chromosome="3"
/ecotype="Columbia"
protein /locus_tag="AT3G59950"
/inference="Similar to RNA sequence,
EST:INSD:EG503150.1,INSD:BP593127.1,INSD:EL170250.1,
INSD:EH824862.1,INSD:AA404769.1,INSD:BP645960.1,
INSD:BP666999.1,INSD:EG503151.1,INSD:AV813762.1,
INSD:EG503140.1,INSD:EL971046.1,INSD:BP609347.1,
INSD:AV819686.1,INSD:EG523876.1,INSD:AV784881.1,
INSD:AV790674.1,INSD:AV441523.1,INSD:BP603713.1,
INSD:BP590529.1,INSD:ES109288.1,INSD:BP862449.1,
INSD:AV801044.1,INSD:EG503147.1,INSD:EG503143.1,
INSD:BP599709.1,INSD:EG523741.1,INSD:EL009559.1,
INSD:BP605836.1,INSD:BP615313.1,INSD:EG503137.1,
INSD:BP644860.1,INSD:AV802970.1,INSD:EH878624.1,
INSD:BP624277.1,INSD:BP625295.1,INSD:BP648003.1,
INSD:AI099953.1,INSD:BP802894.1,INSD:AV559575.1,
INSD:AI099966.1,INSD:BP616660.1,INSD:EG503134.1,
INSD:EL305551.1,INSD:AV807908.1,INSD:BP598380.1,
INSD:BG459201.1,INSD:BP602501.1,INSD:BP855185.1,
INSD:BP595513.1,INSD:DR243672.1,INSD:AV804776.1,
INSD:EL978037.1,INSD:BP638398.1,INSD:BP609919.1,
INSD:AV806811.1,INSD:AV789549.1,INSD:BP606142.1,
INSD:EL112266.1,INSD:BP609882.1,INSD:EH856653.1,
INSD:BX834153.1,INSD:BP612096.1,INSD:AV804439.1,
INSD:EG457507.1,INSD:BP605880.1,INSD:BP852382.1,
INSD:CB263876.1,INSD:BP620613.1,INSD:DR346708.1,
INSD:BP620060.1,INSD:EG503135.1,INSD:BP665561.1,
INSD:BP601805.1,INSD:BP643866.1,INSD:BP666558.1,
INSD:EL147472.1,INSD:BP618167.1,INSD:DR243677.1,
INSD:EG503146.1,INSD:BP625338.1,INSD:EL182399.1,
INSD:AV557372.1,INSD:BP665453.1,INSD:BE530454.1,
INSD:AV797862.1,INSD:EL035567.1,INSD:BP639238.1,
INSD:BP591725.1,INSD:AV820326.1,INSD:AV519866.1,
INSD:DR346712.1,INSD:EH956850.1,INSD:AV800783.1,
INSD:DR243675.1,INSD:R64998.1,INSD:AV789326.1,
INSD:DR371308.1,INSD:ES131563.1,INSD:EL068907.1,
INSD:AV821045.1,INSD:BP598581.1,INSD:AV536965.1,
INSD:BP637702.1,INSD:AV565819.1"
/inference="Similar to RNA sequence,
mRNA:INSD:BX823405.1,INSD:AB073172.1,INSD:AK226932.1"
/note="Peptidase family C54 protein; FUNCTIONS IN:
peptidase activity; INVOLVED IN: autophagy; LOCATED IN:
chloroplast; EXPRESSED IN: 23 plant structures; EXPRESSED
DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s:
Peptidase C54 (InterPro:IPR005078); BEST Arabidopsis
thaliana protein match is: Peptidase family C54 protein
(TAIR:AT2G44140.1); Has 844 Blast hits to 806 proteins in
218 species: Archae - 0; Bacteria - 0; Metazoa - 426;
Fungi - 182; Plants - 81; Viruses - 0; Other Eukaryotes -
155 (source: NCBI BLink)."
/db_xref="TAIR:AT3G59950"
/db_xref="Araport:AT3G59950"
intron_pos 156:1 (1/7)
intron_pos 185:0 (2/7)
intron_pos 202:0 (3/7)
intron_pos 328:2 (4/7)
intron_pos 375:0 (5/7)
intron_pos 393:2 (6/7)
intron_pos 417:1 (7/7)
BEGIN
1 MKAICDRFVP SKCSSSSTSE KRDISSPTSL VSDSASSDNK SNLTLCSDVV ASSSPVSQLC
61 REASTSGHNP VCTTHSSWTV ILKTASMASG AIRRFQDRVL GPSRTGISSS TSEIWLLGVC
121 YKISEGESSE EADAGRVLAA FRQDFSSLIL MTYRRGFEPI GDTTYTSDVN WGCMLRSGQM
181 LFAQALLFQR LGRSWRKKDS EPADEKYLEI LELFGDTEAS AFSIHNLILA GESYGLAAGS
241 WVGPYAVCRS WESLARKNKE ETDDKHKSFS MAVHIVSGSE DGERGGAPIL CIEDVTKTCL
301 EFSEGETEWP PILLLVPLVL GLDRVNPRYI PSLIATFTFP QSLGILGGKP GASTYIVGVQ
361 EDKGFYLDPH DVQQVVTVKK ENQDVDTSSY HCNTLRYVPL ESLDPSLALG FYCQHKDDFD
421 DFCIRATKLA GDSNGAPLFT VTQSHRRNDC GIAETSSSTE TSTEISGEEH EDDWQLL
//