LOCUS AEE82996.1 371 aa PRT PLN 23-MAR-2023
DEFINITION Arabidopsis thaliana Papain family cysteine protease protein.
ACCESSION CP002687-1600
PROTEIN_ID AEE82996.1
SOURCE Arabidopsis thaliana (thale cress)
ORGANISM Arabidopsis thaliana
Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
Camelineae; Arabidopsis.
REFERENCE 1 (bases 1 to 18585056)
AUTHORS Mayer,K., Schuller,C., Wambutt,R., Murphy,G., Volckaert,G.,
Pohl,T., Dusterhoft,A., Stiekema,W., Entian,K.D., Terryn,N.,
Harris,B., Ansorge,W., Brandt,P., Grivell,L., Rieger,M.,
Weichselgartner,M., de Simone,V., Obermaier,B., Mache,R.,
Muller,M., Kreis,M., Delseny,M., Puigdomenech,P., Watson,M.,
Schmidtheini,T., Reichert,B., Portatelle,D., Perez-Alonso,M.,
Boutry,M., Bancroft,I., Vos,P., Hoheisel,J., Zimmermann,W.,
Wedler,H., Ridley,P., Langham,S.A., McCullagh,B., Bilham,L.,
Robben,J., Van der Schueren,J., Grymonprez,B., Chuang,Y.J.,
Vandenbussche,F., Braeken,M., Weltjens,I., Voet,M., Bastiaens,I.,
Aert,R., Defoor,E., Weitzenegger,T., Bothe,G., Ramsperger,U.,
Hilbert,H., Braun,M., Holzer,E., Brandt,A., Peters,S., van
Staveren,M., Dirske,W., Mooijman,P., Klein Lankhorst,R., Rose,M.,
Hauf,J., Kotter,P., Berneiser,S., Hempel,S., Feldpausch,M.,
Lamberth,S., Van den Daele,H., De Keyser,A., Buysshaert,C.,
Gielen,J., Villarroel,R., De Clercq,R., Van Montagu,M., Rogers,J.,
Cronin,A., Quail,M., Bray-Allen,S., Clark,L., Doggett,J., Hall,S.,
Kay,M., Lennard,N., McLay,K., Mayes,R., Pettett,A.,
Rajandream,M.A., Lyne,M., Benes,V., Rechmann,S., Borkova,D.,
Blocker,H., Scharfe,M., Grimm,M., Lohnert,T.H., Dose,S., de
Haan,M., Maarse,A., Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M.,
Fartmann,B., Granderath,K., Dauner,D., Herzl,A., Neumann,S.,
Argiriou,A., Vitale,D., Liguori,R., Piravandi,E., Massenet,O.,
Quigley,F., Clabauld,G., Mundlein,A., Felber,R., Schnabl,S.,
Hiller,R., Schmidt,W., Lecharny,A., Aubourg,S., Chefdor,F.,
Cooke,R., Berger,C., Montfort,A., Casacuberta,E., Gibbons,T.,
Weber,N., Vandenbol,M., Bargues,M., Terol,J., Torres,A.,
Perez-Perez,A., Purnelle,B., Bent,E., Johnson,S., Tacon,D.,
Jesse,T., Heijnen,L., Schwarz,S., Scholler,P., Heber,S., Francs,P.,
Bielke,C., Frishman,D., Haase,D., Lemcke,K., Mewes,H.W.,
Stocker,S., Zaccaria,P., Bevan,M., Wilson,R.K., de la Bastide,M.,
Habermann,K., Parnell,L., Dedhia,N., Gnoj,L., Schutz,K., Huang,E.,
Spiegel,L., Sehkon,M., Murray,J., Sheet,P., Cordes,M.,
Abu-Threideh,J., Stoneking,T., Kalicki,J., Graves,T., Harmon,G.,
Edwards,J., Latreille,P., Courtney,L., Cloud,J., Abbott,A.,
Scott,K., Johnson,D., Minx,P., Bentley,D., Fulton,B., Miller,N.,
Greco,T., Kemp,K., Kramer,J., Fulton,L., Mardis,E., Dante,M.,
Pepin,K., Hillier,L., Nelson,J., Spieth,J., Ryan,E., Andrews,S.,
Geisel,C., Layman,D., Du,H., Ali,J., Berghoff,A., Jones,K.,
Drone,K., Cotton,M., Joshu,C., Antonoiu,B., Zidanic,M., Strong,C.,
Sun,H., Lamar,B., Yordan,C., Ma,P., Zhong,J., Preston,R., Vil,D.,
Shekher,M., Matero,A., Shah,R., Swaby,I.K., O'Shaughnessy,A.,
Rodriguez,M., Hoffmann,J., Till,S., Granat,S., Shohdy,N.,
Hasegawa,A., Hameed,A., Lodhi,M., Johnson,A., Chen,E., Marra,M.,
Martienssen,R. and McCombie,W.R.
TITLE Sequence and analysis of chromosome 4 of the plant Arabidopsis
thaliana
JOURNAL Nature 402 (6763), 769-777 (1999)
PUBMED 10617198
REFERENCE 2 (bases 1 to 18585056)
AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
CONSRTM TAIR
TITLE Direct Submission
JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE 3 (bases 1 to 18585056)
AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
Vaughn,M. and Town,C.D.
TITLE Direct Submission
JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
9704 Medical Center Dr, Rockville, MD 20850, USA
REMARK Protein update by submitter
FEATURES Qualifiers
source /organism="Arabidopsis thaliana"
/mol_type="genomic DNA"
/db_xref="taxon:3702"
/chromosome="4"
/ecotype="Columbia"
protein /locus_tag="AT4G11320"
/gene_synonym="AtCP2"
/gene_synonym="CP2"
/gene_synonym="cysteine protease 2"
/gene_synonym="F8L21.110"
/gene_synonym="F8L21_110"
/inference="Similar to RNA sequence,
EST:INSD:DR301713.1,INSD:EL254168.1,INSD:BP604098.1,
INSD:DR301727.1,INSD:DR301700.1,INSD:EH955176.1,
INSD:EL071242.1,INSD:DR301742.1,INSD:DR301723.1,
INSD:BP578091.1,INSD:CF652663.1,INSD:EL215309.1,
INSD:T22938.1,INSD:BP660091.1,INSD:EH896048.1,
INSD:EH910268.1,INSD:BP580579.1,INSD:BP619676.1,
INSD:EL045160.1,INSD:DR301690.1,INSD:DR301732.1,
INSD:BP573237.1,INSD:BP575866.1,INSD:EL277496.1,
INSD:EL098641.1,INSD:EH897737.1,INSD:EH890406.1,
INSD:DR301736.1,INSD:EH915540.1,INSD:DR301745.1,
INSD:DR301729.1,INSD:DR301725.1,INSD:EL255304.1,
INSD:BP805772.1,INSD:DR301714.1,INSD:EH954226.1,
INSD:BP600275.1,INSD:EH968769.1,INSD:EL177938.1,
INSD:DR301735.1,INSD:DR301747.1,INSD:EL298965.1,
INSD:EH945621.1,INSD:EL250117.1,INSD:BP612387.1,
INSD:EL045375.1,INSD:DR301711.1,INSD:DR301707.1,
INSD:DR301689.1,INSD:EL161119.1,INSD:CF652358.1,
INSD:EL100007.1,INSD:EH820783.1,INSD:DR301695.1,
INSD:EL070761.1,INSD:H36946.1,INSD:DR301698.1,
INSD:DR301749.1,INSD:DR301721.1,INSD:BP608281.1,
INSD:DR301696.1,INSD:EL292809.1,INSD:EL212242.1,
INSD:EH984803.1,INSD:EH908713.1,INSD:EH851602.1,
INSD:BP571597.1,INSD:EG478384.1,INSD:EL053298.1,
INSD:EH930202.1,INSD:BP583338.1,INSD:DR301731.1,
INSD:BP809711.1,INSD:EL028922.1,INSD:DR301720.1,
INSD:BP581666.1,INSD:BP797894.1,INSD:BP807410.1,
INSD:CF651935.1,INSD:BP588413.1,INSD:DR301726.1,
INSD:EL213116.1,INSD:EH904030.1,INSD:BP793779.1,
INSD:DR301744.1,INSD:EL303884.1,INSD:BP808112.1,
INSD:DR301716.1,INSD:EG439246.1,INSD:DR301741.1,
INSD:BP612428.1,INSD:BP573874.1,INSD:EH937803.1,
INSD:DR301688.1,INSD:EH920401.1,INSD:DR301703.1,
INSD:T46532.1,INSD:BP652092.1,INSD:EL230724.1,
INSD:DR301738.1,INSD:BP599483.1,INSD:DR301750.1,
INSD:EL234573.1,INSD:BP802565.1,INSD:DR301708.1,
INSD:BP799106.1,INSD:H36258.1,INSD:EL262607.1,
INSD:BP799611.1,INSD:EL316841.1,INSD:DR301724.1,
INSD:BP797679.1,INSD:N38575.1,INSD:N96898.1,
INSD:ES161936.1,INSD:CF651624.1,INSD:EL131029.1,
INSD:EL281728.1,INSD:DR301709.1,INSD:EL182874.1,
INSD:EG495051.1,INSD:AV798538.1,INSD:BP807809.1,
INSD:DR301715.1,INSD:BP632105.1,INSD:DR301719.1,
INSD:EL196430.1,INSD:BP807997.1,INSD:EL020656.1,
INSD:BP784617.1,INSD:EL252306.1,INSD:H76810.1,
INSD:DR301722.1,INSD:BP791134.1,INSD:EL123473.1,
INSD:AV442093.1,INSD:BP614517.1,INSD:EH906196.1,
INSD:EL237791.1,INSD:DR301704.1,INSD:ES028785.1,
INSD:BP579590.1,INSD:AV441352.1,INSD:EL120129.1,
INSD:BP578989.1,INSD:BP583282.1,INSD:R90453.1,
INSD:DR301691.1,INSD:DR301697.1,INSD:BP583083.1,
INSD:BP605210.1,INSD:EH813878.1,INSD:BP800079.1,
INSD:DR301734.1,INSD:BP613091.1,INSD:EL319278.1,
INSD:BP641494.1,INSD:DR301743.1,INSD:AI100181.1,
INSD:DR301730.1,INSD:EH943303.1,INSD:BP564207.1,
INSD:N65149.1,INSD:EL286204.1,INSD:BP575486.1,
INSD:EL238295.1,INSD:BP564304.1,INSD:BP570038.1,
INSD:DR370973.1,INSD:EL181378.1,INSD:EL116251.1,
INSD:BP577755.1,INSD:DR301693.1,INSD:EL113156.1,
INSD:BP806377.1,INSD:EH922051.1,INSD:DR301706.1,
INSD:DR301746.1,INSD:BP599647.1,INSD:BP587894.1,
INSD:T20979.1,INSD:DR301692.1,INSD:EH845803.1,
INSD:AA712310.1,INSD:R89986.1,INSD:BX836995.1,
INSD:DR301712.1,INSD:EL328497.1,INSD:BX834194.1,
INSD:BP617959.1,INSD:DR301694.1,INSD:EG439245.1,
INSD:BP582346.1,INSD:EL061362.1,INSD:EL240891.1,
INSD:EL006200.1,INSD:EH912197.1,INSD:BP571386.1,
INSD:DR376137.1,INSD:DR301701.1,INSD:EH962069.1,
INSD:EL221537.1,INSD:BP802609.1,INSD:BP569836.1,
INSD:DR301685.1,INSD:DR301699.1,INSD:DR301687.1,
INSD:AV807325.1,INSD:EL293856.1,INSD:BP607786.1,
INSD:DR199690.1,INSD:DR301705.1,INSD:EL266384.1,
INSD:EL150079.1,INSD:T21936.1,INSD:BP571312.1,
INSD:AV818341.1,INSD:BP583252.1,INSD:DR301710.1,
INSD:EL009551.1,INSD:DR301686.1,INSD:BP802607.1,
INSD:BP797277.1,INSD:AI993744.1,INSD:EL262026.1,
INSD:H76176.1,INSD:T04497.1,INSD:DR301728.1,
INSD:DR301717.1,INSD:DR301702.1,INSD:BP617728.1,
INSD:EL254563.1,INSD:DR301740.1,INSD:EG478383.1,
INSD:DR301737.1,INSD:DR301748.1,INSD:DR301718.1,
INSD:H37052.1,INSD:EL323556.1,INSD:DR301739.1,
INSD:EL019502.1,INSD:DR301733.1"
/inference="Similar to RNA sequence,
mRNA:INSD:BX827227.1,INSD:BX827844.1,INSD:AY089180.1,
INSD:AY035055.1,INSD:AY051062.1"
/note="Papain family cysteine protease; FUNCTIONS IN:
cysteine-type endopeptidase activity, cysteine-type
peptidase activity; INVOLVED IN: proteolysis; LOCATED IN:
endomembrane system; CONTAINS InterPro DOMAIN/s:
Proteinase inhibitor I29, cathepsin propeptide
(InterPro:IPR013201), Peptidase C1A, papain
(InterPro:IPR013128), Peptidase C1A, papain C-terminal
(InterPro:IPR000668), Peptidase, cysteine peptidase active
site (InterPro:IPR000169); BEST Arabidopsis thaliana
protein match is: Papain family cysteine protease
(TAIR:AT4G11310.1); Has 30201 Blast hits to 17322 proteins
in 780 species: Archae - 12; Bacteria - 1396; Metazoa -
17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other
Eukaryotes - 2996 (source: NCBI BLink)."
/db_xref="TAIR:AT4G11320"
/db_xref="Araport:AT4G11320"
intron_pos 166:1 (1/3)
intron_pos 245:0 (2/3)
intron_pos 292:0 (3/3)
BEGIN
1 MGYAKSAMLI FLLALVIASC ATAMDMSVVS SNDNHHVTAG PGRRQGIFDA EATLMFESWM
61 VKHGKVYDSV AEKERRLTIF EDNLRFITNR NAENLSYRLG LNRFADLSLH EYGEICHGAD
121 PRPPRNHVFM TSSNRYKTSD GDVLPKSVDW RNEGAVTEVK DQGLCRSCWA FSTVGAVEGL
181 NKIVTGELVT LSEQDLINCN KENNGCGGGK VETAYEFIMN NGGLGTDNDY PYKALNGVCE
241 GRLKEDNKNV MIDGYENLPA NDEAALMKAV AHQPVTAVVD SSSREFQLYE SGVFDGTCGT
301 NLNHGVVVVG YGTENGRDYW IVKNSRGDTW GEAGYMKMAR NIANPRGLCG IAMRASYPLK
361 NSFSTDKVSV A
//