LOCUS AEE82996.1 371 aa PRT PLN 23-MAR-2023 DEFINITION Arabidopsis thaliana Papain family cysteine protease protein. ACCESSION CP002687-1600 PROTEIN_ID AEE82996.1 SOURCE Arabidopsis thaliana (thale cress) ORGANISM Arabidopsis thaliana Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis. REFERENCE 1 (bases 1 to 18585056) AUTHORS Mayer,K., Schuller,C., Wambutt,R., Murphy,G., Volckaert,G., Pohl,T., Dusterhoft,A., Stiekema,W., Entian,K.D., Terryn,N., Harris,B., Ansorge,W., Brandt,P., Grivell,L., Rieger,M., Weichselgartner,M., de Simone,V., Obermaier,B., Mache,R., Muller,M., Kreis,M., Delseny,M., Puigdomenech,P., Watson,M., Schmidtheini,T., Reichert,B., Portatelle,D., Perez-Alonso,M., Boutry,M., Bancroft,I., Vos,P., Hoheisel,J., Zimmermann,W., Wedler,H., Ridley,P., Langham,S.A., McCullagh,B., Bilham,L., Robben,J., Van der Schueren,J., Grymonprez,B., Chuang,Y.J., Vandenbussche,F., Braeken,M., Weltjens,I., Voet,M., Bastiaens,I., Aert,R., Defoor,E., Weitzenegger,T., Bothe,G., Ramsperger,U., Hilbert,H., Braun,M., Holzer,E., Brandt,A., Peters,S., van Staveren,M., Dirske,W., Mooijman,P., Klein Lankhorst,R., Rose,M., Hauf,J., Kotter,P., Berneiser,S., Hempel,S., Feldpausch,M., Lamberth,S., Van den Daele,H., De Keyser,A., Buysshaert,C., Gielen,J., Villarroel,R., De Clercq,R., Van Montagu,M., Rogers,J., Cronin,A., Quail,M., Bray-Allen,S., Clark,L., Doggett,J., Hall,S., Kay,M., Lennard,N., McLay,K., Mayes,R., Pettett,A., Rajandream,M.A., Lyne,M., Benes,V., Rechmann,S., Borkova,D., Blocker,H., Scharfe,M., Grimm,M., Lohnert,T.H., Dose,S., de Haan,M., Maarse,A., Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M., Fartmann,B., Granderath,K., Dauner,D., Herzl,A., Neumann,S., Argiriou,A., Vitale,D., Liguori,R., Piravandi,E., Massenet,O., Quigley,F., Clabauld,G., Mundlein,A., Felber,R., Schnabl,S., Hiller,R., Schmidt,W., Lecharny,A., Aubourg,S., Chefdor,F., Cooke,R., Berger,C., Montfort,A., Casacuberta,E., Gibbons,T., Weber,N., Vandenbol,M., Bargues,M., Terol,J., Torres,A., Perez-Perez,A., Purnelle,B., Bent,E., Johnson,S., Tacon,D., Jesse,T., Heijnen,L., Schwarz,S., Scholler,P., Heber,S., Francs,P., Bielke,C., Frishman,D., Haase,D., Lemcke,K., Mewes,H.W., Stocker,S., Zaccaria,P., Bevan,M., Wilson,R.K., de la Bastide,M., Habermann,K., Parnell,L., Dedhia,N., Gnoj,L., Schutz,K., Huang,E., Spiegel,L., Sehkon,M., Murray,J., Sheet,P., Cordes,M., Abu-Threideh,J., Stoneking,T., Kalicki,J., Graves,T., Harmon,G., Edwards,J., Latreille,P., Courtney,L., Cloud,J., Abbott,A., Scott,K., Johnson,D., Minx,P., Bentley,D., Fulton,B., Miller,N., Greco,T., Kemp,K., Kramer,J., Fulton,L., Mardis,E., Dante,M., Pepin,K., Hillier,L., Nelson,J., Spieth,J., Ryan,E., Andrews,S., Geisel,C., Layman,D., Du,H., Ali,J., Berghoff,A., Jones,K., Drone,K., Cotton,M., Joshu,C., Antonoiu,B., Zidanic,M., Strong,C., Sun,H., Lamar,B., Yordan,C., Ma,P., Zhong,J., Preston,R., Vil,D., Shekher,M., Matero,A., Shah,R., Swaby,I.K., O'Shaughnessy,A., Rodriguez,M., Hoffmann,J., Till,S., Granat,S., Shohdy,N., Hasegawa,A., Hameed,A., Lodhi,M., Johnson,A., Chen,E., Marra,M., Martienssen,R. and McCombie,W.R. TITLE Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana JOURNAL Nature 402 (6763), 769-777 (1999) PUBMED 10617198 REFERENCE 2 (bases 1 to 18585056) AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E. CONSRTM TAIR TITLE Direct Submission JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie Institution, 260 Panama Street, Stanford, CA, USA REFERENCE 3 (bases 1 to 18585056) AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M., Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R., Vaughn,M. and Town,C.D. TITLE Direct Submission JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute, 9704 Medical Center Dr, Rockville, MD 20850, USA REMARK Protein update by submitter FEATURES Qualifiers source /organism="Arabidopsis thaliana" /mol_type="genomic DNA" /db_xref="taxon:3702" /chromosome="4" /ecotype="Columbia" protein /locus_tag="AT4G11320" /gene_synonym="AtCP2" /gene_synonym="CP2" /gene_synonym="cysteine protease 2" /gene_synonym="F8L21.110" /gene_synonym="F8L21_110" /inference="Similar to RNA sequence, EST:INSD:DR301713.1,INSD:EL254168.1,INSD:BP604098.1, INSD:DR301727.1,INSD:DR301700.1,INSD:EH955176.1, INSD:EL071242.1,INSD:DR301742.1,INSD:DR301723.1, INSD:BP578091.1,INSD:CF652663.1,INSD:EL215309.1, INSD:T22938.1,INSD:BP660091.1,INSD:EH896048.1, INSD:EH910268.1,INSD:BP580579.1,INSD:BP619676.1, INSD:EL045160.1,INSD:DR301690.1,INSD:DR301732.1, INSD:BP573237.1,INSD:BP575866.1,INSD:EL277496.1, INSD:EL098641.1,INSD:EH897737.1,INSD:EH890406.1, INSD:DR301736.1,INSD:EH915540.1,INSD:DR301745.1, INSD:DR301729.1,INSD:DR301725.1,INSD:EL255304.1, INSD:BP805772.1,INSD:DR301714.1,INSD:EH954226.1, INSD:BP600275.1,INSD:EH968769.1,INSD:EL177938.1, INSD:DR301735.1,INSD:DR301747.1,INSD:EL298965.1, INSD:EH945621.1,INSD:EL250117.1,INSD:BP612387.1, INSD:EL045375.1,INSD:DR301711.1,INSD:DR301707.1, INSD:DR301689.1,INSD:EL161119.1,INSD:CF652358.1, INSD:EL100007.1,INSD:EH820783.1,INSD:DR301695.1, INSD:EL070761.1,INSD:H36946.1,INSD:DR301698.1, INSD:DR301749.1,INSD:DR301721.1,INSD:BP608281.1, INSD:DR301696.1,INSD:EL292809.1,INSD:EL212242.1, INSD:EH984803.1,INSD:EH908713.1,INSD:EH851602.1, INSD:BP571597.1,INSD:EG478384.1,INSD:EL053298.1, INSD:EH930202.1,INSD:BP583338.1,INSD:DR301731.1, INSD:BP809711.1,INSD:EL028922.1,INSD:DR301720.1, INSD:BP581666.1,INSD:BP797894.1,INSD:BP807410.1, INSD:CF651935.1,INSD:BP588413.1,INSD:DR301726.1, INSD:EL213116.1,INSD:EH904030.1,INSD:BP793779.1, INSD:DR301744.1,INSD:EL303884.1,INSD:BP808112.1, INSD:DR301716.1,INSD:EG439246.1,INSD:DR301741.1, INSD:BP612428.1,INSD:BP573874.1,INSD:EH937803.1, INSD:DR301688.1,INSD:EH920401.1,INSD:DR301703.1, INSD:T46532.1,INSD:BP652092.1,INSD:EL230724.1, INSD:DR301738.1,INSD:BP599483.1,INSD:DR301750.1, INSD:EL234573.1,INSD:BP802565.1,INSD:DR301708.1, INSD:BP799106.1,INSD:H36258.1,INSD:EL262607.1, INSD:BP799611.1,INSD:EL316841.1,INSD:DR301724.1, INSD:BP797679.1,INSD:N38575.1,INSD:N96898.1, INSD:ES161936.1,INSD:CF651624.1,INSD:EL131029.1, INSD:EL281728.1,INSD:DR301709.1,INSD:EL182874.1, INSD:EG495051.1,INSD:AV798538.1,INSD:BP807809.1, INSD:DR301715.1,INSD:BP632105.1,INSD:DR301719.1, INSD:EL196430.1,INSD:BP807997.1,INSD:EL020656.1, INSD:BP784617.1,INSD:EL252306.1,INSD:H76810.1, INSD:DR301722.1,INSD:BP791134.1,INSD:EL123473.1, INSD:AV442093.1,INSD:BP614517.1,INSD:EH906196.1, INSD:EL237791.1,INSD:DR301704.1,INSD:ES028785.1, INSD:BP579590.1,INSD:AV441352.1,INSD:EL120129.1, INSD:BP578989.1,INSD:BP583282.1,INSD:R90453.1, INSD:DR301691.1,INSD:DR301697.1,INSD:BP583083.1, INSD:BP605210.1,INSD:EH813878.1,INSD:BP800079.1, INSD:DR301734.1,INSD:BP613091.1,INSD:EL319278.1, INSD:BP641494.1,INSD:DR301743.1,INSD:AI100181.1, INSD:DR301730.1,INSD:EH943303.1,INSD:BP564207.1, INSD:N65149.1,INSD:EL286204.1,INSD:BP575486.1, INSD:EL238295.1,INSD:BP564304.1,INSD:BP570038.1, INSD:DR370973.1,INSD:EL181378.1,INSD:EL116251.1, INSD:BP577755.1,INSD:DR301693.1,INSD:EL113156.1, INSD:BP806377.1,INSD:EH922051.1,INSD:DR301706.1, INSD:DR301746.1,INSD:BP599647.1,INSD:BP587894.1, INSD:T20979.1,INSD:DR301692.1,INSD:EH845803.1, INSD:AA712310.1,INSD:R89986.1,INSD:BX836995.1, INSD:DR301712.1,INSD:EL328497.1,INSD:BX834194.1, INSD:BP617959.1,INSD:DR301694.1,INSD:EG439245.1, INSD:BP582346.1,INSD:EL061362.1,INSD:EL240891.1, INSD:EL006200.1,INSD:EH912197.1,INSD:BP571386.1, INSD:DR376137.1,INSD:DR301701.1,INSD:EH962069.1, INSD:EL221537.1,INSD:BP802609.1,INSD:BP569836.1, INSD:DR301685.1,INSD:DR301699.1,INSD:DR301687.1, INSD:AV807325.1,INSD:EL293856.1,INSD:BP607786.1, INSD:DR199690.1,INSD:DR301705.1,INSD:EL266384.1, INSD:EL150079.1,INSD:T21936.1,INSD:BP571312.1, INSD:AV818341.1,INSD:BP583252.1,INSD:DR301710.1, INSD:EL009551.1,INSD:DR301686.1,INSD:BP802607.1, INSD:BP797277.1,INSD:AI993744.1,INSD:EL262026.1, INSD:H76176.1,INSD:T04497.1,INSD:DR301728.1, INSD:DR301717.1,INSD:DR301702.1,INSD:BP617728.1, INSD:EL254563.1,INSD:DR301740.1,INSD:EG478383.1, INSD:DR301737.1,INSD:DR301748.1,INSD:DR301718.1, INSD:H37052.1,INSD:EL323556.1,INSD:DR301739.1, INSD:EL019502.1,INSD:DR301733.1" /inference="Similar to RNA sequence, mRNA:INSD:BX827227.1,INSD:BX827844.1,INSD:AY089180.1, INSD:AY035055.1,INSD:AY051062.1" /note="Papain family cysteine protease; FUNCTIONS IN: cysteine-type endopeptidase activity, cysteine-type peptidase activity; INVOLVED IN: proteolysis; LOCATED IN: endomembrane system; CONTAINS InterPro DOMAIN/s: Proteinase inhibitor I29, cathepsin propeptide (InterPro:IPR013201), Peptidase C1A, papain (InterPro:IPR013128), Peptidase C1A, papain C-terminal (InterPro:IPR000668), Peptidase, cysteine peptidase active site (InterPro:IPR000169); BEST Arabidopsis thaliana protein match is: Papain family cysteine protease (TAIR:AT4G11310.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink)." /db_xref="TAIR:AT4G11320" /db_xref="Araport:AT4G11320" intron_pos 166:1 (1/3) intron_pos 245:0 (2/3) intron_pos 292:0 (3/3) BEGIN 1 MGYAKSAMLI FLLALVIASC ATAMDMSVVS SNDNHHVTAG PGRRQGIFDA EATLMFESWM 61 VKHGKVYDSV AEKERRLTIF EDNLRFITNR NAENLSYRLG LNRFADLSLH EYGEICHGAD 121 PRPPRNHVFM TSSNRYKTSD GDVLPKSVDW RNEGAVTEVK DQGLCRSCWA FSTVGAVEGL 181 NKIVTGELVT LSEQDLINCN KENNGCGGGK VETAYEFIMN NGGLGTDNDY PYKALNGVCE 241 GRLKEDNKNV MIDGYENLPA NDEAALMKAV AHQPVTAVVD SSSREFQLYE SGVFDGTCGT 301 NLNHGVVVVG YGTENGRDYW IVKNSRGDTW GEAGYMKMAR NIANPRGLCG IAMRASYPLK 361 NSFSTDKVSV A //