LOCUS       AEE82996.1               371 aa    PRT              PLN 23-MAR-2023
DEFINITION  Arabidopsis thaliana Papain family cysteine protease protein.
ACCESSION   CP002687-1600
PROTEIN_ID  AEE82996.1
SOURCE      Arabidopsis thaliana (thale cress)
  ORGANISM  Arabidopsis thaliana
            Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
            Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
            Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
            Camelineae; Arabidopsis.
REFERENCE   1  (bases 1 to 18585056)
  AUTHORS   Mayer,K., Schuller,C., Wambutt,R., Murphy,G., Volckaert,G.,
            Pohl,T., Dusterhoft,A., Stiekema,W., Entian,K.D., Terryn,N.,
            Harris,B., Ansorge,W., Brandt,P., Grivell,L., Rieger,M.,
            Weichselgartner,M., de Simone,V., Obermaier,B., Mache,R.,
            Muller,M., Kreis,M., Delseny,M., Puigdomenech,P., Watson,M.,
            Schmidtheini,T., Reichert,B., Portatelle,D., Perez-Alonso,M.,
            Boutry,M., Bancroft,I., Vos,P., Hoheisel,J., Zimmermann,W.,
            Wedler,H., Ridley,P., Langham,S.A., McCullagh,B., Bilham,L.,
            Robben,J., Van der Schueren,J., Grymonprez,B., Chuang,Y.J.,
            Vandenbussche,F., Braeken,M., Weltjens,I., Voet,M., Bastiaens,I.,
            Aert,R., Defoor,E., Weitzenegger,T., Bothe,G., Ramsperger,U.,
            Hilbert,H., Braun,M., Holzer,E., Brandt,A., Peters,S., van
            Staveren,M., Dirske,W., Mooijman,P., Klein Lankhorst,R., Rose,M.,
            Hauf,J., Kotter,P., Berneiser,S., Hempel,S., Feldpausch,M.,
            Lamberth,S., Van den Daele,H., De Keyser,A., Buysshaert,C.,
            Gielen,J., Villarroel,R., De Clercq,R., Van Montagu,M., Rogers,J.,
            Cronin,A., Quail,M., Bray-Allen,S., Clark,L., Doggett,J., Hall,S.,
            Kay,M., Lennard,N., McLay,K., Mayes,R., Pettett,A.,
            Rajandream,M.A., Lyne,M., Benes,V., Rechmann,S., Borkova,D.,
            Blocker,H., Scharfe,M., Grimm,M., Lohnert,T.H., Dose,S., de
            Haan,M., Maarse,A., Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M.,
            Fartmann,B., Granderath,K., Dauner,D., Herzl,A., Neumann,S.,
            Argiriou,A., Vitale,D., Liguori,R., Piravandi,E., Massenet,O.,
            Quigley,F., Clabauld,G., Mundlein,A., Felber,R., Schnabl,S.,
            Hiller,R., Schmidt,W., Lecharny,A., Aubourg,S., Chefdor,F.,
            Cooke,R., Berger,C., Montfort,A., Casacuberta,E., Gibbons,T.,
            Weber,N., Vandenbol,M., Bargues,M., Terol,J., Torres,A.,
            Perez-Perez,A., Purnelle,B., Bent,E., Johnson,S., Tacon,D.,
            Jesse,T., Heijnen,L., Schwarz,S., Scholler,P., Heber,S., Francs,P.,
            Bielke,C., Frishman,D., Haase,D., Lemcke,K., Mewes,H.W.,
            Stocker,S., Zaccaria,P., Bevan,M., Wilson,R.K., de la Bastide,M.,
            Habermann,K., Parnell,L., Dedhia,N., Gnoj,L., Schutz,K., Huang,E.,
            Spiegel,L., Sehkon,M., Murray,J., Sheet,P., Cordes,M.,
            Abu-Threideh,J., Stoneking,T., Kalicki,J., Graves,T., Harmon,G.,
            Edwards,J., Latreille,P., Courtney,L., Cloud,J., Abbott,A.,
            Scott,K., Johnson,D., Minx,P., Bentley,D., Fulton,B., Miller,N.,
            Greco,T., Kemp,K., Kramer,J., Fulton,L., Mardis,E., Dante,M.,
            Pepin,K., Hillier,L., Nelson,J., Spieth,J., Ryan,E., Andrews,S.,
            Geisel,C., Layman,D., Du,H., Ali,J., Berghoff,A., Jones,K.,
            Drone,K., Cotton,M., Joshu,C., Antonoiu,B., Zidanic,M., Strong,C.,
            Sun,H., Lamar,B., Yordan,C., Ma,P., Zhong,J., Preston,R., Vil,D.,
            Shekher,M., Matero,A., Shah,R., Swaby,I.K., O'Shaughnessy,A.,
            Rodriguez,M., Hoffmann,J., Till,S., Granat,S., Shohdy,N.,
            Hasegawa,A., Hameed,A., Lodhi,M., Johnson,A., Chen,E., Marra,M.,
            Martienssen,R. and McCombie,W.R.
  TITLE     Sequence and analysis of chromosome 4 of the plant Arabidopsis
            thaliana
  JOURNAL   Nature 402 (6763), 769-777 (1999)
   PUBMED   10617198
REFERENCE   2  (bases 1 to 18585056)
  AUTHORS   Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
  CONSRTM   TAIR
  TITLE     Direct Submission
  JOURNAL   Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
            Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE   3  (bases 1 to 18585056)
  AUTHORS   Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
            Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
            Vaughn,M. and Town,C.D.
  TITLE     Direct Submission
  JOURNAL   Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
            9704 Medical Center Dr, Rockville, MD 20850, USA
  REMARK    Protein update by submitter
FEATURES             Qualifiers
     source          /organism="Arabidopsis thaliana"
                     /mol_type="genomic DNA"
                     /db_xref="taxon:3702"
                     /chromosome="4"
                     /ecotype="Columbia"
     protein         /locus_tag="AT4G11320"
                     /gene_synonym="AtCP2"
                     /gene_synonym="CP2"
                     /gene_synonym="cysteine protease 2"
                     /gene_synonym="F8L21.110"
                     /gene_synonym="F8L21_110"
                     /inference="Similar to RNA sequence,
                     EST:INSD:DR301713.1,INSD:EL254168.1,INSD:BP604098.1,
                     INSD:DR301727.1,INSD:DR301700.1,INSD:EH955176.1,
                     INSD:EL071242.1,INSD:DR301742.1,INSD:DR301723.1,
                     INSD:BP578091.1,INSD:CF652663.1,INSD:EL215309.1,
                     INSD:T22938.1,INSD:BP660091.1,INSD:EH896048.1,
                     INSD:EH910268.1,INSD:BP580579.1,INSD:BP619676.1,
                     INSD:EL045160.1,INSD:DR301690.1,INSD:DR301732.1,
                     INSD:BP573237.1,INSD:BP575866.1,INSD:EL277496.1,
                     INSD:EL098641.1,INSD:EH897737.1,INSD:EH890406.1,
                     INSD:DR301736.1,INSD:EH915540.1,INSD:DR301745.1,
                     INSD:DR301729.1,INSD:DR301725.1,INSD:EL255304.1,
                     INSD:BP805772.1,INSD:DR301714.1,INSD:EH954226.1,
                     INSD:BP600275.1,INSD:EH968769.1,INSD:EL177938.1,
                     INSD:DR301735.1,INSD:DR301747.1,INSD:EL298965.1,
                     INSD:EH945621.1,INSD:EL250117.1,INSD:BP612387.1,
                     INSD:EL045375.1,INSD:DR301711.1,INSD:DR301707.1,
                     INSD:DR301689.1,INSD:EL161119.1,INSD:CF652358.1,
                     INSD:EL100007.1,INSD:EH820783.1,INSD:DR301695.1,
                     INSD:EL070761.1,INSD:H36946.1,INSD:DR301698.1,
                     INSD:DR301749.1,INSD:DR301721.1,INSD:BP608281.1,
                     INSD:DR301696.1,INSD:EL292809.1,INSD:EL212242.1,
                     INSD:EH984803.1,INSD:EH908713.1,INSD:EH851602.1,
                     INSD:BP571597.1,INSD:EG478384.1,INSD:EL053298.1,
                     INSD:EH930202.1,INSD:BP583338.1,INSD:DR301731.1,
                     INSD:BP809711.1,INSD:EL028922.1,INSD:DR301720.1,
                     INSD:BP581666.1,INSD:BP797894.1,INSD:BP807410.1,
                     INSD:CF651935.1,INSD:BP588413.1,INSD:DR301726.1,
                     INSD:EL213116.1,INSD:EH904030.1,INSD:BP793779.1,
                     INSD:DR301744.1,INSD:EL303884.1,INSD:BP808112.1,
                     INSD:DR301716.1,INSD:EG439246.1,INSD:DR301741.1,
                     INSD:BP612428.1,INSD:BP573874.1,INSD:EH937803.1,
                     INSD:DR301688.1,INSD:EH920401.1,INSD:DR301703.1,
                     INSD:T46532.1,INSD:BP652092.1,INSD:EL230724.1,
                     INSD:DR301738.1,INSD:BP599483.1,INSD:DR301750.1,
                     INSD:EL234573.1,INSD:BP802565.1,INSD:DR301708.1,
                     INSD:BP799106.1,INSD:H36258.1,INSD:EL262607.1,
                     INSD:BP799611.1,INSD:EL316841.1,INSD:DR301724.1,
                     INSD:BP797679.1,INSD:N38575.1,INSD:N96898.1,
                     INSD:ES161936.1,INSD:CF651624.1,INSD:EL131029.1,
                     INSD:EL281728.1,INSD:DR301709.1,INSD:EL182874.1,
                     INSD:EG495051.1,INSD:AV798538.1,INSD:BP807809.1,
                     INSD:DR301715.1,INSD:BP632105.1,INSD:DR301719.1,
                     INSD:EL196430.1,INSD:BP807997.1,INSD:EL020656.1,
                     INSD:BP784617.1,INSD:EL252306.1,INSD:H76810.1,
                     INSD:DR301722.1,INSD:BP791134.1,INSD:EL123473.1,
                     INSD:AV442093.1,INSD:BP614517.1,INSD:EH906196.1,
                     INSD:EL237791.1,INSD:DR301704.1,INSD:ES028785.1,
                     INSD:BP579590.1,INSD:AV441352.1,INSD:EL120129.1,
                     INSD:BP578989.1,INSD:BP583282.1,INSD:R90453.1,
                     INSD:DR301691.1,INSD:DR301697.1,INSD:BP583083.1,
                     INSD:BP605210.1,INSD:EH813878.1,INSD:BP800079.1,
                     INSD:DR301734.1,INSD:BP613091.1,INSD:EL319278.1,
                     INSD:BP641494.1,INSD:DR301743.1,INSD:AI100181.1,
                     INSD:DR301730.1,INSD:EH943303.1,INSD:BP564207.1,
                     INSD:N65149.1,INSD:EL286204.1,INSD:BP575486.1,
                     INSD:EL238295.1,INSD:BP564304.1,INSD:BP570038.1,
                     INSD:DR370973.1,INSD:EL181378.1,INSD:EL116251.1,
                     INSD:BP577755.1,INSD:DR301693.1,INSD:EL113156.1,
                     INSD:BP806377.1,INSD:EH922051.1,INSD:DR301706.1,
                     INSD:DR301746.1,INSD:BP599647.1,INSD:BP587894.1,
                     INSD:T20979.1,INSD:DR301692.1,INSD:EH845803.1,
                     INSD:AA712310.1,INSD:R89986.1,INSD:BX836995.1,
                     INSD:DR301712.1,INSD:EL328497.1,INSD:BX834194.1,
                     INSD:BP617959.1,INSD:DR301694.1,INSD:EG439245.1,
                     INSD:BP582346.1,INSD:EL061362.1,INSD:EL240891.1,
                     INSD:EL006200.1,INSD:EH912197.1,INSD:BP571386.1,
                     INSD:DR376137.1,INSD:DR301701.1,INSD:EH962069.1,
                     INSD:EL221537.1,INSD:BP802609.1,INSD:BP569836.1,
                     INSD:DR301685.1,INSD:DR301699.1,INSD:DR301687.1,
                     INSD:AV807325.1,INSD:EL293856.1,INSD:BP607786.1,
                     INSD:DR199690.1,INSD:DR301705.1,INSD:EL266384.1,
                     INSD:EL150079.1,INSD:T21936.1,INSD:BP571312.1,
                     INSD:AV818341.1,INSD:BP583252.1,INSD:DR301710.1,
                     INSD:EL009551.1,INSD:DR301686.1,INSD:BP802607.1,
                     INSD:BP797277.1,INSD:AI993744.1,INSD:EL262026.1,
                     INSD:H76176.1,INSD:T04497.1,INSD:DR301728.1,
                     INSD:DR301717.1,INSD:DR301702.1,INSD:BP617728.1,
                     INSD:EL254563.1,INSD:DR301740.1,INSD:EG478383.1,
                     INSD:DR301737.1,INSD:DR301748.1,INSD:DR301718.1,
                     INSD:H37052.1,INSD:EL323556.1,INSD:DR301739.1,
                     INSD:EL019502.1,INSD:DR301733.1"
                     /inference="Similar to RNA sequence,
                     mRNA:INSD:BX827227.1,INSD:BX827844.1,INSD:AY089180.1,
                     INSD:AY035055.1,INSD:AY051062.1"
                     /note="Papain family cysteine protease; FUNCTIONS IN:
                     cysteine-type endopeptidase activity, cysteine-type
                     peptidase activity; INVOLVED IN: proteolysis; LOCATED IN:
                     endomembrane system; CONTAINS InterPro DOMAIN/s:
                     Proteinase inhibitor I29, cathepsin propeptide
                     (InterPro:IPR013201), Peptidase C1A, papain
                     (InterPro:IPR013128), Peptidase C1A, papain C-terminal
                     (InterPro:IPR000668), Peptidase, cysteine peptidase active
                     site (InterPro:IPR000169); BEST Arabidopsis thaliana
                     protein match is: Papain family cysteine protease
                     (TAIR:AT4G11310.1); Has 30201 Blast hits to 17322 proteins
                     in 780 species: Archae - 12; Bacteria - 1396; Metazoa -
                     17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other
                     Eukaryotes - 2996 (source: NCBI BLink)."
                     /db_xref="TAIR:AT4G11320"
                     /db_xref="Araport:AT4G11320"
     intron_pos      166:1 (1/3)
     intron_pos      245:0 (2/3)
     intron_pos      292:0 (3/3)
BEGIN
        1 MGYAKSAMLI FLLALVIASC ATAMDMSVVS SNDNHHVTAG PGRRQGIFDA EATLMFESWM
       61 VKHGKVYDSV AEKERRLTIF EDNLRFITNR NAENLSYRLG LNRFADLSLH EYGEICHGAD
      121 PRPPRNHVFM TSSNRYKTSD GDVLPKSVDW RNEGAVTEVK DQGLCRSCWA FSTVGAVEGL
      181 NKIVTGELVT LSEQDLINCN KENNGCGGGK VETAYEFIMN NGGLGTDNDY PYKALNGVCE
      241 GRLKEDNKNV MIDGYENLPA NDEAALMKAV AHQPVTAVVD SSSREFQLYE SGVFDGTCGT
      301 NLNHGVVVVG YGTENGRDYW IVKNSRGDTW GEAGYMKMAR NIANPRGLCG IAMRASYPLK
      361 NSFSTDKVSV A
//