LOCUS       AEE78621.1               632 aa    PRT              PLN 23-MAR-2023
DEFINITION  Arabidopsis thaliana Eukaryotic aspartyl protease family
            protein protein.
ACCESSION   CP002686-6935
PROTEIN_ID  AEE78621.1
SOURCE      Arabidopsis thaliana (thale cress)
  ORGANISM  Arabidopsis thaliana
            Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
            Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
            Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
            Camelineae; Arabidopsis.
REFERENCE   1  (bases 1 to 23459830)
  AUTHORS   Salanoubat,M., Lemcke,K., Rieger,M., Ansorge,W., Unseld,M.,
            Fartmann,B., Valle,G., Blocker,H., Perez-Alonso,M., Obermaier,B.,
            Delseny,M., Boutry,M., Grivell,L.A., Mache,R., Puigdomenech,P., De
            Simone,V., Choisne,N., Artiguenave,F., Robert,C., Brottier,P.,
            Wincker,P., Cattolico,L., Weissenbach,J., Saurin,W., Quetier,F.,
            Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M., Benes,V.,
            Wurmbach,E., Drzonek,H., Erfle,H., Jordan,N., Bangert,S.,
            Wiedelmann,R., Kranz,H., Voss,H., Holland,R., Brandt,P.,
            Nyakatura,G., Vezzi,A., D'Angelo,M., Pallavicini,A., Toppo,S.,
            Simionati,B., Conrad,A., Hornischer,K., Kauer,G., Lohnert,T.H.,
            Nordsiek,G., Reichelt,J., Scharfe,M., Schon,O., Bargues,M.,
            Terol,J., Climent,J., Navarro,P., Collado,C., Perez-Perez,A.,
            Ottenwalder,B., Duchemin,D., Cooke,R., Laudie,M., Berger-Llauro,C.,
            Purnelle,B., Masuy,D., de Haan,M., Maarse,A.C., Alcaraz,J.P.,
            Cottet,A., Casacuberta,E., Monfort,A., Argiriou,A., flores,M.,
            Liguori,R., Vitale,D., Mannhaupt,G., Haase,D., Schoof,H., Rudd,S.,
            Zaccaria,P., Mewes,H.W., Mayer,K.F., Kaul,S., Town,C.D., Koo,H.L.,
            Tallon,L.J., Jenkins,J., Rooney,T., Rizzo,M., Walts,A.,
            Utterback,T., Fujii,C.Y., Shea,T.P., Creasy,T.H., Haas,B.,
            Maiti,R., Wu,D., Peterson,J., Van Aken,S., Pai,G., Militscher,J.,
            Sellers,P., Gill,J.E., Feldblyum,T.V., Preuss,D., Lin,X.,
            Nierman,W.C., Salzberg,S.L., White,O., Venter,J.C., Fraser,C.M.,
            Kaneko,T., Nakamura,Y., Sato,S., Kato,T., Asamizu,E., Sasamoto,S.,
            Kimura,T., Idesawa,K., Kawashima,K., Kishida,Y., Kiyokawa,C.,
            Kohara,M., Matsumoto,M., Matsuno,A., Muraki,A., Nakayama,S.,
            Nakazaki,N., Shinpo,S., Takeuchi,C., Wada,T., Watanabe,A.,
            Yamada,M., Yasuda,M. and Tabata,S.
  CONSRTM   European Union Chromosome 3 Arabidopsis Sequencing Consortium;
            Institute for Genomic Research; Kazusa DNA Research Institute
  TITLE     Sequence and analysis of chromosome 3 of the plant Arabidopsis
            thaliana
  JOURNAL   Nature 408 (6814), 820-822 (2000)
   PUBMED   11130713
REFERENCE   2  (bases 1 to 23459830)
  AUTHORS   Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
  CONSRTM   TAIR
  TITLE     Direct Submission
  JOURNAL   Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
            Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE   3  (bases 1 to 23459830)
  AUTHORS   Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
            Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
            Vaughn,M. and Town,C.D.
  TITLE     Direct Submission
  JOURNAL   Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
            9704 Medical Center Dr, Rockville, MD 20850, USA
  REMARK    Protein update by submitter
FEATURES             Qualifiers
     source          /organism="Arabidopsis thaliana"
                     /mol_type="genomic DNA"
                     /db_xref="taxon:3702"
                     /chromosome="3"
                     /ecotype="Columbia"
     protein         /locus_tag="AT3G50050"
                     /inference="Similar to RNA sequence,
                     EST:INSD:AU237805.1,INSD:ES010409.1,INSD:EL208661.1,
                     INSD:AV548904.1,INSD:AV522293.1,INSD:ES066335.1,
                     INSD:EH954077.1,INSD:BP809654.1,INSD:AU228897.1"
                     /inference="Similar to RNA sequence,
                     mRNA:INSD:BT015816.1,INSD:BX824218.1,INSD:BT020205.1"
                     /note="Eukaryotic aspartyl protease family protein;
                     FUNCTIONS IN: aspartic-type endopeptidase activity;
                     INVOLVED IN: proteolysis; LOCATED IN: endomembrane system;
                     EXPRESSED IN: 17 plant structures; EXPRESSED DURING: 9
                     growth stages; CONTAINS InterPro DOMAIN/s: Peptidase
                     aspartic (InterPro:IPR021109), Peptidase aspartic,
                     catalytic (InterPro:IPR009007), Peptidase A1
                     (InterPro:IPR001461), Peptidase aspartic, active site
                     (InterPro:IPR001969); BEST Arabidopsis thaliana protein
                     match is: Eukaryotic aspartyl protease family protein
                     (TAIR:AT5G43100.1); Has 5152 Blast hits to 5136 proteins
                     in 391 species: Archae - 0; Bacteria - 0; Metazoa - 1866;
                     Fungi - 764; Plants - 2030; Viruses - 0; Other Eukaryotes
                     - 492 (source: NCBI BLink)."
                     /db_xref="TAIR:AT3G50050"
                     /db_xref="Araport:AT3G50050"
     intron_pos      91:2 (1/11)
     intron_pos      132:0 (2/11)
     intron_pos      274:2 (3/11)
     intron_pos      327:0 (4/11)
     intron_pos      354:2 (5/11)
     intron_pos      388:0 (6/11)
     intron_pos      412:1 (7/11)
     intron_pos      471:1 (8/11)
     intron_pos      513:0 (9/11)
     intron_pos      547:0 (10/11)
     intron_pos      579:2 (11/11)
BEGIN
        1 MALPSISSIG ATFSLLIYLS LPYSITAGEN NLLHQSPTAR SRRPMVFPLF LSQPNSSSRS
       61 ISIPHRKLHK SDSKSLPHSR MRLYDDLLIN GYYTTRLWIG TPPQMFALIV DSGSTVTYVP
      121 CSDCEQCGKH QDPKFQPEMS STYQPVKCNM DCNCDDDREQ CVYEREYAEH SSSKGVLGED
      181 LISFGNESQL TPQRAVFGCE TVETGDLYSQ RADGIIGLGQ GDLSLVDQLV DKGLISNSFG
      241 LCYGGMDVGG GSMILGGFDY PSDMVFTDSD PDRSPYYNID LTGIRVAGKQ LSLHSRVFDG
      301 EHGAVLDSGT TYAYLPDAAF AAFEEAVMRE VSTLKQIDGP DPNFKDTCFQ VAASNYVSEL
      361 SKIFPSVEMV FKSGQSWLLS PENYMFRHSK VHGAYCLGVF PNGKDHTTLL GGIVVRNTLV
      421 VYDRENSKVG FWRTNCSELS DRLHIDGAPP PATLPSNDSN PSHNSSSNLS GVTQVGQINL
      481 DIQLTVNSSY LKPRIEDLSK IFSKELDVKS SQVSLSNLTS KGNESLVRMV VLPPEPSTWF
      541 SNVTATNIVS RFTNHQIKLP EIFGNYQLVN YKLEPPRKRT NNNIVVIAIG IIAVIVGLSA
      601 YGAWLIWKRK QTSIPYKPVD EAIVAEQELQ PI
//