LOCUS       AEE84368.1              1380 aa    PRT              PLN 23-MAR-2023
DEFINITION  Arabidopsis thaliana tripeptidyl peptidase ii protein.
ACCESSION   CP002687-3635
PROTEIN_ID  AEE84368.1
SOURCE      Arabidopsis thaliana (thale cress)
  ORGANISM  Arabidopsis thaliana
            Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
            Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
            Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
            Camelineae; Arabidopsis.
REFERENCE   1  (bases 1 to 18585056)
  AUTHORS   Mayer,K., Schuller,C., Wambutt,R., Murphy,G., Volckaert,G.,
            Pohl,T., Dusterhoft,A., Stiekema,W., Entian,K.D., Terryn,N.,
            Harris,B., Ansorge,W., Brandt,P., Grivell,L., Rieger,M.,
            Weichselgartner,M., de Simone,V., Obermaier,B., Mache,R.,
            Muller,M., Kreis,M., Delseny,M., Puigdomenech,P., Watson,M.,
            Schmidtheini,T., Reichert,B., Portatelle,D., Perez-Alonso,M.,
            Boutry,M., Bancroft,I., Vos,P., Hoheisel,J., Zimmermann,W.,
            Wedler,H., Ridley,P., Langham,S.A., McCullagh,B., Bilham,L.,
            Robben,J., Van der Schueren,J., Grymonprez,B., Chuang,Y.J.,
            Vandenbussche,F., Braeken,M., Weltjens,I., Voet,M., Bastiaens,I.,
            Aert,R., Defoor,E., Weitzenegger,T., Bothe,G., Ramsperger,U.,
            Hilbert,H., Braun,M., Holzer,E., Brandt,A., Peters,S., van
            Staveren,M., Dirske,W., Mooijman,P., Klein Lankhorst,R., Rose,M.,
            Hauf,J., Kotter,P., Berneiser,S., Hempel,S., Feldpausch,M.,
            Lamberth,S., Van den Daele,H., De Keyser,A., Buysshaert,C.,
            Gielen,J., Villarroel,R., De Clercq,R., Van Montagu,M., Rogers,J.,
            Cronin,A., Quail,M., Bray-Allen,S., Clark,L., Doggett,J., Hall,S.,
            Kay,M., Lennard,N., McLay,K., Mayes,R., Pettett,A.,
            Rajandream,M.A., Lyne,M., Benes,V., Rechmann,S., Borkova,D.,
            Blocker,H., Scharfe,M., Grimm,M., Lohnert,T.H., Dose,S., de
            Haan,M., Maarse,A., Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M.,
            Fartmann,B., Granderath,K., Dauner,D., Herzl,A., Neumann,S.,
            Argiriou,A., Vitale,D., Liguori,R., Piravandi,E., Massenet,O.,
            Quigley,F., Clabauld,G., Mundlein,A., Felber,R., Schnabl,S.,
            Hiller,R., Schmidt,W., Lecharny,A., Aubourg,S., Chefdor,F.,
            Cooke,R., Berger,C., Montfort,A., Casacuberta,E., Gibbons,T.,
            Weber,N., Vandenbol,M., Bargues,M., Terol,J., Torres,A.,
            Perez-Perez,A., Purnelle,B., Bent,E., Johnson,S., Tacon,D.,
            Jesse,T., Heijnen,L., Schwarz,S., Scholler,P., Heber,S., Francs,P.,
            Bielke,C., Frishman,D., Haase,D., Lemcke,K., Mewes,H.W.,
            Stocker,S., Zaccaria,P., Bevan,M., Wilson,R.K., de la Bastide,M.,
            Habermann,K., Parnell,L., Dedhia,N., Gnoj,L., Schutz,K., Huang,E.,
            Spiegel,L., Sehkon,M., Murray,J., Sheet,P., Cordes,M.,
            Abu-Threideh,J., Stoneking,T., Kalicki,J., Graves,T., Harmon,G.,
            Edwards,J., Latreille,P., Courtney,L., Cloud,J., Abbott,A.,
            Scott,K., Johnson,D., Minx,P., Bentley,D., Fulton,B., Miller,N.,
            Greco,T., Kemp,K., Kramer,J., Fulton,L., Mardis,E., Dante,M.,
            Pepin,K., Hillier,L., Nelson,J., Spieth,J., Ryan,E., Andrews,S.,
            Geisel,C., Layman,D., Du,H., Ali,J., Berghoff,A., Jones,K.,
            Drone,K., Cotton,M., Joshu,C., Antonoiu,B., Zidanic,M., Strong,C.,
            Sun,H., Lamar,B., Yordan,C., Ma,P., Zhong,J., Preston,R., Vil,D.,
            Shekher,M., Matero,A., Shah,R., Swaby,I.K., O'Shaughnessy,A.,
            Rodriguez,M., Hoffmann,J., Till,S., Granat,S., Shohdy,N.,
            Hasegawa,A., Hameed,A., Lodhi,M., Johnson,A., Chen,E., Marra,M.,
            Martienssen,R. and McCombie,W.R.
  TITLE     Sequence and analysis of chromosome 4 of the plant Arabidopsis
            thaliana
  JOURNAL   Nature 402 (6763), 769-777 (1999)
   PUBMED   10617198
REFERENCE   2  (bases 1 to 18585056)
  AUTHORS   Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
  CONSRTM   TAIR
  TITLE     Direct Submission
  JOURNAL   Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
            Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE   3  (bases 1 to 18585056)
  AUTHORS   Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
            Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
            Vaughn,M. and Town,C.D.
  TITLE     Direct Submission
  JOURNAL   Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
            9704 Medical Center Dr, Rockville, MD 20850, USA
  REMARK    Protein update by submitter
FEATURES             Qualifiers
     source          /organism="Arabidopsis thaliana"
                     /mol_type="genomic DNA"
                     /db_xref="taxon:3702"
                     /chromosome="4"
                     /ecotype="Columbia"
     protein         /gene="TPP2"
                     /locus_tag="AT4G20850"
                     /gene_synonym="T13K14.10"
                     /gene_synonym="T13K14_10"
                     /gene_synonym="tripeptidyl peptidase ii"
                     /inference="Similar to RNA sequence,
                     EST:INSD:AV823426.1,INSD:N38092.1,INSD:W43747.1,
                     INSD:EH868919.1,INSD:W43589.1,INSD:EH827729.1,
                     INSD:ES182186.1,INSD:AV784385.1,INSD:BE527662.1,
                     INSD:EH990283.1,INSD:AV558901.1,INSD:EH972921.1,
                     INSD:T42395.1,INSD:BE523893.1,INSD:BE528153.1,
                     INSD:EH837097.1,INSD:ES084123.1,INSD:BP795363.1,
                     INSD:EL026431.1,INSD:EL136171.1,INSD:EL223734.1,
                     INSD:EH846819.1,INSD:EL176641.1,INSD:EL117864.1,
                     INSD:EL270607.1,INSD:CB259236.1,INSD:EL113155.1,
                     INSD:AI993714.1,INSD:EG482686.1,INSD:EH965909.1,
                     INSD:EH934026.1,INSD:EH897147.1,INSD:EG444974.1,
                     INSD:BP611572.1,INSD:AV798377.1,INSD:EG445360.1,
                     INSD:AV548713.1,INSD:EL088508.1,INSD:ES166485.1,
                     INSD:EH960355.1,INSD:EH799734.1,INSD:EL001648.1,
                     INSD:N96678.1,INSD:AV563686.1,INSD:EH800166.1,
                     INSD:EL327118.1,INSD:EH874302.1,INSD:EH811962.1,
                     INSD:ES130993.1,INSD:EH883915.1,INSD:EG429201.1,
                     INSD:N38093.1,INSD:AV546527.1,INSD:EL111548.1,
                     INSD:R65233.1,INSD:DR383318.1,INSD:EH810530.1,
                     INSD:EL191361.1,INSD:EL313487.1,INSD:ES071948.1,
                     INSD:EG482675.1,INSD:EH881760.1,INSD:ES106931.1,
                     INSD:BE524673.1,INSD:EL119175.1,INSD:EL184234.1,
                     INSD:BE528366.1"
                     /inference="Similar to RNA sequence, mRNA:INSD:AY096651.1"
                     /note="tripeptidyl peptidase ii (TPP2); FUNCTIONS IN:
                     tripeptidyl-peptidase activity; INVOLVED IN: proteolysis;
                     LOCATED IN: in 6 components; EXPRESSED IN: 25 plant
                     structures; EXPRESSED DURING: 15 growth stages; CONTAINS
                     InterPro DOMAIN/s: Peptidase S8/S53,
                     subtilisin/kexin/sedolisin (InterPro:IPR000209), Peptidase
                     S8, subtilisin-related (InterPro:IPR015500), Peptidase
                     S8/S53, subtilisin, active site (InterPro:IPR022398),
                     Peptidase S8A, tripeptidyl peptidase II
                     (InterPro:IPR022229); Has 6394 Blast hits to 6195 proteins
                     in 1270 species: Archae - 226; Bacteria - 4362; Metazoa -
                     666; Fungi - 272; Plants - 126; Viruses - 0; Other
                     Eukaryotes - 742 (source: NCBI BLink)."
                     /db_xref="TAIR:AT4G20850"
                     /db_xref="Araport:AT4G20850"
     intron_pos      145:1 (1/33)
     intron_pos      171:2 (2/33)
     intron_pos      196:1 (3/33)
     intron_pos      233:0 (4/33)
     intron_pos      260:0 (5/33)
     intron_pos      289:0 (6/33)
     intron_pos      336:2 (7/33)
     intron_pos      387:0 (8/33)
     intron_pos      428:0 (9/33)
     intron_pos      456:0 (10/33)
     intron_pos      518:2 (11/33)
     intron_pos      577:0 (12/33)
     intron_pos      617:2 (13/33)
     intron_pos      644:1 (14/33)
     intron_pos      664:0 (15/33)
     intron_pos      715:2 (16/33)
     intron_pos      775:1 (17/33)
     intron_pos      813:0 (18/33)
     intron_pos      870:0 (19/33)
     intron_pos      913:0 (20/33)
     intron_pos      946:2 (21/33)
     intron_pos      986:0 (22/33)
     intron_pos      1012:2 (23/33)
     intron_pos      1035:0 (24/33)
     intron_pos      1062:2 (25/33)
     intron_pos      1080:0 (26/33)
     intron_pos      1124:0 (27/33)
     intron_pos      1148:0 (28/33)
     intron_pos      1179:0 (29/33)
     intron_pos      1211:0 (30/33)
     intron_pos      1244:0 (31/33)
     intron_pos      1275:0 (32/33)
     intron_pos      1329:0 (33/33)
BEGIN
        1 MDLSLQLQIH GALINKGPSC TSYWASSSSL SLPRDFISSS TFLLHRRLRR RSCSRSRGIR
       61 LRRSGFSAMP CSSSDTLTAS RVGCGGGGGG GAVGGGAENA SVANFKLNES TFIASLMPKK
      121 EIRADCFIEA HPEYDGRGVV IAIFDSGFDP SAAGLHVTSD GKPKVLDVID CTGSGDIDTS
      181 TVVKANEDGH IRGASGATLV VNSSWKNPTG EWRVGSKLVY QLFTDDLTSR VKKERRKSWD
      241 EKNQEEIAKA VNNLYDFDQK HSKVEDAKLK KTREDLQSKV DFLKKQADKY EDKGPVIDAV
      301 VWHDGEVWRV ALDTQSLEED PDSGKLADFS PLTNYRIERK YGVFSRLDAC SFVANVYDEG
      361 KVLSIVTDSS PHGTHVAGIA TAHHPEEHLL NGVAPGAQII SCKIGDSRLG SMETGTGLTR
      421 ALIAALEHNC DLVNMSYGEP ALLPDYGRFV DLVTEAVNKR RLIFVSSAGN SGPALTTVGA
      481 PGGTTSSIIG VGAYVSPAMA AGAHSVVEPP SEGLEYTWSS RGPTSDGDLG VCISAPGGAV
      541 APVPTWTLQR RMLMNGTSMA SPSACGAIAL LLSAMKAEGI PVSPYSVRRA LENTSTPVGD
      601 LPEDKLTTGQ GLMQVDKAYE YLKQFQDYPC VFYQIKVNLS GKTIPTSRGI YLREGTACRQ
      661 STEWTIQVDP KFHEGASNLK ELVPFEECLE LHSTDEGVVR VPDYLLLTNN GRGFNVVVDP
      721 TNLGDGVHYF EVYGIDCKAP ERGPLFRIPV TIIIPKTVAN QPPVISFQQM SFISGHIERR
      781 YIEVPHGATW AEATMRTSGF DTTRRFYIDT LQVCPLRRPI KWESAPTFAS PSAKSFVFPV
      841 VSGQTMELAI AQFWSSGLGS REPTIVDFEI EFHGVGVDKE ELLLDGSEAP IKVEAEALLA
      901 SEKLVPIAVL NKIRVPYQPI DAQLKTLSTG RDRLLSGKQI LALTLTYKFK LEDSAEVKPY
      961 IPLLNNRIYD TKFESQFFMI SDTNKRVYAM GDVYPESSKL PKGEYKLQLY LRHENVELLE
     1021 KLKQLTVFIE RNMGEIRLNL HSEPDGPFTG NGAFKSSVLM PGVKEAFYLG PPTKDKLPKN
     1081 TPQGSMLVGE ISYGKLSFDE KEGKNPKDNP VSYPISYVVP PNKPEEDKKA ASAPTCSKSV
     1141 SERLEQEVRD TKIKFLGNLK QETEEERSEW RKLCTCLKSE YPDYTPLLAK ILEGLLSRSD
     1201 AGDKISHHEE IIEAANEVVR SVDVDELARF LLDKTEPEDD EAEKLKKKME VTRDQLADAL
     1261 YQKGLAMARI ENLKGEKEGE GEEESSQKDK FEENFKELTK WVDVKSSKYG TLTVLREKRL
     1321 SRLGTALKVL DDLIQNENET ANKKLYELKL DLLEEIGWSH LVTYEKQWMQ VRFPKSLPLF
//