LOCUS AEE83586.1 766 aa PRT PLN 23-MAR-2023 DEFINITION Arabidopsis thaliana pentacyclic triterpene synthase 1 protein. ACCESSION CP002687-2484 PROTEIN_ID AEE83586.1 SOURCE Arabidopsis thaliana (thale cress) ORGANISM Arabidopsis thaliana Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis. REFERENCE 1 (bases 1 to 18585056) AUTHORS Mayer,K., Schuller,C., Wambutt,R., Murphy,G., Volckaert,G., Pohl,T., Dusterhoft,A., Stiekema,W., Entian,K.D., Terryn,N., Harris,B., Ansorge,W., Brandt,P., Grivell,L., Rieger,M., Weichselgartner,M., de Simone,V., Obermaier,B., Mache,R., Muller,M., Kreis,M., Delseny,M., Puigdomenech,P., Watson,M., Schmidtheini,T., Reichert,B., Portatelle,D., Perez-Alonso,M., Boutry,M., Bancroft,I., Vos,P., Hoheisel,J., Zimmermann,W., Wedler,H., Ridley,P., Langham,S.A., McCullagh,B., Bilham,L., Robben,J., Van der Schueren,J., Grymonprez,B., Chuang,Y.J., Vandenbussche,F., Braeken,M., Weltjens,I., Voet,M., Bastiaens,I., Aert,R., Defoor,E., Weitzenegger,T., Bothe,G., Ramsperger,U., Hilbert,H., Braun,M., Holzer,E., Brandt,A., Peters,S., van Staveren,M., Dirske,W., Mooijman,P., Klein Lankhorst,R., Rose,M., Hauf,J., Kotter,P., Berneiser,S., Hempel,S., Feldpausch,M., Lamberth,S., Van den Daele,H., De Keyser,A., Buysshaert,C., Gielen,J., Villarroel,R., De Clercq,R., Van Montagu,M., Rogers,J., Cronin,A., Quail,M., Bray-Allen,S., Clark,L., Doggett,J., Hall,S., Kay,M., Lennard,N., McLay,K., Mayes,R., Pettett,A., Rajandream,M.A., Lyne,M., Benes,V., Rechmann,S., Borkova,D., Blocker,H., Scharfe,M., Grimm,M., Lohnert,T.H., Dose,S., de Haan,M., Maarse,A., Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M., Fartmann,B., Granderath,K., Dauner,D., Herzl,A., Neumann,S., Argiriou,A., Vitale,D., Liguori,R., Piravandi,E., Massenet,O., Quigley,F., Clabauld,G., Mundlein,A., Felber,R., Schnabl,S., Hiller,R., Schmidt,W., Lecharny,A., Aubourg,S., Chefdor,F., Cooke,R., Berger,C., Montfort,A., Casacuberta,E., Gibbons,T., Weber,N., Vandenbol,M., Bargues,M., Terol,J., Torres,A., Perez-Perez,A., Purnelle,B., Bent,E., Johnson,S., Tacon,D., Jesse,T., Heijnen,L., Schwarz,S., Scholler,P., Heber,S., Francs,P., Bielke,C., Frishman,D., Haase,D., Lemcke,K., Mewes,H.W., Stocker,S., Zaccaria,P., Bevan,M., Wilson,R.K., de la Bastide,M., Habermann,K., Parnell,L., Dedhia,N., Gnoj,L., Schutz,K., Huang,E., Spiegel,L., Sehkon,M., Murray,J., Sheet,P., Cordes,M., Abu-Threideh,J., Stoneking,T., Kalicki,J., Graves,T., Harmon,G., Edwards,J., Latreille,P., Courtney,L., Cloud,J., Abbott,A., Scott,K., Johnson,D., Minx,P., Bentley,D., Fulton,B., Miller,N., Greco,T., Kemp,K., Kramer,J., Fulton,L., Mardis,E., Dante,M., Pepin,K., Hillier,L., Nelson,J., Spieth,J., Ryan,E., Andrews,S., Geisel,C., Layman,D., Du,H., Ali,J., Berghoff,A., Jones,K., Drone,K., Cotton,M., Joshu,C., Antonoiu,B., Zidanic,M., Strong,C., Sun,H., Lamar,B., Yordan,C., Ma,P., Zhong,J., Preston,R., Vil,D., Shekher,M., Matero,A., Shah,R., Swaby,I.K., O'Shaughnessy,A., Rodriguez,M., Hoffmann,J., Till,S., Granat,S., Shohdy,N., Hasegawa,A., Hameed,A., Lodhi,M., Johnson,A., Chen,E., Marra,M., Martienssen,R. and McCombie,W.R. TITLE Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana JOURNAL Nature 402 (6763), 769-777 (1999) PUBMED 10617198 REFERENCE 2 (bases 1 to 18585056) AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E. CONSRTM TAIR TITLE Direct Submission JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie Institution, 260 Panama Street, Stanford, CA, USA REFERENCE 3 (bases 1 to 18585056) AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M., Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R., Vaughn,M. and Town,C.D. TITLE Direct Submission JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute, 9704 Medical Center Dr, Rockville, MD 20850, USA REMARK Protein update by submitter FEATURES Qualifiers source /organism="Arabidopsis thaliana" /mol_type="genomic DNA" /db_xref="taxon:3702" /chromosome="4" /ecotype="Columbia" protein /gene="PEN1" /locus_tag="AT4G15340" /gene_synonym="04C11" /gene_synonym="ATPEN1" /gene_synonym="DL3715C" /gene_synonym="FCAALL.158" /gene_synonym="PENTACYCLIC TRITERPENE SYNTHASE" /gene_synonym="pentacyclic triterpene synthase 1" /inference="Similar to RNA sequence, EST:INSD:AV540608.1,INSD:AI998912.1" /note="pentacyclic triterpene synthase 1 (PEN1); CONTAINS InterPro DOMAIN/s: Terpene synthase, conserved site (InterPro:IPR002365), Terpenoid cylases/protein prenyltransferase alpha-alpha toroid (InterPro:IPR008930), Squalene cyclase (InterPro:IPR018333), Prenyltransferase/squalene oxidase (InterPro:IPR001330); BEST Arabidopsis thaliana protein match is: baruol synthase 1 (TAIR:AT4G15370.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink)." /db_xref="TAIR:AT4G15340" /db_xref="Araport:AT4G15340" intron_pos 69:0 (1/13) intron_pos 131:0 (2/13) intron_pos 161:0 (3/13) intron_pos 229:0 (4/13) intron_pos 257:1 (5/13) intron_pos 313:0 (6/13) intron_pos 377:0 (7/13) intron_pos 415:0 (8/13) intron_pos 499:0 (9/13) intron_pos 532:0 (10/13) intron_pos 551:0 (11/13) intron_pos 566:2 (12/13) intron_pos 729:0 (13/13) BEGIN 1 MWRLRIGAKA GNDTHLFTTN NYVGRQIWEF DANAGSPQEL AEVEEARRNF SNNRSHYKAS 61 ADLLWRMQFL REKGFEQKIP RVRVEDAAKI RYEDAKTALK RGLHYFTALQ ADDGHWPADN 121 SGPNFFIAPL VICLYITGHL EKIFTVEHRI ELIRYMYNHQ NEDGGWGLHV ESPSIMFCTV 181 INYICLRIVG VEAGHDDDQG STCTKARKWI LDHGGATYTP LIGKACLSVL GVYDWSGCKP 241 MPPEFWFLPS SFPINGGTLW IYLRDIFMGL SYLYGKKFVA TPTPLILQLQ EELYPEPYTK 301 INWRLTRNRC AKEDLCYPSS FLQDLFWKGV HIFSESILNR WPFNKLIRQA ALRTTMKLLH 361 YQDEANRYIT GGSVPKAFHM LACWVEDPEG EYFKKHLARV SDFIWIGEDG LKIQSFGSQL 421 WDTVMSLHFL LDGVEDDVDD EIRSTLVKGY DYLKKSQVTE NPPSDHIKMF RHISKGGWTF 481 SDKDQGWPVS DCTAESLKCC LLFERMPSEF VGQKMDVEKL FDAVDFLLYL QSDNGGITAW 541 EPADGKTWLE WFSPVEFVQD TVIEHEYVEC TGSAIVALTQ FSKQFPEFRK KEVERFITNG 601 VKYIEDLQMK DGSWCGNWGV CFIYGTLFAV RGLVAAGKTF HNCEPIRRAV RFLLDTQNQE 661 GGWGESYLSC LRKKYTPLAG NKTNIVSTGQ ALMVLIMGGQ MERDPLPVHR AAKVVINLQL 721 DNGDFPQQEV MGVFNMNVLL HYPTYRNIYS LWALTLYTQA LRRLQP //