LOCUS AEC10425.1 590 aa PRT PLN 23-MAR-2023 DEFINITION Arabidopsis thaliana beta glucosidase 29 protein. ACCESSION CP002685-6875 PROTEIN_ID AEC10425.1 SOURCE Arabidopsis thaliana (thale cress) ORGANISM Arabidopsis thaliana Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis. REFERENCE 1 (bases 1 to 19698289) AUTHORS Lin,X., Kaul,S., Rounsley,S., Shea,T.P., Benito,M.I., Town,C.D., Fujii,C.Y., Mason,T., Bowman,C.L., Barnstead,M., Feldblyum,T.V., Buell,C.R., Ketchum,K.A., Lee,J., Ronning,C.M., Koo,H.L., Moffat,K.S., Cronin,L.A., Shen,M., Pai,G., Van Aken,S., Umayam,L., Tallon,L.J., Gill,J.E., Adams,M.D., Carrera,A.J., Creasy,T.H., Goodman,H.M., Somerville,C.R., Copenhaver,G.P., Preuss,D., Nierman,W.C., White,O., Eisen,J.A., Salzberg,S.L., Fraser,C.M. and Venter,J.C. TITLE Sequence and analysis of chromosome 2 of the plant Arabidopsis thaliana JOURNAL Nature 402 (6763), 761-768 (1999) PUBMED 10617197 REFERENCE 2 (bases 1 to 19698289) AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E. CONSRTM TAIR TITLE Direct Submission JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie Institution, 260 Panama Street, Stanford, CA, USA REFERENCE 3 (bases 1 to 19698289) AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M., Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R., Vaughn,M. and Town,C.D. TITLE Direct Submission JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute, 9704 Medical Center Dr, Rockville, MD 20850, USA REMARK Protein update by submitter FEATURES Qualifiers source /organism="Arabidopsis thaliana" /mol_type="genomic DNA" /db_xref="taxon:3702" /chromosome="2" /ecotype="Columbia" protein /gene="BGLU29" /locus_tag="AT2G44470" /gene_synonym="beta glucosidase 29" /gene_synonym="F4I1.28" /inference="Similar to RNA sequence, EST:INSD:EG434229.1,INSD:EG511424.1,INSD:EG511414.1, INSD:EG511415.1,INSD:EG511431.1,INSD:EG511423.1, INSD:BE522968.1,INSD:EG511429.1,INSD:AU238909.1, INSD:EG511421.1,INSD:EG511425.1,INSD:BE525003.1, INSD:EG511422.1,INSD:EG511418.1,INSD:BE529057.1, INSD:EG511412.1,INSD:EG511426.1,INSD:EG511416.1, INSD:EG511417.1,INSD:EG511428.1,INSD:EG511413.1, INSD:EG511427.1,INSD:EG511420.1" /note="beta glucosidase 29 (BGLU29); FUNCTIONS IN: cation binding, hydrolase activity, hydrolyzing O-glycosyl compounds, catalytic activity; INVOLVED IN: carbohydrate metabolic process; LOCATED IN: endomembrane system; CONTAINS InterPro DOMAIN/s: Glycoside hydrolase, family 1 (InterPro:IPR001360), Glycoside hydrolase, family 1, active site (InterPro:IPR018120), Glycoside hydrolase, catalytic core (InterPro:IPR017853), Glycoside hydrolase, subgroup, catalytic core (InterPro:IPR013781); BEST Arabidopsis thaliana protein match is: beta glucosidase 28 (TAIR:AT2G44460.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink)." /db_xref="Araport:AT2G44470" /db_xref="TAIR:AT2G44470" intron_pos 49:0 (1/12) intron_pos 72:1 (2/12) intron_pos 91:0 (3/12) intron_pos 116:1 (4/12) intron_pos 142:1 (5/12) intron_pos 171:2 (6/12) intron_pos 257:0 (7/12) intron_pos 296:2 (8/12) intron_pos 369:1 (9/12) intron_pos 381:0 (10/12) intron_pos 415:1 (11/12) intron_pos 451:2 (12/12) BEGIN 1 MNVQIFILLL IISWLTPKIT SLPPESQVLD RSSFPDDFVF GTAISAFQSE GATSEGGKSP 61 TIWDYFSHTF PERTNMQNAD VAVDFYHRYK DDIKLIEELN VDAFRFSISW ARLIPSGKVK 121 DGVNKEGVQF YKALIDELIA NGIQPSVTLY HWDHPQALED EYGGFLNPQI IEDFRNFARV 181 CFENFGDKVK MWTTINEPYV ISVAGYDTGI KAVGRCSKWV NSRCQAGDSA IEPYIVSHHL 241 LLSHAAAVQE FRNCNKTLQD GKIGIVISPW WLEPYDSTSS ADKEAVERGL PLELEWHLNP 301 VIYGDYPETM KKHVGNRLPA FTPEQSKMLI NSSDFIGVNY YSIHFTAHLP HIDHTRPRFR 361 TDHHFEKKLI NRSNHETGPG DDRGKIHSHP EGLRRVLNYI KDKYNNPIVY VKENGIDHYD 421 DGTKSRETIL KDTFRISYHQ DHLKQVHKAI IEDGCDVRGY YVWSLFDNFE WEHGYNSRFG 481 MYYVDFKNNL QRYPKDSVNW FKKFLSRPVV RSEETEDEKV CNVSRKEEKI NKALDVSEGF 541 KTSVDSIVNL IKNGSRIEEE DDEEERDFCA FKNHNDQLGF FLKLQNSLGF //