LOCUS AEC08754.1 614 aa PRT PLN 23-MAR-2023 DEFINITION Arabidopsis thaliana beta glucosidase 33 protein. ACCESSION CP002685-4607 PROTEIN_ID AEC08754.1 SOURCE Arabidopsis thaliana (thale cress) ORGANISM Arabidopsis thaliana Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis. REFERENCE 1 (bases 1 to 19698289) AUTHORS Lin,X., Kaul,S., Rounsley,S., Shea,T.P., Benito,M.I., Town,C.D., Fujii,C.Y., Mason,T., Bowman,C.L., Barnstead,M., Feldblyum,T.V., Buell,C.R., Ketchum,K.A., Lee,J., Ronning,C.M., Koo,H.L., Moffat,K.S., Cronin,L.A., Shen,M., Pai,G., Van Aken,S., Umayam,L., Tallon,L.J., Gill,J.E., Adams,M.D., Carrera,A.J., Creasy,T.H., Goodman,H.M., Somerville,C.R., Copenhaver,G.P., Preuss,D., Nierman,W.C., White,O., Eisen,J.A., Salzberg,S.L., Fraser,C.M. and Venter,J.C. TITLE Sequence and analysis of chromosome 2 of the plant Arabidopsis thaliana JOURNAL Nature 402 (6763), 761-768 (1999) PUBMED 10617197 REFERENCE 2 (bases 1 to 19698289) AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E. CONSRTM TAIR TITLE Direct Submission JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie Institution, 260 Panama Street, Stanford, CA, USA REFERENCE 3 (bases 1 to 19698289) AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M., Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R., Vaughn,M. and Town,C.D. TITLE Direct Submission JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute, 9704 Medical Center Dr, Rockville, MD 20850, USA REMARK Protein update by submitter FEATURES Qualifiers source /organism="Arabidopsis thaliana" /mol_type="genomic DNA" /db_xref="taxon:3702" /chromosome="2" /ecotype="Columbia" protein /gene="BGLU33" /locus_tag="AT2G32860" /gene_synonym="beta glucosidase 33" /gene_synonym="T21L14.20" /gene_synonym="T21L14_20" /inference="Similar to RNA sequence, EST:INSD:EL256646.1,INSD:EL050128.1,INSD:EL113981.1, INSD:EL304568.1,INSD:BP779996.1,INSD:EL049209.1, INSD:BP783343.1,INSD:EH885141.1,INSD:AV826710.1, INSD:EH948939.1,INSD:EL079774.1,INSD:BP783665.1, INSD:EH937690.1,INSD:BP789006.1,INSD:EL009960.1, INSD:BP788235.1,INSD:AV796103.1,INSD:EH891987.1" /inference="similar to RNA sequence, mRNA:INSD:BX819346.1" /note="beta glucosidase 33 (BGLU33); FUNCTIONS IN: cation binding, hydrolase activity, hydrolyzing O-glycosyl compounds, catalytic activity; INVOLVED IN: carbohydrate metabolic process; LOCATED IN: endomembrane system; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 10 growth stages; CONTAINS InterPro DOMAIN/s: Glycoside hydrolase, family 1 (InterPro:IPR001360), Glycoside hydrolase, family 1, active site (InterPro:IPR018120), Glycoside hydrolase, catalytic core (InterPro:IPR017853), Glycoside hydrolase, subgroup, catalytic core (InterPro:IPR013781); BEST Arabidopsis thaliana protein match is: Glycosyl hydrolase superfamily protein (TAIR:AT3G60140.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink)." /db_xref="Araport:AT2G32860" /db_xref="TAIR:AT2G32860" intron_pos 114:0 (1/11) intron_pos 137:1 (2/11) intron_pos 157:0 (3/11) intron_pos 182:1 (4/11) intron_pos 208:1 (5/11) intron_pos 237:2 (6/11) intron_pos 324:1 (7/11) intron_pos 362:2 (8/11) intron_pos 435:1 (9/11) intron_pos 481:1 (10/11) intron_pos 517:2 (11/11) BEGIN 1 MATATLTLFL GLLALTSTIL SFNADARPQP SDEDLGTIIG PHQTSFDDEI GIVIGPHATV 61 DDEDIDMDMG TTVGPQTNLN DDDLGTIIGP EFEIHKQDFP ADFIFGTSVS AYQVEGAKKG 121 SGRGLTSWDE FTHMFPEKVQ QNGDGDEGVD FYTRYKDDIK LMKELNTNGF RFSISWTRIL 181 PYGTIKKGVN EEGVKFYNDL INELLANGIQ PSVTLFHWES PLALEMEYGG FLNERIVEDF 241 REFANFCFKE FGDRVKNWAT FNEPSVYSVA GYSKGKKAPG RCSKWQAPKC PTGDSSEEPY 301 IVAHNQILAH LAAVDEFRNC KKVEGGGKIG IVLVSHWFEP KDPNSSEDVK AARRSLEYQL 361 GWFLRPLTYG QYPAEMLEDV NIRLREFTPE ESEKLRKSLD FVGLNYYGAF FSTPLAKVNS 421 SQLNYETDLR VNWTVITNNL SLPDLQTTSM GIVIYPAGLK NILKHIKDEY MDPEIYIMEN 481 GMDEIDYGTK NITEATNDYG RKEFIKSHIL IMGKSIRMDK VRLKGYYIWS LMDNFEWDKG 541 YKVRFGLYYV DYNDNMKRYI RSSGKWLSEF LDSKETLHKC YFEGHREKGY APKLFDVEYL 601 EPENSQLSYR SDFM //