LOCUS AEC08753.1 613 aa PRT PLN 23-MAR-2023
DEFINITION Arabidopsis thaliana beta glucosidase 33 protein.
ACCESSION CP002685-4608
PROTEIN_ID AEC08753.1
SOURCE Arabidopsis thaliana (thale cress)
ORGANISM Arabidopsis thaliana
Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
Camelineae; Arabidopsis.
REFERENCE 1 (bases 1 to 19698289)
AUTHORS Lin,X., Kaul,S., Rounsley,S., Shea,T.P., Benito,M.I., Town,C.D.,
Fujii,C.Y., Mason,T., Bowman,C.L., Barnstead,M., Feldblyum,T.V.,
Buell,C.R., Ketchum,K.A., Lee,J., Ronning,C.M., Koo,H.L.,
Moffat,K.S., Cronin,L.A., Shen,M., Pai,G., Van Aken,S., Umayam,L.,
Tallon,L.J., Gill,J.E., Adams,M.D., Carrera,A.J., Creasy,T.H.,
Goodman,H.M., Somerville,C.R., Copenhaver,G.P., Preuss,D.,
Nierman,W.C., White,O., Eisen,J.A., Salzberg,S.L., Fraser,C.M. and
Venter,J.C.
TITLE Sequence and analysis of chromosome 2 of the plant Arabidopsis
thaliana
JOURNAL Nature 402 (6763), 761-768 (1999)
PUBMED 10617197
REFERENCE 2 (bases 1 to 19698289)
AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
CONSRTM TAIR
TITLE Direct Submission
JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE 3 (bases 1 to 19698289)
AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
Vaughn,M. and Town,C.D.
TITLE Direct Submission
JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
9704 Medical Center Dr, Rockville, MD 20850, USA
REMARK Protein update by submitter
FEATURES Qualifiers
source /organism="Arabidopsis thaliana"
/mol_type="genomic DNA"
/db_xref="taxon:3702"
/chromosome="2"
/ecotype="Columbia"
protein /gene="BGLU33"
/locus_tag="AT2G32860"
/gene_synonym="beta glucosidase 33"
/gene_synonym="T21L14.20"
/gene_synonym="T21L14_20"
/inference="Similar to RNA sequence,
EST:INSD:EL256646.1,INSD:EL050128.1,INSD:EL113981.1,
INSD:EL304568.1,INSD:BP779996.1,INSD:EL049209.1,
INSD:BP783343.1,INSD:AV826710.1,INSD:EH948939.1,
INSD:EL025210.1,INSD:EL079774.1,INSD:BP783665.1,
INSD:EH937690.1,INSD:BP789006.1,INSD:EL009960.1,
INSD:BP788235.1,INSD:AV796103.1,INSD:EH891987.1"
/inference="similar to RNA sequence,
mRNA:INSD:AK226866.1,INSD:BX819346.1,INSD:AF083694.1"
/note="beta glucosidase 33 (BGLU33); FUNCTIONS IN: cation
binding, hydrolase activity, hydrolyzing O-glycosyl
compounds, catalytic activity; INVOLVED IN: carbohydrate
metabolic process; LOCATED IN: endomembrane system;
EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 10
growth stages; CONTAINS InterPro DOMAIN/s: Glycoside
hydrolase, family 1 (InterPro:IPR001360), Glycoside
hydrolase, family 1, active site (InterPro:IPR018120),
Glycoside hydrolase, catalytic core (InterPro:IPR017853),
Glycoside hydrolase, subgroup, catalytic core
(InterPro:IPR013781); BEST Arabidopsis thaliana protein
match is: Glycosyl hydrolase superfamily protein
(TAIR:AT3G60140.1); Has 11175 Blast hits to 10849 proteins
in 1458 species: Archae - 144; Bacteria - 7682; Metazoa -
711; Fungi - 199; Plants - 1439; Viruses - 0; Other
Eukaryotes - 1000 (source: NCBI BLink)."
/db_xref="Araport:AT2G32860"
/db_xref="TAIR:AT2G32860"
intron_pos 114:0 (1/12)
intron_pos 137:1 (2/12)
intron_pos 157:0 (3/12)
intron_pos 182:1 (4/12)
intron_pos 208:1 (5/12)
intron_pos 237:2 (6/12)
intron_pos 323:0 (7/12)
intron_pos 363:2 (8/12)
intron_pos 436:1 (9/12)
intron_pos 446:0 (10/12)
intron_pos 480:1 (11/12)
intron_pos 516:2 (12/12)
BEGIN
1 MATATLTLFL GLLALTSTIL SFNADARPQP SDEDLGTIIG PHQTSFDDEI GIVIGPHATV
61 DDEDIDMDMG TTVGPQTNLN DDDLGTIIGP EFEIHKQDFP ADFIFGTSVS AYQVEGAKKG
121 SGRGLTSWDE FTHMFPEKVQ QNGDGDEGVD FYTRYKDDIK LMKELNTNGF RFSISWTRIL
181 PYGTIKKGVN EEGVKFYNDL INELLANGIQ PSVTLFHWES PLALEMEYGG FLNERIVEDF
241 REFANFCFKE FGDRVKNWAT FNEPSVYSVA GYSKGKKAPG RCSKWQAPKC PTGDSSEEPY
301 IVAHNQILAH LAAVDEFRNC KKCQEGGGKI GIVLVSHWFE PKDPNSSEDV KAARRSLEYQ
361 LGWFLRPLTY GQYPAEMLED VNIRLREFTP EESEKLRKSL DFVGLNYYGA FFSTPLAKVN
421 SSQLNYETDL RVNWTDSQNN SPHLKTTSMG IVIYPAGLKN ILKHIKDEYM DPEIYIMENG
481 MDEIDYGTKN ITEATNDYGR KEFIKSHILI MGKSIRMDKV RLKGYYIWSL MDNFEWDKGY
541 KVRFGLYYVD YNDNMKRYIR SSGKWLSEFL DSKETLHKCY FEGHREKGYA PKLFDVEYLE
601 PENSQLSYRS DFM
//