LOCUS AEC06532.1 848 aa PRT PLN 23-MAR-2023
DEFINITION Arabidopsis thaliana glycosyl hydrolase family 35 protein protein.
ACCESSION CP002685-1546
PROTEIN_ID AEC06532.1
SOURCE Arabidopsis thaliana (thale cress)
ORGANISM Arabidopsis thaliana
Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
Camelineae; Arabidopsis.
REFERENCE 1 (bases 1 to 19698289)
AUTHORS Lin,X., Kaul,S., Rounsley,S., Shea,T.P., Benito,M.I., Town,C.D.,
Fujii,C.Y., Mason,T., Bowman,C.L., Barnstead,M., Feldblyum,T.V.,
Buell,C.R., Ketchum,K.A., Lee,J., Ronning,C.M., Koo,H.L.,
Moffat,K.S., Cronin,L.A., Shen,M., Pai,G., Van Aken,S., Umayam,L.,
Tallon,L.J., Gill,J.E., Adams,M.D., Carrera,A.J., Creasy,T.H.,
Goodman,H.M., Somerville,C.R., Copenhaver,G.P., Preuss,D.,
Nierman,W.C., White,O., Eisen,J.A., Salzberg,S.L., Fraser,C.M. and
Venter,J.C.
TITLE Sequence and analysis of chromosome 2 of the plant Arabidopsis
thaliana
JOURNAL Nature 402 (6763), 761-768 (1999)
PUBMED 10617197
REFERENCE 2 (bases 1 to 19698289)
AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
CONSRTM TAIR
TITLE Direct Submission
JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE 3 (bases 1 to 19698289)
AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
Vaughn,M. and Town,C.D.
TITLE Direct Submission
JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
9704 Medical Center Dr, Rockville, MD 20850, USA
REMARK Protein update by submitter
FEATURES Qualifiers
source /organism="Arabidopsis thaliana"
/mol_type="genomic DNA"
/db_xref="taxon:3702"
/chromosome="2"
/ecotype="Columbia"
protein /locus_tag="AT2G16730"
/gene_synonym="beta-galactosidase 13"
/gene_synonym="BGAL13"
/gene_synonym="T24I21.14"
/gene_synonym="T24I21_14"
/inference="Similar to RNA sequence,
EST:INSD:AU228003.1,INSD:BP562897.2"
/inference="similar to RNA sequence,
mRNA:INSD:BT004177.1,INSD:AJ270309.1"
/note="glycosyl hydrolase family 35 protein; FUNCTIONS IN:
beta-galactosidase activity; INVOLVED IN: lactose
catabolic process, using glucoside 3-dehydrogenase,
carbohydrate metabolic process, lactose catabolic process
via UDP-galactose, lactose catabolic process; LOCATED IN:
endomembrane system; EXPRESSED IN: 14 plant structures;
EXPRESSED DURING: L mature pollen stage, M germinated
pollen stage, 4 anthesis, C globular stage, petal
differentiation and expansion stage; CONTAINS InterPro
DOMAIN/s: Glycoside hydrolase, family 35, conserved site
(InterPro:IPR019801), Glycoside hydrolase family 2,
carbohydrate-binding (InterPro:IPR006104), Glycoside
hydrolase, family 35 (InterPro:IPR001944),
D-galactoside/L-rhamnose binding SUEL lectin
(InterPro:IPR000922), Glycoside hydrolase, catalytic core
(InterPro:IPR017853), Glycoside hydrolase, subgroup,
catalytic core (InterPro:IPR013781), Galactose-binding
domain-like (InterPro:IPR008979); BEST Arabidopsis
thaliana protein match is: beta-galactosidase 11
(TAIR:AT4G35010.1); Has 2592 Blast hits to 2188 proteins
in 502 species: Archae - 15; Bacteria - 1202; Metazoa -
413; Fungi - 220; Plants - 636; Viruses - 0; Other
Eukaryotes - 106 (source: NCBI BLink)."
/db_xref="Araport:AT2G16730"
/db_xref="TAIR:AT2G16730"
intron_pos 74:0 (1/17)
intron_pos 106:0 (2/17)
intron_pos 143:2 (3/17)
intron_pos 166:0 (4/17)
intron_pos 197:0 (5/17)
intron_pos 245:0 (6/17)
intron_pos 276:2 (7/17)
intron_pos 312:0 (8/17)
intron_pos 340:1 (9/17)
intron_pos 380:0 (10/17)
intron_pos 436:0 (11/17)
intron_pos 494:2 (12/17)
intron_pos 531:1 (13/17)
intron_pos 568:0 (14/17)
intron_pos 605:0 (15/17)
intron_pos 640:0 (16/17)
intron_pos 805:0 (17/17)
BEGIN
1 MKIHSSDHSW LLLAVLVILL SFSGALSSDD KEKKTKSVDK KKEVTYDGTS LIINGNRELL
61 YSGSIHYPRS TPEMWPNIIK RAKQGGLNTI QTYVFWNVHE PEQGKFNFSG RADLVKFIKL
121 IEKNGLYVTL RLGPFIQAEW THGGLPYWLR EVPGIFFRTD NEPFKEHTER YVKVVLDMMK
181 EEKLFASQGG PIILGQIENE YSAVQRAYKE DGLNYIKWAS KLVHSMDLGI PWVMCKQNDA
241 PDPMINACNG RHCGDTFPGP NKDNKPSLWT ENWTTQFRVF GDPPAQRSVE DIAYSVARFF
301 SKNGTHVNYY MYHGGTNFGR TSAHYVTTRY YDDAPLDEFG LEREPKYGHL KHLHNALNLC
361 KKALLWGQPR VEKPSNETEI RYYEQPGTKV CAAFLANNNT EAAEKIKFRG KEYLIPHRSI
421 SILPDCKTVV YNTGEIISHH TSRNFMKSKK ANKNFDFKVF TESVPSKIKG DSFIPVELYG
481 LTKDESDYGW YTTSFKIDDN DLSKKKGGKP NLRIASLGHA LHVWLNGEYL GNGHGSHEEK
541 SFVFQKPVTL KEGENHLTML GVLTGFPDSG SYMEHRYTGP RSVSILGLGS GTLDLTEENK
601 WGNKVGMEGE RLGIHAEEGL KKVKWEKASG KEPGMTWYQT YFDAPESQSA AAIRMNGMGK
661 GLIWVNGEGV GRYWMSFLSP LGQPTQIEYH IPRSFLKPKK NLLVIFEEEP NVKPELIDFV
721 IVNRDTVCSY IGENYTPSVR HWTRKNDQVQ AITDDVHLTA NLKCSGTKKI SAVEFASFGN
781 PNGTCGNFTL GSCNAPVSKK VVEKYCLGKA ECVIPVNKST FEQDKKDSCP KVEKKLAVQV
841 KCGRDKKN
//