LOCUS AEE86952.1 1052 aa PRT PLN 23-MAR-2023 DEFINITION Arabidopsis thaliana beta-galactosidase 14 protein. ACCESSION CP002687-7154 PROTEIN_ID AEE86952.1 SOURCE Arabidopsis thaliana (thale cress) ORGANISM Arabidopsis thaliana Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis. REFERENCE 1 (bases 1 to 18585056) AUTHORS Mayer,K., Schuller,C., Wambutt,R., Murphy,G., Volckaert,G., Pohl,T., Dusterhoft,A., Stiekema,W., Entian,K.D., Terryn,N., Harris,B., Ansorge,W., Brandt,P., Grivell,L., Rieger,M., Weichselgartner,M., de Simone,V., Obermaier,B., Mache,R., Muller,M., Kreis,M., Delseny,M., Puigdomenech,P., Watson,M., Schmidtheini,T., Reichert,B., Portatelle,D., Perez-Alonso,M., Boutry,M., Bancroft,I., Vos,P., Hoheisel,J., Zimmermann,W., Wedler,H., Ridley,P., Langham,S.A., McCullagh,B., Bilham,L., Robben,J., Van der Schueren,J., Grymonprez,B., Chuang,Y.J., Vandenbussche,F., Braeken,M., Weltjens,I., Voet,M., Bastiaens,I., Aert,R., Defoor,E., Weitzenegger,T., Bothe,G., Ramsperger,U., Hilbert,H., Braun,M., Holzer,E., Brandt,A., Peters,S., van Staveren,M., Dirske,W., Mooijman,P., Klein Lankhorst,R., Rose,M., Hauf,J., Kotter,P., Berneiser,S., Hempel,S., Feldpausch,M., Lamberth,S., Van den Daele,H., De Keyser,A., Buysshaert,C., Gielen,J., Villarroel,R., De Clercq,R., Van Montagu,M., Rogers,J., Cronin,A., Quail,M., Bray-Allen,S., Clark,L., Doggett,J., Hall,S., Kay,M., Lennard,N., McLay,K., Mayes,R., Pettett,A., Rajandream,M.A., Lyne,M., Benes,V., Rechmann,S., Borkova,D., Blocker,H., Scharfe,M., Grimm,M., Lohnert,T.H., Dose,S., de Haan,M., Maarse,A., Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M., Fartmann,B., Granderath,K., Dauner,D., Herzl,A., Neumann,S., Argiriou,A., Vitale,D., Liguori,R., Piravandi,E., Massenet,O., Quigley,F., Clabauld,G., Mundlein,A., Felber,R., Schnabl,S., Hiller,R., Schmidt,W., Lecharny,A., Aubourg,S., Chefdor,F., Cooke,R., Berger,C., Montfort,A., Casacuberta,E., Gibbons,T., Weber,N., Vandenbol,M., Bargues,M., Terol,J., Torres,A., Perez-Perez,A., Purnelle,B., Bent,E., Johnson,S., Tacon,D., Jesse,T., Heijnen,L., Schwarz,S., Scholler,P., Heber,S., Francs,P., Bielke,C., Frishman,D., Haase,D., Lemcke,K., Mewes,H.W., Stocker,S., Zaccaria,P., Bevan,M., Wilson,R.K., de la Bastide,M., Habermann,K., Parnell,L., Dedhia,N., Gnoj,L., Schutz,K., Huang,E., Spiegel,L., Sehkon,M., Murray,J., Sheet,P., Cordes,M., Abu-Threideh,J., Stoneking,T., Kalicki,J., Graves,T., Harmon,G., Edwards,J., Latreille,P., Courtney,L., Cloud,J., Abbott,A., Scott,K., Johnson,D., Minx,P., Bentley,D., Fulton,B., Miller,N., Greco,T., Kemp,K., Kramer,J., Fulton,L., Mardis,E., Dante,M., Pepin,K., Hillier,L., Nelson,J., Spieth,J., Ryan,E., Andrews,S., Geisel,C., Layman,D., Du,H., Ali,J., Berghoff,A., Jones,K., Drone,K., Cotton,M., Joshu,C., Antonoiu,B., Zidanic,M., Strong,C., Sun,H., Lamar,B., Yordan,C., Ma,P., Zhong,J., Preston,R., Vil,D., Shekher,M., Matero,A., Shah,R., Swaby,I.K., O'Shaughnessy,A., Rodriguez,M., Hoffmann,J., Till,S., Granat,S., Shohdy,N., Hasegawa,A., Hameed,A., Lodhi,M., Johnson,A., Chen,E., Marra,M., Martienssen,R. and McCombie,W.R. TITLE Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana JOURNAL Nature 402 (6763), 769-777 (1999) PUBMED 10617198 REFERENCE 2 (bases 1 to 18585056) AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E. CONSRTM TAIR TITLE Direct Submission JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie Institution, 260 Panama Street, Stanford, CA, USA REFERENCE 3 (bases 1 to 18585056) AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M., Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R., Vaughn,M. and Town,C.D. TITLE Direct Submission JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute, 9704 Medical Center Dr, Rockville, MD 20850, USA REMARK Protein update by submitter FEATURES Qualifiers source /organism="Arabidopsis thaliana" /mol_type="genomic DNA" /db_xref="taxon:3702" /chromosome="4" /ecotype="Columbia" protein /gene="BGAL14" /locus_tag="AT4G38590" /gene_synonym="beta-galactosidase 14" /gene_synonym="F20M13.150" /gene_synonym="F20M13_150" /note="beta-galactosidase 14 (BGAL14); FUNCTIONS IN: sugar binding, cation binding, beta-galactosidase activity, hydrolase activity, hydrolyzing O-glycosyl compounds, catalytic activity; INVOLVED IN: nuclear mRNA splicing, via spliceosome, carbohydrate metabolic process; LOCATED IN: endomembrane system, nucleus; EXPRESSED IN: 12 plant structures; EXPRESSED DURING: 9 growth stages; CONTAINS InterPro DOMAIN/s: Glycoside hydrolase, family 35, conserved site (InterPro:IPR019801), Glycoside hydrolase, family 35 (InterPro:IPR001944), D-galactoside/L-rhamnose binding SUEL lectin (InterPro:IPR000922), Glycoside hydrolase, catalytic core (InterPro:IPR017853), PRP1 splicing factor, N-terminal (InterPro:IPR010491), Galactose-binding domain-like (InterPro:IPR008979), Glycoside hydrolase, subgroup, catalytic core (InterPro:IPR013781); BEST Arabidopsis thaliana protein match is: glycosyl hydrolase family 35 protein (TAIR:AT2G16730.1); Has 3072 Blast hits to 2969 proteins in 587 species: Archae - 15; Bacteria - 892; Metazoa - 685; Fungi - 433; Plants - 695; Viruses - 19; Other Eukaryotes - 333 (source: NCBI BLink)." /db_xref="TAIR:AT4G38590" /db_xref="Araport:AT4G38590" intron_pos 69:0 (1/17) intron_pos 101:0 (2/17) intron_pos 138:2 (3/17) intron_pos 192:0 (4/17) intron_pos 271:2 (5/17) intron_pos 307:0 (6/17) intron_pos 335:1 (7/17) intron_pos 375:0 (8/17) intron_pos 431:0 (9/17) intron_pos 485:2 (10/17) intron_pos 522:1 (11/17) intron_pos 559:0 (12/17) intron_pos 596:0 (13/17) intron_pos 630:0 (14/17) intron_pos 678:2 (15/17) intron_pos 797:0 (16/17) intron_pos 881:2 (17/17) BEGIN 1 MKSRTRYLIA ILLVISLCSK ASSHDDEKKK KGVTYDGSER NFIDHKWKKR ASFLWFCSLP 61 SKHTSRKHMW PSIIDKARIG GLNTIQTYVF WNVHEPEQGK YDFKGRFDLV KFIKLIHEKG 121 LYVTLRLGPF IQAEWNHGGL PYWLREVPDV YFRTNNEPFK EHTERYVRKI LGMMKEEKLF 181 ASQGGPIILG QIENEYNAVQ LAYKENGEKY IKWAANLVES MNLGIPWVMC KQNDAPGNLI 241 NACNGRHCGD TFPGPNRHDK PSLWTENWTT QFRVFGDPPT QRTVEDIAFS VARYFSKNGS 301 HVNYYMYHGG TNFGRTSAHF VTTRYYDDAP LDEFGLEKAP KYGHLKHVHR ALRLCKKALF 361 WGQLRAQTLG PDTEVRYYEQ PGTKVCAAFL SNNNTRDTNT IKFKGQDYVL PSRSISILPD 421 CKTVVYNTAQ IVAQHSWRDF VKSEKTSKGL KFEMFSENIP SLLDGDSLIP GELYYLTKDK 481 TDYACVKIDE DDFPDQKGLK TILRVASLGH ALIVYVNGEY AGKAHGRHEM KSFEFAKPVN 541 FKTGDNRISI LGVLTGLPDS GSYMEHRFAG PRAISIIGLK SGTRDLTENN EWGHLAGLEG 601 EKKEVYTEEG SKKVKWEKDG KRKPLTWYKT YFETPEGVNA VAIRMKAMGK GLIWVNGIGV 661 GRYWMSFLSP LGEPTQTEYH IPRSFMKGEK KKNMLVILEE EPGVKLESID FVLVNRDTIC 721 SNVGEDYPVS VKSWKREGPK IVSRSKDMRL KAVMRCPPEK QMVEVQFASF GDPTGTCGNF 781 TMGKCSASKS KEVVEKECLG RNYCSIVVAR ETFGDKGCPE IVKTLAVQVK CEKKEGKQDE 841 KKKKEDKDEE EEDDEDDDEE EEEEDKENKD TKDMENKNQD ILDSDSALVS DLGFGPFSTV 901 VVNVPLIGGA APPQPRFNLM PPSNYVAGLG RGAAGFTTRS DIGPARANGD GNADVNHKFD 961 DFEGHDAGLF ANAESDDQDK EADAIWDAID RRMDSRRKDR REAKLKQEIE NYRASNPKVS 1021 GQFVDLTRKL HTLSEDEWDS IPEIGNYSHR LY //