LOCUS       AEC09214.1               591 aa    PRT              PLN 23-MAR-2023
DEFINITION  Arabidopsis thaliana cell wall invertase 4 protein.
ACCESSION   CP002685-5225
PROTEIN_ID  AEC09214.1
SOURCE      Arabidopsis thaliana (thale cress)
  ORGANISM  Arabidopsis thaliana
            Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
            Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
            Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
            Camelineae; Arabidopsis.
REFERENCE   1  (bases 1 to 19698289)
  AUTHORS   Lin,X., Kaul,S., Rounsley,S., Shea,T.P., Benito,M.I., Town,C.D.,
            Fujii,C.Y., Mason,T., Bowman,C.L., Barnstead,M., Feldblyum,T.V.,
            Buell,C.R., Ketchum,K.A., Lee,J., Ronning,C.M., Koo,H.L.,
            Moffat,K.S., Cronin,L.A., Shen,M., Pai,G., Van Aken,S., Umayam,L.,
            Tallon,L.J., Gill,J.E., Adams,M.D., Carrera,A.J., Creasy,T.H.,
            Goodman,H.M., Somerville,C.R., Copenhaver,G.P., Preuss,D.,
            Nierman,W.C., White,O., Eisen,J.A., Salzberg,S.L., Fraser,C.M. and
            Venter,J.C.
  TITLE     Sequence and analysis of chromosome 2 of the plant Arabidopsis
            thaliana
  JOURNAL   Nature 402 (6763), 761-768 (1999)
   PUBMED   10617197
REFERENCE   2  (bases 1 to 19698289)
  AUTHORS   Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
  CONSRTM   TAIR
  TITLE     Direct Submission
  JOURNAL   Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
            Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE   3  (bases 1 to 19698289)
  AUTHORS   Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
            Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
            Vaughn,M. and Town,C.D.
  TITLE     Direct Submission
  JOURNAL   Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
            9704 Medical Center Dr, Rockville, MD 20850, USA
  REMARK    Protein update by submitter
FEATURES             Qualifiers
     source          /organism="Arabidopsis thaliana"
                     /mol_type="genomic DNA"
                     /db_xref="taxon:3702"
                     /chromosome="2"
                     /ecotype="Columbia"
     protein         /gene="cwINV4"
                     /locus_tag="AT2G36190"
                     /gene_synonym="AtCWIN4"
                     /gene_synonym="AtcwINV4"
                     /gene_synonym="cell wall invertase 4"
                     /gene_synonym="F9C22.8"
                     /inference="Similar to RNA sequence,
                     EST:INSD:AV557097.1,INSD:ES099899.1,INSD:ES005937.1,
                     INSD:ES140098.1,INSD:AV561590.1,INSD:ES101399.1,
                     INSD:DR299745.1,INSD:DR380051.1,INSD:ES184971.1"
                     /note="cell wall invertase 4 (cwINV4); CONTAINS InterPro
                     DOMAIN/s: Glycoside hydrolase, family 32
                     (InterPro:IPR001362), Glycoside hydrolase, family 32,
                     active site (InterPro:IPR018053), Glycosyl hydrolases
                     family 32, N-terminal (InterPro:IPR013148), Glycosyl
                     hydrolase family 32, C-terminal (InterPro:IPR013189),
                     Concanavalin A-like lectin/glucanase (InterPro:IPR008985);
                     BEST Arabidopsis thaliana protein match is: cell wall
                     invertase 2 (TAIR:AT3G52600.1); Has 4207 Blast hits to
                     4168 proteins in 1243 species: Archae - 18; Bacteria -
                     2607; Metazoa - 78; Fungi - 284; Plants - 1032; Viruses -
                     0; Other Eukaryotes - 188 (source: NCBI BLink)."
                     /db_xref="Araport:AT2G36190"
                     /db_xref="TAIR:AT2G36190"
     intron_pos      64:1 (1/5)
     intron_pos      67:1 (2/5)
     intron_pos      354:0 (3/5)
     intron_pos      489:2 (4/5)
     intron_pos      528:0 (5/5)
BEGIN
        1 MAISNVISVL LLLLVLINLS NQNIKGIDAF HQIYEELQSE SVESVNHLHR PSFHFQPPKH
       61 WINDPNGPVY YKGLYHLFYQ YNTKGAVWGN IIWAHSVSKD LVNWEALEPA LSPSKWFDIG
      121 GTWSGSITIV PGKGPIILYT GVNQNETQLQ NYAIPEDPSD PYLRKWIKPD DNPIAIPDYT
      181 MNGSAFRDPT TAWFSKDGHW RTVVGSKRKR RGIAYIYRSR DFKHWVKAKH PVHSKQSTGM
      241 WECPDFFPVS LTDFRNGLDL DYVGPNTKHV LKVSLDITRY EYYTLGKYDL KKDRYIPDGN
      301 TPDGWEGLRF DYGNFYASKT FFDYKKNRRI LWGWANESDT VEDDILKGWA GLQVIPRTVL
      361 LDSSKKQLVF WPVEEIESLR GNYVRMNNHD IKMGQRIEVK GITPAQADVE VTFYVGSLEK
      421 AEIFDPSFTW KPLELCNIKG SNVRGGVGPF GLITLATPDL EEYTPVFFRV FNDTKTHKPK
      481 VLMCSDARPS SLKQDTGLLA KDRMYKPSFA GFVDVDMADG RISLRSLIDH SVVESFGALG
      541 KTVITSRVYP VKAVKENAHL YVFNNGTQTV TIESLNAWNM DRPLQMNDGA L
//