LOCUS       AEE86144.1              1432 aa    PRT              PLN 23-MAR-2023
DEFINITION  Arabidopsis thaliana glycine-rich protein protein.
ACCESSION   CP002687-6058
PROTEIN_ID  AEE86144.1
SOURCE      Arabidopsis thaliana (thale cress)
  ORGANISM  Arabidopsis thaliana
            Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
            Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
            Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
            Camelineae; Arabidopsis.
REFERENCE   1  (bases 1 to 18585056)
  AUTHORS   Mayer,K., Schuller,C., Wambutt,R., Murphy,G., Volckaert,G.,
            Pohl,T., Dusterhoft,A., Stiekema,W., Entian,K.D., Terryn,N.,
            Harris,B., Ansorge,W., Brandt,P., Grivell,L., Rieger,M.,
            Weichselgartner,M., de Simone,V., Obermaier,B., Mache,R.,
            Muller,M., Kreis,M., Delseny,M., Puigdomenech,P., Watson,M.,
            Schmidtheini,T., Reichert,B., Portatelle,D., Perez-Alonso,M.,
            Boutry,M., Bancroft,I., Vos,P., Hoheisel,J., Zimmermann,W.,
            Wedler,H., Ridley,P., Langham,S.A., McCullagh,B., Bilham,L.,
            Robben,J., Van der Schueren,J., Grymonprez,B., Chuang,Y.J.,
            Vandenbussche,F., Braeken,M., Weltjens,I., Voet,M., Bastiaens,I.,
            Aert,R., Defoor,E., Weitzenegger,T., Bothe,G., Ramsperger,U.,
            Hilbert,H., Braun,M., Holzer,E., Brandt,A., Peters,S., van
            Staveren,M., Dirske,W., Mooijman,P., Klein Lankhorst,R., Rose,M.,
            Hauf,J., Kotter,P., Berneiser,S., Hempel,S., Feldpausch,M.,
            Lamberth,S., Van den Daele,H., De Keyser,A., Buysshaert,C.,
            Gielen,J., Villarroel,R., De Clercq,R., Van Montagu,M., Rogers,J.,
            Cronin,A., Quail,M., Bray-Allen,S., Clark,L., Doggett,J., Hall,S.,
            Kay,M., Lennard,N., McLay,K., Mayes,R., Pettett,A.,
            Rajandream,M.A., Lyne,M., Benes,V., Rechmann,S., Borkova,D.,
            Blocker,H., Scharfe,M., Grimm,M., Lohnert,T.H., Dose,S., de
            Haan,M., Maarse,A., Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M.,
            Fartmann,B., Granderath,K., Dauner,D., Herzl,A., Neumann,S.,
            Argiriou,A., Vitale,D., Liguori,R., Piravandi,E., Massenet,O.,
            Quigley,F., Clabauld,G., Mundlein,A., Felber,R., Schnabl,S.,
            Hiller,R., Schmidt,W., Lecharny,A., Aubourg,S., Chefdor,F.,
            Cooke,R., Berger,C., Montfort,A., Casacuberta,E., Gibbons,T.,
            Weber,N., Vandenbol,M., Bargues,M., Terol,J., Torres,A.,
            Perez-Perez,A., Purnelle,B., Bent,E., Johnson,S., Tacon,D.,
            Jesse,T., Heijnen,L., Schwarz,S., Scholler,P., Heber,S., Francs,P.,
            Bielke,C., Frishman,D., Haase,D., Lemcke,K., Mewes,H.W.,
            Stocker,S., Zaccaria,P., Bevan,M., Wilson,R.K., de la Bastide,M.,
            Habermann,K., Parnell,L., Dedhia,N., Gnoj,L., Schutz,K., Huang,E.,
            Spiegel,L., Sehkon,M., Murray,J., Sheet,P., Cordes,M.,
            Abu-Threideh,J., Stoneking,T., Kalicki,J., Graves,T., Harmon,G.,
            Edwards,J., Latreille,P., Courtney,L., Cloud,J., Abbott,A.,
            Scott,K., Johnson,D., Minx,P., Bentley,D., Fulton,B., Miller,N.,
            Greco,T., Kemp,K., Kramer,J., Fulton,L., Mardis,E., Dante,M.,
            Pepin,K., Hillier,L., Nelson,J., Spieth,J., Ryan,E., Andrews,S.,
            Geisel,C., Layman,D., Du,H., Ali,J., Berghoff,A., Jones,K.,
            Drone,K., Cotton,M., Joshu,C., Antonoiu,B., Zidanic,M., Strong,C.,
            Sun,H., Lamar,B., Yordan,C., Ma,P., Zhong,J., Preston,R., Vil,D.,
            Shekher,M., Matero,A., Shah,R., Swaby,I.K., O'Shaughnessy,A.,
            Rodriguez,M., Hoffmann,J., Till,S., Granat,S., Shohdy,N.,
            Hasegawa,A., Hameed,A., Lodhi,M., Johnson,A., Chen,E., Marra,M.,
            Martienssen,R. and McCombie,W.R.
  TITLE     Sequence and analysis of chromosome 4 of the plant Arabidopsis
            thaliana
  JOURNAL   Nature 402 (6763), 769-777 (1999)
   PUBMED   10617198
REFERENCE   2  (bases 1 to 18585056)
  AUTHORS   Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
  CONSRTM   TAIR
  TITLE     Direct Submission
  JOURNAL   Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
            Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE   3  (bases 1 to 18585056)
  AUTHORS   Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
            Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
            Vaughn,M. and Town,C.D.
  TITLE     Direct Submission
  JOURNAL   Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
            9704 Medical Center Dr, Rockville, MD 20850, USA
  REMARK    Protein update by submitter
FEATURES             Qualifiers
     source          /organism="Arabidopsis thaliana"
                     /mol_type="genomic DNA"
                     /db_xref="taxon:3702"
                     /chromosome="4"
                     /ecotype="Columbia"
     protein         /locus_tag="AT4G32920"
                     /gene_synonym="F26P21.40"
                     /gene_synonym="F26P21_40"
                     /inference="Similar to RNA sequence,
                     EST:INSD:AV804520.1,INSD:ES006844.1,INSD:AV528963.1,
                     INSD:EL106194.1,INSD:Z26795.1,INSD:AV829130.1,
                     INSD:CD531211.1,INSD:ES015107.1,INSD:BP617192.1,
                     INSD:AV829505.1"
                     /inference="Similar to RNA sequence,
                     mRNA:INSD:AK226977.1,INSD:AY057633.1,INSD:BT002256.1"
                     /note="glycine-rich protein; FUNCTIONS IN:
                     molecular_function unknown; INVOLVED IN:
                     biological_process unknown; LOCATED IN: vacuole; EXPRESSED
                     IN: 21 plant structures; EXPRESSED DURING: 11 growth
                     stages; BEST Arabidopsis thaliana protein match is:
                     unknown protein (TAIR:AT5G11700.1); Has 30201 Blast hits
                     to 17322 proteins in 780 species: Archae - 12; Bacteria -
                     1396; Metazoa - 17338; Fungi - 3422; Plants - 5037;
                     Viruses - 0; Other Eukaryotes - 2996 (source: NCBI
                     BLink)."
                     /db_xref="TAIR:AT4G32920"
                     /db_xref="Araport:AT4G32920"
     intron_pos      271:2 (1/21)
     intron_pos      310:1 (2/21)
     intron_pos      380:0 (3/21)
     intron_pos      420:0 (4/21)
     intron_pos      463:0 (5/21)
     intron_pos      505:0 (6/21)
     intron_pos      523:2 (7/21)
     intron_pos      557:0 (8/21)
     intron_pos      596:1 (9/21)
     intron_pos      666:1 (10/21)
     intron_pos      773:2 (11/21)
     intron_pos      805:0 (12/21)
     intron_pos      841:1 (13/21)
     intron_pos      932:0 (14/21)
     intron_pos      974:2 (15/21)
     intron_pos      1053:0 (16/21)
     intron_pos      1116:0 (17/21)
     intron_pos      1208:2 (18/21)
     intron_pos      1274:0 (19/21)
     intron_pos      1355:0 (20/21)
     intron_pos      1384:2 (21/21)
BEGIN
        1 MISISIPMVR FCLCFAFVIL VSANPKLINS WDETAIRFEP LSPSPAPEPS PDDDDSSVSC
       61 VDDLGGVGSL DSTCKLVADL NLTRDLNITG KGNLHVLPGV RLVCQFPGCS ISVNISGNFS
      121 LAENSSVIAG TFRLAAENAE FGLSSAVDTT GLAGEPPPDT SGTPEGVEGA GGGYGGRGAC
      181 CLSDTTTKIP EDVFGGDVYG WSSLEKPEIY GSRGGSTSNE VDYGGGGGGT VAIEILGYIS
      241 LNGSVLADGA SGGVKGGGGS GGSIFVMAHK MAGNGRLSAS GGDGYAGGGG GRVSVDIYSR
      301 HSDPKIFFNG GRSFGCPENA GAAGTLYDVI SESLTIDNHN KTTYTDTLLL EFPNHRLFTN
      361 LYIRNMAKVA VPLRWSRVQV QGLISLSNGG ELNFGLPRYA SSEFELFAEE LLMSNSAIKV
      421 YGALRMTVKV FLMLKSRMFI DGGGVTILGT SMLEISNLLV LKESSVIQSN GNLGVHGQGL
      481 LNLTGTGDTI EAQRLILSLF YSIQVGAGAV LRGPLQNAST GGLTPKLYCQ RQDCPVELLH
      541 PPEDCNVNSS LPFTLQICRV EDITVEGLIK GSVIQFHLAR TVLVRSSGTI SADGMGCKGG
      601 VGTGRFLRSG IGSGGGHGGK GGSGCYNHTC IEGGESYGNA DLPCELGSGS GNEESTDSVA
      661 GGGIIVLGSL EHPLSSLSLE GSITTDGESP RKTLKGLSNS SLGPGGGSGG TVLLFLRTLE
      721 IGRSAILSSI GGNGSLKGGG GGSGGRIHFH WSDIPTGDVY HPVAIVKGRV YVRGGMGIIE
      781 DNIGGNGTLT GKACPEGLYG LFCEECPSGT YKNVTGSDKA LCHLCPANDI PHRAVYVTVR
      841 GGVAETPCPY KCISDRYHMP HCYTTLEELI YTFGGPWLFG VLLVVVLLLL ALVFSVARMK
      901 FVSGDELHGS APTQHGSQID HSFPFLESLN EVMETSRVEE SQGHMHRIYF LGPNTFSEPW
      961 HLSHTPPEEI KEIVYEAAFN GFVDEVNVIA AYQWWEGAIY IMLSVLVYPL AWSWQQSRRR
     1021 LKFQKLRDFV RSEYDHSCLR SCRSRALYEG LKVAATPDLM LAHLDFFLGG DEKRSDLPPQ
     1081 VHQRLPMPLI FGGDGSYMAY YSLQSDDILT SLLSQLVPPT TWYRFVAGLN AQLRLVQQGK
     1141 LRSTFRSVMR WIETHGNPAL KRHGVRVDLA RFQALSSSSC QYGILVHTIA DEVASTRSDD
     1201 ETEQQHPWGT QIENHSGDFR ENFQPLRSEI NHVRHQECGE IIDIGSLQFL KEEKDVLSLI
     1261 SFLIHNTKPV GHQDLVGLVI SVLLLGDLTL TLLTLLQLYS ISLLEVFLAM FILPLSIIFP
     1321 FPAGVSALFS HGPRRSASRT RVYALWNVTS LVNVVVAFVC GYVHYHGSSS GKKIPYLQPW
     1381 NISMDENEWW IFPVALFLCK VLQSQLVNWH VANLEIQDYS LYSDDSELFW QS
//