LOCUS AEE86144.1 1432 aa PRT PLN 23-MAR-2023 DEFINITION Arabidopsis thaliana glycine-rich protein protein. ACCESSION CP002687-6058 PROTEIN_ID AEE86144.1 SOURCE Arabidopsis thaliana (thale cress) ORGANISM Arabidopsis thaliana Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis. REFERENCE 1 (bases 1 to 18585056) AUTHORS Mayer,K., Schuller,C., Wambutt,R., Murphy,G., Volckaert,G., Pohl,T., Dusterhoft,A., Stiekema,W., Entian,K.D., Terryn,N., Harris,B., Ansorge,W., Brandt,P., Grivell,L., Rieger,M., Weichselgartner,M., de Simone,V., Obermaier,B., Mache,R., Muller,M., Kreis,M., Delseny,M., Puigdomenech,P., Watson,M., Schmidtheini,T., Reichert,B., Portatelle,D., Perez-Alonso,M., Boutry,M., Bancroft,I., Vos,P., Hoheisel,J., Zimmermann,W., Wedler,H., Ridley,P., Langham,S.A., McCullagh,B., Bilham,L., Robben,J., Van der Schueren,J., Grymonprez,B., Chuang,Y.J., Vandenbussche,F., Braeken,M., Weltjens,I., Voet,M., Bastiaens,I., Aert,R., Defoor,E., Weitzenegger,T., Bothe,G., Ramsperger,U., Hilbert,H., Braun,M., Holzer,E., Brandt,A., Peters,S., van Staveren,M., Dirske,W., Mooijman,P., Klein Lankhorst,R., Rose,M., Hauf,J., Kotter,P., Berneiser,S., Hempel,S., Feldpausch,M., Lamberth,S., Van den Daele,H., De Keyser,A., Buysshaert,C., Gielen,J., Villarroel,R., De Clercq,R., Van Montagu,M., Rogers,J., Cronin,A., Quail,M., Bray-Allen,S., Clark,L., Doggett,J., Hall,S., Kay,M., Lennard,N., McLay,K., Mayes,R., Pettett,A., Rajandream,M.A., Lyne,M., Benes,V., Rechmann,S., Borkova,D., Blocker,H., Scharfe,M., Grimm,M., Lohnert,T.H., Dose,S., de Haan,M., Maarse,A., Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M., Fartmann,B., Granderath,K., Dauner,D., Herzl,A., Neumann,S., Argiriou,A., Vitale,D., Liguori,R., Piravandi,E., Massenet,O., Quigley,F., Clabauld,G., Mundlein,A., Felber,R., Schnabl,S., Hiller,R., Schmidt,W., Lecharny,A., Aubourg,S., Chefdor,F., Cooke,R., Berger,C., Montfort,A., Casacuberta,E., Gibbons,T., Weber,N., Vandenbol,M., Bargues,M., Terol,J., Torres,A., Perez-Perez,A., Purnelle,B., Bent,E., Johnson,S., Tacon,D., Jesse,T., Heijnen,L., Schwarz,S., Scholler,P., Heber,S., Francs,P., Bielke,C., Frishman,D., Haase,D., Lemcke,K., Mewes,H.W., Stocker,S., Zaccaria,P., Bevan,M., Wilson,R.K., de la Bastide,M., Habermann,K., Parnell,L., Dedhia,N., Gnoj,L., Schutz,K., Huang,E., Spiegel,L., Sehkon,M., Murray,J., Sheet,P., Cordes,M., Abu-Threideh,J., Stoneking,T., Kalicki,J., Graves,T., Harmon,G., Edwards,J., Latreille,P., Courtney,L., Cloud,J., Abbott,A., Scott,K., Johnson,D., Minx,P., Bentley,D., Fulton,B., Miller,N., Greco,T., Kemp,K., Kramer,J., Fulton,L., Mardis,E., Dante,M., Pepin,K., Hillier,L., Nelson,J., Spieth,J., Ryan,E., Andrews,S., Geisel,C., Layman,D., Du,H., Ali,J., Berghoff,A., Jones,K., Drone,K., Cotton,M., Joshu,C., Antonoiu,B., Zidanic,M., Strong,C., Sun,H., Lamar,B., Yordan,C., Ma,P., Zhong,J., Preston,R., Vil,D., Shekher,M., Matero,A., Shah,R., Swaby,I.K., O'Shaughnessy,A., Rodriguez,M., Hoffmann,J., Till,S., Granat,S., Shohdy,N., Hasegawa,A., Hameed,A., Lodhi,M., Johnson,A., Chen,E., Marra,M., Martienssen,R. and McCombie,W.R. TITLE Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana JOURNAL Nature 402 (6763), 769-777 (1999) PUBMED 10617198 REFERENCE 2 (bases 1 to 18585056) AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E. CONSRTM TAIR TITLE Direct Submission JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie Institution, 260 Panama Street, Stanford, CA, USA REFERENCE 3 (bases 1 to 18585056) AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M., Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R., Vaughn,M. and Town,C.D. TITLE Direct Submission JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute, 9704 Medical Center Dr, Rockville, MD 20850, USA REMARK Protein update by submitter FEATURES Qualifiers source /organism="Arabidopsis thaliana" /mol_type="genomic DNA" /db_xref="taxon:3702" /chromosome="4" /ecotype="Columbia" protein /locus_tag="AT4G32920" /gene_synonym="F26P21.40" /gene_synonym="F26P21_40" /inference="Similar to RNA sequence, EST:INSD:AV804520.1,INSD:ES006844.1,INSD:AV528963.1, INSD:EL106194.1,INSD:Z26795.1,INSD:AV829130.1, INSD:CD531211.1,INSD:ES015107.1,INSD:BP617192.1, INSD:AV829505.1" /inference="Similar to RNA sequence, mRNA:INSD:AK226977.1,INSD:AY057633.1,INSD:BT002256.1" /note="glycine-rich protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: vacuole; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 11 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G11700.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink)." /db_xref="TAIR:AT4G32920" /db_xref="Araport:AT4G32920" intron_pos 271:2 (1/21) intron_pos 310:1 (2/21) intron_pos 380:0 (3/21) intron_pos 420:0 (4/21) intron_pos 463:0 (5/21) intron_pos 505:0 (6/21) intron_pos 523:2 (7/21) intron_pos 557:0 (8/21) intron_pos 596:1 (9/21) intron_pos 666:1 (10/21) intron_pos 773:2 (11/21) intron_pos 805:0 (12/21) intron_pos 841:1 (13/21) intron_pos 932:0 (14/21) intron_pos 974:2 (15/21) intron_pos 1053:0 (16/21) intron_pos 1116:0 (17/21) intron_pos 1208:2 (18/21) intron_pos 1274:0 (19/21) intron_pos 1355:0 (20/21) intron_pos 1384:2 (21/21) BEGIN 1 MISISIPMVR FCLCFAFVIL VSANPKLINS WDETAIRFEP LSPSPAPEPS PDDDDSSVSC 61 VDDLGGVGSL DSTCKLVADL NLTRDLNITG KGNLHVLPGV RLVCQFPGCS ISVNISGNFS 121 LAENSSVIAG TFRLAAENAE FGLSSAVDTT GLAGEPPPDT SGTPEGVEGA GGGYGGRGAC 181 CLSDTTTKIP EDVFGGDVYG WSSLEKPEIY GSRGGSTSNE VDYGGGGGGT VAIEILGYIS 241 LNGSVLADGA SGGVKGGGGS GGSIFVMAHK MAGNGRLSAS GGDGYAGGGG GRVSVDIYSR 301 HSDPKIFFNG GRSFGCPENA GAAGTLYDVI SESLTIDNHN KTTYTDTLLL EFPNHRLFTN 361 LYIRNMAKVA VPLRWSRVQV QGLISLSNGG ELNFGLPRYA SSEFELFAEE LLMSNSAIKV 421 YGALRMTVKV FLMLKSRMFI DGGGVTILGT SMLEISNLLV LKESSVIQSN GNLGVHGQGL 481 LNLTGTGDTI EAQRLILSLF YSIQVGAGAV LRGPLQNAST GGLTPKLYCQ RQDCPVELLH 541 PPEDCNVNSS LPFTLQICRV EDITVEGLIK GSVIQFHLAR TVLVRSSGTI SADGMGCKGG 601 VGTGRFLRSG IGSGGGHGGK GGSGCYNHTC IEGGESYGNA DLPCELGSGS GNEESTDSVA 661 GGGIIVLGSL EHPLSSLSLE GSITTDGESP RKTLKGLSNS SLGPGGGSGG TVLLFLRTLE 721 IGRSAILSSI GGNGSLKGGG GGSGGRIHFH WSDIPTGDVY HPVAIVKGRV YVRGGMGIIE 781 DNIGGNGTLT GKACPEGLYG LFCEECPSGT YKNVTGSDKA LCHLCPANDI PHRAVYVTVR 841 GGVAETPCPY KCISDRYHMP HCYTTLEELI YTFGGPWLFG VLLVVVLLLL ALVFSVARMK 901 FVSGDELHGS APTQHGSQID HSFPFLESLN EVMETSRVEE SQGHMHRIYF LGPNTFSEPW 961 HLSHTPPEEI KEIVYEAAFN GFVDEVNVIA AYQWWEGAIY IMLSVLVYPL AWSWQQSRRR 1021 LKFQKLRDFV RSEYDHSCLR SCRSRALYEG LKVAATPDLM LAHLDFFLGG DEKRSDLPPQ 1081 VHQRLPMPLI FGGDGSYMAY YSLQSDDILT SLLSQLVPPT TWYRFVAGLN AQLRLVQQGK 1141 LRSTFRSVMR WIETHGNPAL KRHGVRVDLA RFQALSSSSC QYGILVHTIA DEVASTRSDD 1201 ETEQQHPWGT QIENHSGDFR ENFQPLRSEI NHVRHQECGE IIDIGSLQFL KEEKDVLSLI 1261 SFLIHNTKPV GHQDLVGLVI SVLLLGDLTL TLLTLLQLYS ISLLEVFLAM FILPLSIIFP 1321 FPAGVSALFS HGPRRSASRT RVYALWNVTS LVNVVVAFVC GYVHYHGSSS GKKIPYLQPW 1381 NISMDENEWW IFPVALFLCK VLQSQLVNWH VANLEIQDYS LYSDDSELFW QS //