LOCUS AEE86145.1 1432 aa PRT PLN 23-MAR-2023
DEFINITION Arabidopsis thaliana glycine-rich protein protein.
ACCESSION CP002687-6059
PROTEIN_ID AEE86145.1
SOURCE Arabidopsis thaliana (thale cress)
ORGANISM Arabidopsis thaliana
Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
Camelineae; Arabidopsis.
REFERENCE 1 (bases 1 to 18585056)
AUTHORS Mayer,K., Schuller,C., Wambutt,R., Murphy,G., Volckaert,G.,
Pohl,T., Dusterhoft,A., Stiekema,W., Entian,K.D., Terryn,N.,
Harris,B., Ansorge,W., Brandt,P., Grivell,L., Rieger,M.,
Weichselgartner,M., de Simone,V., Obermaier,B., Mache,R.,
Muller,M., Kreis,M., Delseny,M., Puigdomenech,P., Watson,M.,
Schmidtheini,T., Reichert,B., Portatelle,D., Perez-Alonso,M.,
Boutry,M., Bancroft,I., Vos,P., Hoheisel,J., Zimmermann,W.,
Wedler,H., Ridley,P., Langham,S.A., McCullagh,B., Bilham,L.,
Robben,J., Van der Schueren,J., Grymonprez,B., Chuang,Y.J.,
Vandenbussche,F., Braeken,M., Weltjens,I., Voet,M., Bastiaens,I.,
Aert,R., Defoor,E., Weitzenegger,T., Bothe,G., Ramsperger,U.,
Hilbert,H., Braun,M., Holzer,E., Brandt,A., Peters,S., van
Staveren,M., Dirske,W., Mooijman,P., Klein Lankhorst,R., Rose,M.,
Hauf,J., Kotter,P., Berneiser,S., Hempel,S., Feldpausch,M.,
Lamberth,S., Van den Daele,H., De Keyser,A., Buysshaert,C.,
Gielen,J., Villarroel,R., De Clercq,R., Van Montagu,M., Rogers,J.,
Cronin,A., Quail,M., Bray-Allen,S., Clark,L., Doggett,J., Hall,S.,
Kay,M., Lennard,N., McLay,K., Mayes,R., Pettett,A.,
Rajandream,M.A., Lyne,M., Benes,V., Rechmann,S., Borkova,D.,
Blocker,H., Scharfe,M., Grimm,M., Lohnert,T.H., Dose,S., de
Haan,M., Maarse,A., Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M.,
Fartmann,B., Granderath,K., Dauner,D., Herzl,A., Neumann,S.,
Argiriou,A., Vitale,D., Liguori,R., Piravandi,E., Massenet,O.,
Quigley,F., Clabauld,G., Mundlein,A., Felber,R., Schnabl,S.,
Hiller,R., Schmidt,W., Lecharny,A., Aubourg,S., Chefdor,F.,
Cooke,R., Berger,C., Montfort,A., Casacuberta,E., Gibbons,T.,
Weber,N., Vandenbol,M., Bargues,M., Terol,J., Torres,A.,
Perez-Perez,A., Purnelle,B., Bent,E., Johnson,S., Tacon,D.,
Jesse,T., Heijnen,L., Schwarz,S., Scholler,P., Heber,S., Francs,P.,
Bielke,C., Frishman,D., Haase,D., Lemcke,K., Mewes,H.W.,
Stocker,S., Zaccaria,P., Bevan,M., Wilson,R.K., de la Bastide,M.,
Habermann,K., Parnell,L., Dedhia,N., Gnoj,L., Schutz,K., Huang,E.,
Spiegel,L., Sehkon,M., Murray,J., Sheet,P., Cordes,M.,
Abu-Threideh,J., Stoneking,T., Kalicki,J., Graves,T., Harmon,G.,
Edwards,J., Latreille,P., Courtney,L., Cloud,J., Abbott,A.,
Scott,K., Johnson,D., Minx,P., Bentley,D., Fulton,B., Miller,N.,
Greco,T., Kemp,K., Kramer,J., Fulton,L., Mardis,E., Dante,M.,
Pepin,K., Hillier,L., Nelson,J., Spieth,J., Ryan,E., Andrews,S.,
Geisel,C., Layman,D., Du,H., Ali,J., Berghoff,A., Jones,K.,
Drone,K., Cotton,M., Joshu,C., Antonoiu,B., Zidanic,M., Strong,C.,
Sun,H., Lamar,B., Yordan,C., Ma,P., Zhong,J., Preston,R., Vil,D.,
Shekher,M., Matero,A., Shah,R., Swaby,I.K., O'Shaughnessy,A.,
Rodriguez,M., Hoffmann,J., Till,S., Granat,S., Shohdy,N.,
Hasegawa,A., Hameed,A., Lodhi,M., Johnson,A., Chen,E., Marra,M.,
Martienssen,R. and McCombie,W.R.
TITLE Sequence and analysis of chromosome 4 of the plant Arabidopsis
thaliana
JOURNAL Nature 402 (6763), 769-777 (1999)
PUBMED 10617198
REFERENCE 2 (bases 1 to 18585056)
AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
CONSRTM TAIR
TITLE Direct Submission
JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE 3 (bases 1 to 18585056)
AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
Vaughn,M. and Town,C.D.
TITLE Direct Submission
JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
9704 Medical Center Dr, Rockville, MD 20850, USA
REMARK Protein update by submitter
FEATURES Qualifiers
source /organism="Arabidopsis thaliana"
/mol_type="genomic DNA"
/db_xref="taxon:3702"
/chromosome="4"
/ecotype="Columbia"
protein /locus_tag="AT4G32920"
/gene_synonym="F26P21.40"
/gene_synonym="F26P21_40"
/note="glycine-rich protein; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G11700.1)."
/db_xref="TAIR:AT4G32920"
/db_xref="Araport:AT4G32920"
intron_pos 271:2 (1/21)
intron_pos 310:1 (2/21)
intron_pos 380:0 (3/21)
intron_pos 420:0 (4/21)
intron_pos 463:0 (5/21)
intron_pos 505:0 (6/21)
intron_pos 523:2 (7/21)
intron_pos 557:0 (8/21)
intron_pos 596:1 (9/21)
intron_pos 666:1 (10/21)
intron_pos 773:2 (11/21)
intron_pos 805:0 (12/21)
intron_pos 841:1 (13/21)
intron_pos 932:0 (14/21)
intron_pos 974:2 (15/21)
intron_pos 1053:0 (16/21)
intron_pos 1116:0 (17/21)
intron_pos 1208:2 (18/21)
intron_pos 1274:0 (19/21)
intron_pos 1355:0 (20/21)
intron_pos 1384:2 (21/21)
BEGIN
1 MISISIPMVR FCLCFAFVIL VSANPKLINS WDETAIRFEP LSPSPAPEPS PDDDDSSVSC
61 VDDLGGVGSL DSTCKLVADL NLTRDLNITG KGNLHVLPGV RLVCQFPGCS ISVNISGNFS
121 LAENSSVIAG TFRLAAENAE FGLSSAVDTT GLAGEPPPDT SGTPEGVEGA GGGYGGRGAC
181 CLSDTTTKIP EDVFGGDVYG WSSLEKPEIY GSRGGSTSNE VDYGGGGGGT VAIEILGYIS
241 LNGSVLADGA SGGVKGGGGS GGSIFVMAHK MAGNGRLSAS GGDGYAGGGG GRVSVDIYSR
301 HSDPKIFFNG GRSFGCPENA GAAGTLYDVI SESLTIDNHN KTTYTDTLLL EFPNHRLFTN
361 LYIRNMAKVA VPLRWSRVQV QGLISLSNGG ELNFGLPRYA SSEFELFAEE LLMSNSAIKV
421 YGALRMTVKV FLMLKSRMFI DGGGVTILGT SMLEISNLLV LKESSVIQSN GNLGVHGQGL
481 LNLTGTGDTI EAQRLILSLF YSIQVGAGAV LRGPLQNAST GGLTPKLYCQ RQDCPVELLH
541 PPEDCNVNSS LPFTLQICRV EDITVEGLIK GSVIQFHLAR TVLVRSSGTI SADGMGCKGG
601 VGTGRFLRSG IGSGGGHGGK GGSGCYNHTC IEGGESYGNA DLPCELGSGS GNEESTDSVA
661 GGGIIVLGSL EHPLSSLSLE GSITTDGESP RKTLKGLSNS SLGPGGGSGG TVLLFLRTLE
721 IGRSAILSSI GGNGSLKGGG GGSGGRIHFH WSDIPTGDVY HPVAIVKGRV YVRGGMGIIE
781 DNIGGNGTLT GKACPEGLYG LFCEECPSGT YKNVTGSDKA LCHLCPANDI PHRAVYVTVR
841 GGVAETPCPY KCISDRYHMP HCYTTLEELI YTFGGPWLFG VLLVVVLLLL ALVFSVARMK
901 FVSGDELHGS APTQHGSQID HSFPFLESLN EVMETSRVEE SQGHMHRIYF LGPNTFSEPW
961 HLSHTPPEEI KEIVYEAAFN GFVDEVNVIA AYQWWEGAIY IMLSVLVYPL AWSWQQSRRR
1021 LKFQKLRDFV RSEYDHSCLR SCRSRALYEG LKVAATPDLM LAHLDFFLGG DEKRSDLPPQ
1081 VHQRLPMPLI FGGDGSYMAY YSLQSDDILT SLLSQLVPPT TWYRFVAGLN AQLRLVQQGK
1141 LRSTFRSVMR WIETHGNPAL KRHGVRVDLA RFQALSSSSC QYGILVHTIA DEVASTRSDD
1201 ETEQQHPWGT QIENHSGDFR ENFQPLRSEI NHVRHQECGE IIDIGSLQFL KEEKDVLSLI
1261 SFLIHNTKPV GHQDLVGLVI SVLLLGDLTL TLLTLLQLYS ISLLEVFLAM FILPLSIIFP
1321 FPAGVSALFS HGPRRSASRT RVYALWNVTS LVNVVVAFVC GYVHYHGSSS GKKIPYLQPW
1381 NISMDENEWW IFPVALFLCK VLQSQLVNWH VANLEIQDYS LYSDDSELFW QS
//