LOCUS AEE82629.1 1041 aa PRT PLN 23-MAR-2023 DEFINITION Arabidopsis thaliana global transcription factor group A2 protein. ACCESSION CP002687-1126 PROTEIN_ID AEE82629.1 SOURCE Arabidopsis thaliana (thale cress) ORGANISM Arabidopsis thaliana Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis. REFERENCE 1 (bases 1 to 18585056) AUTHORS Mayer,K., Schuller,C., Wambutt,R., Murphy,G., Volckaert,G., Pohl,T., Dusterhoft,A., Stiekema,W., Entian,K.D., Terryn,N., Harris,B., Ansorge,W., Brandt,P., Grivell,L., Rieger,M., Weichselgartner,M., de Simone,V., Obermaier,B., Mache,R., Muller,M., Kreis,M., Delseny,M., Puigdomenech,P., Watson,M., Schmidtheini,T., Reichert,B., Portatelle,D., Perez-Alonso,M., Boutry,M., Bancroft,I., Vos,P., Hoheisel,J., Zimmermann,W., Wedler,H., Ridley,P., Langham,S.A., McCullagh,B., Bilham,L., Robben,J., Van der Schueren,J., Grymonprez,B., Chuang,Y.J., Vandenbussche,F., Braeken,M., Weltjens,I., Voet,M., Bastiaens,I., Aert,R., Defoor,E., Weitzenegger,T., Bothe,G., Ramsperger,U., Hilbert,H., Braun,M., Holzer,E., Brandt,A., Peters,S., van Staveren,M., Dirske,W., Mooijman,P., Klein Lankhorst,R., Rose,M., Hauf,J., Kotter,P., Berneiser,S., Hempel,S., Feldpausch,M., Lamberth,S., Van den Daele,H., De Keyser,A., Buysshaert,C., Gielen,J., Villarroel,R., De Clercq,R., Van Montagu,M., Rogers,J., Cronin,A., Quail,M., Bray-Allen,S., Clark,L., Doggett,J., Hall,S., Kay,M., Lennard,N., McLay,K., Mayes,R., Pettett,A., Rajandream,M.A., Lyne,M., Benes,V., Rechmann,S., Borkova,D., Blocker,H., Scharfe,M., Grimm,M., Lohnert,T.H., Dose,S., de Haan,M., Maarse,A., Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M., Fartmann,B., Granderath,K., Dauner,D., Herzl,A., Neumann,S., Argiriou,A., Vitale,D., Liguori,R., Piravandi,E., Massenet,O., Quigley,F., Clabauld,G., Mundlein,A., Felber,R., Schnabl,S., Hiller,R., Schmidt,W., Lecharny,A., Aubourg,S., Chefdor,F., Cooke,R., Berger,C., Montfort,A., Casacuberta,E., Gibbons,T., Weber,N., Vandenbol,M., Bargues,M., Terol,J., Torres,A., Perez-Perez,A., Purnelle,B., Bent,E., Johnson,S., Tacon,D., Jesse,T., Heijnen,L., Schwarz,S., Scholler,P., Heber,S., Francs,P., Bielke,C., Frishman,D., Haase,D., Lemcke,K., Mewes,H.W., Stocker,S., Zaccaria,P., Bevan,M., Wilson,R.K., de la Bastide,M., Habermann,K., Parnell,L., Dedhia,N., Gnoj,L., Schutz,K., Huang,E., Spiegel,L., Sehkon,M., Murray,J., Sheet,P., Cordes,M., Abu-Threideh,J., Stoneking,T., Kalicki,J., Graves,T., Harmon,G., Edwards,J., Latreille,P., Courtney,L., Cloud,J., Abbott,A., Scott,K., Johnson,D., Minx,P., Bentley,D., Fulton,B., Miller,N., Greco,T., Kemp,K., Kramer,J., Fulton,L., Mardis,E., Dante,M., Pepin,K., Hillier,L., Nelson,J., Spieth,J., Ryan,E., Andrews,S., Geisel,C., Layman,D., Du,H., Ali,J., Berghoff,A., Jones,K., Drone,K., Cotton,M., Joshu,C., Antonoiu,B., Zidanic,M., Strong,C., Sun,H., Lamar,B., Yordan,C., Ma,P., Zhong,J., Preston,R., Vil,D., Shekher,M., Matero,A., Shah,R., Swaby,I.K., O'Shaughnessy,A., Rodriguez,M., Hoffmann,J., Till,S., Granat,S., Shohdy,N., Hasegawa,A., Hameed,A., Lodhi,M., Johnson,A., Chen,E., Marra,M., Martienssen,R. and McCombie,W.R. TITLE Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana JOURNAL Nature 402 (6763), 769-777 (1999) PUBMED 10617198 REFERENCE 2 (bases 1 to 18585056) AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E. CONSRTM TAIR TITLE Direct Submission JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie Institution, 260 Panama Street, Stanford, CA, USA REFERENCE 3 (bases 1 to 18585056) AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M., Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R., Vaughn,M. and Town,C.D. TITLE Direct Submission JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute, 9704 Medical Center Dr, Rockville, MD 20850, USA REMARK Protein update by submitter FEATURES Qualifiers source /organism="Arabidopsis thaliana" /mol_type="genomic DNA" /db_xref="taxon:3702" /chromosome="4" /ecotype="Columbia" protein /gene="GTA2" /locus_tag="AT4G08350" /gene_synonym="global transcription factor group A2" /gene_synonym="GTA02" /gene_synonym="SPT5-2" /gene_synonym="T28D5.40" /gene_synonym="T28D5_40" /inference="Similar to RNA sequence, EST:INSD:EG507999.1,INSD:EG508014.1,INSD:EH927635.1, INSD:EL206451.1,INSD:EG508003.1,INSD:EG452920.1, INSD:EG508016.1,INSD:EL977998.1,INSD:BE525873.1, INSD:ES057295.1,INSD:DR354509.1,INSD:EL981005.1, INSD:EG452913.1,INSD:EL971292.1,INSD:EG452914.1, INSD:EG507989.1,INSD:EG452916.1,INSD:EL974739.1, INSD:EG508013.1,INSD:ES105987.1,INSD:EL051697.1, INSD:BP633079.1,INSD:ES053809.1,INSD:EG508005.1, INSD:ES041277.1,INSD:ES175510.1,INSD:ES003693.1, INSD:ES103579.1,INSD:EG508011.1,INSD:EG508007.1, INSD:EG508002.1,INSD:ES168495.1,INSD:AI992901.1, INSD:ES008860.1,INSD:EL243563.1,INSD:ES130278.1, INSD:EG452911.1,INSD:EG508009.1,INSD:AA586259.1, INSD:EL973602.1,INSD:EL994211.1,INSD:ES048559.1, INSD:EG507998.1,INSD:EG507988.1,INSD:ES154686.1, INSD:EG525877.1,INSD:EG525888.1,INSD:ES023368.1, INSD:EG508004.1,INSD:ES101433.1,INSD:EG452915.1, INSD:EH900574.1,INSD:EG508017.1,INSD:EG508010.1, INSD:EG508006.1,INSD:EG507990.1,INSD:EG508012.1, INSD:ES041563.1,INSD:EL978890.1,INSD:ES037268.1, INSD:EL298312.1,INSD:EL285199.1,INSD:T20718.1, INSD:EH869191.1,INSD:EG507994.1,INSD:EL192355.1, INSD:EL982805.1" /note="global transcription factor group A2 (GTA2); FUNCTIONS IN: transcription elongation regulator activity, structural constituent of ribosome, sequence-specific DNA binding transcription factor activity; INVOLVED IN: translation, regulation of transcription from RNA polymerase II promoter, positive regulation of RNA elongation from RNA polymerase II promoter; LOCATED IN: ribosome, intracellular; EXPRESSED IN: guard cell; CONTAINS InterPro DOMAIN/s: Translation protein SH3-like (InterPro:IPR008991), Transcription elongation factor Spt5 (InterPro:IPR017071), Transcription antitermination protein, NusG, N-terminal (InterPro:IPR006645), KOW (InterPro:IPR005824), Ribosomal protein L24/L26, conserved site (InterPro:IPR005825), Transcription elongation factor Spt5, NGN domain (InterPro:IPR005100); BEST Arabidopsis thaliana protein match is: Transcription elongation factor Spt5 (TAIR:AT2G34210.1); Has 14630 Blast hits to 9620 proteins in 607 species: Archae - 121; Bacteria - 647; Metazoa - 6069; Fungi - 2592; Plants - 1061; Viruses - 307; Other Eukaryotes - 3833 (source: NCBI BLink)." /db_xref="TAIR:AT4G08350" /db_xref="Araport:AT4G08350" intron_pos 112:1 (1/21) intron_pos 191:0 (2/21) intron_pos 240:0 (3/21) intron_pos 294:0 (4/21) intron_pos 322:0 (5/21) intron_pos 345:2 (6/21) intron_pos 465:0 (7/21) intron_pos 517:0 (8/21) intron_pos 549:2 (9/21) intron_pos 566:0 (10/21) intron_pos 617:0 (11/21) intron_pos 665:0 (12/21) intron_pos 699:1 (13/21) intron_pos 752:1 (14/21) intron_pos 767:2 (15/21) intron_pos 799:1 (16/21) intron_pos 825:2 (17/21) intron_pos 844:0 (18/21) intron_pos 883:1 (19/21) intron_pos 929:1 (20/21) intron_pos 961:0 (21/21) BEGIN 1 MPRSRDEDDE LDGDYEALDL EEEEEEDEEE EEERGRGGGG SRRKRGRSNF IDDYAEEDSQ 61 EEDDDDEDYG SSRGGKGAAS KRKKPSASIF LDREAHQVDD EDEEEEDEAE DDFIVDNGTD 121 LPDERGDRRY ERRFLPRDEN DEDVEDLERR IQERFSSRHH EEYDEEATEV EQQALLPSVR 181 DPKLWMVKCA IGREREVAVC LMQKFIDRGA DLQIRSVVAL DHLKNFIYVE ADKEAHVKEA 241 IKGMRNIYAN QKILLVPIRE MTDVLSVESK AIDLSRDTWV RMKIGTYKGD LAKVVDVDNV 301 RQRVTVKLIP RIDLQALASK LDGREVSKKK AFVPPPRFMN IDEARELHIR VERRRDHMTG 361 DYFENIGGML FKDGFHYKQV SLKSITVQNV TPTFDELEKF NKPSENGEGD FGGLSTLFAN 421 RKKGHFMKGD AVIVIKGDLK NLKGWVEKVD EENVLIRSEV KGLPDPLAVN ERELCKYFEP 481 GNHVKVVSGT HEGATGMVVK VDQHVLIILS DTTKEHVRVF ADHVVESSEV TTGVTKIGDY 541 ELHDLVLLDN LSFGVIIRLE NEAFQVLKGV PDRPEVALVK LREIKCKLEK KINVQDRYKN 601 VIAVKDDVRV IEGPSKGKQG PVKHIYKGVL FIYDRHHLEH AGFICAKCTS CIVVGGSRSG 661 ANRNGGDSLS RYGNFKAPAP VPSSPGRFQR GRGGGYNNSG GRHGGGRGRG DDSLLGTTVK 721 IRLGPFKGYR GPVVEVKGNS VRVELEMKIV TVDRGAISDN VATTPFRDTS RYSMGSETPM 781 HPSRTPLHPY MTPMRDSGAT PIHDGMRTPM RDRAWNPYTP MSPPRDNWED GNPGSWGTSP 841 QYQPGSPPSR AYEAPTPGSG WASTPGGSYS DAGTPRDHGS AYANAPSPYL PSTPGQPMTP 901 SSASYLPGTP GGQPMTPGTG LDVMSPVIGG DAEAWFMPDI LVDIHKAGED TDVGVIRDVS 961 DGTCKVSLGS SGEGDTIMAL PSELEIIPPR KSDRVKIVGG QYRGSTGKLI GIDGSDGIVK 1021 IDDNLDVKIL DLALLAKFVQ P //