LOCUS AEE86929.1 1465 aa PRT PLN 23-MAR-2023 DEFINITION Arabidopsis thaliana RPAP1-like, carboxy-terminal protein protein. ACCESSION CP002687-7122 PROTEIN_ID AEE86929.1 SOURCE Arabidopsis thaliana (thale cress) ORGANISM Arabidopsis thaliana Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis. REFERENCE 1 (bases 1 to 18585056) AUTHORS Mayer,K., Schuller,C., Wambutt,R., Murphy,G., Volckaert,G., Pohl,T., Dusterhoft,A., Stiekema,W., Entian,K.D., Terryn,N., Harris,B., Ansorge,W., Brandt,P., Grivell,L., Rieger,M., Weichselgartner,M., de Simone,V., Obermaier,B., Mache,R., Muller,M., Kreis,M., Delseny,M., Puigdomenech,P., Watson,M., Schmidtheini,T., Reichert,B., Portatelle,D., Perez-Alonso,M., Boutry,M., Bancroft,I., Vos,P., Hoheisel,J., Zimmermann,W., Wedler,H., Ridley,P., Langham,S.A., McCullagh,B., Bilham,L., Robben,J., Van der Schueren,J., Grymonprez,B., Chuang,Y.J., Vandenbussche,F., Braeken,M., Weltjens,I., Voet,M., Bastiaens,I., Aert,R., Defoor,E., Weitzenegger,T., Bothe,G., Ramsperger,U., Hilbert,H., Braun,M., Holzer,E., Brandt,A., Peters,S., van Staveren,M., Dirske,W., Mooijman,P., Klein Lankhorst,R., Rose,M., Hauf,J., Kotter,P., Berneiser,S., Hempel,S., Feldpausch,M., Lamberth,S., Van den Daele,H., De Keyser,A., Buysshaert,C., Gielen,J., Villarroel,R., De Clercq,R., Van Montagu,M., Rogers,J., Cronin,A., Quail,M., Bray-Allen,S., Clark,L., Doggett,J., Hall,S., Kay,M., Lennard,N., McLay,K., Mayes,R., Pettett,A., Rajandream,M.A., Lyne,M., Benes,V., Rechmann,S., Borkova,D., Blocker,H., Scharfe,M., Grimm,M., Lohnert,T.H., Dose,S., de Haan,M., Maarse,A., Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M., Fartmann,B., Granderath,K., Dauner,D., Herzl,A., Neumann,S., Argiriou,A., Vitale,D., Liguori,R., Piravandi,E., Massenet,O., Quigley,F., Clabauld,G., Mundlein,A., Felber,R., Schnabl,S., Hiller,R., Schmidt,W., Lecharny,A., Aubourg,S., Chefdor,F., Cooke,R., Berger,C., Montfort,A., Casacuberta,E., Gibbons,T., Weber,N., Vandenbol,M., Bargues,M., Terol,J., Torres,A., Perez-Perez,A., Purnelle,B., Bent,E., Johnson,S., Tacon,D., Jesse,T., Heijnen,L., Schwarz,S., Scholler,P., Heber,S., Francs,P., Bielke,C., Frishman,D., Haase,D., Lemcke,K., Mewes,H.W., Stocker,S., Zaccaria,P., Bevan,M., Wilson,R.K., de la Bastide,M., Habermann,K., Parnell,L., Dedhia,N., Gnoj,L., Schutz,K., Huang,E., Spiegel,L., Sehkon,M., Murray,J., Sheet,P., Cordes,M., Abu-Threideh,J., Stoneking,T., Kalicki,J., Graves,T., Harmon,G., Edwards,J., Latreille,P., Courtney,L., Cloud,J., Abbott,A., Scott,K., Johnson,D., Minx,P., Bentley,D., Fulton,B., Miller,N., Greco,T., Kemp,K., Kramer,J., Fulton,L., Mardis,E., Dante,M., Pepin,K., Hillier,L., Nelson,J., Spieth,J., Ryan,E., Andrews,S., Geisel,C., Layman,D., Du,H., Ali,J., Berghoff,A., Jones,K., Drone,K., Cotton,M., Joshu,C., Antonoiu,B., Zidanic,M., Strong,C., Sun,H., Lamar,B., Yordan,C., Ma,P., Zhong,J., Preston,R., Vil,D., Shekher,M., Matero,A., Shah,R., Swaby,I.K., O'Shaughnessy,A., Rodriguez,M., Hoffmann,J., Till,S., Granat,S., Shohdy,N., Hasegawa,A., Hameed,A., Lodhi,M., Johnson,A., Chen,E., Marra,M., Martienssen,R. and McCombie,W.R. TITLE Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana JOURNAL Nature 402 (6763), 769-777 (1999) PUBMED 10617198 REFERENCE 2 (bases 1 to 18585056) AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E. CONSRTM TAIR TITLE Direct Submission JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie Institution, 260 Panama Street, Stanford, CA, USA REFERENCE 3 (bases 1 to 18585056) AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M., Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R., Vaughn,M. and Town,C.D. TITLE Direct Submission JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute, 9704 Medical Center Dr, Rockville, MD 20850, USA REMARK Protein update by submitter FEATURES Qualifiers source /organism="Arabidopsis thaliana" /mol_type="genomic DNA" /db_xref="taxon:3702" /chromosome="4" /ecotype="Columbia" protein /gene="IYO" /locus_tag="AT4G38440" /gene_synonym="F22I13.210" /gene_synonym="F22I13_210" /gene_synonym="MINIYO" /inference="Similar to RNA sequence, EST:INSD:EL330261.1,INSD:AU238236.1,INSD:BP601548.1, INSD:BP806817.1,INSD:AI996603.1,INSD:EH993045.1, INSD:EH809052.1,INSD:EG524997.1,INSD:AV523122.1, INSD:EH889968.1,INSD:EG508266.1,INSD:EL116570.1, INSD:ES160263.1,INSD:EG523092.1,INSD:EG508263.1, INSD:AA605418.1,INSD:R90473.1,INSD:EG523079.1, INSD:EG524993.1,INSD:AU229383.1" /inference="Similar to RNA sequence, mRNA:INSD:BT005439.1,INSD:AK117387.1" /note="LOCATED IN: chloroplast; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 12 growth stages; CONTAINS InterPro DOMAIN/s: RNA polymerase II-associated protein 1, C-terminal (InterPro:IPR013929), RNA polymerase II-associated protein 1, N-terminal (InterPro:IPR013930); Has 276 Blast hits to 220 proteins in 102 species: Archae - 0; Bacteria - 2; Metazoa - 151; Fungi - 65; Plants - 41; Viruses - 0; Other Eukaryotes - 17 (source: NCBI BLink)." /db_xref="TAIR:AT4G38440" /db_xref="Araport:AT4G38440" intron_pos 59:0 (1/9) intron_pos 339:1 (2/9) intron_pos 378:0 (3/9) intron_pos 437:2 (4/9) intron_pos 473:0 (5/9) intron_pos 560:0 (6/9) intron_pos 623:0 (7/9) intron_pos 1337:0 (8/9) intron_pos 1407:0 (9/9) BEGIN 1 MEQSSGRVNP EQPNNVLASL VGSIVEKGIS ENKPPSKPLP PRPSLLSFPV ARHRSHGPHL 61 APVGSSIAQP KDYNDDQEEE EAEERFMNAD SIAAFAKPLQ RKEKKDMDLG RWKDMVSGDD 121 PASTHVPQQS RKLKIIETRP PYVASADAAT TSSNTLLAAR ASDQREFVSD KAPFIKNLGT 181 KERVPLNASP PLAVSNGLGT RHASSSLESD IDVENHAKLQ TMSPDEIAEA QAELLDKMDP 241 ALLSILKKRG EAKLKKRKHS VQGVSITDET AKNSRTEGHF VTPKVMAIPK EKSVVQKPGI 301 AQGFVWDAWT ERVEAARDLR FSFDGNVVEE DVVSPAETGG KWSGVESAAE RDFLRTEGDP 361 GAAGYTIKEA IALARSVIPG QRCLALHLLA SVLDKALNKL CQSRIGYARE EKDKSTDWEA 421 IWAYALGPEP ELVLALRMAL DDNHASVVIA CVKVIQCLLS CSLNENFFNI LENMGPHGKD 481 IFTASVFRSK PEIDLGFLRG CYWKYSAKPS NIVAFREEIL DDGTEDTDTI QKDVFVAGQD 541 VAAGLVRMDI LPRIYHLLET EPTAALEDSI ISVTIAIARH SPKCTTAILK YPKFVQTIVK 601 RFQLNKRMDV LSSQINSVRL LKVLARYDQS TCMEFVKNGT FNAVTWHLFQ FTSSLDSWVK 661 LGKQNCKLSS TLMVEQLRFW KVCIHSGCCV SRFPELFPAL CLWLSCPSFE KLREKNLISE 721 FTSVSNEAYL VLEAFAETLP NMYSQNIPRN ESGTWDWSYV SPMIDSALSW ITLAPQLLKW 781 EKGIESVSVS TTTLLWLYSG VMRTISKVLE KISAEGEEEP LPWLPEFVPK IGLAIIKHKL 841 LSFSVADVSR FGKDSSRCSS FMEYLCFLRE RSQDDELALA SVNCLHGLTR TIVSIQNLIE 901 SARSKMKAPH QVSISTGDES VLANGILAES LAELTSVSCS FRDSVSSEWP IVQSIELHKR 961 GGLAPGVGLG WGASGGGFWS TRVLLAQAGA GLLSLFLNIS LSDSQNDQGS VGFMDKVNSA 1021 LAMCLIAGPR DYLLVERAFE YVLRPHALEH LACCIKSNKK NISFEWECSE GDYHRMSSML 1081 ASHFRHRWLQ QKGRSIAEEG VSGVRKGTVG LETIHEDGEM SNSSTQDKKS DSSTIEWAHQ 1141 RMPLPPHWFL SAISAVHSGK TSTGPPESTE LLEVAKAGVF FLAGLESSSG FGSLPSPVVS 1201 VPLVWKFHAL STVLLVGMDI IEDKNTRNLY NYLQELYGQF LDEARLNHRD TELLRFKSDI 1261 HENYSTFLEM VVEQYAAVSY GDVVYGRQVS VYLHQCVEHS VRLSAWTVLS NARVLELLPS 1321 LDKCLGEADG YLEPVEENEA VLEAYLKSWT CGALDRAATR GSVAYTLVVH HFSSLVFCNQ 1381 AKDKVSLRNK IVKTLVRDLS RKRHREGMML DLLRYKKGSA NAMEEEVIAA ETEKRMEVLK 1441 EGCEGNSTLL LELEKLKSAA LCGRR //