LOCUS       AEE86929.1              1465 aa    PRT              PLN 23-MAR-2023
DEFINITION  Arabidopsis thaliana RPAP1-like, carboxy-terminal protein protein.
ACCESSION   CP002687-7122
PROTEIN_ID  AEE86929.1
SOURCE      Arabidopsis thaliana (thale cress)
  ORGANISM  Arabidopsis thaliana
            Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
            Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
            Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
            Camelineae; Arabidopsis.
REFERENCE   1  (bases 1 to 18585056)
  AUTHORS   Mayer,K., Schuller,C., Wambutt,R., Murphy,G., Volckaert,G.,
            Pohl,T., Dusterhoft,A., Stiekema,W., Entian,K.D., Terryn,N.,
            Harris,B., Ansorge,W., Brandt,P., Grivell,L., Rieger,M.,
            Weichselgartner,M., de Simone,V., Obermaier,B., Mache,R.,
            Muller,M., Kreis,M., Delseny,M., Puigdomenech,P., Watson,M.,
            Schmidtheini,T., Reichert,B., Portatelle,D., Perez-Alonso,M.,
            Boutry,M., Bancroft,I., Vos,P., Hoheisel,J., Zimmermann,W.,
            Wedler,H., Ridley,P., Langham,S.A., McCullagh,B., Bilham,L.,
            Robben,J., Van der Schueren,J., Grymonprez,B., Chuang,Y.J.,
            Vandenbussche,F., Braeken,M., Weltjens,I., Voet,M., Bastiaens,I.,
            Aert,R., Defoor,E., Weitzenegger,T., Bothe,G., Ramsperger,U.,
            Hilbert,H., Braun,M., Holzer,E., Brandt,A., Peters,S., van
            Staveren,M., Dirske,W., Mooijman,P., Klein Lankhorst,R., Rose,M.,
            Hauf,J., Kotter,P., Berneiser,S., Hempel,S., Feldpausch,M.,
            Lamberth,S., Van den Daele,H., De Keyser,A., Buysshaert,C.,
            Gielen,J., Villarroel,R., De Clercq,R., Van Montagu,M., Rogers,J.,
            Cronin,A., Quail,M., Bray-Allen,S., Clark,L., Doggett,J., Hall,S.,
            Kay,M., Lennard,N., McLay,K., Mayes,R., Pettett,A.,
            Rajandream,M.A., Lyne,M., Benes,V., Rechmann,S., Borkova,D.,
            Blocker,H., Scharfe,M., Grimm,M., Lohnert,T.H., Dose,S., de
            Haan,M., Maarse,A., Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M.,
            Fartmann,B., Granderath,K., Dauner,D., Herzl,A., Neumann,S.,
            Argiriou,A., Vitale,D., Liguori,R., Piravandi,E., Massenet,O.,
            Quigley,F., Clabauld,G., Mundlein,A., Felber,R., Schnabl,S.,
            Hiller,R., Schmidt,W., Lecharny,A., Aubourg,S., Chefdor,F.,
            Cooke,R., Berger,C., Montfort,A., Casacuberta,E., Gibbons,T.,
            Weber,N., Vandenbol,M., Bargues,M., Terol,J., Torres,A.,
            Perez-Perez,A., Purnelle,B., Bent,E., Johnson,S., Tacon,D.,
            Jesse,T., Heijnen,L., Schwarz,S., Scholler,P., Heber,S., Francs,P.,
            Bielke,C., Frishman,D., Haase,D., Lemcke,K., Mewes,H.W.,
            Stocker,S., Zaccaria,P., Bevan,M., Wilson,R.K., de la Bastide,M.,
            Habermann,K., Parnell,L., Dedhia,N., Gnoj,L., Schutz,K., Huang,E.,
            Spiegel,L., Sehkon,M., Murray,J., Sheet,P., Cordes,M.,
            Abu-Threideh,J., Stoneking,T., Kalicki,J., Graves,T., Harmon,G.,
            Edwards,J., Latreille,P., Courtney,L., Cloud,J., Abbott,A.,
            Scott,K., Johnson,D., Minx,P., Bentley,D., Fulton,B., Miller,N.,
            Greco,T., Kemp,K., Kramer,J., Fulton,L., Mardis,E., Dante,M.,
            Pepin,K., Hillier,L., Nelson,J., Spieth,J., Ryan,E., Andrews,S.,
            Geisel,C., Layman,D., Du,H., Ali,J., Berghoff,A., Jones,K.,
            Drone,K., Cotton,M., Joshu,C., Antonoiu,B., Zidanic,M., Strong,C.,
            Sun,H., Lamar,B., Yordan,C., Ma,P., Zhong,J., Preston,R., Vil,D.,
            Shekher,M., Matero,A., Shah,R., Swaby,I.K., O'Shaughnessy,A.,
            Rodriguez,M., Hoffmann,J., Till,S., Granat,S., Shohdy,N.,
            Hasegawa,A., Hameed,A., Lodhi,M., Johnson,A., Chen,E., Marra,M.,
            Martienssen,R. and McCombie,W.R.
  TITLE     Sequence and analysis of chromosome 4 of the plant Arabidopsis
            thaliana
  JOURNAL   Nature 402 (6763), 769-777 (1999)
   PUBMED   10617198
REFERENCE   2  (bases 1 to 18585056)
  AUTHORS   Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
  CONSRTM   TAIR
  TITLE     Direct Submission
  JOURNAL   Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
            Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE   3  (bases 1 to 18585056)
  AUTHORS   Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
            Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
            Vaughn,M. and Town,C.D.
  TITLE     Direct Submission
  JOURNAL   Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
            9704 Medical Center Dr, Rockville, MD 20850, USA
  REMARK    Protein update by submitter
FEATURES             Qualifiers
     source          /organism="Arabidopsis thaliana"
                     /mol_type="genomic DNA"
                     /db_xref="taxon:3702"
                     /chromosome="4"
                     /ecotype="Columbia"
     protein         /gene="IYO"
                     /locus_tag="AT4G38440"
                     /gene_synonym="F22I13.210"
                     /gene_synonym="F22I13_210"
                     /gene_synonym="MINIYO"
                     /inference="Similar to RNA sequence,
                     EST:INSD:EL330261.1,INSD:AU238236.1,INSD:BP601548.1,
                     INSD:BP806817.1,INSD:AI996603.1,INSD:EH993045.1,
                     INSD:EH809052.1,INSD:EG524997.1,INSD:AV523122.1,
                     INSD:EH889968.1,INSD:EG508266.1,INSD:EL116570.1,
                     INSD:ES160263.1,INSD:EG523092.1,INSD:EG508263.1,
                     INSD:AA605418.1,INSD:R90473.1,INSD:EG523079.1,
                     INSD:EG524993.1,INSD:AU229383.1"
                     /inference="Similar to RNA sequence,
                     mRNA:INSD:BT005439.1,INSD:AK117387.1"
                     /note="LOCATED IN: chloroplast; EXPRESSED IN: 21 plant
                     structures; EXPRESSED DURING: 12 growth stages; CONTAINS
                     InterPro DOMAIN/s: RNA polymerase II-associated protein 1,
                     C-terminal (InterPro:IPR013929), RNA polymerase
                     II-associated protein 1, N-terminal (InterPro:IPR013930);
                     Has 276 Blast hits to 220 proteins in 102 species: Archae
                     - 0; Bacteria - 2; Metazoa - 151; Fungi - 65; Plants - 41;
                     Viruses - 0; Other Eukaryotes - 17 (source: NCBI BLink)."
                     /db_xref="TAIR:AT4G38440"
                     /db_xref="Araport:AT4G38440"
     intron_pos      59:0 (1/9)
     intron_pos      339:1 (2/9)
     intron_pos      378:0 (3/9)
     intron_pos      437:2 (4/9)
     intron_pos      473:0 (5/9)
     intron_pos      560:0 (6/9)
     intron_pos      623:0 (7/9)
     intron_pos      1337:0 (8/9)
     intron_pos      1407:0 (9/9)
BEGIN
        1 MEQSSGRVNP EQPNNVLASL VGSIVEKGIS ENKPPSKPLP PRPSLLSFPV ARHRSHGPHL
       61 APVGSSIAQP KDYNDDQEEE EAEERFMNAD SIAAFAKPLQ RKEKKDMDLG RWKDMVSGDD
      121 PASTHVPQQS RKLKIIETRP PYVASADAAT TSSNTLLAAR ASDQREFVSD KAPFIKNLGT
      181 KERVPLNASP PLAVSNGLGT RHASSSLESD IDVENHAKLQ TMSPDEIAEA QAELLDKMDP
      241 ALLSILKKRG EAKLKKRKHS VQGVSITDET AKNSRTEGHF VTPKVMAIPK EKSVVQKPGI
      301 AQGFVWDAWT ERVEAARDLR FSFDGNVVEE DVVSPAETGG KWSGVESAAE RDFLRTEGDP
      361 GAAGYTIKEA IALARSVIPG QRCLALHLLA SVLDKALNKL CQSRIGYARE EKDKSTDWEA
      421 IWAYALGPEP ELVLALRMAL DDNHASVVIA CVKVIQCLLS CSLNENFFNI LENMGPHGKD
      481 IFTASVFRSK PEIDLGFLRG CYWKYSAKPS NIVAFREEIL DDGTEDTDTI QKDVFVAGQD
      541 VAAGLVRMDI LPRIYHLLET EPTAALEDSI ISVTIAIARH SPKCTTAILK YPKFVQTIVK
      601 RFQLNKRMDV LSSQINSVRL LKVLARYDQS TCMEFVKNGT FNAVTWHLFQ FTSSLDSWVK
      661 LGKQNCKLSS TLMVEQLRFW KVCIHSGCCV SRFPELFPAL CLWLSCPSFE KLREKNLISE
      721 FTSVSNEAYL VLEAFAETLP NMYSQNIPRN ESGTWDWSYV SPMIDSALSW ITLAPQLLKW
      781 EKGIESVSVS TTTLLWLYSG VMRTISKVLE KISAEGEEEP LPWLPEFVPK IGLAIIKHKL
      841 LSFSVADVSR FGKDSSRCSS FMEYLCFLRE RSQDDELALA SVNCLHGLTR TIVSIQNLIE
      901 SARSKMKAPH QVSISTGDES VLANGILAES LAELTSVSCS FRDSVSSEWP IVQSIELHKR
      961 GGLAPGVGLG WGASGGGFWS TRVLLAQAGA GLLSLFLNIS LSDSQNDQGS VGFMDKVNSA
     1021 LAMCLIAGPR DYLLVERAFE YVLRPHALEH LACCIKSNKK NISFEWECSE GDYHRMSSML
     1081 ASHFRHRWLQ QKGRSIAEEG VSGVRKGTVG LETIHEDGEM SNSSTQDKKS DSSTIEWAHQ
     1141 RMPLPPHWFL SAISAVHSGK TSTGPPESTE LLEVAKAGVF FLAGLESSSG FGSLPSPVVS
     1201 VPLVWKFHAL STVLLVGMDI IEDKNTRNLY NYLQELYGQF LDEARLNHRD TELLRFKSDI
     1261 HENYSTFLEM VVEQYAAVSY GDVVYGRQVS VYLHQCVEHS VRLSAWTVLS NARVLELLPS
     1321 LDKCLGEADG YLEPVEENEA VLEAYLKSWT CGALDRAATR GSVAYTLVVH HFSSLVFCNQ
     1381 AKDKVSLRNK IVKTLVRDLS RKRHREGMML DLLRYKKGSA NAMEEEVIAA ETEKRMEVLK
     1441 EGCEGNSTLL LELEKLKSAA LCGRR
//