LOCUS       AEE75130.1              1329 aa    PRT              PLN 23-MAR-2023
DEFINITION  Arabidopsis thaliana Cleavage and polyadenylation specificity
            factor (CPSF) A subunit protein protein.
ACCESSION   CP002686-2136
PROTEIN_ID  AEE75130.1
SOURCE      Arabidopsis thaliana (thale cress)
  ORGANISM  Arabidopsis thaliana
            Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
            Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
            Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
            Camelineae; Arabidopsis.
REFERENCE   1  (bases 1 to 23459830)
  AUTHORS   Salanoubat,M., Lemcke,K., Rieger,M., Ansorge,W., Unseld,M.,
            Fartmann,B., Valle,G., Blocker,H., Perez-Alonso,M., Obermaier,B.,
            Delseny,M., Boutry,M., Grivell,L.A., Mache,R., Puigdomenech,P., De
            Simone,V., Choisne,N., Artiguenave,F., Robert,C., Brottier,P.,
            Wincker,P., Cattolico,L., Weissenbach,J., Saurin,W., Quetier,F.,
            Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M., Benes,V.,
            Wurmbach,E., Drzonek,H., Erfle,H., Jordan,N., Bangert,S.,
            Wiedelmann,R., Kranz,H., Voss,H., Holland,R., Brandt,P.,
            Nyakatura,G., Vezzi,A., D'Angelo,M., Pallavicini,A., Toppo,S.,
            Simionati,B., Conrad,A., Hornischer,K., Kauer,G., Lohnert,T.H.,
            Nordsiek,G., Reichelt,J., Scharfe,M., Schon,O., Bargues,M.,
            Terol,J., Climent,J., Navarro,P., Collado,C., Perez-Perez,A.,
            Ottenwalder,B., Duchemin,D., Cooke,R., Laudie,M., Berger-Llauro,C.,
            Purnelle,B., Masuy,D., de Haan,M., Maarse,A.C., Alcaraz,J.P.,
            Cottet,A., Casacuberta,E., Monfort,A., Argiriou,A., flores,M.,
            Liguori,R., Vitale,D., Mannhaupt,G., Haase,D., Schoof,H., Rudd,S.,
            Zaccaria,P., Mewes,H.W., Mayer,K.F., Kaul,S., Town,C.D., Koo,H.L.,
            Tallon,L.J., Jenkins,J., Rooney,T., Rizzo,M., Walts,A.,
            Utterback,T., Fujii,C.Y., Shea,T.P., Creasy,T.H., Haas,B.,
            Maiti,R., Wu,D., Peterson,J., Van Aken,S., Pai,G., Militscher,J.,
            Sellers,P., Gill,J.E., Feldblyum,T.V., Preuss,D., Lin,X.,
            Nierman,W.C., Salzberg,S.L., White,O., Venter,J.C., Fraser,C.M.,
            Kaneko,T., Nakamura,Y., Sato,S., Kato,T., Asamizu,E., Sasamoto,S.,
            Kimura,T., Idesawa,K., Kawashima,K., Kishida,Y., Kiyokawa,C.,
            Kohara,M., Matsumoto,M., Matsuno,A., Muraki,A., Nakayama,S.,
            Nakazaki,N., Shinpo,S., Takeuchi,C., Wada,T., Watanabe,A.,
            Yamada,M., Yasuda,M. and Tabata,S.
  CONSRTM   European Union Chromosome 3 Arabidopsis Sequencing Consortium;
            Institute for Genomic Research; Kazusa DNA Research Institute
  TITLE     Sequence and analysis of chromosome 3 of the plant Arabidopsis
            thaliana
  JOURNAL   Nature 408 (6814), 820-822 (2000)
   PUBMED   11130713
REFERENCE   2  (bases 1 to 23459830)
  AUTHORS   Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
  CONSRTM   TAIR
  TITLE     Direct Submission
  JOURNAL   Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
            Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE   3  (bases 1 to 23459830)
  AUTHORS   Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
            Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
            Vaughn,M. and Town,C.D.
  TITLE     Direct Submission
  JOURNAL   Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
            9704 Medical Center Dr, Rockville, MD 20850, USA
  REMARK    Protein update by submitter
FEATURES             Qualifiers
     source          /organism="Arabidopsis thaliana"
                     /mol_type="genomic DNA"
                     /db_xref="taxon:3702"
                     /chromosome="3"
                     /ecotype="Columbia"
     protein         /locus_tag="AT3G11960"
                     /inference="Similar to RNA sequence,
                     EST:INSD:EH839868.1,INSD:EL202745.1,INSD:BG459493.1,
                     INSD:AV550329.1,INSD:ES124065.1,INSD:EL125903.1,
                     INSD:EH929984.1,INSD:AV547569.1,INSD:AV523965.1,
                     INSD:AV529912.1,INSD:AV538174.1,INSD:AV546518.1,
                     INSD:BP839374.1,INSD:BP828082.1,INSD:AV543012.1,
                     INSD:AV529749.1,INSD:EG515693.1,INSD:EL059289.1,
                     INSD:ES202817.1,INSD:EG515692.1,INSD:EL291402.1,
                     INSD:EH872364.1"
                     /note="Cleavage and polyadenylation specificity factor
                     (CPSF) A subunit protein; FUNCTIONS IN: nucleic acid
                     binding; INVOLVED IN: biological_process unknown; LOCATED
                     IN: nucleus, chloroplast; EXPRESSED IN: 23 plant
                     structures; EXPRESSED DURING: 13 growth stages; CONTAINS
                     InterPro DOMAIN/s: Cleavage/polyadenylation specificity
                     factor, A subunit, C-terminal (InterPro:IPR004871); BEST
                     Arabidopsis thaliana protein match is: damaged DNA binding
                     protein 1A (TAIR:AT4G05420.2); Has 954 Blast hits to 777
                     proteins in 185 species: Archae - 0; Bacteria - 0; Metazoa
                     - 323; Fungi - 233; Plants - 246; Viruses - 0; Other
                     Eukaryotes - 152 (source: NCBI BLink)."
                     /db_xref="TAIR:AT3G11960"
                     /db_xref="Araport:AT3G11960"
     intron_pos      68:0 (1/12)
     intron_pos      113:0 (2/12)
     intron_pos      138:2 (3/12)
     intron_pos      185:2 (4/12)
     intron_pos      835:0 (5/12)
     intron_pos      937:2 (6/12)
     intron_pos      1043:0 (7/12)
     intron_pos      1095:0 (8/12)
     intron_pos      1148:1 (9/12)
     intron_pos      1182:0 (10/12)
     intron_pos      1227:2 (11/12)
     intron_pos      1264:0 (12/12)
BEGIN
        1 MAAPEDESSA QSQSSPATAA PTPPPSSSPS SAGDHYLAKC ILRPSVVLQV AYGYFRSPSS
       61 RDIVFGKETC IELVVIGEDG IVESVCEQYV FGTIKDLAVI PQSSKLYSNS LQMGKDLLAV
      121 LSDSGKLSFL SFSNEMHRIS YPSEDGGNGS SIQAISGTIW SMCFISKDFN ESKEYAPILA
      181 IVINRKGSLM NELALFRWNV KEESICLISE YVETGALAHS IVEVPHSSGF AFLFRIGDVL
      241 LMDLRDPQNP CCLFRTSLDF VPASLMEEHF VEESCRVQDG DDEGCNVVVC ALLELRDHEV
      301 RDHDPMFIDT ESDIGKLSSK NVSSWTWEPE NNHNPRMIIC LDNGDFFMFE LIYEDDGVKV
      361 NLSECLYKGL PCKDILWIEG GFLATFAEMA DGTVFKLGTE KLHWMSSIQN IAPILDFSVM
      421 DDQNEKRDQI FACCGVTPEG SLRIIRSGIN VEKLLKTAPV YQGITGTWTV KMKLTDVYHS
      481 FLVLSFVEET RVLSVGLSFK DVTDSVGFQS DVCTFACGLV ADGLLVQIHQ DAIRLCMPTM
      541 DAHSDGIPVS SPFFSSWFPE NVSISLGAVG QNLIVVSTSN PCFLSILGVK SVSSQCCEIY
      601 EIQRVTLQYE VSCISVPQKH IGKKRSRDSS PDNFCKAAIP SAMEQGYTFL IGTHKPSVEV
      661 LSFTEDGVGV RVLASGLVSL TNTMGTVISG CIPQDVRLVL VDQLYVLSGL RNGMLLRFEW
      721 APFSNSSGLN CPDYFSHCKE EMDTVVGKKD NLPVNLLLIA TRRIGITPVF LVPFSDSLDS
      781 DIIALSDRPW LLQTARQSLS YTSISFQPST HATPVCSFEC PQGILFVSEN CLHLVEMVHS
      841 KRRNAQKFQL GGTPRKVIYH SESKLLIVMR TDLYDTCTSD ICCVDPLSGS VLSSYKLKPG
      901 ETGKSMELVR VGNEHVLVVG TSLSSGPAIL PSGEAESTKG RVIILCLEHT QNSDSGSMTI
      961 CSKACSSSQR TSPFHDVVGY TTENLSSSSL CSSPDDYSYD GIKLDEAETW QLRLASSTTW
     1021 PGMVLAICPY LDHYFLASAG NAFYVCGFPN DSPERMKRFA VGRTRFMITS LRTYFTRIVV
     1081 GDCRDGVLFY SYHEESKKLH QIYCDPAQRL VADCFLMDAN SVAVSDRKGS IAILSCKDHS
     1141 DFGMKHLVKI PHDNPEYSSP ESNLNLNCAY YMGEIAMSIK KGCNIYKLPA DDVLRSYGLS
     1201 KSIDTADDTI IAGTLLGSIF VFAPISSEEY ELLEGVQAKL GIHPLTAPVL GNDHNEFRGR
     1261 ENPSQARKIL DGDMLAQFLE LTNRQQESVL STPQPSPSTS KASSKQRSFP PLMLHQVVQL
     1321 LERVHYALH
//