LOCUS AEE75130.1 1329 aa PRT PLN 23-MAR-2023
DEFINITION Arabidopsis thaliana Cleavage and polyadenylation specificity
factor (CPSF) A subunit protein protein.
ACCESSION CP002686-2136
PROTEIN_ID AEE75130.1
SOURCE Arabidopsis thaliana (thale cress)
ORGANISM Arabidopsis thaliana
Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
Camelineae; Arabidopsis.
REFERENCE 1 (bases 1 to 23459830)
AUTHORS Salanoubat,M., Lemcke,K., Rieger,M., Ansorge,W., Unseld,M.,
Fartmann,B., Valle,G., Blocker,H., Perez-Alonso,M., Obermaier,B.,
Delseny,M., Boutry,M., Grivell,L.A., Mache,R., Puigdomenech,P., De
Simone,V., Choisne,N., Artiguenave,F., Robert,C., Brottier,P.,
Wincker,P., Cattolico,L., Weissenbach,J., Saurin,W., Quetier,F.,
Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M., Benes,V.,
Wurmbach,E., Drzonek,H., Erfle,H., Jordan,N., Bangert,S.,
Wiedelmann,R., Kranz,H., Voss,H., Holland,R., Brandt,P.,
Nyakatura,G., Vezzi,A., D'Angelo,M., Pallavicini,A., Toppo,S.,
Simionati,B., Conrad,A., Hornischer,K., Kauer,G., Lohnert,T.H.,
Nordsiek,G., Reichelt,J., Scharfe,M., Schon,O., Bargues,M.,
Terol,J., Climent,J., Navarro,P., Collado,C., Perez-Perez,A.,
Ottenwalder,B., Duchemin,D., Cooke,R., Laudie,M., Berger-Llauro,C.,
Purnelle,B., Masuy,D., de Haan,M., Maarse,A.C., Alcaraz,J.P.,
Cottet,A., Casacuberta,E., Monfort,A., Argiriou,A., flores,M.,
Liguori,R., Vitale,D., Mannhaupt,G., Haase,D., Schoof,H., Rudd,S.,
Zaccaria,P., Mewes,H.W., Mayer,K.F., Kaul,S., Town,C.D., Koo,H.L.,
Tallon,L.J., Jenkins,J., Rooney,T., Rizzo,M., Walts,A.,
Utterback,T., Fujii,C.Y., Shea,T.P., Creasy,T.H., Haas,B.,
Maiti,R., Wu,D., Peterson,J., Van Aken,S., Pai,G., Militscher,J.,
Sellers,P., Gill,J.E., Feldblyum,T.V., Preuss,D., Lin,X.,
Nierman,W.C., Salzberg,S.L., White,O., Venter,J.C., Fraser,C.M.,
Kaneko,T., Nakamura,Y., Sato,S., Kato,T., Asamizu,E., Sasamoto,S.,
Kimura,T., Idesawa,K., Kawashima,K., Kishida,Y., Kiyokawa,C.,
Kohara,M., Matsumoto,M., Matsuno,A., Muraki,A., Nakayama,S.,
Nakazaki,N., Shinpo,S., Takeuchi,C., Wada,T., Watanabe,A.,
Yamada,M., Yasuda,M. and Tabata,S.
CONSRTM European Union Chromosome 3 Arabidopsis Sequencing Consortium;
Institute for Genomic Research; Kazusa DNA Research Institute
TITLE Sequence and analysis of chromosome 3 of the plant Arabidopsis
thaliana
JOURNAL Nature 408 (6814), 820-822 (2000)
PUBMED 11130713
REFERENCE 2 (bases 1 to 23459830)
AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
CONSRTM TAIR
TITLE Direct Submission
JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE 3 (bases 1 to 23459830)
AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
Vaughn,M. and Town,C.D.
TITLE Direct Submission
JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
9704 Medical Center Dr, Rockville, MD 20850, USA
REMARK Protein update by submitter
FEATURES Qualifiers
source /organism="Arabidopsis thaliana"
/mol_type="genomic DNA"
/db_xref="taxon:3702"
/chromosome="3"
/ecotype="Columbia"
protein /locus_tag="AT3G11960"
/inference="Similar to RNA sequence,
EST:INSD:EH839868.1,INSD:EL202745.1,INSD:BG459493.1,
INSD:AV550329.1,INSD:ES124065.1,INSD:EL125903.1,
INSD:EH929984.1,INSD:AV547569.1,INSD:AV523965.1,
INSD:AV529912.1,INSD:AV538174.1,INSD:AV546518.1,
INSD:BP839374.1,INSD:BP828082.1,INSD:AV543012.1,
INSD:AV529749.1,INSD:EG515693.1,INSD:EL059289.1,
INSD:ES202817.1,INSD:EG515692.1,INSD:EL291402.1,
INSD:EH872364.1"
/note="Cleavage and polyadenylation specificity factor
(CPSF) A subunit protein; FUNCTIONS IN: nucleic acid
binding; INVOLVED IN: biological_process unknown; LOCATED
IN: nucleus, chloroplast; EXPRESSED IN: 23 plant
structures; EXPRESSED DURING: 13 growth stages; CONTAINS
InterPro DOMAIN/s: Cleavage/polyadenylation specificity
factor, A subunit, C-terminal (InterPro:IPR004871); BEST
Arabidopsis thaliana protein match is: damaged DNA binding
protein 1A (TAIR:AT4G05420.2); Has 954 Blast hits to 777
proteins in 185 species: Archae - 0; Bacteria - 0; Metazoa
- 323; Fungi - 233; Plants - 246; Viruses - 0; Other
Eukaryotes - 152 (source: NCBI BLink)."
/db_xref="TAIR:AT3G11960"
/db_xref="Araport:AT3G11960"
intron_pos 68:0 (1/12)
intron_pos 113:0 (2/12)
intron_pos 138:2 (3/12)
intron_pos 185:2 (4/12)
intron_pos 835:0 (5/12)
intron_pos 937:2 (6/12)
intron_pos 1043:0 (7/12)
intron_pos 1095:0 (8/12)
intron_pos 1148:1 (9/12)
intron_pos 1182:0 (10/12)
intron_pos 1227:2 (11/12)
intron_pos 1264:0 (12/12)
BEGIN
1 MAAPEDESSA QSQSSPATAA PTPPPSSSPS SAGDHYLAKC ILRPSVVLQV AYGYFRSPSS
61 RDIVFGKETC IELVVIGEDG IVESVCEQYV FGTIKDLAVI PQSSKLYSNS LQMGKDLLAV
121 LSDSGKLSFL SFSNEMHRIS YPSEDGGNGS SIQAISGTIW SMCFISKDFN ESKEYAPILA
181 IVINRKGSLM NELALFRWNV KEESICLISE YVETGALAHS IVEVPHSSGF AFLFRIGDVL
241 LMDLRDPQNP CCLFRTSLDF VPASLMEEHF VEESCRVQDG DDEGCNVVVC ALLELRDHEV
301 RDHDPMFIDT ESDIGKLSSK NVSSWTWEPE NNHNPRMIIC LDNGDFFMFE LIYEDDGVKV
361 NLSECLYKGL PCKDILWIEG GFLATFAEMA DGTVFKLGTE KLHWMSSIQN IAPILDFSVM
421 DDQNEKRDQI FACCGVTPEG SLRIIRSGIN VEKLLKTAPV YQGITGTWTV KMKLTDVYHS
481 FLVLSFVEET RVLSVGLSFK DVTDSVGFQS DVCTFACGLV ADGLLVQIHQ DAIRLCMPTM
541 DAHSDGIPVS SPFFSSWFPE NVSISLGAVG QNLIVVSTSN PCFLSILGVK SVSSQCCEIY
601 EIQRVTLQYE VSCISVPQKH IGKKRSRDSS PDNFCKAAIP SAMEQGYTFL IGTHKPSVEV
661 LSFTEDGVGV RVLASGLVSL TNTMGTVISG CIPQDVRLVL VDQLYVLSGL RNGMLLRFEW
721 APFSNSSGLN CPDYFSHCKE EMDTVVGKKD NLPVNLLLIA TRRIGITPVF LVPFSDSLDS
781 DIIALSDRPW LLQTARQSLS YTSISFQPST HATPVCSFEC PQGILFVSEN CLHLVEMVHS
841 KRRNAQKFQL GGTPRKVIYH SESKLLIVMR TDLYDTCTSD ICCVDPLSGS VLSSYKLKPG
901 ETGKSMELVR VGNEHVLVVG TSLSSGPAIL PSGEAESTKG RVIILCLEHT QNSDSGSMTI
961 CSKACSSSQR TSPFHDVVGY TTENLSSSSL CSSPDDYSYD GIKLDEAETW QLRLASSTTW
1021 PGMVLAICPY LDHYFLASAG NAFYVCGFPN DSPERMKRFA VGRTRFMITS LRTYFTRIVV
1081 GDCRDGVLFY SYHEESKKLH QIYCDPAQRL VADCFLMDAN SVAVSDRKGS IAILSCKDHS
1141 DFGMKHLVKI PHDNPEYSSP ESNLNLNCAY YMGEIAMSIK KGCNIYKLPA DDVLRSYGLS
1201 KSIDTADDTI IAGTLLGSIF VFAPISSEEY ELLEGVQAKL GIHPLTAPVL GNDHNEFRGR
1261 ENPSQARKIL DGDMLAQFLE LTNRQQESVL STPQPSPSTS KASSKQRSFP PLMLHQVVQL
1321 LERVHYALH
//