LOCUS AEE77393.2 1482 aa PRT PLN 23-MAR-2023
DEFINITION Arabidopsis thaliana 5'-3' exonuclease family protein protein.
ACCESSION CP002686-5262
PROTEIN_ID AEE77393.2
SOURCE Arabidopsis thaliana (thale cress)
ORGANISM Arabidopsis thaliana
Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
Camelineae; Arabidopsis.
REFERENCE 1 (bases 1 to 23459830)
AUTHORS Salanoubat,M., Lemcke,K., Rieger,M., Ansorge,W., Unseld,M.,
Fartmann,B., Valle,G., Blocker,H., Perez-Alonso,M., Obermaier,B.,
Delseny,M., Boutry,M., Grivell,L.A., Mache,R., Puigdomenech,P., De
Simone,V., Choisne,N., Artiguenave,F., Robert,C., Brottier,P.,
Wincker,P., Cattolico,L., Weissenbach,J., Saurin,W., Quetier,F.,
Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M., Benes,V.,
Wurmbach,E., Drzonek,H., Erfle,H., Jordan,N., Bangert,S.,
Wiedelmann,R., Kranz,H., Voss,H., Holland,R., Brandt,P.,
Nyakatura,G., Vezzi,A., D'Angelo,M., Pallavicini,A., Toppo,S.,
Simionati,B., Conrad,A., Hornischer,K., Kauer,G., Lohnert,T.H.,
Nordsiek,G., Reichelt,J., Scharfe,M., Schon,O., Bargues,M.,
Terol,J., Climent,J., Navarro,P., Collado,C., Perez-Perez,A.,
Ottenwalder,B., Duchemin,D., Cooke,R., Laudie,M., Berger-Llauro,C.,
Purnelle,B., Masuy,D., de Haan,M., Maarse,A.C., Alcaraz,J.P.,
Cottet,A., Casacuberta,E., Monfort,A., Argiriou,A., flores,M.,
Liguori,R., Vitale,D., Mannhaupt,G., Haase,D., Schoof,H., Rudd,S.,
Zaccaria,P., Mewes,H.W., Mayer,K.F., Kaul,S., Town,C.D., Koo,H.L.,
Tallon,L.J., Jenkins,J., Rooney,T., Rizzo,M., Walts,A.,
Utterback,T., Fujii,C.Y., Shea,T.P., Creasy,T.H., Haas,B.,
Maiti,R., Wu,D., Peterson,J., Van Aken,S., Pai,G., Militscher,J.,
Sellers,P., Gill,J.E., Feldblyum,T.V., Preuss,D., Lin,X.,
Nierman,W.C., Salzberg,S.L., White,O., Venter,J.C., Fraser,C.M.,
Kaneko,T., Nakamura,Y., Sato,S., Kato,T., Asamizu,E., Sasamoto,S.,
Kimura,T., Idesawa,K., Kawashima,K., Kishida,Y., Kiyokawa,C.,
Kohara,M., Matsumoto,M., Matsuno,A., Muraki,A., Nakayama,S.,
Nakazaki,N., Shinpo,S., Takeuchi,C., Wada,T., Watanabe,A.,
Yamada,M., Yasuda,M. and Tabata,S.
CONSRTM European Union Chromosome 3 Arabidopsis Sequencing Consortium;
Institute for Genomic Research; Kazusa DNA Research Institute
TITLE Sequence and analysis of chromosome 3 of the plant Arabidopsis
thaliana
JOURNAL Nature 408 (6814), 820-822 (2000)
PUBMED 11130713
REFERENCE 2 (bases 1 to 23459830)
AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
CONSRTM TAIR
TITLE Direct Submission
JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE 3 (bases 1 to 23459830)
AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
Vaughn,M. and Town,C.D.
TITLE Direct Submission
JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
9704 Medical Center Dr, Rockville, MD 20850, USA
REMARK Protein update by submitter
FEATURES Qualifiers
source /organism="Arabidopsis thaliana"
/mol_type="genomic DNA"
/db_xref="taxon:3702"
/chromosome="3"
/ecotype="Columbia"
protein /gene="UVH3"
/locus_tag="AT3G28030"
/gene_synonym="ULTRAVIOLET HYPERSENSITIVE 3"
/gene_synonym="UV REPAIR DEFECTIVE 1"
/gene_synonym="UVR1"
/inference="Similar to RNA sequence,
EST:INSD:ES088142.1,INSD:EL279245.1,INSD:EH993652.1"
/inference="Similar to RNA sequence, mRNA:INSD:AF312711.1"
/note="ULTRAVIOLET HYPERSENSITIVE 3 (UVH3); FUNCTIONS IN:
protein binding, nuclease activity; INVOLVED IN: DNA
repair, response to UV-B, response to heat,
non-photoreactive DNA repair; LOCATED IN: nucleus;
EXPRESSED IN: stem, leaf whorl, seed; EXPRESSED DURING: E
expanded cotyledon stage; CONTAINS InterPro DOMAIN/s: XPG
conserved site (InterPro:IPR019974), Xeroderma pigmentosum
group G protein (InterPro:IPR001044), XPG N-terminal
(InterPro:IPR006085), DNA repair protein (XPGC)/yeast Rad
(InterPro:IPR006084), 5'-3' exonuclease, C-terminal
subdomain (InterPro:IPR020045), Helix-hairpin-helix motif,
class 2 (InterPro:IPR008918), XPG/RAD2 endonuclease
(InterPro:IPR006086); BEST Arabidopsis thaliana protein
match is: 5'-3' exonuclease family protein
(TAIR:AT1G01880.1); Has 8115 Blast hits to 5779 proteins
in 624 species: Archae - 528; Bacteria - 414; Metazoa -
2441; Fungi - 1496; Plants - 576; Viruses - 71; Other
Eukaryotes - 2589 (source: NCBI BLink)."
/db_xref="TAIR:AT3G28030"
/db_xref="Araport:AT3G28030"
intron_pos 30:1 (1/16)
intron_pos 114:0 (2/16)
intron_pos 174:2 (3/16)
intron_pos 194:0 (4/16)
intron_pos 230:2 (5/16)
intron_pos 273:0 (6/16)
intron_pos 299:0 (7/16)
intron_pos 317:0 (8/16)
intron_pos 378:2 (9/16)
intron_pos 931:0 (10/16)
intron_pos 996:0 (11/16)
intron_pos 1024:2 (12/16)
intron_pos 1104:0 (13/16)
intron_pos 1149:2 (14/16)
intron_pos 1178:0 (15/16)
intron_pos 1320:0 (16/16)
BEGIN
1 MGVQGLWELL APVGRRVSVE TLANKRLAID ASIWMVQFIK AMRDEKGDMV QNAHLIGFFR
61 RICKLLFLRT KPIFVFDGAT PALKRRTVIA RRRQRENAQT KIRKTAEKLL LNRLKDIRLK
121 EQAKDIKNQR LKQDDSDRVK KRVSSDSVED NLRVPVEEDD VGASFFQEEK LDEVSQASLV
181 GETGVDDVVK ESVKDDPKGK GVLLDGDDLD NLVQDSSVQG KDYQEKLDEM LAASLAAEEE
241 RNFTSKASTS AAAIPSEEDE EEDSDGDEEI LLPVMDGNID PAVLASLPPS MQLDLLAQMR
301 EKLMAENRQK YQKVKKAPEK FSELQIEAYL KTVAFRREIN EVQRSAGGRA VGGVQTSRIA
361 SEANREFIFS SSFAGDKEVL ASAREGRNDE NQKKTSQQSL PVSVKNASPL KKSDATIELD
421 RDEPKNPDEN IEVYIDERGR FRIRNRHMGI QMTRDIQRNL HLMKEKERTA SGSMAKNDET
481 FSAWENFPTE DQFLEKSPVE KDVVDLEIQN DDSMLHPPSS IEISFDHDGG GKDLNDEDDM
541 FLQLAAGGPV TISSTENDPK EDTSPWASDS DWEEVPVEQN TSVSKLEANL SNQHIPKDIS
601 IAEGVAWEEY SCKNANNSVE NDTVTKITKG YLEEEADLQE AIKKSLLELH DKESGDVLEE
661 NQSVRVNLVV DKPSEDSLCS RETVGEAEEE RFLDEITILK TSGAISEQSN TSVAGNADGQ
721 KGITKQFGTH PSSGSNNVSH AVSNKLSKVK SVISPEKALN VASQNRMLST MAKQHNEEGS
781 ESFGGESVKV SAMPIADEEI TGFLDEKDNA DGESSIMMDD KRDYSRRKIQ SLVTESRDPS
841 RNVVRSRIGI LHDTDSQNER REENNSNEHT FNIDSSTDFE EKGVPVEFSE ANIEEEIRVL
901 DQEFVSLGDE QRKLERNAES VSSEMFAECQ ELLQIFGIPY IIAPMEAEAQ CAFMEQSNLV
961 DGIVTDDSDV FLFGARSVYK NIFDDRKYVE TYFMKDIEKE LGLSRDKIIR MAMLLGSDYT
1021 EGISGIGIVN AIEVVTAFPE EDGLQKFREW VESPDPTILG KTDAKTGSKV KKRGSASVDN
1081 KGIISGASTD DTEEIKQIFM DQHRKVSKNW HIPLTFPSEA VISAYLNPQV DLSTEKFSWG
1141 KPDLSVLRKL CWEKFNWNGK KTDELLLPVL KEYEKRETQL RIEAFYSFNE RFAKIRSKRI
1201 NKAVKGIGGG LSSDVADHTL QEGPRKRNKK KVAPHETEDN NTSDKDSPIA NEKVKNKRKR
1261 LEKPSSSRGR GRAQKRGRGR GRVQKDLLEL SDGSSDDDDD DDKVVELEAK PANLQKVRKS
1321 TRSRNPVMYS AKEDDELDES RSNEGSPSEN FEEVDEGRIG NDDSVDASIN DCPSEDYIQT
1381 GGGFCADEAD EIGDAHLEDK ATDDYRVIGG GFCVDEDETA EENTMDDDAE ILKMESEEQR
1441 KKGKRRNEED ASLDENVDIH FGNSSAGGLS AMPFLKRKKR KN
//