LOCUS AEE74621.1 445 aa PRT PLN 23-MAR-2023
DEFINITION Arabidopsis thaliana DNA glycosylase superfamily protein protein.
ACCESSION CP002686-1425
PROTEIN_ID AEE74621.1
SOURCE Arabidopsis thaliana (thale cress)
ORGANISM Arabidopsis thaliana
Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
Camelineae; Arabidopsis.
REFERENCE 1 (bases 1 to 23459830)
AUTHORS Salanoubat,M., Lemcke,K., Rieger,M., Ansorge,W., Unseld,M.,
Fartmann,B., Valle,G., Blocker,H., Perez-Alonso,M., Obermaier,B.,
Delseny,M., Boutry,M., Grivell,L.A., Mache,R., Puigdomenech,P., De
Simone,V., Choisne,N., Artiguenave,F., Robert,C., Brottier,P.,
Wincker,P., Cattolico,L., Weissenbach,J., Saurin,W., Quetier,F.,
Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M., Benes,V.,
Wurmbach,E., Drzonek,H., Erfle,H., Jordan,N., Bangert,S.,
Wiedelmann,R., Kranz,H., Voss,H., Holland,R., Brandt,P.,
Nyakatura,G., Vezzi,A., D'Angelo,M., Pallavicini,A., Toppo,S.,
Simionati,B., Conrad,A., Hornischer,K., Kauer,G., Lohnert,T.H.,
Nordsiek,G., Reichelt,J., Scharfe,M., Schon,O., Bargues,M.,
Terol,J., Climent,J., Navarro,P., Collado,C., Perez-Perez,A.,
Ottenwalder,B., Duchemin,D., Cooke,R., Laudie,M., Berger-Llauro,C.,
Purnelle,B., Masuy,D., de Haan,M., Maarse,A.C., Alcaraz,J.P.,
Cottet,A., Casacuberta,E., Monfort,A., Argiriou,A., flores,M.,
Liguori,R., Vitale,D., Mannhaupt,G., Haase,D., Schoof,H., Rudd,S.,
Zaccaria,P., Mewes,H.W., Mayer,K.F., Kaul,S., Town,C.D., Koo,H.L.,
Tallon,L.J., Jenkins,J., Rooney,T., Rizzo,M., Walts,A.,
Utterback,T., Fujii,C.Y., Shea,T.P., Creasy,T.H., Haas,B.,
Maiti,R., Wu,D., Peterson,J., Van Aken,S., Pai,G., Militscher,J.,
Sellers,P., Gill,J.E., Feldblyum,T.V., Preuss,D., Lin,X.,
Nierman,W.C., Salzberg,S.L., White,O., Venter,J.C., Fraser,C.M.,
Kaneko,T., Nakamura,Y., Sato,S., Kato,T., Asamizu,E., Sasamoto,S.,
Kimura,T., Idesawa,K., Kawashima,K., Kishida,Y., Kiyokawa,C.,
Kohara,M., Matsumoto,M., Matsuno,A., Muraki,A., Nakayama,S.,
Nakazaki,N., Shinpo,S., Takeuchi,C., Wada,T., Watanabe,A.,
Yamada,M., Yasuda,M. and Tabata,S.
CONSRTM European Union Chromosome 3 Arabidopsis Sequencing Consortium;
Institute for Genomic Research; Kazusa DNA Research Institute
TITLE Sequence and analysis of chromosome 3 of the plant Arabidopsis
thaliana
JOURNAL Nature 408 (6814), 820-822 (2000)
PUBMED 11130713
REFERENCE 2 (bases 1 to 23459830)
AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
CONSRTM TAIR
TITLE Direct Submission
JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE 3 (bases 1 to 23459830)
AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
Vaughn,M. and Town,C.D.
TITLE Direct Submission
JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
9704 Medical Center Dr, Rockville, MD 20850, USA
REMARK Protein update by submitter
FEATURES Qualifiers
source /organism="Arabidopsis thaliana"
/mol_type="genomic DNA"
/db_xref="taxon:3702"
/chromosome="3"
/ecotype="Columbia"
protein /locus_tag="AT3G07930"
/inference="Similar to RNA sequence,
EST:INSD:BP562815.2,INSD:EL067837.1,INSD:EL205861.1,
INSD:EL340825.1,INSD:EL194999.1,INSD:DR238605.1,
INSD:DR238604.1,INSD:DR238603.1,INSD:BX837740.1,
INSD:ES076571.1,INSD:EH964791.1,INSD:AU227253.1"
/inference="Similar to RNA sequence,
mRNA:INSD:BX824133.1,INSD:BT028919.1"
/note="DNA glycosylase superfamily protein; FUNCTIONS IN:
catalytic activity; INVOLVED IN: DNA repair, base-excision
repair; EXPRESSED IN: 23 plant structures; EXPRESSED
DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: DNA
glycosylase (InterPro:IPR011257), HhH-GPD domain
(InterPro:IPR003265); Has 35333 Blast hits to 34131
proteins in 2444 species: Archae - 798; Bacteria - 22429;
Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0;
Other Eukaryotes - 9610 (source: NCBI BLink)."
/db_xref="TAIR:AT3G07930"
/db_xref="Araport:AT3G07930"
intron_pos 343:0 (1/1)
BEGIN
1 MVPPIIYKYK RRKDRRLGRD DDSSVMMTRR RPDSDFIEVS DENRSFALFK EDDEKNRDLG
61 LVDDGSTNLV LQCHDDGCSL EKDNSNSLDD LFSGFVYKGV RRRKRDDFGS ITTSNLVSPQ
121 IADDDDDSVS DSHIERQECS EFHVEVRRVS PYFQGSTVSQ QSKEGCDSDS VCSKEGCSKV
181 QAKVPRVSPY FQASTISQCD SDIVSSSQSG RNYRKGSSKR QVKVRRVSPY FQESTVSEQP
241 NQAPKGLRNY FKVVKVSRYF HADGIQVNES QKEKSRNVRK TPIVSPVLSL SQKTDDVYLR
301 KTPDNTWVPP RSPCNLLQED HWHDPWRVLV ICMLLNKTSG AQTRGVISDL FGLCTDAKTA
361 TEVKEEEIEN LIKPLGLQKK RTKMIQRLSL EYLQESWTHV TQLHGVGKYA ADAYAIFCNG
421 NWDRVKPNDH MLNYYWDYLR IRYKL
//