LOCUS       AEE76124.1               330 aa    PRT              PLN 23-MAR-2023
DEFINITION  Arabidopsis thaliana uracil dna glycosylase protein.
ACCESSION   CP002686-3477
PROTEIN_ID  AEE76124.1
SOURCE      Arabidopsis thaliana (thale cress)
  ORGANISM  Arabidopsis thaliana
            Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
            Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
            Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
            Camelineae; Arabidopsis.
REFERENCE   1  (bases 1 to 23459830)
  AUTHORS   Salanoubat,M., Lemcke,K., Rieger,M., Ansorge,W., Unseld,M.,
            Fartmann,B., Valle,G., Blocker,H., Perez-Alonso,M., Obermaier,B.,
            Delseny,M., Boutry,M., Grivell,L.A., Mache,R., Puigdomenech,P., De
            Simone,V., Choisne,N., Artiguenave,F., Robert,C., Brottier,P.,
            Wincker,P., Cattolico,L., Weissenbach,J., Saurin,W., Quetier,F.,
            Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M., Benes,V.,
            Wurmbach,E., Drzonek,H., Erfle,H., Jordan,N., Bangert,S.,
            Wiedelmann,R., Kranz,H., Voss,H., Holland,R., Brandt,P.,
            Nyakatura,G., Vezzi,A., D'Angelo,M., Pallavicini,A., Toppo,S.,
            Simionati,B., Conrad,A., Hornischer,K., Kauer,G., Lohnert,T.H.,
            Nordsiek,G., Reichelt,J., Scharfe,M., Schon,O., Bargues,M.,
            Terol,J., Climent,J., Navarro,P., Collado,C., Perez-Perez,A.,
            Ottenwalder,B., Duchemin,D., Cooke,R., Laudie,M., Berger-Llauro,C.,
            Purnelle,B., Masuy,D., de Haan,M., Maarse,A.C., Alcaraz,J.P.,
            Cottet,A., Casacuberta,E., Monfort,A., Argiriou,A., flores,M.,
            Liguori,R., Vitale,D., Mannhaupt,G., Haase,D., Schoof,H., Rudd,S.,
            Zaccaria,P., Mewes,H.W., Mayer,K.F., Kaul,S., Town,C.D., Koo,H.L.,
            Tallon,L.J., Jenkins,J., Rooney,T., Rizzo,M., Walts,A.,
            Utterback,T., Fujii,C.Y., Shea,T.P., Creasy,T.H., Haas,B.,
            Maiti,R., Wu,D., Peterson,J., Van Aken,S., Pai,G., Militscher,J.,
            Sellers,P., Gill,J.E., Feldblyum,T.V., Preuss,D., Lin,X.,
            Nierman,W.C., Salzberg,S.L., White,O., Venter,J.C., Fraser,C.M.,
            Kaneko,T., Nakamura,Y., Sato,S., Kato,T., Asamizu,E., Sasamoto,S.,
            Kimura,T., Idesawa,K., Kawashima,K., Kishida,Y., Kiyokawa,C.,
            Kohara,M., Matsumoto,M., Matsuno,A., Muraki,A., Nakayama,S.,
            Nakazaki,N., Shinpo,S., Takeuchi,C., Wada,T., Watanabe,A.,
            Yamada,M., Yasuda,M. and Tabata,S.
  CONSRTM   European Union Chromosome 3 Arabidopsis Sequencing Consortium;
            Institute for Genomic Research; Kazusa DNA Research Institute
  TITLE     Sequence and analysis of chromosome 3 of the plant Arabidopsis
            thaliana
  JOURNAL   Nature 408 (6814), 820-822 (2000)
   PUBMED   11130713
REFERENCE   2  (bases 1 to 23459830)
  AUTHORS   Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
  CONSRTM   TAIR
  TITLE     Direct Submission
  JOURNAL   Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
            Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE   3  (bases 1 to 23459830)
  AUTHORS   Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
            Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
            Vaughn,M. and Town,C.D.
  TITLE     Direct Submission
  JOURNAL   Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
            9704 Medical Center Dr, Rockville, MD 20850, USA
  REMARK    Protein update by submitter
FEATURES             Qualifiers
     source          /organism="Arabidopsis thaliana"
                     /mol_type="genomic DNA"
                     /db_xref="taxon:3702"
                     /chromosome="3"
                     /ecotype="Columbia"
     protein         /gene="UNG"
                     /locus_tag="AT3G18630"
                     /gene_synonym="ATUNG"
                     /gene_synonym="uracil dna glycosylase"
                     /inference="Similar to RNA sequence,
                     EST:INSD:EG435578.1,INSD:EG427656.1,INSD:EG439822.1,
                     INSD:ES081988.1,INSD:EG427733.1,INSD:EG435576.1,
                     INSD:EL148812.1,INSD:EG435577.1,INSD:EG429511.1,
                     INSD:EG429522.1,INSD:EG429520.1,INSD:EG439841.1,
                     INSD:EG427734.1,INSD:EG439848.1,INSD:EG439842.1,
                     INSD:EG429514.1,INSD:EG497837.1,INSD:EG439850.1,
                     INSD:EG427657.1,INSD:EH833070.1,INSD:DR274001.1,
                     INSD:EG439821.1,INSD:AV782014.1,INSD:EG429516.1,
                     INSD:EL191269.1,INSD:EL235040.1,INSD:EG429515.1,
                     INSD:EG429512.1,INSD:EG439837.1,INSD:EG439838.1,
                     INSD:EG439856.1,INSD:EG435575.1,INSD:EG439857.1,
                     INSD:EG439839.1,INSD:DR380378.1,INSD:EG427658.1,
                     INSD:EG439823.1,INSD:EG429513.1,INSD:EG429521.1"
                     /inference="Similar to RNA sequence,
                     mRNA:INSD:BT029175.1,INSD:AY084956.1"
                     /note="uracil dna glycosylase (UNG); FUNCTIONS IN: uracil
                     DNA N-glycosylase activity; INVOLVED IN: DNA repair,
                     base-excision repair; LOCATED IN: mitochondrion; EXPRESSED
                     IN: 15 plant structures; EXPRESSED DURING: 8 growth
                     stages; CONTAINS InterPro DOMAIN/s: Uracil-DNA glycosylase
                     (InterPro:IPR002043), Uracil-DNA glycosylase-like
                     (InterPro:IPR005122); BEST Arabidopsis thaliana protein
                     match is: unknown protein (TAIR:AT2G10550.1); Has 5606
                     Blast hits to 5606 proteins in 2219 species: Archae - 2;
                     Bacteria - 4117; Metazoa - 124; Fungi - 141; Plants - 47;
                     Viruses - 234; Other Eukaryotes - 941 (source: NCBI
                     BLink)."
                     /db_xref="TAIR:AT3G18630"
                     /db_xref="Araport:AT3G18630"
     intron_pos      96:1 (1/6)
     intron_pos      173:0 (2/6)
     intron_pos      226:0 (3/6)
     intron_pos      236:1 (4/6)
     intron_pos      281:2 (5/6)
     intron_pos      309:2 (6/6)
BEGIN
        1 MASSTPKTLM DFFQPAKRLK ASPSSSSFPA VSVAGGSRDL GSVANSPPRV TVTTSVADDS
       61 SGLTPEQIAR AEFNKFVAKS KRNLAVCSER VTKAKSEGNC YVPLSELLVE ESWLKALPGE
      121 FHKPYAKSLS DFLEREIITD SKSPLIYPPQ HLIFNALNTT PFDRVKTVII GQDPYHGPGQ
      181 AMGLSFSVPE GEKLPSSLLN IFKELHKDVG CSIPRHGNLQ KWAVQGVLLL NAVLTVRSKQ
      241 PNSHAKKGWE QFTDAVIQSI SQQKEGVVFL LWGRYAQEKS KLIDATKHHI LTAAHPSGLS
      301 ANRGFFDCRH FSRANQLLEE MGIPPIDWQL
//