LOCUS AEC08549.1 379 aa PRT PLN 23-MAR-2023 DEFINITION Arabidopsis thaliana DNA glycosylase superfamily protein protein. ACCESSION CP002685-4327 PROTEIN_ID AEC08549.1 SOURCE Arabidopsis thaliana (thale cress) ORGANISM Arabidopsis thaliana Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis. REFERENCE 1 (bases 1 to 19698289) AUTHORS Lin,X., Kaul,S., Rounsley,S., Shea,T.P., Benito,M.I., Town,C.D., Fujii,C.Y., Mason,T., Bowman,C.L., Barnstead,M., Feldblyum,T.V., Buell,C.R., Ketchum,K.A., Lee,J., Ronning,C.M., Koo,H.L., Moffat,K.S., Cronin,L.A., Shen,M., Pai,G., Van Aken,S., Umayam,L., Tallon,L.J., Gill,J.E., Adams,M.D., Carrera,A.J., Creasy,T.H., Goodman,H.M., Somerville,C.R., Copenhaver,G.P., Preuss,D., Nierman,W.C., White,O., Eisen,J.A., Salzberg,S.L., Fraser,C.M. and Venter,J.C. TITLE Sequence and analysis of chromosome 2 of the plant Arabidopsis thaliana JOURNAL Nature 402 (6763), 761-768 (1999) PUBMED 10617197 REFERENCE 2 (bases 1 to 19698289) AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E. CONSRTM TAIR TITLE Direct Submission JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie Institution, 260 Panama Street, Stanford, CA, USA REFERENCE 3 (bases 1 to 19698289) AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M., Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R., Vaughn,M. and Town,C.D. TITLE Direct Submission JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute, 9704 Medical Center Dr, Rockville, MD 20850, USA REMARK Protein update by submitter FEATURES Qualifiers source /organism="Arabidopsis thaliana" /mol_type="genomic DNA" /db_xref="taxon:3702" /chromosome="2" /ecotype="Columbia" protein /gene="ATNTH1" /locus_tag="AT2G31450" /gene_synonym="T28P16.6" /gene_synonym="T28P16_6" /inference="Similar to RNA sequence, EST:INSD:EH885681.1,INSD:ES112115.1,INSD:EL312158.1, INSD:DR380559.1,INSD:DR251791.1,INSD:AV790013.1, INSD:DR251796.1,INSD:DR251793.1,INSD:DR251795.1, INSD:EH829618.1" /inference="similar to RNA sequence, mRNA:INSD:AY085041.1,INSD:BX819806.1,INSD:AJ272248.1, INSD:BX820625.1" /note="ATNTH1; CONTAINS InterPro DOMAIN/s: Helix-hairpin-helix motif (InterPro:IPR000445), Helix-hairpin-helix DNA-binding motif, class 1 (InterPro:IPR003583), Endonuclease III, iron-sulphur binding site (InterPro:IPR004035), DNA glycosylase (InterPro:IPR011257), Endonuclease III-like, iron-sulphur cluster loop motif (InterPro:IPR003651), Endonuclease III, conserved site-2 (InterPro:IPR004036), HhH-GPD domain (InterPro:IPR003265); BEST Arabidopsis thaliana protein match is: endonuclease III 2 (TAIR:AT1G05900.2); Has 14067 Blast hits to 14061 proteins in 2669 species: Archae - 367; Bacteria - 9308; Metazoa - 224; Fungi - 193; Plants - 158; Viruses - 0; Other Eukaryotes - 3817 (source: NCBI BLink)." /db_xref="Araport:AT2G31450" /db_xref="TAIR:AT2G31450" intron_pos 54:1 (1/10) intron_pos 94:0 (2/10) intron_pos 115:1 (3/10) intron_pos 131:1 (4/10) intron_pos 171:0 (5/10) intron_pos 193:1 (6/10) intron_pos 224:0 (7/10) intron_pos 270:0 (8/10) intron_pos 305:0 (9/10) intron_pos 333:0 (10/10) BEGIN 1 MILLVNGGAA TSIHPNAARF YRIGTMSRQI HGAVSSSKHI SLKTQHPLSD SNSELAYGAS 61 GSETRVYTRK KRLKQEPFEP LEKYSGKGVN THKLCGLPDI EDFAYKKTIG SPSSSRSTET 121 SITVTSVKTA GYPPENWVEV LEGIRQMRSS EDAPVDSMGC DKAGSFLPPT ERRFAVLLGA 181 LLSSQTKDQV NNAAIHRLHQ NGLLTPEAVD KADESTIKEL IYPVGFYTRK ATYMKKIARI 241 CLVKYDGDIP SSLDDLLSLP GIGPKMAHLI LHIAWNDVQG ICVDTHVHRI CNRLGWVSRP 301 GTKQKTTSPE ETRVALQQWL PKEEWVAINP LLVGFGQMIC TPIRPRCEAC SVSKLCPAAF 361 KETSSPSSKL KKSNRSKEP //