LOCUS AEC08860.1 351 aa PRT PLN 23-MAR-2023 DEFINITION Arabidopsis thaliana AT hook motif DNA-binding family protein protein. ACCESSION CP002685-4755 PROTEIN_ID AEC08860.1 SOURCE Arabidopsis thaliana (thale cress) ORGANISM Arabidopsis thaliana Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis. REFERENCE 1 (bases 1 to 19698289) AUTHORS Lin,X., Kaul,S., Rounsley,S., Shea,T.P., Benito,M.I., Town,C.D., Fujii,C.Y., Mason,T., Bowman,C.L., Barnstead,M., Feldblyum,T.V., Buell,C.R., Ketchum,K.A., Lee,J., Ronning,C.M., Koo,H.L., Moffat,K.S., Cronin,L.A., Shen,M., Pai,G., Van Aken,S., Umayam,L., Tallon,L.J., Gill,J.E., Adams,M.D., Carrera,A.J., Creasy,T.H., Goodman,H.M., Somerville,C.R., Copenhaver,G.P., Preuss,D., Nierman,W.C., White,O., Eisen,J.A., Salzberg,S.L., Fraser,C.M. and Venter,J.C. TITLE Sequence and analysis of chromosome 2 of the plant Arabidopsis thaliana JOURNAL Nature 402 (6763), 761-768 (1999) PUBMED 10617197 REFERENCE 2 (bases 1 to 19698289) AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E. CONSRTM TAIR TITLE Direct Submission JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie Institution, 260 Panama Street, Stanford, CA, USA REFERENCE 3 (bases 1 to 19698289) AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M., Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R., Vaughn,M. and Town,C.D. TITLE Direct Submission JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute, 9704 Medical Center Dr, Rockville, MD 20850, USA REMARK Protein update by submitter FEATURES Qualifiers source /organism="Arabidopsis thaliana" /mol_type="genomic DNA" /db_xref="taxon:3702" /chromosome="2" /ecotype="Columbia" protein /locus_tag="AT2G33620" /gene_synonym="AHL10" /gene_synonym="AT-hook motif nuclear localized protein 10" /gene_synonym="F4P9.39" /gene_synonym="F4P9_39" /inference="Similar to RNA sequence, EST:INSD:BP644851.1,INSD:ES053756.1,INSD:ES029568.1, INSD:BP573511.1,INSD:EL976795.1,INSD:BP573715.1, INSD:EG482640.1,INSD:BP614544.1,INSD:BP672023.1, INSD:ES072568.1,INSD:ES056014.1,INSD:BP634434.1, INSD:BP578438.1,INSD:ES052114.1,INSD:EL282090.1, INSD:EL267306.1,INSD:ES064634.1,INSD:EL003770.1, INSD:BP582269.1,INSD:AV782942.1,INSD:BP645347.1, INSD:BP652921.1,INSD:ES035421.1,INSD:AV800673.1, INSD:EL321137.1,INSD:BP576823.1,INSD:BP582414.1, INSD:DR378268.1,INSD:AV784873.1,INSD:BP574845.1, INSD:BP609513.1,INSD:BP579150.1,INSD:BP615644.1, INSD:ES203117.1,INSD:EL255217.1,INSD:BP582782.1, INSD:AV545411.1,INSD:DR379612.1,INSD:EL295377.1, INSD:ES094891.1,INSD:EH852307.1,INSD:AV530318.1, INSD:DR263980.1,INSD:BP665444.1,INSD:BP616649.1, INSD:ES119629.1,INSD:BP577850.1,INSD:AV810136.1, INSD:AV558588.1,INSD:BP638865.1,INSD:ES210856.1, INSD:EL059280.1,INSD:BP575366.1,INSD:Z34237.1, INSD:ES036562.1,INSD:AV822240.1,INSD:BP581024.1, INSD:ES001170.1,INSD:ES130335.1,INSD:AV791148.1, INSD:EL971109.1,INSD:BP580619.1,INSD:ES121966.1, INSD:BP572798.1,INSD:EL216447.1,INSD:AV793303.1, INSD:AV567384.1,INSD:BP622328.1,INSD:BP579401.1, INSD:BP778329.1,INSD:BP619359.1,INSD:BP576319.1, INSD:EL334697.1,INSD:BP610816.1,INSD:ES158571.1, INSD:AV820247.1" /inference="similar to RNA sequence, mRNA:INSD:AY081729.1,INSD:AF385705.1" /note="AT hook motif DNA-binding family protein; FUNCTIONS IN: DNA binding; LOCATED IN: cytosol; EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF296 (InterPro:IPR005175), AT hook, DNA-binding motif (InterPro:IPR017956); BEST Arabidopsis thaliana protein match is: AT-hook motif nuclear-localized protein 1 (TAIR:AT4G12080.1); Has 969 Blast hits to 965 proteins in 147 species: Archae - 0; Bacteria - 205; Metazoa - 5; Fungi - 3; Plants - 754; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink)." /db_xref="Araport:AT2G33620" /db_xref="TAIR:AT2G33620" intron_pos 157:1 (1/4) intron_pos 175:0 (2/4) intron_pos 219:0 (3/4) intron_pos 273:0 (4/4) BEGIN 1 MSGSETGLMA ATRESMQFTM ALHQQQQHSQ AQPQQSQNRP LSFGGDDGTA LYKQPMRSVS 61 PPQQYQPNSA GENSVLNMNL PGGESGGMTG TGSEPVKKRR GRPRKYGPDS GEMSLGLNPG 121 APSFTVSQPS SGGDGGEKKR GRPPGSSSKR LKLQALGSTG IGFTPHVLTV LAGEDVSSKI 181 MALTHNGPRA VCVLSANGAI SNVTLRQSAT SGGTVTYEGR FEILSLSGSF HLLENNGQRS 241 RTGGLSVSLS SPDGNVLGGS VAGLLIAASP VQIVVGSFLP DGEKEPKQHV GQMGLSSPVL 301 PRVAPTQVLM TPSSPQSRGT MSESSCGGGH GSPIHQSTGG PYNNTINMPW K //