LOCUS AEE83541.1 768 aa PRT PLN 23-MAR-2023 DEFINITION Arabidopsis thaliana hAT transposon superfamily protein. ACCESSION CP002687-2398 PROTEIN_ID AEE83541.1 SOURCE Arabidopsis thaliana (thale cress) ORGANISM Arabidopsis thaliana Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis. REFERENCE 1 (bases 1 to 18585056) AUTHORS Mayer,K., Schuller,C., Wambutt,R., Murphy,G., Volckaert,G., Pohl,T., Dusterhoft,A., Stiekema,W., Entian,K.D., Terryn,N., Harris,B., Ansorge,W., Brandt,P., Grivell,L., Rieger,M., Weichselgartner,M., de Simone,V., Obermaier,B., Mache,R., Muller,M., Kreis,M., Delseny,M., Puigdomenech,P., Watson,M., Schmidtheini,T., Reichert,B., Portatelle,D., Perez-Alonso,M., Boutry,M., Bancroft,I., Vos,P., Hoheisel,J., Zimmermann,W., Wedler,H., Ridley,P., Langham,S.A., McCullagh,B., Bilham,L., Robben,J., Van der Schueren,J., Grymonprez,B., Chuang,Y.J., Vandenbussche,F., Braeken,M., Weltjens,I., Voet,M., Bastiaens,I., Aert,R., Defoor,E., Weitzenegger,T., Bothe,G., Ramsperger,U., Hilbert,H., Braun,M., Holzer,E., Brandt,A., Peters,S., van Staveren,M., Dirske,W., Mooijman,P., Klein Lankhorst,R., Rose,M., Hauf,J., Kotter,P., Berneiser,S., Hempel,S., Feldpausch,M., Lamberth,S., Van den Daele,H., De Keyser,A., Buysshaert,C., Gielen,J., Villarroel,R., De Clercq,R., Van Montagu,M., Rogers,J., Cronin,A., Quail,M., Bray-Allen,S., Clark,L., Doggett,J., Hall,S., Kay,M., Lennard,N., McLay,K., Mayes,R., Pettett,A., Rajandream,M.A., Lyne,M., Benes,V., Rechmann,S., Borkova,D., Blocker,H., Scharfe,M., Grimm,M., Lohnert,T.H., Dose,S., de Haan,M., Maarse,A., Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M., Fartmann,B., Granderath,K., Dauner,D., Herzl,A., Neumann,S., Argiriou,A., Vitale,D., Liguori,R., Piravandi,E., Massenet,O., Quigley,F., Clabauld,G., Mundlein,A., Felber,R., Schnabl,S., Hiller,R., Schmidt,W., Lecharny,A., Aubourg,S., Chefdor,F., Cooke,R., Berger,C., Montfort,A., Casacuberta,E., Gibbons,T., Weber,N., Vandenbol,M., Bargues,M., Terol,J., Torres,A., Perez-Perez,A., Purnelle,B., Bent,E., Johnson,S., Tacon,D., Jesse,T., Heijnen,L., Schwarz,S., Scholler,P., Heber,S., Francs,P., Bielke,C., Frishman,D., Haase,D., Lemcke,K., Mewes,H.W., Stocker,S., Zaccaria,P., Bevan,M., Wilson,R.K., de la Bastide,M., Habermann,K., Parnell,L., Dedhia,N., Gnoj,L., Schutz,K., Huang,E., Spiegel,L., Sehkon,M., Murray,J., Sheet,P., Cordes,M., Abu-Threideh,J., Stoneking,T., Kalicki,J., Graves,T., Harmon,G., Edwards,J., Latreille,P., Courtney,L., Cloud,J., Abbott,A., Scott,K., Johnson,D., Minx,P., Bentley,D., Fulton,B., Miller,N., Greco,T., Kemp,K., Kramer,J., Fulton,L., Mardis,E., Dante,M., Pepin,K., Hillier,L., Nelson,J., Spieth,J., Ryan,E., Andrews,S., Geisel,C., Layman,D., Du,H., Ali,J., Berghoff,A., Jones,K., Drone,K., Cotton,M., Joshu,C., Antonoiu,B., Zidanic,M., Strong,C., Sun,H., Lamar,B., Yordan,C., Ma,P., Zhong,J., Preston,R., Vil,D., Shekher,M., Matero,A., Shah,R., Swaby,I.K., O'Shaughnessy,A., Rodriguez,M., Hoffmann,J., Till,S., Granat,S., Shohdy,N., Hasegawa,A., Hameed,A., Lodhi,M., Johnson,A., Chen,E., Marra,M., Martienssen,R. and McCombie,W.R. TITLE Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana JOURNAL Nature 402 (6763), 769-777 (1999) PUBMED 10617198 REFERENCE 2 (bases 1 to 18585056) AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E. CONSRTM TAIR TITLE Direct Submission JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie Institution, 260 Panama Street, Stanford, CA, USA REFERENCE 3 (bases 1 to 18585056) AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M., Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R., Vaughn,M. and Town,C.D. TITLE Direct Submission JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute, 9704 Medical Center Dr, Rockville, MD 20850, USA REMARK Protein update by submitter FEATURES Qualifiers source /organism="Arabidopsis thaliana" /mol_type="genomic DNA" /db_xref="taxon:3702" /chromosome="4" /ecotype="Columbia" protein /locus_tag="AT4G15020" /gene_synonym="DL3551W" /gene_synonym="FCAALL.174" /inference="Similar to RNA sequence, EST:INSD:AV814292.1,INSD:ES171064.1,INSD:EH971998.1, INSD:ES191709.1,INSD:EL975505.1,INSD:EH850060.1, INSD:EL240611.1,INSD:N97094.1,INSD:BP794973.1, INSD:EL975589.1,INSD:EL227580.1,INSD:BE527083.1, INSD:EL300232.1,INSD:AV557073.1,INSD:EL986743.1" /inference="Similar to RNA sequence, mRNA:INSD:BT030003.1,INSD:AK226708.1" /note="hAT transposon superfamily; FUNCTIONS IN: protein dimerization activity, DNA binding; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 19 plant structures; EXPRESSED DURING: 8 growth stages; CONTAINS InterPro DOMAIN/s: HAT dimerisation (InterPro:IPR008906), Zinc finger, BED-type predicted (InterPro:IPR003656), Protein of unknown function DUF659 (InterPro:IPR007021); BEST Arabidopsis thaliana protein match is: hAT transposon superfamily (TAIR:AT3G22220.2); Has 879 Blast hits to 805 proteins in 22 species: Archae - 0; Bacteria - 0; Metazoa - 5; Fungi - 2; Plants - 863; Viruses - 0; Other Eukaryotes - 9 (source: NCBI BLink)." /db_xref="TAIR:AT4G15020" /db_xref="Araport:AT4G15020" intron_pos 609:1 (1/3) intron_pos 677:2 (2/3) intron_pos 740:1 (3/3) BEGIN 1 MDAELEPVAL TPQKQDNAWK HCEIYKYGDR LQMRCLYCRK MFKGGGITRV KEHLAGKKGQ 61 GTICDQVPED VRLFLQQCID GTVRRQRKRH KSSSEPLSVA SLPPIEGDMM VVQPDVNDGF 121 KSPGSSDVVV QNESLLSGRT KQRTYRSKKN AFENGSASNN VDLIGRDMDN LIPVAISSVK 181 NIVHPSFRDR ENTIHMAIGR FLFGIGADFD AVNSVNFQPM IDAIASGGFG VSAPTHDDLR 241 GWILKNCVEE MAKEIDECKA MWKRTGCSIL VEELNSDKGF KVLNFLVYCP EKVVFLKSVD 301 ASEVLSSADK LFELLSELVE EVGSTNVVQV ITKCDDYYVD AGKRLMLVYP SLYWVPCAAH 361 CIDQMLEEFG KLGWISETIE QAQAITRFVY NHSGVLNLMW KFTSGNDILL PAFSSSATNF 421 ATLGRIAELK SNLQAMVTSA EWNECSYSEE PSGLVMNALT DEAFWKAVAL VNHLTSPLLR 481 ALRIVCSEKR PAMGYVYAAL YRAKDAIKTH LVNREDYIIY WKIIDRWWEQ QQHIPLLAAG 541 FFLNPKLFYN TNEEIRSELI LSVLDCIERL VPDDKIQDKI IKELTSYKTA GGVFGRNLAI 601 RARDTMLPAE WWSTYGESCL NLSRFAIRIL SQTCSSSVSC RRNQIPVEHI YQSKNSIEQK 661 RLSDLVFVQY NMRLRQLGPG SGDDTLDPLS HNRIDVLKEW VSGDQACVEG NGSADWKSLE 721 SIHRNQVAPI IDDTEDLGSG FDDIEIFKVE KEVRDEGYYT NTSEKLFT //