LOCUS AEE76508.1 1132 aa PRT PLN 23-MAR-2023
DEFINITION Arabidopsis thaliana DNA binding protein protein.
ACCESSION CP002686-4028
PROTEIN_ID AEE76508.1
SOURCE Arabidopsis thaliana (thale cress)
ORGANISM Arabidopsis thaliana
Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
Camelineae; Arabidopsis.
REFERENCE 1 (bases 1 to 23459830)
AUTHORS Salanoubat,M., Lemcke,K., Rieger,M., Ansorge,W., Unseld,M.,
Fartmann,B., Valle,G., Blocker,H., Perez-Alonso,M., Obermaier,B.,
Delseny,M., Boutry,M., Grivell,L.A., Mache,R., Puigdomenech,P., De
Simone,V., Choisne,N., Artiguenave,F., Robert,C., Brottier,P.,
Wincker,P., Cattolico,L., Weissenbach,J., Saurin,W., Quetier,F.,
Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M., Benes,V.,
Wurmbach,E., Drzonek,H., Erfle,H., Jordan,N., Bangert,S.,
Wiedelmann,R., Kranz,H., Voss,H., Holland,R., Brandt,P.,
Nyakatura,G., Vezzi,A., D'Angelo,M., Pallavicini,A., Toppo,S.,
Simionati,B., Conrad,A., Hornischer,K., Kauer,G., Lohnert,T.H.,
Nordsiek,G., Reichelt,J., Scharfe,M., Schon,O., Bargues,M.,
Terol,J., Climent,J., Navarro,P., Collado,C., Perez-Perez,A.,
Ottenwalder,B., Duchemin,D., Cooke,R., Laudie,M., Berger-Llauro,C.,
Purnelle,B., Masuy,D., de Haan,M., Maarse,A.C., Alcaraz,J.P.,
Cottet,A., Casacuberta,E., Monfort,A., Argiriou,A., flores,M.,
Liguori,R., Vitale,D., Mannhaupt,G., Haase,D., Schoof,H., Rudd,S.,
Zaccaria,P., Mewes,H.W., Mayer,K.F., Kaul,S., Town,C.D., Koo,H.L.,
Tallon,L.J., Jenkins,J., Rooney,T., Rizzo,M., Walts,A.,
Utterback,T., Fujii,C.Y., Shea,T.P., Creasy,T.H., Haas,B.,
Maiti,R., Wu,D., Peterson,J., Van Aken,S., Pai,G., Militscher,J.,
Sellers,P., Gill,J.E., Feldblyum,T.V., Preuss,D., Lin,X.,
Nierman,W.C., Salzberg,S.L., White,O., Venter,J.C., Fraser,C.M.,
Kaneko,T., Nakamura,Y., Sato,S., Kato,T., Asamizu,E., Sasamoto,S.,
Kimura,T., Idesawa,K., Kawashima,K., Kishida,Y., Kiyokawa,C.,
Kohara,M., Matsumoto,M., Matsuno,A., Muraki,A., Nakayama,S.,
Nakazaki,N., Shinpo,S., Takeuchi,C., Wada,T., Watanabe,A.,
Yamada,M., Yasuda,M. and Tabata,S.
CONSRTM European Union Chromosome 3 Arabidopsis Sequencing Consortium;
Institute for Genomic Research; Kazusa DNA Research Institute
TITLE Sequence and analysis of chromosome 3 of the plant Arabidopsis
thaliana
JOURNAL Nature 408 (6814), 820-822 (2000)
PUBMED 11130713
REFERENCE 2 (bases 1 to 23459830)
AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
CONSRTM TAIR
TITLE Direct Submission
JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE 3 (bases 1 to 23459830)
AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
Vaughn,M. and Town,C.D.
TITLE Direct Submission
JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
9704 Medical Center Dr, Rockville, MD 20850, USA
REMARK Protein update by submitter
FEATURES Qualifiers
source /organism="Arabidopsis thaliana"
/mol_type="genomic DNA"
/db_xref="taxon:3702"
/chromosome="3"
/ecotype="Columbia"
protein /gene="ALY3"
/locus_tag="AT3G21430"
/gene_synonym="ALWAYS EARLY 3"
/gene_synonym="ARABIDOPSIS THALIANA ALWAYS EARLY 3"
/gene_synonym="ATALY3"
/inference="Similar to RNA sequence,
EST:INSD:ES194472.1,INSD:AV522871.1,INSD:EL117535.1,
INSD:ES088394.1,INSD:ES211217.1,INSD:BP615345.1,
INSD:T21550.1,INSD:BP825583.1,INSD:EL210287.1,
INSD:ES202604.1,INSD:AI100605.1,INSD:T76391.1,
INSD:ES117090.1,INSD:EL099065.1,INSD:ES098620.1,
INSD:AV528407.1,INSD:EG457142.1,INSD:EH812577.1,
INSD:AA042383.1,INSD:ES189040.1"
/inference="Similar to RNA sequence,
mRNA:INSD:AK230069.1,INSD:AJ583497.1"
/note="ALWAYS EARLY 3 (ALY3); FUNCTIONS IN: DNA binding;
INVOLVED IN: biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 23 plant
structures; EXPRESSED DURING: 13 growth stages; CONTAINS
InterPro DOMAIN/s: SANT, DNA-binding (InterPro:IPR001005),
Homeodomain-like (InterPro:IPR009057), Myb, DNA-binding
(InterPro:IPR014778), DIRP (InterPro:IPR010561); BEST
Arabidopsis thaliana protein match is: DIRP; Myb-like
DNA-binding domain (TAIR:AT3G05380.4); Has 30201 Blast
hits to 17322 proteins in 780 species: Archae - 12;
Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants -
5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI
BLink)."
/db_xref="TAIR:AT3G21430"
/db_xref="Araport:AT3G21430"
intron_pos 36:0 (1/18)
intron_pos 69:0 (2/18)
intron_pos 90:0 (3/18)
intron_pos 116:0 (4/18)
intron_pos 176:1 (5/18)
intron_pos 263:0 (6/18)
intron_pos 382:1 (7/18)
intron_pos 408:1 (8/18)
intron_pos 484:0 (9/18)
intron_pos 503:0 (10/18)
intron_pos 588:0 (11/18)
intron_pos 647:2 (12/18)
intron_pos 742:0 (13/18)
intron_pos 816:0 (14/18)
intron_pos 869:0 (15/18)
intron_pos 918:0 (16/18)
intron_pos 989:0 (17/18)
intron_pos 1079:0 (18/18)
BEGIN
1 MAPSRSKKSK YKKKPRAKAV SPHKDEESMS KTKQRKRKLS DMLGPQWSKE ELERFYEGYR
61 KFGKEWKKVA GFVHSRSAEM VEALYTMNKA YLSLPEGTAS VVGLTAMMTD HYSVLHGGSD
121 SEQENNEGIE TPRSAPKRSR VKSSDHPSIG LEGLSDRLQF RSSSGFMPSL KKRRTETMPR
181 AVGKRTPRIP ISYTLEKDTR ERYLSPVKRG LNQKGDDTDD DMEHEIALAL AEASQRGGST
241 KNSHTPNRKA KMYPPDKKGE RMRADIDLAI AKLHATDMED VRCEPSLGST EADNADYSGG
301 RNDLTHGEGS SAVEKQQKGR TYYRRRVGIK EEDAKEACSG TDEAPSLGAP DEKFEQEREG
361 KALKFTYKVS RRKSKKSLFT ADEDTACDAL HTLADLSLMM PETATDTESS VQAEEKKAGE
421 AYVSDFKGTD PASMSKSSSL RNSKQRRYGS NDLCNPELER KSPSSSLIQK RRQKALPAKV
481 RENVLKDELA ASSQVIEPCN SKGIGEEYKP VGRGKRSASI RNSHEKKSAK SHDHTSSSNN
541 IVEEDESAPS NAVIKKQVNL PTKVRSRRKI VTEKPLTIDD GKISETIEKF SHCISSFRAR
601 RWCIFEWFYS AIDYPWFARQ EFVEYLDHVG LGHVPRLTRV EWGVIRSSLG KPRRFSEQFL
661 KEEKEKLYLY RDSVRKHYDE LNTGMREGLP MDLARPLNVS QRVICLHPKS REIHDGNVLT
721 VDHCRYRIQF DNPELGVEFV KDTECMPLNP LENMPASLAR HYAFSNYHIQ NPIEEKMHER
781 AKESMLEGYP KLSCETGHLL SSPNYNISNS LKQEKVDISS SNPQAQDGVD EALALQLFNS
841 QPSSIGQIQA READVQALSE LTRALDKKEL VLRELKCMND EVVESQKDGH NNALKDSESF
901 KKQYAAVLFQ LSEINEQVSL ALLGLRQRNT YQENVPYSSI RRMSKSGEPD GQLTYEDNNA
961 SDTNGFHVSE IVESSRIKAR KMVYRAVQAL ELLRKDENNN VNMEEAIDFV NNQLSIDQTE
1021 GSSVQQTQGG QDQRLPSTPN PPSSTPANDS HLNQPDQNDL QVPSDLVSRC IATLLMIQKC
1081 TERQFPPSEV AQVLDSAVAS LQPCCSQNLP IYTEIQKCMG IIRNQILALV PS
//