LOCUS AEC07335.1 819 aa PRT PLN 23-MAR-2023
DEFINITION Arabidopsis thaliana DNA-binding protein, putative (duplicated
DUF1399) protein.
ACCESSION CP002685-2642
PROTEIN_ID AEC07335.1
SOURCE Arabidopsis thaliana (thale cress)
ORGANISM Arabidopsis thaliana
Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
Camelineae; Arabidopsis.
REFERENCE 1 (bases 1 to 19698289)
AUTHORS Lin,X., Kaul,S., Rounsley,S., Shea,T.P., Benito,M.I., Town,C.D.,
Fujii,C.Y., Mason,T., Bowman,C.L., Barnstead,M., Feldblyum,T.V.,
Buell,C.R., Ketchum,K.A., Lee,J., Ronning,C.M., Koo,H.L.,
Moffat,K.S., Cronin,L.A., Shen,M., Pai,G., Van Aken,S., Umayam,L.,
Tallon,L.J., Gill,J.E., Adams,M.D., Carrera,A.J., Creasy,T.H.,
Goodman,H.M., Somerville,C.R., Copenhaver,G.P., Preuss,D.,
Nierman,W.C., White,O., Eisen,J.A., Salzberg,S.L., Fraser,C.M. and
Venter,J.C.
TITLE Sequence and analysis of chromosome 2 of the plant Arabidopsis
thaliana
JOURNAL Nature 402 (6763), 761-768 (1999)
PUBMED 10617197
REFERENCE 2 (bases 1 to 19698289)
AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
CONSRTM TAIR
TITLE Direct Submission
JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE 3 (bases 1 to 19698289)
AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
Vaughn,M. and Town,C.D.
TITLE Direct Submission
JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
9704 Medical Center Dr, Rockville, MD 20850, USA
REMARK Protein update by submitter
FEATURES Qualifiers
source /organism="Arabidopsis thaliana"
/mol_type="genomic DNA"
/db_xref="taxon:3702"
/chromosome="2"
/ecotype="Columbia"
protein /locus_tag="AT2G22660"
/gene_synonym="T9I22.10"
/gene_synonym="T9I22_10"
/inference="Similar to RNA sequence,
EST:INSD:AV782975.1,INSD:DR353012.1,INSD:EG523488.1,
INSD:AA394703.1,INSD:DR355164.1,INSD:DR353008.1,
INSD:BP605136.1,INSD:BP601929.1,INSD:BP603093.1,
INSD:AV822262.1,INSD:ES182219.1,INSD:EL268833.1,
INSD:DR363194.1,INSD:EL985356.1,INSD:BP587132.1,
INSD:ES004603.1,INSD:EL171878.1,INSD:BP615304.1,
INSD:ES207736.1,INSD:R84099.1,INSD:AV552363.1,
INSD:BP592710.1,INSD:DR353014.1,INSD:EH805538.1,
INSD:AV527335.1,INSD:AV560576.1,INSD:EH946112.1,
INSD:BP844536.1,INSD:BP600749.1,INSD:EH872600.1,
INSD:BP661916.1,INSD:DR353011.1,INSD:EG499939.1,
INSD:BP598841.1,INSD:DR363192.1,INSD:DR353010.1,
INSD:EH908374.1,INSD:T88195.1,INSD:EH822151.1,
INSD:DR353007.1,INSD:ES079117.1,INSD:BE527391.1,
INSD:BP621803.1,INSD:ES015647.1,INSD:BP860387.1,
INSD:EL044959.1,INSD:EG528845.1,INSD:EL062242.1,
INSD:CF774195.1,INSD:EL223306.1,INSD:BP588984.1,
INSD:W43604.1,INSD:ES119361.1,INSD:BE530254.1,
INSD:BP805982.1,INSD:EL254359.1,INSD:ES073011.1,
INSD:ES019762.1,INSD:ES032678.1,INSD:BP599799.1,
INSD:EL261434.1,INSD:AV806421.1,INSD:ES163374.1,
INSD:BP780654.1,INSD:EL118470.1,INSD:DR363191.1,
INSD:DR226304.1,INSD:AV520844.1,INSD:ES035070.1,
INSD:AV442570.1,INSD:BP804244.1,INSD:BP611624.1,
INSD:AV528678.1,INSD:BP597051.1,INSD:ES066169.1,
INSD:DR353009.1,INSD:AV542962.1,INSD:ES184326.1,
INSD:EH813850.1,INSD:AV523092.1,INSD:EH935194.1,
INSD:DR379196.1,INSD:DR353013.1,INSD:BP809302.1,
INSD:DR363195.1,INSD:N96064.1,INSD:ES123581.1,
INSD:CB258471.1,INSD:EH953132.1,INSD:ES034413.1,
INSD:ES048019.1,INSD:EH820217.1,INSD:R65205.1,
INSD:BP591232.1,INSD:BP781546.1,INSD:EG528846.1,
INSD:AA597600.1,INSD:BU635224.1,INSD:AV542824.1,
INSD:N65082.1,INSD:BP808113.1,INSD:AI994025.1,
INSD:EL969956.1,INSD:BP592377.1,INSD:EH831705.1,
INSD:EL231295.1,INSD:AV440663.1,INSD:DR363193.1,
INSD:BP587022.1,INSD:EL088036.1,INSD:T44373.1,
INSD:EG499148.1"
/note="Protein of unknown function (duplicated DUF1399);
FUNCTIONS IN: molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: plasma membrane;
EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 13
growth stages; CONTAINS InterPro DOMAIN/s: Protein of
unknown function DUF1399 (InterPro:IPR009836); BEST
Arabidopsis thaliana protein match is: Protein of unknown
function (duplicated DUF1399) (TAIR:AT4G37900.1); Has
35333 Blast hits to 34131 proteins in 2444 species: Archae
- 798; Bacteria - 22429; Metazoa - 974; Fungi - 991;
Plants - 531; Viruses - 0; Other Eukaryotes - 9610
(source: NCBI BLink)."
/db_xref="Araport:AT2G22660"
/db_xref="TAIR:AT2G22660"
intron_pos 55:2 (1/7)
intron_pos 96:0 (2/7)
intron_pos 181:0 (3/7)
intron_pos 333:0 (4/7)
intron_pos 514:2 (5/7)
intron_pos 579:0 (6/7)
intron_pos 631:0 (7/7)
BEGIN
1 MDKEKDHEVE WLEAQKIEIS VDLLAAAKQH LLFLETVDRN RWLYDGPALE KAIYRYNACW
61 LPLLVKYSES SSVSEGSLVP PLDCEWIWHC HRLNPVRYNS DCEQFYGRVL DNSGVLSSVD
121 GNCKLKTEDL WKRLYPDEPY ELDLDNIDLE DISEKSSALE KCTKYDLVSA VKRQSPFYYQ
181 VSRSHVNSDI FLQEAVARYK GFLYLIKMNR ERSLKRFCVP TYDVDLIWHT HQLHPVSYCD
241 DMVKLIGKVL EHDDTDSDRG KGKKLDTGFS KTTAQWEETF GTRYWKAGAM HRGKTPVPVT
301 NSPYASDVLV KDPTAKDDFQ NLIQFPEVEV VEVLLEIIGV RNLPDGHKGK VSVMFSKTQP
361 DSLFNAERRL TILSEVGEKQ VATFQCEPTG ELVFKLISCS PSKIPVSREP KNLGFASLSL
421 KEFLFPVITQ LSVEKWLELT PSKGSQTDTK PISLRVAVSF TPPVRSPSVL HMVQSRPSCK
481 GSCFFPIIGK SRLAKSSTHI VDETQTEVIT LQIRNSADGG ILKDDQRQVM GVTDSGETRV
541 LAVYTGSFWS LLDSKWSLKQ INASTADNPL FEILGPRVVK IFSGRKLDYE PKHCANLRSD
601 LDFMTLVEFS KQHPYGKTVG LVDMRFGSIE AKENWLLLPG IVSAFILHTV LKKGGSEGFN
661 VTTKDIKEES KQTKLVAATE NNVNANSTNV ETQTAITAPK KGSGCGGGCS GECGNMVKAA
721 NASGCGSSCS GECGDMVKSA ANASGCGSGC SGECGNMVKA ANASGGGYGA RCKAAKASGC
781 GGGCGGGCGG GCGDMVKSVN ASGCGGGCNG ECGNMVKAA
//