LOCUS AEC07335.1 819 aa PRT PLN 23-MAR-2023 DEFINITION Arabidopsis thaliana DNA-binding protein, putative (duplicated DUF1399) protein. ACCESSION CP002685-2642 PROTEIN_ID AEC07335.1 SOURCE Arabidopsis thaliana (thale cress) ORGANISM Arabidopsis thaliana Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis. REFERENCE 1 (bases 1 to 19698289) AUTHORS Lin,X., Kaul,S., Rounsley,S., Shea,T.P., Benito,M.I., Town,C.D., Fujii,C.Y., Mason,T., Bowman,C.L., Barnstead,M., Feldblyum,T.V., Buell,C.R., Ketchum,K.A., Lee,J., Ronning,C.M., Koo,H.L., Moffat,K.S., Cronin,L.A., Shen,M., Pai,G., Van Aken,S., Umayam,L., Tallon,L.J., Gill,J.E., Adams,M.D., Carrera,A.J., Creasy,T.H., Goodman,H.M., Somerville,C.R., Copenhaver,G.P., Preuss,D., Nierman,W.C., White,O., Eisen,J.A., Salzberg,S.L., Fraser,C.M. and Venter,J.C. TITLE Sequence and analysis of chromosome 2 of the plant Arabidopsis thaliana JOURNAL Nature 402 (6763), 761-768 (1999) PUBMED 10617197 REFERENCE 2 (bases 1 to 19698289) AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E. CONSRTM TAIR TITLE Direct Submission JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie Institution, 260 Panama Street, Stanford, CA, USA REFERENCE 3 (bases 1 to 19698289) AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M., Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R., Vaughn,M. and Town,C.D. TITLE Direct Submission JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute, 9704 Medical Center Dr, Rockville, MD 20850, USA REMARK Protein update by submitter FEATURES Qualifiers source /organism="Arabidopsis thaliana" /mol_type="genomic DNA" /db_xref="taxon:3702" /chromosome="2" /ecotype="Columbia" protein /locus_tag="AT2G22660" /gene_synonym="T9I22.10" /gene_synonym="T9I22_10" /inference="Similar to RNA sequence, EST:INSD:AV782975.1,INSD:DR353012.1,INSD:EG523488.1, INSD:AA394703.1,INSD:DR355164.1,INSD:DR353008.1, INSD:BP605136.1,INSD:BP601929.1,INSD:BP603093.1, INSD:AV822262.1,INSD:ES182219.1,INSD:EL268833.1, INSD:DR363194.1,INSD:EL985356.1,INSD:BP587132.1, INSD:ES004603.1,INSD:EL171878.1,INSD:BP615304.1, INSD:ES207736.1,INSD:R84099.1,INSD:AV552363.1, INSD:BP592710.1,INSD:DR353014.1,INSD:EH805538.1, INSD:AV527335.1,INSD:AV560576.1,INSD:EH946112.1, INSD:BP844536.1,INSD:BP600749.1,INSD:EH872600.1, INSD:BP661916.1,INSD:DR353011.1,INSD:EG499939.1, INSD:BP598841.1,INSD:DR363192.1,INSD:DR353010.1, INSD:EH908374.1,INSD:T88195.1,INSD:EH822151.1, INSD:DR353007.1,INSD:ES079117.1,INSD:BE527391.1, INSD:BP621803.1,INSD:ES015647.1,INSD:BP860387.1, INSD:EL044959.1,INSD:EG528845.1,INSD:EL062242.1, INSD:CF774195.1,INSD:EL223306.1,INSD:BP588984.1, INSD:W43604.1,INSD:ES119361.1,INSD:BE530254.1, INSD:BP805982.1,INSD:EL254359.1,INSD:ES073011.1, INSD:ES019762.1,INSD:ES032678.1,INSD:BP599799.1, INSD:EL261434.1,INSD:AV806421.1,INSD:ES163374.1, INSD:BP780654.1,INSD:EL118470.1,INSD:DR363191.1, INSD:DR226304.1,INSD:AV520844.1,INSD:ES035070.1, INSD:AV442570.1,INSD:BP804244.1,INSD:BP611624.1, INSD:AV528678.1,INSD:BP597051.1,INSD:ES066169.1, INSD:DR353009.1,INSD:AV542962.1,INSD:ES184326.1, INSD:EH813850.1,INSD:AV523092.1,INSD:EH935194.1, INSD:DR379196.1,INSD:DR353013.1,INSD:BP809302.1, INSD:DR363195.1,INSD:N96064.1,INSD:ES123581.1, INSD:CB258471.1,INSD:EH953132.1,INSD:ES034413.1, INSD:ES048019.1,INSD:EH820217.1,INSD:R65205.1, INSD:BP591232.1,INSD:BP781546.1,INSD:EG528846.1, INSD:AA597600.1,INSD:BU635224.1,INSD:AV542824.1, INSD:N65082.1,INSD:BP808113.1,INSD:AI994025.1, INSD:EL969956.1,INSD:BP592377.1,INSD:EH831705.1, INSD:EL231295.1,INSD:AV440663.1,INSD:DR363193.1, INSD:BP587022.1,INSD:EL088036.1,INSD:T44373.1, INSD:EG499148.1" /note="Protein of unknown function (duplicated DUF1399); FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1399 (InterPro:IPR009836); BEST Arabidopsis thaliana protein match is: Protein of unknown function (duplicated DUF1399) (TAIR:AT4G37900.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink)." /db_xref="Araport:AT2G22660" /db_xref="TAIR:AT2G22660" intron_pos 55:2 (1/7) intron_pos 96:0 (2/7) intron_pos 181:0 (3/7) intron_pos 333:0 (4/7) intron_pos 514:2 (5/7) intron_pos 579:0 (6/7) intron_pos 631:0 (7/7) BEGIN 1 MDKEKDHEVE WLEAQKIEIS VDLLAAAKQH LLFLETVDRN RWLYDGPALE KAIYRYNACW 61 LPLLVKYSES SSVSEGSLVP PLDCEWIWHC HRLNPVRYNS DCEQFYGRVL DNSGVLSSVD 121 GNCKLKTEDL WKRLYPDEPY ELDLDNIDLE DISEKSSALE KCTKYDLVSA VKRQSPFYYQ 181 VSRSHVNSDI FLQEAVARYK GFLYLIKMNR ERSLKRFCVP TYDVDLIWHT HQLHPVSYCD 241 DMVKLIGKVL EHDDTDSDRG KGKKLDTGFS KTTAQWEETF GTRYWKAGAM HRGKTPVPVT 301 NSPYASDVLV KDPTAKDDFQ NLIQFPEVEV VEVLLEIIGV RNLPDGHKGK VSVMFSKTQP 361 DSLFNAERRL TILSEVGEKQ VATFQCEPTG ELVFKLISCS PSKIPVSREP KNLGFASLSL 421 KEFLFPVITQ LSVEKWLELT PSKGSQTDTK PISLRVAVSF TPPVRSPSVL HMVQSRPSCK 481 GSCFFPIIGK SRLAKSSTHI VDETQTEVIT LQIRNSADGG ILKDDQRQVM GVTDSGETRV 541 LAVYTGSFWS LLDSKWSLKQ INASTADNPL FEILGPRVVK IFSGRKLDYE PKHCANLRSD 601 LDFMTLVEFS KQHPYGKTVG LVDMRFGSIE AKENWLLLPG IVSAFILHTV LKKGGSEGFN 661 VTTKDIKEES KQTKLVAATE NNVNANSTNV ETQTAITAPK KGSGCGGGCS GECGNMVKAA 721 NASGCGSSCS GECGDMVKSA ANASGCGSGC SGECGNMVKA ANASGGGYGA RCKAAKASGC 781 GGGCGGGCGG GCGDMVKSVN ASGCGGGCNG ECGNMVKAA //