LOCUS AEC08329.1 1002 aa PRT PLN 23-MAR-2023 DEFINITION Arabidopsis thaliana Double Clp-N motif-containing P-loop nucleoside triphosphate hydrolases superfamily protein protein. ACCESSION CP002685-4019 PROTEIN_ID AEC08329.1 SOURCE Arabidopsis thaliana (thale cress) ORGANISM Arabidopsis thaliana Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis. REFERENCE 1 (bases 1 to 19698289) AUTHORS Lin,X., Kaul,S., Rounsley,S., Shea,T.P., Benito,M.I., Town,C.D., Fujii,C.Y., Mason,T., Bowman,C.L., Barnstead,M., Feldblyum,T.V., Buell,C.R., Ketchum,K.A., Lee,J., Ronning,C.M., Koo,H.L., Moffat,K.S., Cronin,L.A., Shen,M., Pai,G., Van Aken,S., Umayam,L., Tallon,L.J., Gill,J.E., Adams,M.D., Carrera,A.J., Creasy,T.H., Goodman,H.M., Somerville,C.R., Copenhaver,G.P., Preuss,D., Nierman,W.C., White,O., Eisen,J.A., Salzberg,S.L., Fraser,C.M. and Venter,J.C. TITLE Sequence and analysis of chromosome 2 of the plant Arabidopsis thaliana JOURNAL Nature 402 (6763), 761-768 (1999) PUBMED 10617197 REFERENCE 2 (bases 1 to 19698289) AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E. CONSRTM TAIR TITLE Direct Submission JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie Institution, 260 Panama Street, Stanford, CA, USA REFERENCE 3 (bases 1 to 19698289) AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M., Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R., Vaughn,M. and Town,C.D. TITLE Direct Submission JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute, 9704 Medical Center Dr, Rockville, MD 20850, USA REMARK Protein update by submitter FEATURES Qualifiers source /organism="Arabidopsis thaliana" /mol_type="genomic DNA" /db_xref="taxon:3702" /chromosome="2" /ecotype="Columbia" protein /locus_tag="AT2G29970" /gene_synonym="F23F1.11" /gene_synonym="F23F1_11" /gene_synonym="SMAX1-like 7" /gene_synonym="SMXL7" /inference="Similar to RNA sequence, EST:INSD:EH868752.1,INSD:BE527736.1,INSD:EL085674.1, INSD:ES035552.1,INSD:AV782243.1,INSD:EH886703.1, INSD:EL982030.1,INSD:BP807163.1,INSD:AV523958.1, INSD:EL288723.1,INSD:AV809156.1,INSD:AV527684.1, INSD:AI999685.1,INSD:EG524600.1,INSD:EL087809.1, INSD:AV792851.1,INSD:BX838112.1,INSD:AV821710.1, INSD:EH878456.1,INSD:BP606404.1,INSD:DR312642.1, INSD:EL234638.1,INSD:BP602371.1,INSD:EL200912.1, INSD:EL294228.1,INSD:EH891759.1,INSD:DR356185.1, INSD:AV529740.1,INSD:EH895870.1,INSD:BP601786.1, INSD:AV528906.1,INSD:EH978040.1,INSD:EL077541.1, INSD:DR371512.1,INSD:DR312643.1,INSD:BP779658.1, INSD:BX834922.1" /inference="similar to RNA sequence, mRNA:INSD:AY039922.1" /note="Double Clp-N motif-containing P-loop nucleoside triphosphate hydrolases superfamily protein; BEST Arabidopsis thaliana protein match is: Double Clp-N motif-containing P-loop nucleoside triphosphate hydrolases superfamily protein (TAIR:AT1G07200.2); Has 10913 Blast hits to 10488 proteins in 2533 species: Archae - 15; Bacteria - 9122; Metazoa - 6; Fungi - 149; Plants - 525; Viruses - 0; Other Eukaryotes - 1096 (source: NCBI BLink)." /db_xref="Araport:AT2G29970" /db_xref="TAIR:AT2G29970" intron_pos 387:2 (1/2) intron_pos 468:0 (2/2) BEGIN 1 MPTPVTTARQ CLTEETARAL DDAVSVARRR SHAQTTSLHA VSGLLTMPSS ILREVCISRA 61 AHNTPYSSRL QFRALELCVG VSLDRLPSSK STPTTTVEED PPVSNSLMAA IKRSQATQRR 121 HPETYHLHQI HGNNNTETTS VLKVELKYFI LSILDDPIVS RVFGEAGFRS TDIKLDVLHP 181 PVTSQFSSRF TSRSRIPPLF LCNLPESDSG RVRFGFPFGD LDENCRRIGE VLARKDKKNP 241 LLVGVCGVEA LKTFTDSINR GKFGFLPLEI SGLSVVSIKI SEVLVDGSRI DIKFDDLGRL 301 KSGMVLNLGE LKVLASDVFS VDVIEKFVLK LADLLKLHRE KLWFIGSVSS NETYLKLIER 361 FPTIDKDWNL HLLPITSSSQ GLYPKSSLMG SFVPFGGFFS STSDFRIPSS SSMNQTLPRC 421 HLCNEKYEQE VTAFAKSGSM IDDQCSEKLP SWLRNVEHEH EKGNLGKVKD DPNVLASRIP 481 ALQKKWDDIC QRIHQTPAFP KLSFQPVRPQ FPLQLGSSSQ TKMSLGSPTE KIVCTRTSES 541 FQGMVALPQN PPHQPGLSVK ISKPKHTEDL SSSTTNSPLS FVTTDLGLGT IYASKNQEPS 601 TPVSVERRDF EVIKEKQLLS ASRYCKDFKS LRELLSRKVG FQNEAVNAIS EIVCGYRDES 661 RRRNNHVATT SNVWLALLGP DKAGKKKVAL ALAEVFCGGQ DNFICVDFKS QDSLDDRFRG 721 KTVVDYIAGE VARRADSVVF IENVEKAEFP DQIRLSEAMR TGKLRDSHGR EISMKNVIVV 781 ATISGSDKAS DCHVLEEPVK YSEERVLNAK NWTLQIKLAD TSNVNKNGPN KRRQEEAETE 841 VTELRALKSQ RSFLDLNLPV DEIEANEDEA YTMSENTEAW LEDFVEQVDG KVTFKLIDFD 901 ELAKNIKRNI LSLFHLSFGP ETHLEIENDV ILKILAALRW SSDEEKTFDQ WLQTVLAPSF 961 AKARQKCVPA APFSVKLVAS RESPAEEETT GIQQFPARVE VI //