LOCUS AEC10589.1 723 aa PRT PLN 23-MAR-2023 DEFINITION Arabidopsis thaliana sterile alpha motif (SAM) domain- containing protein protein. ACCESSION CP002685-7095 PROTEIN_ID AEC10589.1 SOURCE Arabidopsis thaliana (thale cress) ORGANISM Arabidopsis thaliana Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis. REFERENCE 1 (bases 1 to 19698289) AUTHORS Lin,X., Kaul,S., Rounsley,S., Shea,T.P., Benito,M.I., Town,C.D., Fujii,C.Y., Mason,T., Bowman,C.L., Barnstead,M., Feldblyum,T.V., Buell,C.R., Ketchum,K.A., Lee,J., Ronning,C.M., Koo,H.L., Moffat,K.S., Cronin,L.A., Shen,M., Pai,G., Van Aken,S., Umayam,L., Tallon,L.J., Gill,J.E., Adams,M.D., Carrera,A.J., Creasy,T.H., Goodman,H.M., Somerville,C.R., Copenhaver,G.P., Preuss,D., Nierman,W.C., White,O., Eisen,J.A., Salzberg,S.L., Fraser,C.M. and Venter,J.C. TITLE Sequence and analysis of chromosome 2 of the plant Arabidopsis thaliana JOURNAL Nature 402 (6763), 761-768 (1999) PUBMED 10617197 REFERENCE 2 (bases 1 to 19698289) AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E. CONSRTM TAIR TITLE Direct Submission JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie Institution, 260 Panama Street, Stanford, CA, USA REFERENCE 3 (bases 1 to 19698289) AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M., Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R., Vaughn,M. and Town,C.D. TITLE Direct Submission JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute, 9704 Medical Center Dr, Rockville, MD 20850, USA REMARK Protein update by submitter FEATURES Qualifiers source /organism="Arabidopsis thaliana" /mol_type="genomic DNA" /db_xref="taxon:3702" /chromosome="2" /ecotype="Columbia" protein /locus_tag="AT2G45700" /gene_synonym="F4I18.32" /inference="Similar to RNA sequence, EST:INSD:EH988600.1,INSD:AU227370.1,INSD:BP858083.1, INSD:EH843507.1,INSD:BP845612.1,INSD:EL287847.1, INSD:AU236449.1,INSD:ES005001.1,INSD:ES017767.1, INSD:AV547263.1,INSD:ES100149.1,INSD:ES050016.1, INSD:BP587436.1,INSD:EH883276.1" /inference="similar to RNA sequence, mRNA:INSD:BX820730.1,INSD:BT006104.1,INSD:BT005774.1, INSD:AK228444.1" /note="sterile alpha motif (SAM) domain-containing protein; FUNCTIONS IN: hydrolase activity; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Sterile alpha motif-type (InterPro:IPR013761), Sterile alpha motif (InterPro:IPR001660), Sterile alpha motif homology (InterPro:IPR010993), Sterile alpha motif, type 1 (InterPro:IPR021129), DNA repair metallo-beta-lactamase (InterPro:IPR011084), Beta-lactamase-like (InterPro:IPR001279); BEST Arabidopsis thaliana protein match is: DNA repair metallo-beta-lactamase family protein (TAIR:AT3G26680.3); Has 2157 Blast hits to 2099 proteins in 358 species: Archae - 119; Bacteria - 227; Metazoa - 850; Fungi - 290; Plants - 353; Viruses - 0; Other Eukaryotes - 318 (source: NCBI BLink)." /db_xref="Araport:AT2G45700" /db_xref="TAIR:AT2G45700" intron_pos 212:0 (1/9) intron_pos 277:0 (2/9) intron_pos 406:0 (3/9) intron_pos 427:1 (4/9) intron_pos 500:0 (5/9) intron_pos 536:0 (6/9) intron_pos 571:1 (7/9) intron_pos 642:0 (8/9) intron_pos 676:2 (9/9) BEGIN 1 MSNTVEDDDD DFQIPPSSQL SIRKPLHPTN ANNISHRPPN KKPRLCRYPG KENVTPPPSP 61 DPDLFCSSST PHCILDCIPS SVDCSLGDFN GPISSLGEED KEDKDDCIKV NREGYLCNSM 121 EARLLKSRIC LGFDSGIHED DEGFVESNSE LDVLINLCSE SEGRSGEFSL GKDDSIQCPL 181 CSMDISSLSE EQRQVHSNTC LDKSYNQPSE QDSLRKCENL SSLIKESIDD PVQLPQLVTD 241 LSPVLKWLRS LGLAKYEDVF IREEIDWDTL QSLTEEDLLS IGITSLGPRK KIVNALSGVR 301 DPFASSAEVQ AQSHCTSGHV TERQRDKSTT RKASEPKKPT ANKLITEFFP GQATEGTKIR 361 TAPKPVAEKS PSDSSSRRAV RRNGNNGKSK VIPHWNCIPG TPFRVDAFKY LTRDCCHWFL 421 THFHLDHYQG LTKSFSHGKI YCSLVTAKLV NMKIGIPWER LQVLDLGQKV NISGIDVTCF 481 DANHCPGSIM ILFEPANGKA VLHTGDFRYS EEMSNWLIGS HISSLILDTT YCNPQYDFPK 541 QEAVIQFVVE AIQAEAFNPK TLFLIGSYTI GKERLFLEVA RVLREKIYIN PAKLKLLECL 601 GFSKDDIQWF TVKEEESHIH VVPLWTLASF KRLKHVANRY TNRYSLIVAF SPTGWTSGKT 661 KKKSPGRRLQ QGTIIRYEVP YSEHSSFTEL KEFVQKVSPE VIIPSVNNDG PDSAAAMVSL 721 LVT //