LOCUS AEC10608.1 523 aa PRT PLN 23-MAR-2023 DEFINITION Arabidopsis thaliana O-glucosyltransferase rumi-like protein (DUF821) protein. ACCESSION CP002685-7121 PROTEIN_ID AEC10608.1 SOURCE Arabidopsis thaliana (thale cress) ORGANISM Arabidopsis thaliana Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis. REFERENCE 1 (bases 1 to 19698289) AUTHORS Lin,X., Kaul,S., Rounsley,S., Shea,T.P., Benito,M.I., Town,C.D., Fujii,C.Y., Mason,T., Bowman,C.L., Barnstead,M., Feldblyum,T.V., Buell,C.R., Ketchum,K.A., Lee,J., Ronning,C.M., Koo,H.L., Moffat,K.S., Cronin,L.A., Shen,M., Pai,G., Van Aken,S., Umayam,L., Tallon,L.J., Gill,J.E., Adams,M.D., Carrera,A.J., Creasy,T.H., Goodman,H.M., Somerville,C.R., Copenhaver,G.P., Preuss,D., Nierman,W.C., White,O., Eisen,J.A., Salzberg,S.L., Fraser,C.M. and Venter,J.C. TITLE Sequence and analysis of chromosome 2 of the plant Arabidopsis thaliana JOURNAL Nature 402 (6763), 761-768 (1999) PUBMED 10617197 REFERENCE 2 (bases 1 to 19698289) AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E. CONSRTM TAIR TITLE Direct Submission JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie Institution, 260 Panama Street, Stanford, CA, USA REFERENCE 3 (bases 1 to 19698289) AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M., Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R., Vaughn,M. and Town,C.D. TITLE Direct Submission JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute, 9704 Medical Center Dr, Rockville, MD 20850, USA REMARK Protein update by submitter FEATURES Qualifiers source /organism="Arabidopsis thaliana" /mol_type="genomic DNA" /db_xref="taxon:3702" /chromosome="2" /ecotype="Columbia" protein /locus_tag="AT2G45840" /gene_synonym="F4I18.18" /inference="Similar to RNA sequence, EST:INSD:BP813673.1,INSD:EG421743.1,INSD:EG421788.1, INSD:EG421834.1,INSD:EG421714.1,INSD:EG421711.1, INSD:EG421725.1,INSD:AV796104.1,INSD:BP828595.1, INSD:EG421737.1,INSD:EG421726.1,INSD:EG421789.1, INSD:EG421740.1,INSD:EG421807.1,INSD:EG421783.1, INSD:EG421835.1,INSD:AU238756.1,INSD:EG421738.1, INSD:BP656892.1,INSD:EG421741.1,INSD:EG421815.1, INSD:EG421784.1,INSD:EG421719.1,INSD:EG421723.1, INSD:EG421785.1,INSD:AV797574.1,INSD:EG421727.1, INSD:EG421790.1,INSD:EG421782.1,INSD:EG421722.1, INSD:EG421729.1,INSD:EH956476.1,INSD:EG421732.1, INSD:EG421813.1,INSD:EG421730.1,INSD:EG421728.1, INSD:EG421832.1,INSD:EG421739.1,INSD:EG421805.1, INSD:EG421713.1,INSD:BP795664.1,INSD:EG421721.1, INSD:EG421710.1,INSD:EG421724.1,INSD:EG421830.1, INSD:AU229991.1,INSD:DR381857.1,INSD:EG421715.1, INSD:BP827785.1,INSD:EG421806.1,INSD:BP590815.1, INSD:EG421812.1" /inference="similar to RNA sequence, mRNA:INSD:DQ056580.1,INSD:AK229568.1" /note="CONTAINS InterPro DOMAIN/s: Lipopolysaccharide-modifying protein (InterPro:IPR006598), Protein of unknown function DUF821, CAP10-like (InterPro:IPR008539); BEST Arabidopsis thaliana protein match is: Arabidopsis thaliana protein of unknown function (DUF821) (TAIR:AT3G61280.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink)." /db_xref="Araport:AT2G45840" /db_xref="TAIR:AT2G45840" intron_pos 54:0 (1/5) intron_pos 249:2 (2/5) intron_pos 309:0 (3/5) intron_pos 330:2 (4/5) intron_pos 401:0 (5/5) BEGIN 1 MSSHVVDDQR QPNNSISSLS SKLTKPWITT TIFIFVFFFF IILVGASLRW MDMFLIGGGR 61 IKVTPIFTRN TNATIPKEKL TTPLNFTLQC SLDQNIATQT CPASNPEKSQ PSKDEPETCP 121 DYFRWIHKDL EAWRETGITR ETLERASDKA HFRLIIKGGR VYVHQYKKSF QTRDVFTIWG 181 IVQLLRMYPG QVPDLELLFM CHDSPEIWRR DYRPRPGVNV TWPPPPLFHY CGHSGAFDIV 241 FPDWSFWGWP EINIKEWNKQ SELISEGIKK VKWEEREPYA YWKGNPGVAM VRRDLMHCHD 301 PMVHLYRQDW SREGRIGYRT SNLEDQCTHR YKIYVEGRAW SVSEKYILAC DSMTLLVKPF 361 YFDFFTRSLV PMEHYWPIRP QEKCSDIVFA VHWGNNNTKK ARAIGRNGSG YVRKNLKMKY 421 VYDYMLHLLQ SYGKLMKMNV EVPQGAKEVC PETMACPING GRMRQSMDDS LVMSPSVKAT 481 CEMPPPFEED ELKKFLEKKE SVEKEVEKWT NEYWQEQKKI LKH //