LOCUS AEC09699.1 336 aa PRT PLN 23-MAR-2023 DEFINITION Arabidopsis thaliana Nucleotide-diphospho-sugar transferases superfamily protein protein. ACCESSION CP002685-5878 PROTEIN_ID AEC09699.1 SOURCE Arabidopsis thaliana (thale cress) ORGANISM Arabidopsis thaliana Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis. REFERENCE 1 (bases 1 to 19698289) AUTHORS Lin,X., Kaul,S., Rounsley,S., Shea,T.P., Benito,M.I., Town,C.D., Fujii,C.Y., Mason,T., Bowman,C.L., Barnstead,M., Feldblyum,T.V., Buell,C.R., Ketchum,K.A., Lee,J., Ronning,C.M., Koo,H.L., Moffat,K.S., Cronin,L.A., Shen,M., Pai,G., Van Aken,S., Umayam,L., Tallon,L.J., Gill,J.E., Adams,M.D., Carrera,A.J., Creasy,T.H., Goodman,H.M., Somerville,C.R., Copenhaver,G.P., Preuss,D., Nierman,W.C., White,O., Eisen,J.A., Salzberg,S.L., Fraser,C.M. and Venter,J.C. TITLE Sequence and analysis of chromosome 2 of the plant Arabidopsis thaliana JOURNAL Nature 402 (6763), 761-768 (1999) PUBMED 10617197 REFERENCE 2 (bases 1 to 19698289) AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E. CONSRTM TAIR TITLE Direct Submission JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie Institution, 260 Panama Street, Stanford, CA, USA REFERENCE 3 (bases 1 to 19698289) AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M., Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R., Vaughn,M. and Town,C.D. TITLE Direct Submission JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute, 9704 Medical Center Dr, Rockville, MD 20850, USA REMARK Protein update by submitter FEATURES Qualifiers source /organism="Arabidopsis thaliana" /mol_type="genomic DNA" /db_xref="taxon:3702" /chromosome="2" /ecotype="Columbia" protein /locus_tag="AT2G39630" /gene_synonym="F12L6.29" /gene_synonym="F12L6_29" /inference="Similar to RNA sequence, EST:INSD:DR256910.1,INSD:AI997399.1,INSD:ES006398.1, INSD:BP632315.1,INSD:EL978186.1,INSD:EL267540.1, INSD:EL027353.1,INSD:ES103156.1,INSD:ES118341.1, INSD:BP856786.1,INSD:EH928274.1,INSD:BP810397.2, INSD:AA067555.1,INSD:AV564341.1,INSD:BP855450.1, INSD:AV563532.1,INSD:ES096028.1,INSD:AV790247.1, INSD:DR256912.1,INSD:EH923508.1,INSD:EL068757.1, INSD:DR256917.1,INSD:AV824746.1,INSD:DR372491.1, INSD:ES008548.1,INSD:EL030927.1,INSD:EH902205.1, INSD:DR256920.1,INSD:DR256919.1,INSD:DR256915.1, INSD:ES145316.1,INSD:DR256911.1,INSD:BE038160.1, INSD:DR256908.1,INSD:BP668474.1,INSD:ES054713.1, INSD:EH918458.1,INSD:DR256913.1,INSD:DR256909.1, INSD:AV787501.1,INSD:DR256914.1,INSD:EL255552.1, INSD:AV816380.1,INSD:EH862771.1,INSD:AV562121.1, INSD:N38608.1,INSD:BP848056.1,INSD:EH926928.1, INSD:BX835687.1,INSD:EL004904.1" /inference="similar to RNA sequence, mRNA:INSD:AY056120.1,INSD:BX820046.1,INSD:AY078031.1" /note="Nucleotide-diphospho-sugar transferases superfamily protein; FUNCTIONS IN: transferase activity, transferring glycosyl groups; INVOLVED IN: biosynthetic process, protein amino acid glycosylation; LOCATED IN: endoplasmic reticulum; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Glycosyl transferase, family 2 (InterPro:IPR001173); BEST Arabidopsis thaliana protein match is: Nucleotide-diphospho-sugar transferases superfamily protein (TAIR:AT1G20575.1); Has 14679 Blast hits to 14665 proteins in 2386 species: Archae - 662; Bacteria - 10857; Metazoa - 271; Fungi - 260; Plants - 99; Viruses - 22; Other Eukaryotes - 2508 (source: NCBI BLink)." /db_xref="Araport:AT2G39630" /db_xref="TAIR:AT2G39630" intron_pos 38:2 (1/9) intron_pos 55:0 (2/9) intron_pos 90:2 (3/9) intron_pos 106:0 (4/9) intron_pos 151:0 (5/9) intron_pos 181:0 (6/9) intron_pos 225:0 (7/9) intron_pos 253:0 (8/9) intron_pos 274:2 (9/9) BEGIN 1 MEFLVTVAEF SLWLLLIVLF GFLSVVVFEA WRRRHSNVSV ETVTTLDDPK SIKPIPCPHI 61 TDPAEKYLSL IVPAYNEELR LPAALEETMD YLQDRASRDK SFSFEVVIVD DGSVDGTKRV 121 AFDFIRKYTI DNIRVIPLGK NQGKGEAIRK GMLHSRGQLL LMLDADGATK VTDLEKLENQ 181 INAVAREEYS IRNPASKDMD FKIGDVQVSA FGSRAHLEEK ALATRKWYRN FLMKGFHLVV 241 LLAAGPGIRD TQCGFKMFTR AAARRLFTNV HLKRWCFDVE LVYLCKRFNI PMVEISVKWS 301 EIPGSKVSML SIPNMLWELA LMSVGYRTGM WKIHQV //