LOCUS AEC09148.1 497 aa PRT PLN 23-MAR-2023 DEFINITION Arabidopsis thaliana Nucleotide-diphospho-sugar transferases superfamily protein protein. ACCESSION CP002685-5142 PROTEIN_ID AEC09148.1 SOURCE Arabidopsis thaliana (thale cress) ORGANISM Arabidopsis thaliana Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis. REFERENCE 1 (bases 1 to 19698289) AUTHORS Lin,X., Kaul,S., Rounsley,S., Shea,T.P., Benito,M.I., Town,C.D., Fujii,C.Y., Mason,T., Bowman,C.L., Barnstead,M., Feldblyum,T.V., Buell,C.R., Ketchum,K.A., Lee,J., Ronning,C.M., Koo,H.L., Moffat,K.S., Cronin,L.A., Shen,M., Pai,G., Van Aken,S., Umayam,L., Tallon,L.J., Gill,J.E., Adams,M.D., Carrera,A.J., Creasy,T.H., Goodman,H.M., Somerville,C.R., Copenhaver,G.P., Preuss,D., Nierman,W.C., White,O., Eisen,J.A., Salzberg,S.L., Fraser,C.M. and Venter,J.C. TITLE Sequence and analysis of chromosome 2 of the plant Arabidopsis thaliana JOURNAL Nature 402 (6763), 761-768 (1999) PUBMED 10617197 REFERENCE 2 (bases 1 to 19698289) AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E. CONSRTM TAIR TITLE Direct Submission JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie Institution, 260 Panama Street, Stanford, CA, USA REFERENCE 3 (bases 1 to 19698289) AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M., Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R., Vaughn,M. and Town,C.D. TITLE Direct Submission JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute, 9704 Medical Center Dr, Rockville, MD 20850, USA REMARK Protein update by submitter FEATURES Qualifiers source /organism="Arabidopsis thaliana" /mol_type="genomic DNA" /db_xref="taxon:3702" /chromosome="2" /ecotype="Columbia" protein /gene="PGSIP7" /locus_tag="AT2G35710" /gene_synonym="plant glycogenin-like starch initiation protein 7" /gene_synonym="T20F21.27" /inference="Similar to RNA sequence, EST:INSD:EG460159.1,INSD:EG460173.1,INSD:EG505148.1, INSD:EG460179.1,INSD:EG460154.1,INSD:EG418542.1, INSD:EG460186.1,INSD:EG460178.1,INSD:BP603927.1, INSD:EG460150.1,INSD:EG505143.1,INSD:AV786045.1, INSD:EG505145.1,INSD:EG460183.1,INSD:AV440620.1, INSD:EG460155.1,INSD:BP807282.1,INSD:EG505161.1, INSD:EG505147.1,INSD:BP655101.1,INSD:EG460168.1, INSD:EG460164.1,INSD:EG505158.1,INSD:EG460184.1, INSD:DR307428.1,INSD:CB255123.1,INSD:EG460167.1, INSD:AV824411.1,INSD:EG460153.1,INSD:EG460175.1, INSD:EG418564.1,INSD:EG505151.1,INSD:EG505152.1, INSD:EG460170.1,INSD:EG505144.1,INSD:EG505159.1, INSD:EG460169.1,INSD:EG505140.1,INSD:EG505157.1, INSD:EG418531.1,INSD:EG460180.1,INSD:EG505155.1, INSD:EG505156.1,INSD:EG505149.1,INSD:BP600601.1, INSD:EG505141.1,INSD:EG418553.1" /inference="similar to RNA sequence, mRNA:INSD:AY096617.1,INSD:BX820810.1,INSD:AY063949.1, INSD:BX820847.1" /note="Nucleotide-diphospho-sugar transferases superfamily protein; FUNCTIONS IN: transferase activity, transferring hexosyl groups, transferase activity, transferring glycosyl groups; INVOLVED IN: carbohydrate biosynthetic process, biosynthetic process; LOCATED IN: endomembrane system; EXPRESSED IN: 11 plant structures; EXPRESSED DURING: 7 growth stages; CONTAINS InterPro DOMAIN/s: Glycosyl transferase, family 8 (InterPro:IPR002495); BEST Arabidopsis thaliana protein match is: Nucleotide-diphospho-sugar transferases superfamily protein (TAIR:AT4G16600.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink)." /db_xref="Araport:AT2G35710" /db_xref="TAIR:AT2G35710" intron_pos 115:2 (1/3) intron_pos 269:1 (2/3) intron_pos 318:2 (3/3) BEGIN 1 MDLQRGFVFL SLVLSFMIIE TTAYRERQLL LLQPPQETAI DTANAVVTVQ DRGLKTRRPE 61 HKNAYATMMY MGTPRDYEFY VATRVLIRSL RSLHVEADLV VIASLDVPLR WVQTLEEEDG 121 AKVVRVENVD NPYRRQTNFN SRFKLTLNKL YAWALSDYDR VVMLDADNLF LKKADELFQC 181 GRFCAVFINP CIFHTGLFVL QPSVEVFKDM LHELQVGRKN PDGADQGFLV SYFSDLLDQP 241 LFSPPSNGSV LNGHLRLPLG YQMDASYFYL KLRWNIPCGP NSVITFPGAV WLKPWYWWSW 301 PVLPLGFSWH EQRRATIGYS AEMPLVIIQA MFYLGIIVVT RLARPNITKL CYRRSDRNLT 361 TIQAGFKLIA LLSVVAAYIF PFFTIPHTIH PLIGWSLYLM ASFALSSISI NTLLLPTLPV 421 LTPWLGILGT LLVMAFPWYP DGVVRALSVF AYAFCCAPFV WVSFRKITSH LQVLIEKEVL 481 FPRLGDSGVT SGFSKLY //