LOCUS       AAF80123.1               435 aa    PRT              PLN 29-JUN-2000
DEFINITION  Arabidopsis thaliana T21E18.5 protein.
ACCESSION   AC024174-5
PROTEIN_ID  AAF80123.1
SOURCE      Arabidopsis thaliana (thale cress)
  ORGANISM  Arabidopsis thaliana
            Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
            Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
            Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
            Camelineae; Arabidopsis.
REFERENCE   1  (bases 1 to 74316)
  AUTHORS   Sakano,H., Vaysberg,M., Lee,J., Lenz,C., Liu,S.X., Pham,P.,
            Toriumi,M., Yu,G., Chin,C., Chiou,J., Choi,E., Chung,M.,
            Gonzalez,A., Howng,B., Liu,A., Altafi,H., Brooks,S., Buehler,E.,
            Chao,Q., Conn,L., Conway,A.B., Hansen,N.F., Johnson-Hopson,C.,
            Khan,S., Kim,C., Lam,B., Miranda,M., Nguyen,M., Palm,C.J.,
            Shinn,P., Southwick,A., Davis,R.W., Ecker,J.R., Federspiel,N.A. and
            Theologis,A.
  TITLE     The sequence of BAC T21E18 from Arabidopsis thaliana chromosome 1
  JOURNAL   Unpublished
REFERENCE   2  (bases 1 to 74316)
  AUTHORS   Theologis,A.
  TITLE     Direct Submission
  JOURNAL   Submitted (25-FEB-2000) Plant Gene Expression Center, 800 Buchanan
            Street, Albany, CA 94710, USA
REFERENCE   3  (bases 1 to 74316)
  AUTHORS   Theologis,A.
  TITLE     Direct Submission
  JOURNAL   Submitted (15-MAR-2000) Plant Gene Expression Center, 800 Buchanan
            Street, Albany, CA 94710, USA
REFERENCE   4  (bases 1 to 74316)
  AUTHORS   Theologis,A.
  TITLE     Direct Submission
  JOURNAL   Submitted (29-JUN-2000) Plant Gene Expression Center, 800 Buchanan
            St., Albany, CA 94710, USA
COMMENT     On Mar 15, 2000 this sequence version replaced AC024174.1.
            The sequence is of BAC T21E18 from Arabidopsis thaliana chromosome
            1.  The sequence does not represent the sequence of the entire
            insert of this clone because BAC T21E18 contains an E.coli
            insertion element 10 (IS10) at position 101153.  The IS10 (1329 bp
            in size) and the target site duplication (TGCATGGTC) have been
            removed from this entry.  The correct Arabidopsis sequence is
            confirmed by sequencing the PCR product from genomic DNA.  The
            sequence is also shorter by 30500 bp because we submit only the
            unique sequence of the clone.  However, in order to facilitate the
            joining of overlapping clones in the future, for creation of larger
            contigs, we provide small overlaps (200 bp) between overlapping
            sumbitted clones.  The 5' end of this sequence overlaps by 200 bp
            to the 3' end of the sequence of the clone T20M3.
FEATURES             Qualifiers
     source          /organism="Arabidopsis thaliana"
                     /mol_type="genomic DNA"
                     /db_xref="taxon:3702"
                     /chromosome="1"
                     /clone="T21E18"
                     /ecotype="Columbia"
     protein         /gene="T21E18.5"
                     /inference="non-experimental evidence, no additional
                     details recorded"
                     /note="Contains similarity to UDPG glucosyltransferase
                     from Solanum berthaultii gi|2232354 and contains
                     UDP-glycoronysyl and UDP-glucosyl transferases PF|00201
                     domain. ESTs gb|AV551176, gb|Z46581, gb|AV439781,
                     gb|AV542358, gb|AV525326, gb|AV538963, gb|Z46580,
                     gb|AV547292, gb|AV532314, gb|AV565317, gb|AV542340 come
                     from this gene."
BEGIN
        1 MTTTTTKKPH VLVIPFPQSG HMVPHLDLTH QILLRGATVT VLVTPKNSSY LDALRSLHSP
       61 EHFKTLILPF PSHPCIPSGV ESLQQLPLEA IVHMFDALSR LHDPLVDFLS RQPPSDLPDA
      121 ILGSSFLSPW INKVADAFSI KSISFLPINA HSISVMWAQE DRSFFNDLET ATTESYGLVI
      181 NSFYDLEPEF VETVKTRFLN HHRIWTVGPL LPFKAGVDRG GQSSIPPAKV SAWLDSCPED
      241 NSVVYVGFGS QIRLTAEQTA ALAAALEKSS VRFIWAVRDA AKKVNSSDNS VEEDVIPAGF
      301 EERVKEKGLV IRGWAPQTMI LEHRAVGSYL THLGWGSVLE GMVGGVMLLA WPMQADHFFN
      361 TTLIVDKLRA AVRVGENRDS VPDSDKLARI LAESAREDLP ERVTLMKLRE KAMEAIKEGG
      421 SSYKNLDELV AEMCL
//