LOCUS AY088313 1789 bp mRNA linear PLN 27-JAN-2006 DEFINITION Arabidopsis thaliana clone 5509 mRNA, complete sequence. ACCESSION AY088313 VERSION AY088313.1 KEYWORDS FLI_CDNA. SOURCE Arabidopsis thaliana (thale cress) ORGANISM Arabidopsis thaliana Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis. REFERENCE 1 (bases 1 to 1789) AUTHORS Haas,B.J., Volfovsky,N., Town,C.D., Troukhan,M., Alexandrov,N., Feldmann,K.A., Flavell,R.B., White,O. and Salzberg,S.L. TITLE Full-length messenger RNA sequences greatly improve genome annotation JOURNAL Genome Biol. 3 (6), RESEARCH0029 (2002) PUBMED 12093376 REFERENCE 2 (bases 1 to 1789) AUTHORS Alexandrov,N.N., Troukhan,M.E., Brover,V.V., Tatarinova,T., Flavell,R.B. and Feldmann,K.A. TITLE Features of Arabidopsis genes and genome discovered using full-length cDNAs JOURNAL Plant Mol. Biol. 60 (1), 69-85 (2006) PUBMED 16463100 REFERENCE 3 (bases 1 to 1789) AUTHORS Brover,V., Troukhan,M., Alexandrov,N., Lu,Y.-P., Flavell,R. and Feldmann,K. TITLE Direct Submission JOURNAL Submitted (11-MAR-2002) Ceres, Inc, 3007 Malibu Canyon Road, Malibu, CA 90265, USA COMMENT This clone sequence is one of 5,000 Ceres full-length cDNAs made available to TIGR and Genbank. The following quality assessment of this set was done by comparison with known proteins: two percent of the clones are estimated to be 5'-truncated; less than one percent are 3'-truncated; approximately two percent represent alternative splice variants, including unspliced introns and spliced exons; one percent may contain premature stop codons; five percent may have frame shifts in a coding region. A sequence is considered to be 5'-truncated if it lacks the translation initiation start (ATG). A sequence is considered to be 3'-truncated if it lacks the C-terminal end of the encoded protein. Please note that these cDNA sequences are derived from the Ws or LAer ecotypes and therefore may contain polymorphisms when compared to sequences from Col-0. Genset carried out the library production and sequencing of the full-length clones. Ceres, Inc. carried out the clustering of the 5' sequences, selection of clones, and sequence assembly. FEATURES Location/Qualifiers source 1..1789 /organism="Arabidopsis thaliana" /mol_type="mRNA" /db_xref="taxon:3702" /clone="5509" CDS 84..1592 /codon_start=1 /product="putative UDP-N-acetylglucosamine pyrophosphorylase" /protein_id="AAM65852.1" /translation="MKEPTTEIEIETSAVTTILPPPLPPTASPHQALVERLKDYGQED VFSLWDELSPEERDLLLRDIENLDLPRIDRIIRCSLHSQGLPVAAIEPVPENCVSTVE ERTKEDREKWWKMGLKAIYEGKLGVVLLSGGQGTRLGSSDPKGCYNIGLPSGKSLFQI QAERILCVQRLASQAMSEASPTRPVTIQWYIMTSPFTHEPTQKFFESHKYFGLEPDQV TFFQQGALPCISKDGKFIMETPFSLSKAPDGNGGVYTALKSSRLLEDMASRGIKYVDC YGVDNVLVRVADPTFLGYFIDKSAASAAKVVRKAYPQEKVGVFVRRGKGGPLTVVEYT ELDQSMASATNQQTGRLQYCWSNVCLHMFTLDFLNQVANGLEKDSVYHLAEKKIPSIN GDIVGLKLEQFIFDCFPYAPSTALFEVLREEEFAPVKNANGSNYDTPESARLLVLRLH TRWVIAAGGFLTHSVPLYATGVEVSPLCSYAGENLEAICRGRTFHAPCEISL" BASE COUNT 482 a 355 c 438 g 514 t ORIGIN 1 ccctcgcgca tcttcggtgc tataaaacaa actcacgact agatcatttt tggagtgaga 61 tttacgtttg atagactact acaatgaagg agccaacgac ggagatagag attgaaactt 121 cagctgtcac aacgatcctg cctcctcctc ttcctccgac ggcgtcacct catcaggcgt 181 tggtggagag gctcaaggat tatggacagg aagatgtttt ctctctctgg gatgaactct 241 caccggaaga gcgagatctc ctcctccgag atatcgagaa tttggatctt ccaaggatag 301 atcggatcat cagatgctca cttcactcac aagggttgcc agtggcggca atagaaccgg 361 tgccggagaa ttgtgtgtca acggtggagg aaagaactaa ggaagacaga gaaaaatggt 421 ggaaaatggg attaaaagct atctacgaag gcaaattggg tgtggtgctt ttatctggtg 481 gacagggaac aagacttgga agttcagatc caaaagggtg ttataatatc ggactgccat 541 ctgggaaatc actttttcag attcaagctg agaggatctt atgtgtccaa aggcttgctt 601 ctcaggcaat gagtgaggca agtccaactc gcccagttac aatacagtgg tatataatga 661 ccagtccatt tactcatgaa ccaacacaaa aattcttcga gagtcacaag tattttggcc 721 ttgaaccaga tcaagtcacc ttttttcaac aaggagctct gccttgcatt tcaaaggatg 781 gcaagtttat catggagaca cctttcagcc tatccaaggc gccggatggg aacgggggag 841 tttatacagc tttaaaatct tcaaggttat tagaagatat ggcttcgagg gggattaaat 901 atgtggattg ctatggtgtt gacaatgttc tggttcgagt agctgaccct acttttctgg 961 gatacttcat cgacaaaagt gcagcttcag ctgcaaaagt agtgcgcaag gcatatccac 1021 aggaaaaagt tggagtattt gtaaggaggg gaaaaggtgg gcctttgact gtagttgagt 1081 acacagagct tgaccagtct atggcttctg caactaatca acaaacagga cgtcttcaat 1141 attgctggag taacgtgtgc ttacacatgt tcactctgga tttccttaac caagttgcga 1201 atgggctgga aaaagacagc gtttaccatt tggcggagaa gaagataccg tctataaatg 1261 gcgacatagt gggactaaaa ctagaacagt tcatattcga ttgctttcct tatgctcctt 1321 cgactgcact ttttgaggtg ttgagggagg aagagtttgc accggtgaag aacgcaaacg 1381 ggtcgaatta cgacacaccg gaaagcgcaa gactgttggt tctacgactg catacacgtt 1441 gggtcatagc agctggtgga tttctaacac attccgttcc tttatatgcg actggtgtgg 1501 aagtgtcacc attgtgctcg tacgctggag aaaatctaga agcgatttgt cggggaagaa 1561 cctttcacgc accatgtgaa atctccctct aatcttcttc tttctttttc ttatctattc 1621 ttgtaatttt gtcattgtct ttgctttttc ttttggcttg gttcttcttt tgtgttggtg 1681 tttggtctct atatatgaac tgtaactggc ggaggtggtc agtagtctct tttcaatgtg 1741 ttactcttca ttattgcaac aatagtgaag tctccgaaat tttggtgtt //