LOCUS       AAF80138.1              1322 aa    PRT              PLN 29-JUN-2000
DEFINITION  Arabidopsis thaliana T21E18.20 protein.
ACCESSION   AC024174-20
PROTEIN_ID  AAF80138.1
SOURCE      Arabidopsis thaliana (thale cress)
  ORGANISM  Arabidopsis thaliana
            Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
            Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
            Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
            Camelineae; Arabidopsis.
REFERENCE   1  (bases 1 to 74316)
  AUTHORS   Sakano,H., Vaysberg,M., Lee,J., Lenz,C., Liu,S.X., Pham,P.,
            Toriumi,M., Yu,G., Chin,C., Chiou,J., Choi,E., Chung,M.,
            Gonzalez,A., Howng,B., Liu,A., Altafi,H., Brooks,S., Buehler,E.,
            Chao,Q., Conn,L., Conway,A.B., Hansen,N.F., Johnson-Hopson,C.,
            Khan,S., Kim,C., Lam,B., Miranda,M., Nguyen,M., Palm,C.J.,
            Shinn,P., Southwick,A., Davis,R.W., Ecker,J.R., Federspiel,N.A. and
            Theologis,A.
  TITLE     The sequence of BAC T21E18 from Arabidopsis thaliana chromosome 1
  JOURNAL   Unpublished
REFERENCE   2  (bases 1 to 74316)
  AUTHORS   Theologis,A.
  TITLE     Direct Submission
  JOURNAL   Submitted (25-FEB-2000) Plant Gene Expression Center, 800 Buchanan
            Street, Albany, CA 94710, USA
REFERENCE   3  (bases 1 to 74316)
  AUTHORS   Theologis,A.
  TITLE     Direct Submission
  JOURNAL   Submitted (15-MAR-2000) Plant Gene Expression Center, 800 Buchanan
            Street, Albany, CA 94710, USA
REFERENCE   4  (bases 1 to 74316)
  AUTHORS   Theologis,A.
  TITLE     Direct Submission
  JOURNAL   Submitted (29-JUN-2000) Plant Gene Expression Center, 800 Buchanan
            St., Albany, CA 94710, USA
COMMENT     On Mar 15, 2000 this sequence version replaced AC024174.1.
            The sequence is of BAC T21E18 from Arabidopsis thaliana chromosome
            1.  The sequence does not represent the sequence of the entire
            insert of this clone because BAC T21E18 contains an E.coli
            insertion element 10 (IS10) at position 101153.  The IS10 (1329 bp
            in size) and the target site duplication (TGCATGGTC) have been
            removed from this entry.  The correct Arabidopsis sequence is
            confirmed by sequencing the PCR product from genomic DNA.  The
            sequence is also shorter by 30500 bp because we submit only the
            unique sequence of the clone.  However, in order to facilitate the
            joining of overlapping clones in the future, for creation of larger
            contigs, we provide small overlaps (200 bp) between overlapping
            sumbitted clones.  The 5' end of this sequence overlaps by 200 bp
            to the 3' end of the sequence of the clone T20M3.
FEATURES             Qualifiers
     source          /organism="Arabidopsis thaliana"
                     /mol_type="genomic DNA"
                     /db_xref="taxon:3702"
                     /chromosome="1"
                     /clone="T21E18"
                     /ecotype="Columbia"
     protein         /gene="T21E18.20"
                     /inference="non-experimental evidence, no additional
                     details recorded"
                     /note="Contains similarity to an unknown protein T5J8.5
                     gi|4263522 from Arabidopsis thaliana BAC T5J8 gb|AC004044
                     and contains multiple PPR PF|01535 repeats. ESTs
                     gb|AV565358, gb|AV558710, gb|AV524184 come from this
                     gene."
     intron_pos      32:2 (1/11)
     intron_pos      80:2 (2/11)
     intron_pos      108:0 (3/11)
     intron_pos      122:0 (4/11)
     intron_pos      142:0 (5/11)
     intron_pos      182:0 (6/11)
     intron_pos      585:0 (7/11)
     intron_pos      619:0 (8/11)
     intron_pos      666:0 (9/11)
     intron_pos      708:0 (10/11)
     intron_pos      734:2 (11/11)
BEGIN
        1 MGYTLQQILR SICSNTDWNY AVFWKLNHHS PMVLTLEDVY CVNHERGLMP ESLHGGRHAH
       61 DPLGLAVAKM SYHVHSLGEG IVGQVAISGQ HQWIFSEYLN DSHSTLQVHN GWESQISAGI
      121 KTILIVAVGS CGVVQLGSLC KVEEDPALVT HIRHLFLALT DPLADHASNL MQCDINSPSD
      181 RPKIPSKCLH EASPDFSGEF DKAMDMEGLN IVSQNTSNRS NDLPYNFTPT YFHMERTAQV
      241 IGGLEAVQPS MFGSNDCVTS GFSVGVVDTK HKNQVDISDM SKVIYDEETG GYRYSRELDP
      301 NFQHYSRNHV RNSGGTSALA MESDRLKAGS SYPQLDSTVL TALKTDKDYS RRNEVFQPSE
      361 SQGSIFVKDT EHRQEEKSES SQLDALTASL CSFSGSELLE ALGPAFSKTS TDYGELAKFE
      421 SAAAIRRTND MSHSHLTFES SSENLLDAVV ASMSNGDGNV RREISSSRST QSLLTTAEMA
      481 QAEPFGHNKQ NIVSTVDSVI SQPPLADGLI QQNPSNICGA FSSIGFSSTC LSSSSDQFPT
      541 SLEIPKKNKK RAKPGESSRP RPRDRQLIQD RIKELRELVP NGSKCSIDSL LECTIKHMLF
      601 LQSVSQHADK LTKSASSKMQ HKDTGTLGIS STEQGSSWAV EIGGHLQVCS IMVENLDKEG
      661 VMLIEMLCEE CSHFLEIANV IRSLELIILR GTTEKQGEKT WICFVVEGQN NKVMHRMDIL
      721 WSLVQIFQPK ATNSLHLYRQ SQILYMNAFA NVHSLRVPSH HLRDFSASLS LAPPNLKKII
      781 KQCSTPKLLE SALAAMIKTS LNQDCRLMNQ FITACTSFKR LDLAVSTMTQ MQEPNVFVYN
      841 ALFKGFVTCS HPIRSLELYV RMLRDSVSPS SYTYSSLVKA SSFASRFGES LQAHIWKFGF
      901 GFHVKIQTTL IDFYSATGRI REARKVFDEM PERDDIAWTT MVSAYRRVLD MDSANSLANQ
      961 MSEKNEATSN CLINGYMGLG NLEQAESLFN QMPVKDIISW TTMIKGYSQN KRYREAIAVF
     1021 YKMMEEGIIP DEVTMSTVIS ACAHLGVLEI GKEVHMYTLQ NGFVLDVYIG SALVDMYSKC
     1081 GSLERALLVF FNLPKKNLFC WNSIIEGLAA HGFAQEALKM FAKMEMESVK PNAVTFVSVF
     1141 TACTHAGLVD EGRRIYRSMI DDYSIVSNVE HYGGMVHLFS KAGLIYEALE LIGNMEFEPN
     1201 AVIWGALLDG CRIHKNLVIA EIAFNKLMVL EPMNSGYYFL LVSMYAEQNR WRDVAEIRGR
     1261 MRELGIEKIC PGTSSIRIDK RDHLFAAADK SHSASDEVCL LLDEIYDQMG LAGYVQETEN
     1321 VY
//