LOCUS AAF80138.1 1322 aa PRT PLN 29-JUN-2000 DEFINITION Arabidopsis thaliana T21E18.20 protein. ACCESSION AC024174-20 PROTEIN_ID AAF80138.1 SOURCE Arabidopsis thaliana (thale cress) ORGANISM Arabidopsis thaliana Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis. REFERENCE 1 (bases 1 to 74316) AUTHORS Sakano,H., Vaysberg,M., Lee,J., Lenz,C., Liu,S.X., Pham,P., Toriumi,M., Yu,G., Chin,C., Chiou,J., Choi,E., Chung,M., Gonzalez,A., Howng,B., Liu,A., Altafi,H., Brooks,S., Buehler,E., Chao,Q., Conn,L., Conway,A.B., Hansen,N.F., Johnson-Hopson,C., Khan,S., Kim,C., Lam,B., Miranda,M., Nguyen,M., Palm,C.J., Shinn,P., Southwick,A., Davis,R.W., Ecker,J.R., Federspiel,N.A. and Theologis,A. TITLE The sequence of BAC T21E18 from Arabidopsis thaliana chromosome 1 JOURNAL Unpublished REFERENCE 2 (bases 1 to 74316) AUTHORS Theologis,A. TITLE Direct Submission JOURNAL Submitted (25-FEB-2000) Plant Gene Expression Center, 800 Buchanan Street, Albany, CA 94710, USA REFERENCE 3 (bases 1 to 74316) AUTHORS Theologis,A. TITLE Direct Submission JOURNAL Submitted (15-MAR-2000) Plant Gene Expression Center, 800 Buchanan Street, Albany, CA 94710, USA REFERENCE 4 (bases 1 to 74316) AUTHORS Theologis,A. TITLE Direct Submission JOURNAL Submitted (29-JUN-2000) Plant Gene Expression Center, 800 Buchanan St., Albany, CA 94710, USA COMMENT On Mar 15, 2000 this sequence version replaced AC024174.1. The sequence is of BAC T21E18 from Arabidopsis thaliana chromosome 1. The sequence does not represent the sequence of the entire insert of this clone because BAC T21E18 contains an E.coli insertion element 10 (IS10) at position 101153. The IS10 (1329 bp in size) and the target site duplication (TGCATGGTC) have been removed from this entry. The correct Arabidopsis sequence is confirmed by sequencing the PCR product from genomic DNA. The sequence is also shorter by 30500 bp because we submit only the unique sequence of the clone. However, in order to facilitate the joining of overlapping clones in the future, for creation of larger contigs, we provide small overlaps (200 bp) between overlapping sumbitted clones. The 5' end of this sequence overlaps by 200 bp to the 3' end of the sequence of the clone T20M3. FEATURES Qualifiers source /organism="Arabidopsis thaliana" /mol_type="genomic DNA" /db_xref="taxon:3702" /chromosome="1" /clone="T21E18" /ecotype="Columbia" protein /gene="T21E18.20" /inference="non-experimental evidence, no additional details recorded" /note="Contains similarity to an unknown protein T5J8.5 gi|4263522 from Arabidopsis thaliana BAC T5J8 gb|AC004044 and contains multiple PPR PF|01535 repeats. ESTs gb|AV565358, gb|AV558710, gb|AV524184 come from this gene." intron_pos 32:2 (1/11) intron_pos 80:2 (2/11) intron_pos 108:0 (3/11) intron_pos 122:0 (4/11) intron_pos 142:0 (5/11) intron_pos 182:0 (6/11) intron_pos 585:0 (7/11) intron_pos 619:0 (8/11) intron_pos 666:0 (9/11) intron_pos 708:0 (10/11) intron_pos 734:2 (11/11) BEGIN 1 MGYTLQQILR SICSNTDWNY AVFWKLNHHS PMVLTLEDVY CVNHERGLMP ESLHGGRHAH 61 DPLGLAVAKM SYHVHSLGEG IVGQVAISGQ HQWIFSEYLN DSHSTLQVHN GWESQISAGI 121 KTILIVAVGS CGVVQLGSLC KVEEDPALVT HIRHLFLALT DPLADHASNL MQCDINSPSD 181 RPKIPSKCLH EASPDFSGEF DKAMDMEGLN IVSQNTSNRS NDLPYNFTPT YFHMERTAQV 241 IGGLEAVQPS MFGSNDCVTS GFSVGVVDTK HKNQVDISDM SKVIYDEETG GYRYSRELDP 301 NFQHYSRNHV RNSGGTSALA MESDRLKAGS SYPQLDSTVL TALKTDKDYS RRNEVFQPSE 361 SQGSIFVKDT EHRQEEKSES SQLDALTASL CSFSGSELLE ALGPAFSKTS TDYGELAKFE 421 SAAAIRRTND MSHSHLTFES SSENLLDAVV ASMSNGDGNV RREISSSRST QSLLTTAEMA 481 QAEPFGHNKQ NIVSTVDSVI SQPPLADGLI QQNPSNICGA FSSIGFSSTC LSSSSDQFPT 541 SLEIPKKNKK RAKPGESSRP RPRDRQLIQD RIKELRELVP NGSKCSIDSL LECTIKHMLF 601 LQSVSQHADK LTKSASSKMQ HKDTGTLGIS STEQGSSWAV EIGGHLQVCS IMVENLDKEG 661 VMLIEMLCEE CSHFLEIANV IRSLELIILR GTTEKQGEKT WICFVVEGQN NKVMHRMDIL 721 WSLVQIFQPK ATNSLHLYRQ SQILYMNAFA NVHSLRVPSH HLRDFSASLS LAPPNLKKII 781 KQCSTPKLLE SALAAMIKTS LNQDCRLMNQ FITACTSFKR LDLAVSTMTQ MQEPNVFVYN 841 ALFKGFVTCS HPIRSLELYV RMLRDSVSPS SYTYSSLVKA SSFASRFGES LQAHIWKFGF 901 GFHVKIQTTL IDFYSATGRI REARKVFDEM PERDDIAWTT MVSAYRRVLD MDSANSLANQ 961 MSEKNEATSN CLINGYMGLG NLEQAESLFN QMPVKDIISW TTMIKGYSQN KRYREAIAVF 1021 YKMMEEGIIP DEVTMSTVIS ACAHLGVLEI GKEVHMYTLQ NGFVLDVYIG SALVDMYSKC 1081 GSLERALLVF FNLPKKNLFC WNSIIEGLAA HGFAQEALKM FAKMEMESVK PNAVTFVSVF 1141 TACTHAGLVD EGRRIYRSMI DDYSIVSNVE HYGGMVHLFS KAGLIYEALE LIGNMEFEPN 1201 AVIWGALLDG CRIHKNLVIA EIAFNKLMVL EPMNSGYYFL LVSMYAEQNR WRDVAEIRGR 1261 MRELGIEKIC PGTSSIRIDK RDHLFAAADK SHSASDEVCL LLDEIYDQMG LAGYVQETEN 1321 VY //