LOCUS       AAF80119.1               614 aa    PRT              PLN 29-JUN-2000
DEFINITION  Arabidopsis thaliana T21E18.1 protein.
ACCESSION   AC024174-1
PROTEIN_ID  AAF80119.1
SOURCE      Arabidopsis thaliana (thale cress)
  ORGANISM  Arabidopsis thaliana
            Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
            Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
            Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
            Camelineae; Arabidopsis.
REFERENCE   1  (bases 1 to 74316)
  AUTHORS   Sakano,H., Vaysberg,M., Lee,J., Lenz,C., Liu,S.X., Pham,P.,
            Toriumi,M., Yu,G., Chin,C., Chiou,J., Choi,E., Chung,M.,
            Gonzalez,A., Howng,B., Liu,A., Altafi,H., Brooks,S., Buehler,E.,
            Chao,Q., Conn,L., Conway,A.B., Hansen,N.F., Johnson-Hopson,C.,
            Khan,S., Kim,C., Lam,B., Miranda,M., Nguyen,M., Palm,C.J.,
            Shinn,P., Southwick,A., Davis,R.W., Ecker,J.R., Federspiel,N.A. and
            Theologis,A.
  TITLE     The sequence of BAC T21E18 from Arabidopsis thaliana chromosome 1
  JOURNAL   Unpublished
REFERENCE   2  (bases 1 to 74316)
  AUTHORS   Theologis,A.
  TITLE     Direct Submission
  JOURNAL   Submitted (25-FEB-2000) Plant Gene Expression Center, 800 Buchanan
            Street, Albany, CA 94710, USA
REFERENCE   3  (bases 1 to 74316)
  AUTHORS   Theologis,A.
  TITLE     Direct Submission
  JOURNAL   Submitted (15-MAR-2000) Plant Gene Expression Center, 800 Buchanan
            Street, Albany, CA 94710, USA
REFERENCE   4  (bases 1 to 74316)
  AUTHORS   Theologis,A.
  TITLE     Direct Submission
  JOURNAL   Submitted (29-JUN-2000) Plant Gene Expression Center, 800 Buchanan
            St., Albany, CA 94710, USA
COMMENT     On Mar 15, 2000 this sequence version replaced AC024174.1.
            The sequence is of BAC T21E18 from Arabidopsis thaliana chromosome
            1.  The sequence does not represent the sequence of the entire
            insert of this clone because BAC T21E18 contains an E.coli
            insertion element 10 (IS10) at position 101153.  The IS10 (1329 bp
            in size) and the target site duplication (TGCATGGTC) have been
            removed from this entry.  The correct Arabidopsis sequence is
            confirmed by sequencing the PCR product from genomic DNA.  The
            sequence is also shorter by 30500 bp because we submit only the
            unique sequence of the clone.  However, in order to facilitate the
            joining of overlapping clones in the future, for creation of larger
            contigs, we provide small overlaps (200 bp) between overlapping
            sumbitted clones.  The 5' end of this sequence overlaps by 200 bp
            to the 3' end of the sequence of the clone T20M3.
FEATURES             Qualifiers
     source          /organism="Arabidopsis thaliana"
                     /mol_type="genomic DNA"
                     /db_xref="taxon:3702"
                     /chromosome="1"
                     /clone="T21E18"
                     /ecotype="Columbia"
     protein         /gene="T21E18.1"
                     /inference="non-experimental evidence, no additional
                     details recorded"
                     /note="Contains similarity to an amino acid transporter
                     cationic 1 (Atrc1) from Mus musculus gi|6671596 and
                     contains an amino acid permease PF|00324 domain. ESTs
                     gb|AI995600, gb|AV566914, gb|AV531134 come from this
                     gene."
     intron_pos      117:0 (1/7)
     intron_pos      266:0 (2/7)
     intron_pos      325:0 (3/7)
     intron_pos      400:0 (4/7)
     intron_pos      459:0 (5/7)
     intron_pos      538:0 (6/7)
     intron_pos      570:0 (7/7)
BEGIN
        1 MVGPIFIDLD SDYPGGDFSH RLDQSSTLKL KFPWEATKVS ATNASPPPHG SLISVLQLSD
       61 LNHCPLRPRR LPFAPPLATL WFVASAFSTS YCWASALPSA PVSSSSPEPS LAMLDLNRLV
      121 IIAAVACLGV TISFLLAGAS CVLNALCYAE LSSRFPAVVG GAYMYSYSAF NEITAFLVFV
      181 QLMLDYHIGA ASISRSLASY AVALLELFPA LKGSIPLWMG SGKELLGGLL SLNILAPILL
      241 ALLTLVLCQG VRESSAVNSV MTATKVVIVL VVICAGAFEI DVANWSPFAP NGFKAVLTGA
      301 TVVFFSYVGF DAVANSAEES KNPQRDLPIG IMGSLLVCIS LYIGVCLVLT GMVPFSLLSE
      361 DAPLAEAFSS KGMKFVSILI SIGAVAGLTT TLLVGLYVQS RLYLGLGRDG LLPSIFSRIH
      421 PTLHTPLHSQ IWCGIVAGVL AGIFNVHSLS HILSVGTLTG YSVVAACVVA LRLNDKKDRE
      481 SSNRWTSSWQ EGVICLVIIA CSGFGAGVFY RFSASVIFIL LSVGVAVVAS AVLHYRQAYA
      541 LPLGSGFSCP GVPIVPSVCI FFNIFLFAQL HYEAWIRFVV VSVLATAVYA LYGQYHADPS
      601 MLDYQRAPET ESDA
//