LOCUS       AEE77893.1              1096 aa    PRT              PLN 23-MAR-2023
DEFINITION  Arabidopsis thaliana hypothetical protein protein.
ACCESSION   CP002686-5918
PROTEIN_ID  AEE77893.1
SOURCE      Arabidopsis thaliana (thale cress)
  ORGANISM  Arabidopsis thaliana
            Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
            Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
            Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
            Camelineae; Arabidopsis.
REFERENCE   1  (bases 1 to 23459830)
  AUTHORS   Salanoubat,M., Lemcke,K., Rieger,M., Ansorge,W., Unseld,M.,
            Fartmann,B., Valle,G., Blocker,H., Perez-Alonso,M., Obermaier,B.,
            Delseny,M., Boutry,M., Grivell,L.A., Mache,R., Puigdomenech,P., De
            Simone,V., Choisne,N., Artiguenave,F., Robert,C., Brottier,P.,
            Wincker,P., Cattolico,L., Weissenbach,J., Saurin,W., Quetier,F.,
            Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M., Benes,V.,
            Wurmbach,E., Drzonek,H., Erfle,H., Jordan,N., Bangert,S.,
            Wiedelmann,R., Kranz,H., Voss,H., Holland,R., Brandt,P.,
            Nyakatura,G., Vezzi,A., D'Angelo,M., Pallavicini,A., Toppo,S.,
            Simionati,B., Conrad,A., Hornischer,K., Kauer,G., Lohnert,T.H.,
            Nordsiek,G., Reichelt,J., Scharfe,M., Schon,O., Bargues,M.,
            Terol,J., Climent,J., Navarro,P., Collado,C., Perez-Perez,A.,
            Ottenwalder,B., Duchemin,D., Cooke,R., Laudie,M., Berger-Llauro,C.,
            Purnelle,B., Masuy,D., de Haan,M., Maarse,A.C., Alcaraz,J.P.,
            Cottet,A., Casacuberta,E., Monfort,A., Argiriou,A., flores,M.,
            Liguori,R., Vitale,D., Mannhaupt,G., Haase,D., Schoof,H., Rudd,S.,
            Zaccaria,P., Mewes,H.W., Mayer,K.F., Kaul,S., Town,C.D., Koo,H.L.,
            Tallon,L.J., Jenkins,J., Rooney,T., Rizzo,M., Walts,A.,
            Utterback,T., Fujii,C.Y., Shea,T.P., Creasy,T.H., Haas,B.,
            Maiti,R., Wu,D., Peterson,J., Van Aken,S., Pai,G., Militscher,J.,
            Sellers,P., Gill,J.E., Feldblyum,T.V., Preuss,D., Lin,X.,
            Nierman,W.C., Salzberg,S.L., White,O., Venter,J.C., Fraser,C.M.,
            Kaneko,T., Nakamura,Y., Sato,S., Kato,T., Asamizu,E., Sasamoto,S.,
            Kimura,T., Idesawa,K., Kawashima,K., Kishida,Y., Kiyokawa,C.,
            Kohara,M., Matsumoto,M., Matsuno,A., Muraki,A., Nakayama,S.,
            Nakazaki,N., Shinpo,S., Takeuchi,C., Wada,T., Watanabe,A.,
            Yamada,M., Yasuda,M. and Tabata,S.
  CONSRTM   European Union Chromosome 3 Arabidopsis Sequencing Consortium;
            Institute for Genomic Research; Kazusa DNA Research Institute
  TITLE     Sequence and analysis of chromosome 3 of the plant Arabidopsis
            thaliana
  JOURNAL   Nature 408 (6814), 820-822 (2000)
   PUBMED   11130713
REFERENCE   2  (bases 1 to 23459830)
  AUTHORS   Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
  CONSRTM   TAIR
  TITLE     Direct Submission
  JOURNAL   Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
            Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE   3  (bases 1 to 23459830)
  AUTHORS   Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
            Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
            Vaughn,M. and Town,C.D.
  TITLE     Direct Submission
  JOURNAL   Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
            9704 Medical Center Dr, Rockville, MD 20850, USA
  REMARK    Protein update by submitter
FEATURES             Qualifiers
     source          /organism="Arabidopsis thaliana"
                     /mol_type="genomic DNA"
                     /db_xref="taxon:3702"
                     /chromosome="3"
                     /ecotype="Columbia"
     protein         /gene="CEF"
                     /locus_tag="AT3G44340"
                     /gene_synonym="clone eighty-four"
                     /inference="Similar to RNA sequence,
                     EST:INSD:Z25619.1,INSD:AV548723.1,INSD:ES192795.1,
                     INSD:AV546405.1,INSD:AV567609.1,INSD:ES123483.1,
                     INSD:EG481134.1,INSD:AU237916.1,INSD:BP817019.1,
                     INSD:AV537182.1,INSD:DR357754.1,INSD:CD531602.1,
                     INSD:EL123478.1,INSD:AV545803.1,INSD:EL029187.1,
                     INSD:AV546943.1,INSD:AV546492.1,INSD:BP790883.1,
                     INSD:EG481135.1,INSD:BP671541.1,INSD:AV554813.1,
                     INSD:DR382645.1,INSD:BX836551.1,INSD:AV529827.1,
                     INSD:AV548495.1,INSD:EL047338.1,INSD:EH828184.1,
                     INSD:EG514376.1,INSD:ES117338.1,INSD:BX840863.1,
                     INSD:AV546100.1,INSD:ES171073.1,INSD:AV554927.1,
                     INSD:AV554781.1,INSD:EL093957.1,INSD:AV524025.1,
                     INSD:EG514240.1,INSD:AV563549.1,INSD:BE530792.1,
                     INSD:AV546472.1"
                     /inference="Similar to RNA sequence, mRNA:INSD:AJ251579.1"
                     /note="clone eighty-four (CEF); FUNCTIONS IN: transporter
                     activity, zinc ion binding; INVOLVED IN: response to
                     oxidative stress, vesicle-mediated transport; LOCATED IN:
                     COPII vesicle coat, chloroplast; EXPRESSED IN: 24 plant
                     structures; EXPRESSED DURING: 15 growth stages; CONTAINS
                     InterPro DOMAIN/s: Sec23/Sec24, helical domain
                     (InterPro:IPR006900), Sec23/Sec24 beta-sandwich
                     (InterPro:IPR012990), Sec23/Sec24, trunk domain
                     (InterPro:IPR006896), Zinc finger, Sec23/Sec24-type
                     (InterPro:IPR006895), Gelsolin domain
                     (InterPro:IPR007123); BEST Arabidopsis thaliana protein
                     match is: Sec23/Sec24 protein transport family protein
                     (TAIR:AT4G32640.2); Has 125534 Blast hits to 60845
                     proteins in 1833 species: Archae - 62; Bacteria - 18823;
                     Metazoa - 61754; Fungi - 17926; Plants - 11260; Viruses -
                     2145; Other Eukaryotes - 13564 (source: NCBI BLink)."
                     /db_xref="TAIR:AT3G44340"
                     /db_xref="Araport:AT3G44340"
     intron_pos      304:0 (1/23)
     intron_pos      364:0 (2/23)
     intron_pos      389:0 (3/23)
     intron_pos      422:0 (4/23)
     intron_pos      459:1 (5/23)
     intron_pos      502:0 (6/23)
     intron_pos      542:0 (7/23)
     intron_pos      569:0 (8/23)
     intron_pos      595:0 (9/23)
     intron_pos      626:0 (10/23)
     intron_pos      644:1 (11/23)
     intron_pos      670:0 (12/23)
     intron_pos      694:0 (13/23)
     intron_pos      722:0 (14/23)
     intron_pos      762:0 (15/23)
     intron_pos      785:0 (16/23)
     intron_pos      811:0 (17/23)
     intron_pos      857:1 (18/23)
     intron_pos      912:1 (19/23)
     intron_pos      961:0 (20/23)
     intron_pos      1021:0 (21/23)
     intron_pos      1050:2 (22/23)
     intron_pos      1061:1 (23/23)
BEGIN
        1 MAAPVPPGAY RPNNNQQNSG GPPNFVPGSQ GNPNSLAANM QNLNINRPPP PMPGSGPRPS
       61 PPFGQSPQSF PQQQQQQPRP SPMARPGPPP PAAMARPGGP PQVSQPGGFP PVGRPVAPPS
      121 NQPPFGGRPS TGPLVGGGSS FPQPGGFPAS GPPGGVPSGP PSGARPIGFG SPPPMGPGMS
      181 MPPPSGMPGG PLSNGPPPSG MHGGHLSNGP PPSGMPGGPL SNGPPPPMMG PGAFPRGSQF
      241 TSGPMMAPPP PYGQPPNAGP FTGNSPLSSP PAHSIPPPTN FPGVPYGRPP MPGGFPYGAP
      301 PQQLPSAPGT PGSIYGMGPM QNQSMTSVSS PSKIDLNQIP RPGSSSSPIV YETRVENKAN
      361 PPPPTTVDYI TRDTGNSSPR YMRCTINQIP CTVDLLSTSG MQLALIVQPM ALSHPSEEPI
      421 QVVDFGESGP VRCSRCKGYV NPFMKFIDQG RKFICNLCGY TDETPRDYQC NLGPDGRRRD
      481 ADERPELCRG TVDFVATKEY MVRDPMPAVY FFLIDVSMNA IQTGATAAAC SAIQQVLSDL
      541 PEGPRTFVGI ATFDSTIHFY NLKRALQQPL MLIVPDVQDV YTPLETDVIV QLSECRQHLE
      601 ILLESIPTMF QESKSPESAF GAAVKAAFLA MKSTGGKLMV FQSVLPSVGI GALSSREADG
      661 RANASAGEKE AHKLLQPADK TLRTMAIEFA EYQVCVDLFI TTQAYVDMAS ISEIPRTTGG
      721 QVYCYYPFSA LSDPPKLYND LRWNITRPQG FEAVMRVRCS QGIQVQEYSG NFCKRIPTDI
      781 DLPAIDCDKA IMVTLKHDDK LQDGAECGFQ CALLYTTISG ERRIRVLNLS IPCTNMLSNL
      841 FRSADLDSQF ACMLKQAANE IPSKALPLVK EQATNDCITI LHSYRKFCAT VTSTGQLILP
      901 EALKLLPLYT LALTKGVGLR MDGRIDDRSF WINHVSSLST PLAIPLVYPR MIAVHDLDAN
      961 DNEENVVPCP IPLQSEHLSD EGVYFLENGE DGLIYIGESV NSDILQKLFN VRSAAELPSQ
     1021 YVLQKYDNQL SKKFNDVVNE IRRQRSSYLR IKLCKKGDPA GNMLFQSYMV EDRGSGGASY
     1081 VDFLVSVHRQ IQHKLN
//