LOCUS       AEE86064.1               365 aa    PRT              PLN 23-MAR-2023
DEFINITION  Arabidopsis thaliana choice-of-anchor C domain protein,
            putative (Protein of unknown function, DUF642) protein.
ACCESSION   CP002687-5945
PROTEIN_ID  AEE86064.1
SOURCE      Arabidopsis thaliana (thale cress)
  ORGANISM  Arabidopsis thaliana
            Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
            Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
            Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
            Camelineae; Arabidopsis.
REFERENCE   1  (bases 1 to 18585056)
  AUTHORS   Mayer,K., Schuller,C., Wambutt,R., Murphy,G., Volckaert,G.,
            Pohl,T., Dusterhoft,A., Stiekema,W., Entian,K.D., Terryn,N.,
            Harris,B., Ansorge,W., Brandt,P., Grivell,L., Rieger,M.,
            Weichselgartner,M., de Simone,V., Obermaier,B., Mache,R.,
            Muller,M., Kreis,M., Delseny,M., Puigdomenech,P., Watson,M.,
            Schmidtheini,T., Reichert,B., Portatelle,D., Perez-Alonso,M.,
            Boutry,M., Bancroft,I., Vos,P., Hoheisel,J., Zimmermann,W.,
            Wedler,H., Ridley,P., Langham,S.A., McCullagh,B., Bilham,L.,
            Robben,J., Van der Schueren,J., Grymonprez,B., Chuang,Y.J.,
            Vandenbussche,F., Braeken,M., Weltjens,I., Voet,M., Bastiaens,I.,
            Aert,R., Defoor,E., Weitzenegger,T., Bothe,G., Ramsperger,U.,
            Hilbert,H., Braun,M., Holzer,E., Brandt,A., Peters,S., van
            Staveren,M., Dirske,W., Mooijman,P., Klein Lankhorst,R., Rose,M.,
            Hauf,J., Kotter,P., Berneiser,S., Hempel,S., Feldpausch,M.,
            Lamberth,S., Van den Daele,H., De Keyser,A., Buysshaert,C.,
            Gielen,J., Villarroel,R., De Clercq,R., Van Montagu,M., Rogers,J.,
            Cronin,A., Quail,M., Bray-Allen,S., Clark,L., Doggett,J., Hall,S.,
            Kay,M., Lennard,N., McLay,K., Mayes,R., Pettett,A.,
            Rajandream,M.A., Lyne,M., Benes,V., Rechmann,S., Borkova,D.,
            Blocker,H., Scharfe,M., Grimm,M., Lohnert,T.H., Dose,S., de
            Haan,M., Maarse,A., Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M.,
            Fartmann,B., Granderath,K., Dauner,D., Herzl,A., Neumann,S.,
            Argiriou,A., Vitale,D., Liguori,R., Piravandi,E., Massenet,O.,
            Quigley,F., Clabauld,G., Mundlein,A., Felber,R., Schnabl,S.,
            Hiller,R., Schmidt,W., Lecharny,A., Aubourg,S., Chefdor,F.,
            Cooke,R., Berger,C., Montfort,A., Casacuberta,E., Gibbons,T.,
            Weber,N., Vandenbol,M., Bargues,M., Terol,J., Torres,A.,
            Perez-Perez,A., Purnelle,B., Bent,E., Johnson,S., Tacon,D.,
            Jesse,T., Heijnen,L., Schwarz,S., Scholler,P., Heber,S., Francs,P.,
            Bielke,C., Frishman,D., Haase,D., Lemcke,K., Mewes,H.W.,
            Stocker,S., Zaccaria,P., Bevan,M., Wilson,R.K., de la Bastide,M.,
            Habermann,K., Parnell,L., Dedhia,N., Gnoj,L., Schutz,K., Huang,E.,
            Spiegel,L., Sehkon,M., Murray,J., Sheet,P., Cordes,M.,
            Abu-Threideh,J., Stoneking,T., Kalicki,J., Graves,T., Harmon,G.,
            Edwards,J., Latreille,P., Courtney,L., Cloud,J., Abbott,A.,
            Scott,K., Johnson,D., Minx,P., Bentley,D., Fulton,B., Miller,N.,
            Greco,T., Kemp,K., Kramer,J., Fulton,L., Mardis,E., Dante,M.,
            Pepin,K., Hillier,L., Nelson,J., Spieth,J., Ryan,E., Andrews,S.,
            Geisel,C., Layman,D., Du,H., Ali,J., Berghoff,A., Jones,K.,
            Drone,K., Cotton,M., Joshu,C., Antonoiu,B., Zidanic,M., Strong,C.,
            Sun,H., Lamar,B., Yordan,C., Ma,P., Zhong,J., Preston,R., Vil,D.,
            Shekher,M., Matero,A., Shah,R., Swaby,I.K., O'Shaughnessy,A.,
            Rodriguez,M., Hoffmann,J., Till,S., Granat,S., Shohdy,N.,
            Hasegawa,A., Hameed,A., Lodhi,M., Johnson,A., Chen,E., Marra,M.,
            Martienssen,R. and McCombie,W.R.
  TITLE     Sequence and analysis of chromosome 4 of the plant Arabidopsis
            thaliana
  JOURNAL   Nature 402 (6763), 769-777 (1999)
   PUBMED   10617198
REFERENCE   2  (bases 1 to 18585056)
  AUTHORS   Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
  CONSRTM   TAIR
  TITLE     Direct Submission
  JOURNAL   Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
            Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE   3  (bases 1 to 18585056)
  AUTHORS   Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
            Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
            Vaughn,M. and Town,C.D.
  TITLE     Direct Submission
  JOURNAL   Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
            9704 Medical Center Dr, Rockville, MD 20850, USA
  REMARK    Protein update by submitter
FEATURES             Qualifiers
     source          /organism="Arabidopsis thaliana"
                     /mol_type="genomic DNA"
                     /db_xref="taxon:3702"
                     /chromosome="4"
                     /ecotype="Columbia"
     protein         /locus_tag="AT4G32460"
                     /gene_synonym="F8B4.160"
                     /gene_synonym="F8B4_160"
                     /inference="Similar to RNA sequence,
                     EST:INSD:Z18147.1,INSD:ES184034.1,INSD:Z34535.1,
                     INSD:DR235658.1,INSD:DR235654.1,INSD:DR235649.1,
                     INSD:EH937272.1,INSD:DR235662.1,INSD:DR235663.1,
                     INSD:DR230943.1,INSD:EL106888.1,INSD:DR235645.1,
                     INSD:BP829605.1,INSD:DR235648.1,INSD:R30487.1,
                     INSD:DR235640.1,INSD:BX839257.1,INSD:DR235644.1,
                     INSD:DR235667.1,INSD:AV550315.1,INSD:DR235660.1,
                     INSD:AV821984.1,INSD:DR235664.1,INSD:DR235657.1,
                     INSD:ES122369.1,INSD:DR235653.1,INSD:DR235656.1,
                     INSD:AV533684.1,INSD:CA781351.1,INSD:EL123048.1,
                     INSD:DR235668.1,INSD:ES065765.1,INSD:CK121381.1,
                     INSD:DR235646.1,INSD:DR235643.1,INSD:EL323127.1,
                     INSD:DR235636.1,INSD:BP819453.1,INSD:EL251648.1,
                     INSD:ES011734.1,INSD:DR235665.1,INSD:DR235650.1,
                     INSD:Z18131.1,INSD:BP860866.1,INSD:BP826699.1,
                     INSD:DR235634.1,INSD:DR235635.1,INSD:DR235637.1,
                     INSD:DR235639.1,INSD:DR235651.1,INSD:DR235647.1,
                     INSD:EH835776.1,INSD:DR235642.1,INSD:DR235659.1,
                     INSD:T76314.1,INSD:DR235638.1,INSD:T88617.1,
                     INSD:BP833297.1,INSD:ES093721.1,INSD:BP842370.1,
                     INSD:DR235655.1,INSD:DR235666.1,INSD:EL143083.1,
                     INSD:DR235661.1"
                     /inference="Similar to RNA sequence, mRNA:INSD:BT025248.1"
                     /note="FUNCTIONS IN: molecular_function unknown; INVOLVED
                     IN: biological_process unknown; LOCATED IN: plant-type
                     cell wall; EXPRESSED IN: 24 plant structures; EXPRESSED
                     DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s:
                     Protein of unknown function DUF642 (InterPro:IPR006946),
                     Galactose-binding domain-like (InterPro:IPR008979); BEST
                     Arabidopsis thaliana protein match is: Protein of unknown
                     function, DUF642 (TAIR:AT5G11420.1); Has 35333 Blast hits
                     to 34131 proteins in 2444 species: Archae - 798; Bacteria
                     - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
                     - 0; Other Eukaryotes - 9610 (source: NCBI BLink)."
                     /db_xref="TAIR:AT4G32460"
                     /db_xref="Araport:AT4G32460"
     intron_pos      25:1 (1/2)
     intron_pos      191:1 (2/2)
BEGIN
        1 MKEMGVIVLL LLHSFFYVAF CFNDGLLPNG DFELGPRHSD MKGTQVINIT AIPNWELSGF
       61 VEYIPSGHKQ GDMILVVPKG AFAVRLGNEA SIKQKISVKK GSYYSITFSA ARTCAQDERL
      121 NVSVAPHHAV MPIQTVYSSS GWDLYSWAFK AQSDYADIVI HNPGVEEDPA CGPLIDGVAM
      181 RALFPPRPTN KNILKNGGFE EGPWVLPNIS SGVLIPPNSI DDHSPLPGWM VESLKAVKYI
      241 DSDHFSVPQG RRAVELVAGK ESAVAQVVRT IPGKTYVLSF SVGDASNACA GSMIVEAFAG
      301 KDTIKVPYES KGKGGFKRSS LRFVAVSSRT RVMFYSTFYA MRNDDFSSLC GPVIDDVKLL
      361 SARRP
//