LOCUS AEE86064.1 365 aa PRT PLN 23-MAR-2023 DEFINITION Arabidopsis thaliana choice-of-anchor C domain protein, putative (Protein of unknown function, DUF642) protein. ACCESSION CP002687-5945 PROTEIN_ID AEE86064.1 SOURCE Arabidopsis thaliana (thale cress) ORGANISM Arabidopsis thaliana Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis. REFERENCE 1 (bases 1 to 18585056) AUTHORS Mayer,K., Schuller,C., Wambutt,R., Murphy,G., Volckaert,G., Pohl,T., Dusterhoft,A., Stiekema,W., Entian,K.D., Terryn,N., Harris,B., Ansorge,W., Brandt,P., Grivell,L., Rieger,M., Weichselgartner,M., de Simone,V., Obermaier,B., Mache,R., Muller,M., Kreis,M., Delseny,M., Puigdomenech,P., Watson,M., Schmidtheini,T., Reichert,B., Portatelle,D., Perez-Alonso,M., Boutry,M., Bancroft,I., Vos,P., Hoheisel,J., Zimmermann,W., Wedler,H., Ridley,P., Langham,S.A., McCullagh,B., Bilham,L., Robben,J., Van der Schueren,J., Grymonprez,B., Chuang,Y.J., Vandenbussche,F., Braeken,M., Weltjens,I., Voet,M., Bastiaens,I., Aert,R., Defoor,E., Weitzenegger,T., Bothe,G., Ramsperger,U., Hilbert,H., Braun,M., Holzer,E., Brandt,A., Peters,S., van Staveren,M., Dirske,W., Mooijman,P., Klein Lankhorst,R., Rose,M., Hauf,J., Kotter,P., Berneiser,S., Hempel,S., Feldpausch,M., Lamberth,S., Van den Daele,H., De Keyser,A., Buysshaert,C., Gielen,J., Villarroel,R., De Clercq,R., Van Montagu,M., Rogers,J., Cronin,A., Quail,M., Bray-Allen,S., Clark,L., Doggett,J., Hall,S., Kay,M., Lennard,N., McLay,K., Mayes,R., Pettett,A., Rajandream,M.A., Lyne,M., Benes,V., Rechmann,S., Borkova,D., Blocker,H., Scharfe,M., Grimm,M., Lohnert,T.H., Dose,S., de Haan,M., Maarse,A., Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M., Fartmann,B., Granderath,K., Dauner,D., Herzl,A., Neumann,S., Argiriou,A., Vitale,D., Liguori,R., Piravandi,E., Massenet,O., Quigley,F., Clabauld,G., Mundlein,A., Felber,R., Schnabl,S., Hiller,R., Schmidt,W., Lecharny,A., Aubourg,S., Chefdor,F., Cooke,R., Berger,C., Montfort,A., Casacuberta,E., Gibbons,T., Weber,N., Vandenbol,M., Bargues,M., Terol,J., Torres,A., Perez-Perez,A., Purnelle,B., Bent,E., Johnson,S., Tacon,D., Jesse,T., Heijnen,L., Schwarz,S., Scholler,P., Heber,S., Francs,P., Bielke,C., Frishman,D., Haase,D., Lemcke,K., Mewes,H.W., Stocker,S., Zaccaria,P., Bevan,M., Wilson,R.K., de la Bastide,M., Habermann,K., Parnell,L., Dedhia,N., Gnoj,L., Schutz,K., Huang,E., Spiegel,L., Sehkon,M., Murray,J., Sheet,P., Cordes,M., Abu-Threideh,J., Stoneking,T., Kalicki,J., Graves,T., Harmon,G., Edwards,J., Latreille,P., Courtney,L., Cloud,J., Abbott,A., Scott,K., Johnson,D., Minx,P., Bentley,D., Fulton,B., Miller,N., Greco,T., Kemp,K., Kramer,J., Fulton,L., Mardis,E., Dante,M., Pepin,K., Hillier,L., Nelson,J., Spieth,J., Ryan,E., Andrews,S., Geisel,C., Layman,D., Du,H., Ali,J., Berghoff,A., Jones,K., Drone,K., Cotton,M., Joshu,C., Antonoiu,B., Zidanic,M., Strong,C., Sun,H., Lamar,B., Yordan,C., Ma,P., Zhong,J., Preston,R., Vil,D., Shekher,M., Matero,A., Shah,R., Swaby,I.K., O'Shaughnessy,A., Rodriguez,M., Hoffmann,J., Till,S., Granat,S., Shohdy,N., Hasegawa,A., Hameed,A., Lodhi,M., Johnson,A., Chen,E., Marra,M., Martienssen,R. and McCombie,W.R. TITLE Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana JOURNAL Nature 402 (6763), 769-777 (1999) PUBMED 10617198 REFERENCE 2 (bases 1 to 18585056) AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E. CONSRTM TAIR TITLE Direct Submission JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie Institution, 260 Panama Street, Stanford, CA, USA REFERENCE 3 (bases 1 to 18585056) AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M., Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R., Vaughn,M. and Town,C.D. TITLE Direct Submission JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute, 9704 Medical Center Dr, Rockville, MD 20850, USA REMARK Protein update by submitter FEATURES Qualifiers source /organism="Arabidopsis thaliana" /mol_type="genomic DNA" /db_xref="taxon:3702" /chromosome="4" /ecotype="Columbia" protein /locus_tag="AT4G32460" /gene_synonym="F8B4.160" /gene_synonym="F8B4_160" /inference="Similar to RNA sequence, EST:INSD:Z18147.1,INSD:ES184034.1,INSD:Z34535.1, INSD:DR235658.1,INSD:DR235654.1,INSD:DR235649.1, INSD:EH937272.1,INSD:DR235662.1,INSD:DR235663.1, INSD:DR230943.1,INSD:EL106888.1,INSD:DR235645.1, INSD:BP829605.1,INSD:DR235648.1,INSD:R30487.1, INSD:DR235640.1,INSD:BX839257.1,INSD:DR235644.1, INSD:DR235667.1,INSD:AV550315.1,INSD:DR235660.1, INSD:AV821984.1,INSD:DR235664.1,INSD:DR235657.1, INSD:ES122369.1,INSD:DR235653.1,INSD:DR235656.1, INSD:AV533684.1,INSD:CA781351.1,INSD:EL123048.1, INSD:DR235668.1,INSD:ES065765.1,INSD:CK121381.1, INSD:DR235646.1,INSD:DR235643.1,INSD:EL323127.1, INSD:DR235636.1,INSD:BP819453.1,INSD:EL251648.1, INSD:ES011734.1,INSD:DR235665.1,INSD:DR235650.1, INSD:Z18131.1,INSD:BP860866.1,INSD:BP826699.1, INSD:DR235634.1,INSD:DR235635.1,INSD:DR235637.1, INSD:DR235639.1,INSD:DR235651.1,INSD:DR235647.1, INSD:EH835776.1,INSD:DR235642.1,INSD:DR235659.1, INSD:T76314.1,INSD:DR235638.1,INSD:T88617.1, INSD:BP833297.1,INSD:ES093721.1,INSD:BP842370.1, INSD:DR235655.1,INSD:DR235666.1,INSD:EL143083.1, INSD:DR235661.1" /inference="Similar to RNA sequence, mRNA:INSD:BT025248.1" /note="FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plant-type cell wall; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF642 (InterPro:IPR006946), Galactose-binding domain-like (InterPro:IPR008979); BEST Arabidopsis thaliana protein match is: Protein of unknown function, DUF642 (TAIR:AT5G11420.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink)." /db_xref="TAIR:AT4G32460" /db_xref="Araport:AT4G32460" intron_pos 25:1 (1/2) intron_pos 191:1 (2/2) BEGIN 1 MKEMGVIVLL LLHSFFYVAF CFNDGLLPNG DFELGPRHSD MKGTQVINIT AIPNWELSGF 61 VEYIPSGHKQ GDMILVVPKG AFAVRLGNEA SIKQKISVKK GSYYSITFSA ARTCAQDERL 121 NVSVAPHHAV MPIQTVYSSS GWDLYSWAFK AQSDYADIVI HNPGVEEDPA CGPLIDGVAM 181 RALFPPRPTN KNILKNGGFE EGPWVLPNIS SGVLIPPNSI DDHSPLPGWM VESLKAVKYI 241 DSDHFSVPQG RRAVELVAGK ESAVAQVVRT IPGKTYVLSF SVGDASNACA GSMIVEAFAG 301 KDTIKVPYES KGKGGFKRSS LRFVAVSSRT RVMFYSTFYA MRNDDFSSLC GPVIDDVKLL 361 SARRP //