LOCUS       AEE85075.1              1081 aa    PRT              PLN 23-MAR-2023
DEFINITION  Arabidopsis thaliana homolog of DNA mismatch repair protein
            MSH3 protein.
ACCESSION   CP002687-4593
PROTEIN_ID  AEE85075.1
SOURCE      Arabidopsis thaliana (thale cress)
  ORGANISM  Arabidopsis thaliana
            Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
            Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
            Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
            Camelineae; Arabidopsis.
REFERENCE   1  (bases 1 to 18585056)
  AUTHORS   Mayer,K., Schuller,C., Wambutt,R., Murphy,G., Volckaert,G.,
            Pohl,T., Dusterhoft,A., Stiekema,W., Entian,K.D., Terryn,N.,
            Harris,B., Ansorge,W., Brandt,P., Grivell,L., Rieger,M.,
            Weichselgartner,M., de Simone,V., Obermaier,B., Mache,R.,
            Muller,M., Kreis,M., Delseny,M., Puigdomenech,P., Watson,M.,
            Schmidtheini,T., Reichert,B., Portatelle,D., Perez-Alonso,M.,
            Boutry,M., Bancroft,I., Vos,P., Hoheisel,J., Zimmermann,W.,
            Wedler,H., Ridley,P., Langham,S.A., McCullagh,B., Bilham,L.,
            Robben,J., Van der Schueren,J., Grymonprez,B., Chuang,Y.J.,
            Vandenbussche,F., Braeken,M., Weltjens,I., Voet,M., Bastiaens,I.,
            Aert,R., Defoor,E., Weitzenegger,T., Bothe,G., Ramsperger,U.,
            Hilbert,H., Braun,M., Holzer,E., Brandt,A., Peters,S., van
            Staveren,M., Dirske,W., Mooijman,P., Klein Lankhorst,R., Rose,M.,
            Hauf,J., Kotter,P., Berneiser,S., Hempel,S., Feldpausch,M.,
            Lamberth,S., Van den Daele,H., De Keyser,A., Buysshaert,C.,
            Gielen,J., Villarroel,R., De Clercq,R., Van Montagu,M., Rogers,J.,
            Cronin,A., Quail,M., Bray-Allen,S., Clark,L., Doggett,J., Hall,S.,
            Kay,M., Lennard,N., McLay,K., Mayes,R., Pettett,A.,
            Rajandream,M.A., Lyne,M., Benes,V., Rechmann,S., Borkova,D.,
            Blocker,H., Scharfe,M., Grimm,M., Lohnert,T.H., Dose,S., de
            Haan,M., Maarse,A., Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M.,
            Fartmann,B., Granderath,K., Dauner,D., Herzl,A., Neumann,S.,
            Argiriou,A., Vitale,D., Liguori,R., Piravandi,E., Massenet,O.,
            Quigley,F., Clabauld,G., Mundlein,A., Felber,R., Schnabl,S.,
            Hiller,R., Schmidt,W., Lecharny,A., Aubourg,S., Chefdor,F.,
            Cooke,R., Berger,C., Montfort,A., Casacuberta,E., Gibbons,T.,
            Weber,N., Vandenbol,M., Bargues,M., Terol,J., Torres,A.,
            Perez-Perez,A., Purnelle,B., Bent,E., Johnson,S., Tacon,D.,
            Jesse,T., Heijnen,L., Schwarz,S., Scholler,P., Heber,S., Francs,P.,
            Bielke,C., Frishman,D., Haase,D., Lemcke,K., Mewes,H.W.,
            Stocker,S., Zaccaria,P., Bevan,M., Wilson,R.K., de la Bastide,M.,
            Habermann,K., Parnell,L., Dedhia,N., Gnoj,L., Schutz,K., Huang,E.,
            Spiegel,L., Sehkon,M., Murray,J., Sheet,P., Cordes,M.,
            Abu-Threideh,J., Stoneking,T., Kalicki,J., Graves,T., Harmon,G.,
            Edwards,J., Latreille,P., Courtney,L., Cloud,J., Abbott,A.,
            Scott,K., Johnson,D., Minx,P., Bentley,D., Fulton,B., Miller,N.,
            Greco,T., Kemp,K., Kramer,J., Fulton,L., Mardis,E., Dante,M.,
            Pepin,K., Hillier,L., Nelson,J., Spieth,J., Ryan,E., Andrews,S.,
            Geisel,C., Layman,D., Du,H., Ali,J., Berghoff,A., Jones,K.,
            Drone,K., Cotton,M., Joshu,C., Antonoiu,B., Zidanic,M., Strong,C.,
            Sun,H., Lamar,B., Yordan,C., Ma,P., Zhong,J., Preston,R., Vil,D.,
            Shekher,M., Matero,A., Shah,R., Swaby,I.K., O'Shaughnessy,A.,
            Rodriguez,M., Hoffmann,J., Till,S., Granat,S., Shohdy,N.,
            Hasegawa,A., Hameed,A., Lodhi,M., Johnson,A., Chen,E., Marra,M.,
            Martienssen,R. and McCombie,W.R.
  TITLE     Sequence and analysis of chromosome 4 of the plant Arabidopsis
            thaliana
  JOURNAL   Nature 402 (6763), 769-777 (1999)
   PUBMED   10617198
REFERENCE   2  (bases 1 to 18585056)
  AUTHORS   Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
  CONSRTM   TAIR
  TITLE     Direct Submission
  JOURNAL   Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
            Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE   3  (bases 1 to 18585056)
  AUTHORS   Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
            Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
            Vaughn,M. and Town,C.D.
  TITLE     Direct Submission
  JOURNAL   Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
            9704 Medical Center Dr, Rockville, MD 20850, USA
  REMARK    Protein update by submitter
FEATURES             Qualifiers
     source          /organism="Arabidopsis thaliana"
                     /mol_type="genomic DNA"
                     /db_xref="taxon:3702"
                     /chromosome="4"
                     /ecotype="Columbia"
     protein         /gene="MSH3"
                     /locus_tag="AT4G25540"
                     /gene_synonym="ATMSH3"
                     /gene_synonym="homolog of DNA mismatch repair protein
                     MSH3"
                     /gene_synonym="M7J2.90"
                     /gene_synonym="M7J2_90"
                     /inference="Similar to RNA sequence,
                     EST:INSD:BP823572.1,INSD:BP643959.1"
                     /inference="Similar to RNA sequence,
                     mRNA:INSD:AJ007791.1,INSD:AK222004.1"
                     /note="homolog of DNA mismatch repair protein MSH3 (MSH3);
                     CONTAINS InterPro DOMAIN/s: DNA mismatch repair protein
                     MutS, clamp (InterPro:IPR007861), DNA mismatch repair
                     protein MutS, connector (InterPro:IPR007860), DNA mismatch
                     repair protein MutS, N-terminal (InterPro:IPR016151), DNA
                     mismatch repair protein MutS, core (InterPro:IPR007696),
                     DNA mismatch repair protein MutS, C-terminal
                     (InterPro:IPR000432), DNA mismatch repair protein
                     MutS-like, N-terminal (InterPro:IPR007695); BEST
                     Arabidopsis thaliana protein match is: MUTS homolog 6
                     (TAIR:AT4G02070.2); Has 14547 Blast hits to 13713 proteins
                     in 2703 species: Archae - 153; Bacteria - 9793; Metazoa -
                     705; Fungi - 864; Plants - 451; Viruses - 3; Other
                     Eukaryotes - 2578 (source: NCBI BLink)."
                     /db_xref="TAIR:AT4G25540"
                     /db_xref="Araport:AT4G25540"
     intron_pos      313:0 (1/11)
     intron_pos      377:0 (2/11)
     intron_pos      432:0 (3/11)
     intron_pos      465:0 (4/11)
     intron_pos      549:0 (5/11)
     intron_pos      634:0 (6/11)
     intron_pos      678:0 (7/11)
     intron_pos      765:0 (8/11)
     intron_pos      789:0 (9/11)
     intron_pos      839:0 (10/11)
     intron_pos      993:0 (11/11)
BEGIN
        1 MGKQKQQTIS RFFAPKPKSP THEPNPVAES STPPPKISAT VSFSPSKRKL LSDHLAAASP
       61 KKPKLSPHTQ NPVPDPNLHQ RFLQRFLEPS PEEYVPETSS SRKYTPLEQQ VVELKSKYPD
      121 VVLMVEVGYR YRFFGEDAEI AARVLGIYAH MDHNFMTASV PTFRLNFHVR RLVNAGYKIG
      181 VVKQTETAAI KSHGANRTGP FFRGLSALYT KATLEAAEDI SGGCGGEEGF GSQSNFLVCV
      241 VDERVKSETL GCGIEMSFDV RVGVVGVEIS TGEVVYEEFN DNFMRSGLEA VILSLSPAEL
      301 LLGQPLSQQT EKFLVAHAGP TSNVRVERAS LDCFSNGNAV DEVISLCEKI SAGNLEDDKE
      361 MKLEAAEKGM SCLTVHTIMN MPHLTVQALA LTFCHLKQFG FERILYQGAS FRSLSSNTEM
      421 TLSANTLQQL EVVKNNSDGS ESGSLFHNMN HTLTVYGSRL LRHWVTHPLC DRNLISARLD
      481 AVSEISACMG SHSSSQLSSE LVEEGSERAI VSPEFYLVLS SVLTAMSRSS DIQRGITRIF
      541 HRTAKATEFI AVMEAILLAG KQIQRLGIKQ DSEMRSMQSA TVRSTLLRKL ISVISSPVVV
      601 DNAGKLLSAL NKEAAVRGDL LDILITSSDQ FPELAEARQA VLVIREKLDS SIASFRKKLA
      661 IRNLEFLQVS GITHLIELPV DSKVPMNWVK VNSTKKTIRY HPPEIVAGLD ELALATEHLA
      721 IVNRASWDSF LKSFSRYYTD FKAAVQALAA LDCLHSLSTL SRNKNYVRPE FVDDCEPVEI
      781 NIQSGRHPVL ETILQDNFVP NDTILHAEGE YCQIITGPNM GGKSCYIRQV ALISIMAQVG
      841 SFVPASFAKL HVLDGVFTRM GASDSIQHGR STFLEELSEA SHIIRTCSSR SLVILDELGR
      901 GTSTHDGVAI AYATLQHLLA EKRCLVLFVT HYPEIAEISN GFPGSVGTYH VSYLTLQKDK
      961 GSYDHDDVTY LYKLVRGLCS RSFGFKVAQL AQIPPSCIRR AISMAAKLEA EVRARERNTR
     1021 MGEPEGHEEP RGAEESISAL GDLFADLKFA LSEEDPWKAF EFLKHAWKIA GKIRLKPTCS
     1081 F
//