LOCUS AEE76889.1 1118 aa PRT PLN 23-MAR-2023
DEFINITION Arabidopsis thaliana MUTL protein homolog 1 protein.
ACCESSION CP002686-4534
PROTEIN_ID AEE76889.1
SOURCE Arabidopsis thaliana (thale cress)
ORGANISM Arabidopsis thaliana
Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
Camelineae; Arabidopsis.
REFERENCE 1 (bases 1 to 23459830)
AUTHORS Salanoubat,M., Lemcke,K., Rieger,M., Ansorge,W., Unseld,M.,
Fartmann,B., Valle,G., Blocker,H., Perez-Alonso,M., Obermaier,B.,
Delseny,M., Boutry,M., Grivell,L.A., Mache,R., Puigdomenech,P., De
Simone,V., Choisne,N., Artiguenave,F., Robert,C., Brottier,P.,
Wincker,P., Cattolico,L., Weissenbach,J., Saurin,W., Quetier,F.,
Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M., Benes,V.,
Wurmbach,E., Drzonek,H., Erfle,H., Jordan,N., Bangert,S.,
Wiedelmann,R., Kranz,H., Voss,H., Holland,R., Brandt,P.,
Nyakatura,G., Vezzi,A., D'Angelo,M., Pallavicini,A., Toppo,S.,
Simionati,B., Conrad,A., Hornischer,K., Kauer,G., Lohnert,T.H.,
Nordsiek,G., Reichelt,J., Scharfe,M., Schon,O., Bargues,M.,
Terol,J., Climent,J., Navarro,P., Collado,C., Perez-Perez,A.,
Ottenwalder,B., Duchemin,D., Cooke,R., Laudie,M., Berger-Llauro,C.,
Purnelle,B., Masuy,D., de Haan,M., Maarse,A.C., Alcaraz,J.P.,
Cottet,A., Casacuberta,E., Monfort,A., Argiriou,A., flores,M.,
Liguori,R., Vitale,D., Mannhaupt,G., Haase,D., Schoof,H., Rudd,S.,
Zaccaria,P., Mewes,H.W., Mayer,K.F., Kaul,S., Town,C.D., Koo,H.L.,
Tallon,L.J., Jenkins,J., Rooney,T., Rizzo,M., Walts,A.,
Utterback,T., Fujii,C.Y., Shea,T.P., Creasy,T.H., Haas,B.,
Maiti,R., Wu,D., Peterson,J., Van Aken,S., Pai,G., Militscher,J.,
Sellers,P., Gill,J.E., Feldblyum,T.V., Preuss,D., Lin,X.,
Nierman,W.C., Salzberg,S.L., White,O., Venter,J.C., Fraser,C.M.,
Kaneko,T., Nakamura,Y., Sato,S., Kato,T., Asamizu,E., Sasamoto,S.,
Kimura,T., Idesawa,K., Kawashima,K., Kishida,Y., Kiyokawa,C.,
Kohara,M., Matsumoto,M., Matsuno,A., Muraki,A., Nakayama,S.,
Nakazaki,N., Shinpo,S., Takeuchi,C., Wada,T., Watanabe,A.,
Yamada,M., Yasuda,M. and Tabata,S.
CONSRTM European Union Chromosome 3 Arabidopsis Sequencing Consortium;
Institute for Genomic Research; Kazusa DNA Research Institute
TITLE Sequence and analysis of chromosome 3 of the plant Arabidopsis
thaliana
JOURNAL Nature 408 (6814), 820-822 (2000)
PUBMED 11130713
REFERENCE 2 (bases 1 to 23459830)
AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
CONSRTM TAIR
TITLE Direct Submission
JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE 3 (bases 1 to 23459830)
AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
Vaughn,M. and Town,C.D.
TITLE Direct Submission
JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
9704 Medical Center Dr, Rockville, MD 20850, USA
REMARK Protein update by submitter
FEATURES Qualifiers
source /organism="Arabidopsis thaliana"
/mol_type="genomic DNA"
/db_xref="taxon:3702"
/chromosome="3"
/ecotype="Columbia"
protein /gene="MSH1"
/locus_tag="AT3G24320"
/gene_synonym="ATMSH1"
/gene_synonym="CHLOROPLAST MUTATOR"
/gene_synonym="CHM"
/gene_synonym="CHM1"
/gene_synonym="MUTL protein homolog 1"
/inference="Similar to RNA sequence,
EST:INSD:AI993280.1,INSD:ES084306.1,INSD:ES132582.1,
INSD:ES019976.1,INSD:AV440284.1,INSD:AI994411.1,
INSD:ES139411.1,INSD:AA404881.1,INSD:ES209437.1,
INSD:ES193349.1"
/inference="Similar to RNA sequence, mRNA:INSD:AY191303.1"
/note="MUTL protein homolog 1 (MSH1); FUNCTIONS IN:
damaged DNA binding, mismatched DNA binding, catalytic
activity, ATP binding, nuclease activity; INVOLVED IN:
mismatch repair, mitochondrial genome maintenance,
mitochondrial DNA metabolic process; LOCATED IN:
mitochondrion, chloroplast; EXPRESSED IN: 13 plant
structures; EXPRESSED DURING: 7 growth stages; CONTAINS
InterPro DOMAIN/s: Serine/cysteine peptidase, trypsin-like
(InterPro:IPR009003), DNA mismatch repair protein MutS,
N-terminal (InterPro:IPR016151), Excinuclease ABC, C
subunit, N-terminal (InterPro:IPR000305), DNA mismatch
repair protein MutS, C-terminal (InterPro:IPR000432), DNA
mismatch repair protein MutS-like, N-terminal
(InterPro:IPR007695); BEST Arabidopsis thaliana protein
match is: MUTS homolog 6 (TAIR:AT4G02070.2); Has 14048
Blast hits to 12806 proteins in 2713 species: Archae -
218; Bacteria - 10179; Metazoa - 621; Fungi - 751; Plants
- 393; Viruses - 3; Other Eukaryotes - 1883 (source: NCBI
BLink)."
/db_xref="TAIR:AT3G24320"
/db_xref="Araport:AT3G24320"
intron_pos 38:2 (1/21)
intron_pos 86:0 (2/21)
intron_pos 118:2 (3/21)
intron_pos 144:0 (4/21)
intron_pos 184:0 (5/21)
intron_pos 200:0 (6/21)
intron_pos 221:2 (7/21)
intron_pos 250:1 (8/21)
intron_pos 305:1 (9/21)
intron_pos 342:0 (10/21)
intron_pos 377:1 (11/21)
intron_pos 402:2 (12/21)
intron_pos 422:1 (13/21)
intron_pos 445:0 (14/21)
intron_pos 503:0 (15/21)
intron_pos 573:0 (16/21)
intron_pos 658:2 (17/21)
intron_pos 706:2 (18/21)
intron_pos 726:0 (19/21)
intron_pos 826:0 (20/21)
intron_pos 1042:0 (21/21)
BEGIN
1 MHWIATRNAV VSFPKWRFFF RSSYRTYSSL KPSSPILLNR RYSEGISCLR DGKSLKRITT
61 ASKKVKTSSD VLTDKDLSHL VWWKERLQTC KKPSTLQLIE RLMYTNLLGL DPSLRNGSLK
121 DGNLNWEMLQ FKSRFPREVL LCRVGEFYEA IGIDACILVE YAGLNPFGGL RSDSIPKAGC
181 PIMNLRQTLD DLTRNGYSVC IVEEVQGPTP ARSRKGRFIS GHAHPGSPYV YGLVGVDHDL
241 DFPDPMPVVG ISRSARGYCM ISIFETMKAY SLDDGLTEEA LVTKLRTRRC HHLFLHASLR
301 HNASGTCRWG EFGEGGLLWG ECSSRNFEWF EGDTLSELLS RVKDVYGLDD EVSFRNVNVP
361 SKNRPRPLHL GTATQIGALP TEGIPCLLKV LLPSTCSGLP SLYVRDLLLN PPAYDIALKI
421 QETCKLMSTV TCSIPEFTCV SSAKLVKLLE QREANYIEFC RIKNVLDDVL HMHRHAELVE
481 ILKLLMDPTW VATGLKIDFD TFVNECHWAS DTIGEMISLD ENESHQNVSK CDNVPNEFFY
541 DMESSWRGRV KGIHIEEEIT QVEKSAEALS LAVAEDFHPI ISRIKATTAS LGGPKGEIAY
601 AREHESVWFK GKRFTPSIWA GTAGEDQIKQ LKPALDSKGK KVGEEWFTTP KVEIALVRYH
661 EASENAKARV LELLRELSVK LQTKINVLVF ASMLLVISKA LFSHACEGRR RKWVFPTLVG
721 FSLDEGAKPL DGASRMKLTG LSPYWFDVSS GTAVHNTVDM QSLFLLTGPN GGGKSSLLRS
781 ICAAALLGIS GLMVPAESAC IPHFDSIMLH MKSYDSPVDG KSSFQVEMSE IRSIVSQATS
841 RSLVLIDEIC RGTETAKGTC IAGSVVESLD TSGCLGIVST HLHGIFSLPL TAKNITYKAM
901 GAENVEGQTK PTWKLTDGVC RESLAFETAK REGVPESVIQ RAEALYLSVY AKDASAEVVK
961 PDQIITSSNN DQQIQKPVSS ERSLEKDLAK AIVKICGKKM IEPEAIECLS IGARELPPPS
1021 TVGSSCVYVM RRPDKRLYIG QTDDLEGRIR AHRAKEGLQG SSFLYLMVQG KSMACQLETL
1081 LINQLHEQGY SLANLADGKH RNFGTSSSLS TSDVVSIL
//