LOCUS AEE78284.1 1171 aa PRT PLN 23-MAR-2023
DEFINITION Arabidopsis thaliana Structural maintenance of chromosomes
(SMC) family protein protein.
ACCESSION CP002686-6472
PROTEIN_ID AEE78284.1
SOURCE Arabidopsis thaliana (thale cress)
ORGANISM Arabidopsis thaliana
Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
Camelineae; Arabidopsis.
REFERENCE 1 (bases 1 to 23459830)
AUTHORS Salanoubat,M., Lemcke,K., Rieger,M., Ansorge,W., Unseld,M.,
Fartmann,B., Valle,G., Blocker,H., Perez-Alonso,M., Obermaier,B.,
Delseny,M., Boutry,M., Grivell,L.A., Mache,R., Puigdomenech,P., De
Simone,V., Choisne,N., Artiguenave,F., Robert,C., Brottier,P.,
Wincker,P., Cattolico,L., Weissenbach,J., Saurin,W., Quetier,F.,
Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M., Benes,V.,
Wurmbach,E., Drzonek,H., Erfle,H., Jordan,N., Bangert,S.,
Wiedelmann,R., Kranz,H., Voss,H., Holland,R., Brandt,P.,
Nyakatura,G., Vezzi,A., D'Angelo,M., Pallavicini,A., Toppo,S.,
Simionati,B., Conrad,A., Hornischer,K., Kauer,G., Lohnert,T.H.,
Nordsiek,G., Reichelt,J., Scharfe,M., Schon,O., Bargues,M.,
Terol,J., Climent,J., Navarro,P., Collado,C., Perez-Perez,A.,
Ottenwalder,B., Duchemin,D., Cooke,R., Laudie,M., Berger-Llauro,C.,
Purnelle,B., Masuy,D., de Haan,M., Maarse,A.C., Alcaraz,J.P.,
Cottet,A., Casacuberta,E., Monfort,A., Argiriou,A., flores,M.,
Liguori,R., Vitale,D., Mannhaupt,G., Haase,D., Schoof,H., Rudd,S.,
Zaccaria,P., Mewes,H.W., Mayer,K.F., Kaul,S., Town,C.D., Koo,H.L.,
Tallon,L.J., Jenkins,J., Rooney,T., Rizzo,M., Walts,A.,
Utterback,T., Fujii,C.Y., Shea,T.P., Creasy,T.H., Haas,B.,
Maiti,R., Wu,D., Peterson,J., Van Aken,S., Pai,G., Militscher,J.,
Sellers,P., Gill,J.E., Feldblyum,T.V., Preuss,D., Lin,X.,
Nierman,W.C., Salzberg,S.L., White,O., Venter,J.C., Fraser,C.M.,
Kaneko,T., Nakamura,Y., Sato,S., Kato,T., Asamizu,E., Sasamoto,S.,
Kimura,T., Idesawa,K., Kawashima,K., Kishida,Y., Kiyokawa,C.,
Kohara,M., Matsumoto,M., Matsuno,A., Muraki,A., Nakayama,S.,
Nakazaki,N., Shinpo,S., Takeuchi,C., Wada,T., Watanabe,A.,
Yamada,M., Yasuda,M. and Tabata,S.
CONSRTM European Union Chromosome 3 Arabidopsis Sequencing Consortium;
Institute for Genomic Research; Kazusa DNA Research Institute
TITLE Sequence and analysis of chromosome 3 of the plant Arabidopsis
thaliana
JOURNAL Nature 408 (6814), 820-822 (2000)
PUBMED 11130713
REFERENCE 2 (bases 1 to 23459830)
AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
CONSRTM TAIR
TITLE Direct Submission
JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE 3 (bases 1 to 23459830)
AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
Vaughn,M. and Town,C.D.
TITLE Direct Submission
JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
9704 Medical Center Dr, Rockville, MD 20850, USA
REMARK Protein update by submitter
FEATURES Qualifiers
source /organism="Arabidopsis thaliana"
/mol_type="genomic DNA"
/db_xref="taxon:3702"
/chromosome="3"
/ecotype="Columbia"
protein /gene="ATSMC2"
/locus_tag="AT3G47460"
/inference="Similar to RNA sequence,
EST:INSD:EL117738.1,INSD:AV566225.1,INSD:EL312243.1,
INSD:EH904265.1,INSD:EH863344.1,INSD:AV788325.1,
INSD:EL052515.1,INSD:AV567722.1,INSD:CB262419.1,
INSD:AU228373.1,INSD:ES142326.1,INSD:ES118208.1,
INSD:EL969894.1,INSD:ES097415.1,INSD:EL988397.1,
INSD:EG512980.1,INSD:EH805380.1,INSD:EL278601.1,
INSD:EH869788.1,INSD:EL972016.1,INSD:ES115252.1,
INSD:EL276355.1,INSD:EH854344.1,INSD:AV566768.1,
INSD:AV566471.1,INSD:ES104022.1,INSD:AU237333.1,
INSD:ES072153.1,INSD:EL135897.1,INSD:BP594651.1,
INSD:AV522450.1,INSD:EG456703.1,INSD:EL246751.1,
INSD:EH885898.1,INSD:ES105628.1,INSD:AV555946.1,
INSD:ES156101.1,INSD:AV562726.1,INSD:AV556075.1,
INSD:ES031643.1,INSD:AV530767.1,INSD:EL065747.1,
INSD:W43039.1,INSD:ES204485.1"
/note="ATSMC2; FUNCTIONS IN: transporter activity;
INVOLVED IN: chromosome organization; LOCATED IN:
chromosome; EXPRESSED IN: 19 plant structures; EXPRESSED
DURING: 10 growth stages; CONTAINS InterPro DOMAIN/s: SMCs
flexible hinge (InterPro:IPR010935), RecF/RecN/SMC
protein, N-terminal (InterPro:IPR003395); BEST Arabidopsis
thaliana protein match is: structural maintenance of
chromosomes 2 (TAIR:AT5G62410.1); Has 105566 Blast hits to
56901 proteins in 3202 species: Archae - 1526; Bacteria -
19204; Metazoa - 44960; Fungi - 8478; Plants - 5252;
Viruses - 457; Other Eukaryotes - 25689 (source: NCBI
BLink)."
/db_xref="TAIR:AT3G47460"
/db_xref="Araport:AT3G47460"
intron_pos 107:0 (1/19)
intron_pos 334:0 (2/19)
intron_pos 382:0 (3/19)
intron_pos 479:0 (4/19)
intron_pos 541:0 (5/19)
intron_pos 593:0 (6/19)
intron_pos 613:0 (7/19)
intron_pos 634:0 (8/19)
intron_pos 664:2 (9/19)
intron_pos 697:0 (10/19)
intron_pos 736:0 (11/19)
intron_pos 809:0 (12/19)
intron_pos 856:0 (13/19)
intron_pos 912:0 (14/19)
intron_pos 973:2 (15/19)
intron_pos 1005:0 (16/19)
intron_pos 1034:2 (17/19)
intron_pos 1110:0 (18/19)
intron_pos 1135:0 (19/19)
BEGIN
1 MHIKEICLEG FKSYATRTVV PGFDPHFNAI TGLNGSGKSN ILDSICFVLG ITNLQQVRAA
61 NLQELVYKQG QAGITRATVS VTFDNSERNR SPLGHEDHSE ITVTRQIVVG GKNKYLINGK
121 LAQPNQVQNL FHSVQLNVNN PHFLIMQGRI TKVLNMKPME ILSMLEEAAG TRMYENKKEA
181 ALKTLEKKQT KVDEINKLLE KDILPALEKL RREKSQYMQW ANGNAELDRL KRFCVAFEYV
241 QAEKIRDNSI HVVEEMKIKM TGIDEQTDKT QGEISELEKQ IKALTQAREA SMGGEVKALS
301 DKVDSLSNEV TRELSKLTNM EDTLQGEEKN AEKMVHNIED LKKSVEERAS ALNKCDEGAA
361 ELKQKFQEFS TTLEECEREH QGILAGKSSG DEEKCLEDQL RDAKISVGTA ETELKQLNTK
421 ISHCEKELKE KKSQLMSKQD EAVAVENELD ARKNDVESVK RAFDSLPYKE GQMEALEKDR
481 ESELEIGHRL KDKVHELSAQ LANVQFTYRD PVKNFDRSKV KGVVAKLIKV NDRSSMTALE
541 VTAGGKLFNV IVDTEDTGKQ LLQKGDLRRR VTIIPLNKIQ SHLVPPRVQQ ATVGKGNAEL
601 ALSLVGYSEE LKNAMEYVFG STFVCKTTDA AKEVAFNREI RTPSVTLEGD VFQPSGLLTG
661 GSRKGGGDLL RQLHDLAEAE TKFRAHQKSL SEIEANIKEL QPLQTKFTDM KAQLELKMYD
721 MSLFLKRAEQ NEHHKLGDAV KKLEEEVEEM RSQIKEKEGL YKSCADTVST LEKSIKDHDK
781 NREGRLKDLE KNIKTLKARI QASSKDLKGH ENVRERLVME QEAVTQEQSY LKSQLTSLRT
841 QISTLASDVG NQRAKVDAIQ KDHDQSLSEL KLIHAKMKEC DTQISGSIAE QEKCLQKISD
901 MKLDRKKLEN EVTRMEMEHK NCSVKVDKLV EKHTWITSEK RLFGNGGTDY DFESRDPHKA
961 REELERLQTD QSSLEKRVNK KVTAMFEKAE DEYNALMTKK NIIETDKSKI KKVIEELDEK
1021 KKETLKVTWV KVNQDFGSIF STLLPGTMSK LEPPEGGTFL DGLEVRVAFG DVWKQSLSEL
1081 SGGQRSLLAL SLILALLLFK PAPIYILDEV DAALDLSHTQ NIGRMIKSHF PHSQFIVVSL
1141 KEGMFSNADV LFRTKFVDGV STVQRTVTKQ S
//