LOCUS AEE78284.1 1171 aa PRT PLN 23-MAR-2023 DEFINITION Arabidopsis thaliana Structural maintenance of chromosomes (SMC) family protein protein. ACCESSION CP002686-6472 PROTEIN_ID AEE78284.1 SOURCE Arabidopsis thaliana (thale cress) ORGANISM Arabidopsis thaliana Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis. REFERENCE 1 (bases 1 to 23459830) AUTHORS Salanoubat,M., Lemcke,K., Rieger,M., Ansorge,W., Unseld,M., Fartmann,B., Valle,G., Blocker,H., Perez-Alonso,M., Obermaier,B., Delseny,M., Boutry,M., Grivell,L.A., Mache,R., Puigdomenech,P., De Simone,V., Choisne,N., Artiguenave,F., Robert,C., Brottier,P., Wincker,P., Cattolico,L., Weissenbach,J., Saurin,W., Quetier,F., Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M., Benes,V., Wurmbach,E., Drzonek,H., Erfle,H., Jordan,N., Bangert,S., Wiedelmann,R., Kranz,H., Voss,H., Holland,R., Brandt,P., Nyakatura,G., Vezzi,A., D'Angelo,M., Pallavicini,A., Toppo,S., Simionati,B., Conrad,A., Hornischer,K., Kauer,G., Lohnert,T.H., Nordsiek,G., Reichelt,J., Scharfe,M., Schon,O., Bargues,M., Terol,J., Climent,J., Navarro,P., Collado,C., Perez-Perez,A., Ottenwalder,B., Duchemin,D., Cooke,R., Laudie,M., Berger-Llauro,C., Purnelle,B., Masuy,D., de Haan,M., Maarse,A.C., Alcaraz,J.P., Cottet,A., Casacuberta,E., Monfort,A., Argiriou,A., flores,M., Liguori,R., Vitale,D., Mannhaupt,G., Haase,D., Schoof,H., Rudd,S., Zaccaria,P., Mewes,H.W., Mayer,K.F., Kaul,S., Town,C.D., Koo,H.L., Tallon,L.J., Jenkins,J., Rooney,T., Rizzo,M., Walts,A., Utterback,T., Fujii,C.Y., Shea,T.P., Creasy,T.H., Haas,B., Maiti,R., Wu,D., Peterson,J., Van Aken,S., Pai,G., Militscher,J., Sellers,P., Gill,J.E., Feldblyum,T.V., Preuss,D., Lin,X., Nierman,W.C., Salzberg,S.L., White,O., Venter,J.C., Fraser,C.M., Kaneko,T., Nakamura,Y., Sato,S., Kato,T., Asamizu,E., Sasamoto,S., Kimura,T., Idesawa,K., Kawashima,K., Kishida,Y., Kiyokawa,C., Kohara,M., Matsumoto,M., Matsuno,A., Muraki,A., Nakayama,S., Nakazaki,N., Shinpo,S., Takeuchi,C., Wada,T., Watanabe,A., Yamada,M., Yasuda,M. and Tabata,S. CONSRTM European Union Chromosome 3 Arabidopsis Sequencing Consortium; Institute for Genomic Research; Kazusa DNA Research Institute TITLE Sequence and analysis of chromosome 3 of the plant Arabidopsis thaliana JOURNAL Nature 408 (6814), 820-822 (2000) PUBMED 11130713 REFERENCE 2 (bases 1 to 23459830) AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E. CONSRTM TAIR TITLE Direct Submission JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie Institution, 260 Panama Street, Stanford, CA, USA REFERENCE 3 (bases 1 to 23459830) AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M., Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R., Vaughn,M. and Town,C.D. TITLE Direct Submission JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute, 9704 Medical Center Dr, Rockville, MD 20850, USA REMARK Protein update by submitter FEATURES Qualifiers source /organism="Arabidopsis thaliana" /mol_type="genomic DNA" /db_xref="taxon:3702" /chromosome="3" /ecotype="Columbia" protein /gene="ATSMC2" /locus_tag="AT3G47460" /inference="Similar to RNA sequence, EST:INSD:EL117738.1,INSD:AV566225.1,INSD:EL312243.1, INSD:EH904265.1,INSD:EH863344.1,INSD:AV788325.1, INSD:EL052515.1,INSD:AV567722.1,INSD:CB262419.1, INSD:AU228373.1,INSD:ES142326.1,INSD:ES118208.1, INSD:EL969894.1,INSD:ES097415.1,INSD:EL988397.1, INSD:EG512980.1,INSD:EH805380.1,INSD:EL278601.1, INSD:EH869788.1,INSD:EL972016.1,INSD:ES115252.1, INSD:EL276355.1,INSD:EH854344.1,INSD:AV566768.1, INSD:AV566471.1,INSD:ES104022.1,INSD:AU237333.1, INSD:ES072153.1,INSD:EL135897.1,INSD:BP594651.1, INSD:AV522450.1,INSD:EG456703.1,INSD:EL246751.1, INSD:EH885898.1,INSD:ES105628.1,INSD:AV555946.1, INSD:ES156101.1,INSD:AV562726.1,INSD:AV556075.1, INSD:ES031643.1,INSD:AV530767.1,INSD:EL065747.1, INSD:W43039.1,INSD:ES204485.1" /note="ATSMC2; FUNCTIONS IN: transporter activity; INVOLVED IN: chromosome organization; LOCATED IN: chromosome; EXPRESSED IN: 19 plant structures; EXPRESSED DURING: 10 growth stages; CONTAINS InterPro DOMAIN/s: SMCs flexible hinge (InterPro:IPR010935), RecF/RecN/SMC protein, N-terminal (InterPro:IPR003395); BEST Arabidopsis thaliana protein match is: structural maintenance of chromosomes 2 (TAIR:AT5G62410.1); Has 105566 Blast hits to 56901 proteins in 3202 species: Archae - 1526; Bacteria - 19204; Metazoa - 44960; Fungi - 8478; Plants - 5252; Viruses - 457; Other Eukaryotes - 25689 (source: NCBI BLink)." /db_xref="TAIR:AT3G47460" /db_xref="Araport:AT3G47460" intron_pos 107:0 (1/19) intron_pos 334:0 (2/19) intron_pos 382:0 (3/19) intron_pos 479:0 (4/19) intron_pos 541:0 (5/19) intron_pos 593:0 (6/19) intron_pos 613:0 (7/19) intron_pos 634:0 (8/19) intron_pos 664:2 (9/19) intron_pos 697:0 (10/19) intron_pos 736:0 (11/19) intron_pos 809:0 (12/19) intron_pos 856:0 (13/19) intron_pos 912:0 (14/19) intron_pos 973:2 (15/19) intron_pos 1005:0 (16/19) intron_pos 1034:2 (17/19) intron_pos 1110:0 (18/19) intron_pos 1135:0 (19/19) BEGIN 1 MHIKEICLEG FKSYATRTVV PGFDPHFNAI TGLNGSGKSN ILDSICFVLG ITNLQQVRAA 61 NLQELVYKQG QAGITRATVS VTFDNSERNR SPLGHEDHSE ITVTRQIVVG GKNKYLINGK 121 LAQPNQVQNL FHSVQLNVNN PHFLIMQGRI TKVLNMKPME ILSMLEEAAG TRMYENKKEA 181 ALKTLEKKQT KVDEINKLLE KDILPALEKL RREKSQYMQW ANGNAELDRL KRFCVAFEYV 241 QAEKIRDNSI HVVEEMKIKM TGIDEQTDKT QGEISELEKQ IKALTQAREA SMGGEVKALS 301 DKVDSLSNEV TRELSKLTNM EDTLQGEEKN AEKMVHNIED LKKSVEERAS ALNKCDEGAA 361 ELKQKFQEFS TTLEECEREH QGILAGKSSG DEEKCLEDQL RDAKISVGTA ETELKQLNTK 421 ISHCEKELKE KKSQLMSKQD EAVAVENELD ARKNDVESVK RAFDSLPYKE GQMEALEKDR 481 ESELEIGHRL KDKVHELSAQ LANVQFTYRD PVKNFDRSKV KGVVAKLIKV NDRSSMTALE 541 VTAGGKLFNV IVDTEDTGKQ LLQKGDLRRR VTIIPLNKIQ SHLVPPRVQQ ATVGKGNAEL 601 ALSLVGYSEE LKNAMEYVFG STFVCKTTDA AKEVAFNREI RTPSVTLEGD VFQPSGLLTG 661 GSRKGGGDLL RQLHDLAEAE TKFRAHQKSL SEIEANIKEL QPLQTKFTDM KAQLELKMYD 721 MSLFLKRAEQ NEHHKLGDAV KKLEEEVEEM RSQIKEKEGL YKSCADTVST LEKSIKDHDK 781 NREGRLKDLE KNIKTLKARI QASSKDLKGH ENVRERLVME QEAVTQEQSY LKSQLTSLRT 841 QISTLASDVG NQRAKVDAIQ KDHDQSLSEL KLIHAKMKEC DTQISGSIAE QEKCLQKISD 901 MKLDRKKLEN EVTRMEMEHK NCSVKVDKLV EKHTWITSEK RLFGNGGTDY DFESRDPHKA 961 REELERLQTD QSSLEKRVNK KVTAMFEKAE DEYNALMTKK NIIETDKSKI KKVIEELDEK 1021 KKETLKVTWV KVNQDFGSIF STLLPGTMSK LEPPEGGTFL DGLEVRVAFG DVWKQSLSEL 1081 SGGQRSLLAL SLILALLLFK PAPIYILDEV DAALDLSHTQ NIGRMIKSHF PHSQFIVVSL 1141 KEGMFSNADV LFRTKFVDGV STVQRTVTKQ S //