LOCUS       AEE73669.1              2176 aa    PRT              PLN 23-MAR-2023
DEFINITION  Arabidopsis thaliana methyl-CPG-binding domain 9 protein.
ACCESSION   CP002686-108
PROTEIN_ID  AEE73669.1
SOURCE      Arabidopsis thaliana (thale cress)
  ORGANISM  Arabidopsis thaliana
            Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
            Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
            Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
            Camelineae; Arabidopsis.
REFERENCE   1  (bases 1 to 23459830)
  AUTHORS   Salanoubat,M., Lemcke,K., Rieger,M., Ansorge,W., Unseld,M.,
            Fartmann,B., Valle,G., Blocker,H., Perez-Alonso,M., Obermaier,B.,
            Delseny,M., Boutry,M., Grivell,L.A., Mache,R., Puigdomenech,P., De
            Simone,V., Choisne,N., Artiguenave,F., Robert,C., Brottier,P.,
            Wincker,P., Cattolico,L., Weissenbach,J., Saurin,W., Quetier,F.,
            Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M., Benes,V.,
            Wurmbach,E., Drzonek,H., Erfle,H., Jordan,N., Bangert,S.,
            Wiedelmann,R., Kranz,H., Voss,H., Holland,R., Brandt,P.,
            Nyakatura,G., Vezzi,A., D'Angelo,M., Pallavicini,A., Toppo,S.,
            Simionati,B., Conrad,A., Hornischer,K., Kauer,G., Lohnert,T.H.,
            Nordsiek,G., Reichelt,J., Scharfe,M., Schon,O., Bargues,M.,
            Terol,J., Climent,J., Navarro,P., Collado,C., Perez-Perez,A.,
            Ottenwalder,B., Duchemin,D., Cooke,R., Laudie,M., Berger-Llauro,C.,
            Purnelle,B., Masuy,D., de Haan,M., Maarse,A.C., Alcaraz,J.P.,
            Cottet,A., Casacuberta,E., Monfort,A., Argiriou,A., flores,M.,
            Liguori,R., Vitale,D., Mannhaupt,G., Haase,D., Schoof,H., Rudd,S.,
            Zaccaria,P., Mewes,H.W., Mayer,K.F., Kaul,S., Town,C.D., Koo,H.L.,
            Tallon,L.J., Jenkins,J., Rooney,T., Rizzo,M., Walts,A.,
            Utterback,T., Fujii,C.Y., Shea,T.P., Creasy,T.H., Haas,B.,
            Maiti,R., Wu,D., Peterson,J., Van Aken,S., Pai,G., Militscher,J.,
            Sellers,P., Gill,J.E., Feldblyum,T.V., Preuss,D., Lin,X.,
            Nierman,W.C., Salzberg,S.L., White,O., Venter,J.C., Fraser,C.M.,
            Kaneko,T., Nakamura,Y., Sato,S., Kato,T., Asamizu,E., Sasamoto,S.,
            Kimura,T., Idesawa,K., Kawashima,K., Kishida,Y., Kiyokawa,C.,
            Kohara,M., Matsumoto,M., Matsuno,A., Muraki,A., Nakayama,S.,
            Nakazaki,N., Shinpo,S., Takeuchi,C., Wada,T., Watanabe,A.,
            Yamada,M., Yasuda,M. and Tabata,S.
  CONSRTM   European Union Chromosome 3 Arabidopsis Sequencing Consortium;
            Institute for Genomic Research; Kazusa DNA Research Institute
  TITLE     Sequence and analysis of chromosome 3 of the plant Arabidopsis
            thaliana
  JOURNAL   Nature 408 (6814), 820-822 (2000)
   PUBMED   11130713
REFERENCE   2  (bases 1 to 23459830)
  AUTHORS   Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
  CONSRTM   TAIR
  TITLE     Direct Submission
  JOURNAL   Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
            Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE   3  (bases 1 to 23459830)
  AUTHORS   Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
            Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
            Vaughn,M. and Town,C.D.
  TITLE     Direct Submission
  JOURNAL   Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
            9704 Medical Center Dr, Rockville, MD 20850, USA
  REMARK    Protein update by submitter
FEATURES             Qualifiers
     source          /organism="Arabidopsis thaliana"
                     /mol_type="genomic DNA"
                     /db_xref="taxon:3702"
                     /chromosome="3"
                     /ecotype="Columbia"
     protein         /gene="MBD9"
                     /locus_tag="AT3G01460"
                     /gene_synonym="ATMBD9"
                     /gene_synonym="methyl-CPG-binding domain 9"
                     /gene_synonym="T13O15.10"
                     /gene_synonym="T13O15_10"
                     /inference="Similar to RNA sequence,
                     EST:INSD:AI995518.1,INSD:EL330329.1,INSD:AV529938.1,
                     INSD:CD533698.1,INSD:AV547756.1,INSD:CD534185.1,
                     INSD:BE526698.1,INSD:T43427.1,INSD:EH941628.1,
                     INSD:AV551791.1,INSD:EL980535.1,INSD:AV539612.1,
                     INSD:EL010738.1,INSD:BE524167.1,INSD:BE524168.1,
                     INSD:AV539367.1,INSD:CB518269.1,INSD:ES124976.1,
                     INSD:AV548906.1,INSD:AV546133.1,INSD:ES082180.1,
                     INSD:EH994125.1,INSD:AV530089.1,INSD:EH915085.1,
                     INSD:AV524103.1,INSD:ES155717.1,INSD:EL064697.1,
                     INSD:BU636361.1,INSD:BP822774.1"
                     /inference="Similar to RNA sequence, mRNA:INSD:BX822537.1"
                     /note="methyl-CPG-binding domain 9 (MBD9); FUNCTIONS IN:
                     methyl-CpG binding, DNA binding; INVOLVED IN:
                     photoperiodism, flowering, secondary shoot formation,
                     regulation of transcription, DNA-dependent; LOCATED IN:
                     nucleus; EXPRESSED IN: 23 plant structures; EXPRESSED
                     DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Zinc
                     finger, PHD-type, conserved site (InterPro:IPR019786),
                     Zinc finger, RING-type (InterPro:IPR001841), FY-rich,
                     C-terminal (InterPro:IPR003889), Zinc finger, PHD-type
                     (InterPro:IPR001965), FY-rich, N-terminal
                     (InterPro:IPR003888), DNA-binding, integrase-type
                     (InterPro:IPR016177), Zinc finger, FYVE/PHD-type
                     (InterPro:IPR011011), Methyl-CpG DNA binding
                     (InterPro:IPR001739), Zinc finger, PHD-finger
                     (InterPro:IPR019787); BEST Arabidopsis thaliana protein
                     match is: RING/FYVE/PHD-type zinc finger family protein
                     (TAIR:AT1G77250.1); Has 6416 Blast hits to 3988 proteins
                     in 224 species: Archae - 0; Bacteria - 0; Metazoa - 4085;
                     Fungi - 602; Plants - 1260; Viruses - 0; Other Eukaryotes
                     - 469 (source: NCBI BLink)."
                     /db_xref="TAIR:AT3G01460"
                     /db_xref="Araport:AT3G01460"
     intron_pos      172:2 (1/9)
     intron_pos      402:0 (2/9)
     intron_pos      773:0 (3/9)
     intron_pos      1000:0 (4/9)
     intron_pos      1101:0 (5/9)
     intron_pos      1213:0 (6/9)
     intron_pos      1246:0 (7/9)
     intron_pos      1391:0 (8/9)
     intron_pos      2075:0 (9/9)
BEGIN
        1 MEPTDSTNEQ LGDTKTAAVK EESRSFLGID LNEIPTGATL GGGCTAGQDD DGEYEPVEVV
       61 RSIHDNPDPA PGAPAEVPEP DRDASCGACG RPESIELVVV CDACERGFHM SCVNDGVEAA
      121 PSADWMCSDC RTGGERSKLW PLGVKSKLIL DMNASPPSDA EGYGAEETSD SRKHMLASSS
      181 CIGNSFDYAM MHSSFSSLGR GHASLEASGL MSRNTKMSMD ALGSHNLGFG FPLNLNNSSL
      241 PMRFPSLDPS ELFLQNLRHF ISERHGVLED GWRVEFRQPL NGYQLCAVYC APNGKTFSSI
      301 QEVACYLGLA INGNYSCMDA EIRNENSLLQ ERLHTPKRRK TSRWPNNGFP EQKGSSVSAQ
      361 LRRFPFNGQT MSPFAVKSGT HFQAGGSLSS GNNGCGCEEA KNGCPMQFED FFVLSLGRID
      421 IRQSYHNVNV IYPIGYKSCW HDKITGSLFT CEVSDGNSGP IFKVTRSPCS KSFIPAGSTV
      481 FSCPKIDEMV EQNSDKLSNR RDSTQERDDD ASVEILLSEH CPPLGDDILS CLREKSFSKT
      541 VNSLRSEVDS SRVDFDKNLS YDQDHGVEIG DIVVEEDSLS DAWKKVSQKL VDACSIVLKQ
      601 KGTLNFLCKH VDRETSEINW DTMNEKDNVI LSLSKFCCSL APCSVTCGEK DKSEFAAVVD
      661 ALSRWLDQNR FGLDADFVQE MIEHMPGAES CTNYRTLKSR SSSSVPITVA EGALVVKPKG
      721 GENVKDEVFG EISRKAKKPK LNGGHGVRNL HPPPGRPMCL RLPPGLVGDF LQVSEVFWRF
      781 HEILGFEEAF SPENLEQELI NPVFDGLFLD KPGKDDKRSE INFTDKDSTA TKLFSLFDES
      841 RQPFPAKNTS ASELKEKKAG DSSDFKISDS SRGSCVGALL TRAHISLLQV LICELQSKVA
      901 AFVDPNFDSG ESRSRRGRKK DDSTLSAKRN KLHMLPVNEF TWPELARRYI LSLLSMDGNL
      961 ESAEIAARES GKVFRCLQGD GGLLCGSLTG VAGMEADSML LAEAIKKISG SLTSENDVLS
     1021 VEDDDSDGLD ATETNTCSGD IPEWAQVLEP VKKLPTNVGT RIRKCVYEAL ERNPPEWAKK
     1081 ILEHSISKEI YKGNASGPTK KAVLSLLADI RGGDLVQRSI KGTKKRTYIS VSDVIMKKCR
     1141 AVLRGVAAAD EDKVLCTLLG RKLLNSSDND DDGLLGSPAM VSRPLDFRTI DLRLAAGAYD
     1201 GSTEAFLEDV LELWSSIRVM YADQPDCVDL VATLSEKFKS LYEAEVVPLV QKLKDYRKLE
     1261 CLSAEMKKEI KDIVVSVNKL PKAPWDEGVC KVCGVDKDDD SVLLCDTCDA EYHTYCLNPP
     1321 LIRIPDGNWY CPSCVIAKRM AQEALESYKL VRRRKGRKYQ GELTRASMEL TAHLADVMEE
     1381 KDYWEFSAEE RILLLKLLCD ELLSSSLVHQ HLEQCAEAII EMQQKLRSLS SEWKNAKMRQ
     1441 EFLTAKLAKV EPSILKEVGE PHNSSYFADQ MGCDPQPQEG VGDGVTRDDE TSSTAYLNKN
     1501 QGKSPLETDT QPGESHVNFG ESKISSPETI SSPGRHELPI ADTSPLVTDN LPEKDTSETL
     1561 LKSVGRNHET HSPNSNAVEL PTAHDASSQA SQELQACQQD LSATSNEIQN LQQSIRSIES
     1621 QLLKQSIRRD FLGTDASGRL YWGCCFPDEN PRILVDGSIS LQKPVQADLI GSKVPSPFLH
     1681 TVDHGRLRLS PWTYYETETE ISELVQWLHD DDLKERDLRE SILWWKRLRY GDVQKEKKQA
     1741 QNLSAPVFAT GLETKAAMSM EKRYGPCIKL EMETLKKRGK KTKVAEREKL CRCECLESIL
     1801 PSMIHCLICH KTFASDDEFE DHTESKCIPY SLATEEGKDI SDSSKAKESL KSDYLNVKSS
     1861 AGKDVAEISN VSELDSGLIR YQEEESISPY HFEEICSKFV TKDCNRDLVK EIGLISSNGI
     1921 PTFLPSSSTH LNDSVLISAK SNKPDGGDSG DQVIFAGPET NVEGLNSESN MSFDRSVTDS
     1981 HGGPLDKPSG LGFGFSEQKN KKSSGSGLKS CCVVPQAALK RVTGKALPGF RFLKTNLLDM
     2041 DVALPEEALR PSKSHPNRRR AWRVFVKSSQ SIYELVQATI VVEDMIKTEY LKNEWWYWSS
     2101 LSAAAKISTL SALSVRIFSL DAAIIYDKPI TPSNPIDETK PIISLPDQKS QPVSDSQERS
     2161 SRVRRSGKKR KEPEGS
//