LOCUS       AEE82708.1              1512 aa    PRT              PLN 23-MAR-2023
DEFINITION  Arabidopsis thaliana DNA (cytosine-5-)-methyltransferase
            family protein protein.
ACCESSION   CP002687-1225
PROTEIN_ID  AEE82708.1
SOURCE      Arabidopsis thaliana (thale cress)
  ORGANISM  Arabidopsis thaliana
            Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
            Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
            Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
            Camelineae; Arabidopsis.
REFERENCE   1  (bases 1 to 18585056)
  AUTHORS   Mayer,K., Schuller,C., Wambutt,R., Murphy,G., Volckaert,G.,
            Pohl,T., Dusterhoft,A., Stiekema,W., Entian,K.D., Terryn,N.,
            Harris,B., Ansorge,W., Brandt,P., Grivell,L., Rieger,M.,
            Weichselgartner,M., de Simone,V., Obermaier,B., Mache,R.,
            Muller,M., Kreis,M., Delseny,M., Puigdomenech,P., Watson,M.,
            Schmidtheini,T., Reichert,B., Portatelle,D., Perez-Alonso,M.,
            Boutry,M., Bancroft,I., Vos,P., Hoheisel,J., Zimmermann,W.,
            Wedler,H., Ridley,P., Langham,S.A., McCullagh,B., Bilham,L.,
            Robben,J., Van der Schueren,J., Grymonprez,B., Chuang,Y.J.,
            Vandenbussche,F., Braeken,M., Weltjens,I., Voet,M., Bastiaens,I.,
            Aert,R., Defoor,E., Weitzenegger,T., Bothe,G., Ramsperger,U.,
            Hilbert,H., Braun,M., Holzer,E., Brandt,A., Peters,S., van
            Staveren,M., Dirske,W., Mooijman,P., Klein Lankhorst,R., Rose,M.,
            Hauf,J., Kotter,P., Berneiser,S., Hempel,S., Feldpausch,M.,
            Lamberth,S., Van den Daele,H., De Keyser,A., Buysshaert,C.,
            Gielen,J., Villarroel,R., De Clercq,R., Van Montagu,M., Rogers,J.,
            Cronin,A., Quail,M., Bray-Allen,S., Clark,L., Doggett,J., Hall,S.,
            Kay,M., Lennard,N., McLay,K., Mayes,R., Pettett,A.,
            Rajandream,M.A., Lyne,M., Benes,V., Rechmann,S., Borkova,D.,
            Blocker,H., Scharfe,M., Grimm,M., Lohnert,T.H., Dose,S., de
            Haan,M., Maarse,A., Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M.,
            Fartmann,B., Granderath,K., Dauner,D., Herzl,A., Neumann,S.,
            Argiriou,A., Vitale,D., Liguori,R., Piravandi,E., Massenet,O.,
            Quigley,F., Clabauld,G., Mundlein,A., Felber,R., Schnabl,S.,
            Hiller,R., Schmidt,W., Lecharny,A., Aubourg,S., Chefdor,F.,
            Cooke,R., Berger,C., Montfort,A., Casacuberta,E., Gibbons,T.,
            Weber,N., Vandenbol,M., Bargues,M., Terol,J., Torres,A.,
            Perez-Perez,A., Purnelle,B., Bent,E., Johnson,S., Tacon,D.,
            Jesse,T., Heijnen,L., Schwarz,S., Scholler,P., Heber,S., Francs,P.,
            Bielke,C., Frishman,D., Haase,D., Lemcke,K., Mewes,H.W.,
            Stocker,S., Zaccaria,P., Bevan,M., Wilson,R.K., de la Bastide,M.,
            Habermann,K., Parnell,L., Dedhia,N., Gnoj,L., Schutz,K., Huang,E.,
            Spiegel,L., Sehkon,M., Murray,J., Sheet,P., Cordes,M.,
            Abu-Threideh,J., Stoneking,T., Kalicki,J., Graves,T., Harmon,G.,
            Edwards,J., Latreille,P., Courtney,L., Cloud,J., Abbott,A.,
            Scott,K., Johnson,D., Minx,P., Bentley,D., Fulton,B., Miller,N.,
            Greco,T., Kemp,K., Kramer,J., Fulton,L., Mardis,E., Dante,M.,
            Pepin,K., Hillier,L., Nelson,J., Spieth,J., Ryan,E., Andrews,S.,
            Geisel,C., Layman,D., Du,H., Ali,J., Berghoff,A., Jones,K.,
            Drone,K., Cotton,M., Joshu,C., Antonoiu,B., Zidanic,M., Strong,C.,
            Sun,H., Lamar,B., Yordan,C., Ma,P., Zhong,J., Preston,R., Vil,D.,
            Shekher,M., Matero,A., Shah,R., Swaby,I.K., O'Shaughnessy,A.,
            Rodriguez,M., Hoffmann,J., Till,S., Granat,S., Shohdy,N.,
            Hasegawa,A., Hameed,A., Lodhi,M., Johnson,A., Chen,E., Marra,M.,
            Martienssen,R. and McCombie,W.R.
  TITLE     Sequence and analysis of chromosome 4 of the plant Arabidopsis
            thaliana
  JOURNAL   Nature 402 (6763), 769-777 (1999)
   PUBMED   10617198
REFERENCE   2  (bases 1 to 18585056)
  AUTHORS   Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
  CONSRTM   TAIR
  TITLE     Direct Submission
  JOURNAL   Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
            Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE   3  (bases 1 to 18585056)
  AUTHORS   Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
            Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
            Vaughn,M. and Town,C.D.
  TITLE     Direct Submission
  JOURNAL   Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
            9704 Medical Center Dr, Rockville, MD 20850, USA
  REMARK    Protein update by submitter
FEATURES             Qualifiers
     source          /organism="Arabidopsis thaliana"
                     /mol_type="genomic DNA"
                     /db_xref="taxon:3702"
                     /chromosome="4"
                     /ecotype="Columbia"
     protein         /locus_tag="AT4G08990"
                     /gene_synonym="F23J3.20"
                     /gene_synonym="F23J3_20"
                     /note="DNA (cytosine-5-)-methyltransferase family protein;
                     FUNCTIONS IN: DNA binding, DNA
                     (cytosine-5-)-methyltransferase activity; INVOLVED IN: DNA
                     methylation; LOCATED IN: nucleus; CONTAINS InterPro
                     DOMAIN/s: DNA (cytosine-5)-methyltransferase 1
                     (InterPro:IPR017198), DNA methylase, C-5 cytosine-specific
                     (InterPro:IPR001525), Bromo adjacent homology (BAH) domain
                     (InterPro:IPR001025), DNA methylase, C-5
                     cytosine-specific, active site (InterPro:IPR018117); BEST
                     Arabidopsis thaliana protein match is: DNA
                     methyltransferase 2 (TAIR:AT4G14140.1); Has 7214 Blast
                     hits to 6137 proteins in 1440 species: Archae - 249;
                     Bacteria - 4362; Metazoa - 525; Fungi - 274; Plants - 335;
                     Viruses - 111; Other Eukaryotes - 1358 (source: NCBI
                     BLink)."
                     /db_xref="TAIR:AT4G08990"
                     /db_xref="Araport:AT4G08990"
     intron_pos      465:2 (1/10)
     intron_pos      970:0 (2/10)
     intron_pos      1025:0 (3/10)
     intron_pos      1091:1 (4/10)
     intron_pos      1128:2 (5/10)
     intron_pos      1178:0 (6/10)
     intron_pos      1244:0 (7/10)
     intron_pos      1337:0 (8/10)
     intron_pos      1391:0 (9/10)
     intron_pos      1464:0 (10/10)
BEGIN
        1 METKVGKQKK RSVDSNDDVS KERRPKRAAA CRNFKEKPLR ISDKSETVEA KKEQNVVEEI
       61 VAIQLTSSLE SNDDPRPNRR LTDFVLHNSD GVPQPVEMLE LGDIFLEGVV LPLGDDKNEE
      121 KGVRFQSFGR VENWNISGYE DGSPGIWIST ALADYDCRKP ASKYKKIYDY FFEKACACVE
      181 VFKSLSKNPD TSLDELLAAV ARSMSGSKIF SSGGAIQEFV ISQGEFIYNQ LAGLDETAKN
      241 HETCFVENSV LVSLRDHESS KIHKALSNVA LRIDESQLVK SDHLVDGAEA EDVRYAKLIQ
      301 EEEYRISMER SRNKRSSTTS ASNKFYIKIN EHEIANDYPL PSYYKNTKEE TDELLLFEPG
      361 YEVDTRDLPC RTLHNWALYN SDSRMISLEV LPMRPCAEID VTVFGSGVVA EDDGSGFCLD
      421 DSESSTSTQS NVHDGMNIFL SQIKEWMIEF GAEMIFVTLR TDMAWYRLGK PSKQYAPWFE
      481 TVMKTVRVAI SIFNMLMRES RVAKLSYANV IKRLCGLEEN DKAYISSKLL DVERYVVVHG
      541 QIILQLFEEY PDKDIKRCPF VTGLASKMQD IHHTKWIIKR KKKILQKGKN LNPRAGLAHV
      601 VTRMKPMQAT TTRLVNRIWG EFYSIYSPEV PSEAIHEVEE EEIEEDEEED ENEEDDIEEE
      661 AVEVQKSHTP KKSRGNSEDM EIKWNGEILG ETSDGEPLYG RALVGGETVA VGSAVILEVD
      721 DPDETPAIYF VEFMFESSDQ CKMLHGKLLQ RGSETVIGTA ANERELFLTN ECLTVHLKDI
      781 KGTVSLDIRS RPWGHQYRKE NLVVDKLDRA RAEERKANGL PTEYYCKSLY SPERGGFFSL
      841 PRNDIGLGSG FCSSCKIKEE EEERSKTKLN ISKTGVFSNG IEYYNGDFVY VLPNYITKDG
      901 LKKGTSRRTT LKCGRNVGLK AFVVCQLLDV IVLEESRKAS NASFQVKLTR FYRPEDISEE
      961 KAYASDIQEL YYSHDTYILP PEALQGKCEV RKKNDMPLCR EYPILDHIFF CEVFYDSSTG
     1021 YLKQFPANMK LKFSTIKDET LLREKKGKGV ETGTSSGILM KPDEVPKEMR LATLDIFAGC
     1081 GGLSHGLEKA GVSNTKWAIE YEEPAGHAFK QNHPEATVFV DNCNVILRAI MEKCGDVDDC
     1141 VSTVEAAELV AKLDENQKST LPLPGQADFI SGGPPCQGFS GMNRFSDGSW SKVQCEMILA
     1201 FLSFADYFRP KYFLLENVKK FVTYNKGRTF QLTMASLLEI GYQVRFGILE AGTYGVSQPR
     1261 KRVIIWAASP EEVLPEWPEP MHVFDNPGSK ISLPRGLHYD TVRNTKFGAP FRSITVRDTI
     1321 GDLPLVENGE SKINKEYRTT PVSWFQKKIR GNMSVLTDHI CKGLNELNLI RCKKIPKRPG
     1381 ADWRDLPDEN VTLSNGLVEK LRPLALSKTA KNHNEWKGLY GRLDWQGNLP ISITDPQPMG
     1441 KVGMCFHPEQ DRIITVRECA RSQGFPDSYE FSGTTKHKHR QIGNAVPPPL AFALGRKLKE
     1501 ALYLKSSLQH QS
//