LOCUS       BC011752                2012 bp    mRNA    linear   HUM 23-DEC-2003
DEFINITION  Homo sapiens methyl-CpG binding domain protein 4, mRNA (cDNA clone
            MGC:19710 IMAGE:3534047), complete cds.
ACCESSION   BC011752
VERSION     BC011752.2
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2012)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 2012)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (30-JUL-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     On Dec 19, 2003 this sequence version replaced BC011752.1.
            Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: DCTD/DTP
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: National Institutes of Health Intramural
            Sequencing Center (NISC),
            Gaithersburg, Maryland;
            Web site: http://www.nisc.nih.gov/
            Contact: nisc_mgc@nhgri.nih.gov
            Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
            Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
            Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
            Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
            Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
            McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
            Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
            Young,A., Zhang,L.-H. and Green,E.D.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 27 Row: d Column: 6
            This clone was selected for full length sequencing because it
            passed the following selection criteria: Similarity but not
            identity to protein.
FEATURES             Location/Qualifiers
     source          1..2012
                     /db_xref="H-InvDB:HIT000035425"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:19710 IMAGE:3534047"
                     /tissue_type="Lung, small cell carcinoma"
                     /clone_lib="NIH_MGC_7"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     gene            1..2012
                     /gene="MBD4"
                     /gene_synonym="MED1"
                     /db_xref="GeneID:8930"
                     /db_xref="MIM:603574"
     CDS             44..1768
                     /gene="MBD4"
                     /gene_synonym="MED1"
                     /codon_start=1
                     /product="MBD4 protein"
                     /protein_id="AAH11752.1"
                     /db_xref="GeneID:8930"
                     /db_xref="MIM:603574"
                     /translation="MGTTGLESLSLGDRGAAPTVTSSERLVPDPPNDLRKEDVAMELE
                     RVGEDEEQMMIKRSSECNPLLQEPIASAQFGATAGTECRKSVPCGWERVVKQRLFGKT
                     AGRFDVYFISPQGLKFRSKSSLANYLHKNGETSLKPEDFDFTVLSKRGIKSRYKDCSM
                     AALTSHLQNQSNNSNWNLRTRSKCKKDVFMPPSSSSELQESRGLSNFTSTHLLLKEDE
                     GVDDVNFRKVRKPKGKVTILKGIPIKKTKKGCRKSCSGFVQSDSKRESVCNKADAESE
                     PVAQKSQLDRTVCISDAGACGETLSVTSEENSLVKKKERSLSSGSNFCSEQKTSGIIN
                     KFCSAKDSEHNEKYEDTFLESEEIGTKVEVVERKEHLHTDILKRGSEMDNNCSPTRKD
                     FTEDTIPRTQIERRKTSLYFSSKYNKEALSPPRRKAFKKWTPPRSPFNLVQETLFHDP
                     WKLLIATIFLNRTSGKMAIPVLWKFLEKYPSAEVARTADWRDVSELLKPLGLYDLRAK
                     TIVKFSDEYLTKQWKYPIELHGIGKYGNDSYRIFCVNEWKQVHPEDHKLNKYHDWLWE
                     NHEKLSLS"
     misc_feature    278..508
                     /gene="MBD4"
                     /gene_synonym="MED1"
                     /note="MBD; Region: Methyl-CpG binding domain. DNA binding
                     domain found in several chromosomal proteins. Also known
                     as the TAM (TTF-IIP5, ARBP, MeCP1) domain. MBD proteins
                     may be involved in recruiting histone deacetylases to
                     methyl CpG-enriched regions in the genome to repress
                     transcription"
                     /db_xref="CDD:cd00122"
     misc_feature    1406..1762
                     /gene="MBD4"
                     /gene_synonym="MED1"
                     /note="HhH-GPD; Region: HhH-GPD superfamily base excision
                     DNA repair protein. This family contains a diverse range
                     of structurally related DNA repair proteins. The
                     superfamily is called the HhH-GPD family after its
                     hallmark Helix-hairpin-helix and Gly/Pro rich loop
                     followed by a conserved aspartate. This includes
                     endonuclease III, EC:4.2.99.18 and MutY an A/G-specific
                     adenine glycosylase, both have a C terminal 4Fe-4S
                     cluster. The family also includes 8-oxoguanine DNA
                     glycosylases. The methyl-CPG binding protein MBD4 also
                     contains a related domain that is a thymine DNA
                     glycosylase. The family also includes DNA-3-methyladenine
                     glycosylase II EC:3.2.2.21 and other members of the AlkA
                     family"
                     /db_xref="CDD:pfam00730"
BASE COUNT          673 a          388 c          440 g          511 t
ORIGIN      
        1 gcgttgcggc gctgggctcg ttgctgcagc cggaccctgc tcgatgggca cgactgggct
       61 ggagagtctg agtctggggg accgcggagc tgcccccacc gtcacctcta gtgagcgcct
      121 agtcccagac ccgccgaatg acctccgcaa agaagatgtt gctatggaat tggaaagagt
      181 gggagaagat gaggaacaaa tgatgataaa aagaagcagt gaatgtaatc ccttgctaca
      241 agaacccatc gcttctgctc agtttggtgc tactgcagga acagaatgcc gtaagtctgt
      301 cccatgtgga tgggaaagag ttgtgaagca aaggttattt gggaagacag caggaagatt
      361 tgatgtgtac tttatcagcc cacaaggact gaagttcaga tccaaaagtt cacttgctaa
      421 ttatcttcac aaaaatggag agacttctct taagccagaa gattttgatt ttactgtact
      481 ttctaaaagg ggtatcaagt caagatataa agactgcagc atggcagccc tgacatccca
      541 tctacaaaac caaagtaaca attcaaactg gaacctcagg acccgaagca agtgcaaaaa
      601 ggatgtgttt atgccgccaa gtagtagttc agagttgcag gagagcagag gactctctaa
      661 ctttacttcc actcatttgc ttttgaaaga agatgagggt gttgatgatg ttaacttcag
      721 aaaggttaga aagcccaaag gaaaggtgac tattttgaaa ggaatcccaa ttaagaaaac
      781 taaaaaagga tgtaggaaga gctgttcagg ttttgttcaa agtgatagca aaagagaatc
      841 tgtgtgtaat aaagcagatg ctgaaagtga acctgttgca caaaaaagtc agcttgatag
      901 aactgtctgc atttctgatg ctggagcatg tggtgagacc ctcagtgtga ccagtgaaga
      961 aaacagcctt gtaaaaaaaa aagaaagatc attgagttca ggatcaaatt tttgttctga
     1021 acaaaaaact tctggcatca taaacaaatt ttgttcagcc aaagactcag aacacaacga
     1081 gaagtatgag gatacctttt tagaatctga agaaatcgga acaaaagtag aagttgtgga
     1141 aaggaaagaa catttgcata ctgacatttt aaaacgtggc tctgaaatgg acaacaactg
     1201 ctcaccaacc aggaaagact tcactgaaga taccatccca cgaacacaga tagaaagaag
     1261 gaaaacaagc ctgtattttt ccagcaaata taacaaagaa gctcttagcc ccccacgacg
     1321 taaagccttt aagaaatgga cacctcctcg gtcacctttt aatctcgttc aagaaacact
     1381 ttttcatgat ccatggaagc ttctcatcgc tactatattt ctcaatcgga cctcaggcaa
     1441 aatggcaata cctgtgcttt ggaagtttct ggagaagtat ccttcagctg aggtagcaag
     1501 aaccgcagac tggagagatg tgtcagaact tcttaaacct cttggtctct acgatcttcg
     1561 ggcaaaaacc attgtcaagt tctcagatga atacctgaca aagcagtgga agtatccaat
     1621 tgagcttcat gggattggta aatatggcaa cgactcttac cgaatttttt gtgtcaatga
     1681 gtggaagcag gtgcaccctg aagaccacaa attaaataaa tatcatgact ggctttggga
     1741 aaatcatgaa aaattaagtc tatcttaaac tctgcagctt tcaagctcat ctgttatgca
     1801 tagctttgca cttcaaaaaa gcttaattaa gtacaaccaa ccacctttcc agccatagag
     1861 attttaatta gcccaactag aagcctagtg tgtgtgcttt cttaatgtgt gtgccaatgg
     1921 tggatctttg ctactgaatg tgtttgaaca tgttttgaga tttttttaaa ataaattatt
     1981 atttgacaac aaaaaaaaaa aaaaaaaaaa aa
//