LOCUS BC011752 2012 bp mRNA linear HUM 23-DEC-2003
DEFINITION Homo sapiens methyl-CpG binding domain protein 4, mRNA (cDNA clone
MGC:19710 IMAGE:3534047), complete cds.
ACCESSION BC011752
VERSION BC011752.2
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 2012)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 2012)
AUTHORS Strausberg,R.
TITLE Direct Submission
JOURNAL Submitted (30-JUL-2001) National Institutes of Health, Mammalian
Gene Collection (MGC), Cancer Genomics Office, National Cancer
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT On Dec 19, 2003 this sequence version replaced BC011752.1.
Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: DCTD/DTP
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: National Institutes of Health Intramural
Sequencing Center (NISC),
Gaithersburg, Maryland;
Web site: http://www.nisc.nih.gov/
Contact: nisc_mgc@nhgri.nih.gov
Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
Young,A., Zhang,L.-H. and Green,E.D.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 27 Row: d Column: 6
This clone was selected for full length sequencing because it
passed the following selection criteria: Similarity but not
identity to protein.
FEATURES Location/Qualifiers
source 1..2012
/db_xref="H-InvDB:HIT000035425"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:19710 IMAGE:3534047"
/tissue_type="Lung, small cell carcinoma"
/clone_lib="NIH_MGC_7"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene 1..2012
/gene="MBD4"
/gene_synonym="MED1"
/db_xref="GeneID:8930"
/db_xref="MIM:603574"
CDS 44..1768
/gene="MBD4"
/gene_synonym="MED1"
/codon_start=1
/product="MBD4 protein"
/protein_id="AAH11752.1"
/db_xref="GeneID:8930"
/db_xref="MIM:603574"
/translation="MGTTGLESLSLGDRGAAPTVTSSERLVPDPPNDLRKEDVAMELE
RVGEDEEQMMIKRSSECNPLLQEPIASAQFGATAGTECRKSVPCGWERVVKQRLFGKT
AGRFDVYFISPQGLKFRSKSSLANYLHKNGETSLKPEDFDFTVLSKRGIKSRYKDCSM
AALTSHLQNQSNNSNWNLRTRSKCKKDVFMPPSSSSELQESRGLSNFTSTHLLLKEDE
GVDDVNFRKVRKPKGKVTILKGIPIKKTKKGCRKSCSGFVQSDSKRESVCNKADAESE
PVAQKSQLDRTVCISDAGACGETLSVTSEENSLVKKKERSLSSGSNFCSEQKTSGIIN
KFCSAKDSEHNEKYEDTFLESEEIGTKVEVVERKEHLHTDILKRGSEMDNNCSPTRKD
FTEDTIPRTQIERRKTSLYFSSKYNKEALSPPRRKAFKKWTPPRSPFNLVQETLFHDP
WKLLIATIFLNRTSGKMAIPVLWKFLEKYPSAEVARTADWRDVSELLKPLGLYDLRAK
TIVKFSDEYLTKQWKYPIELHGIGKYGNDSYRIFCVNEWKQVHPEDHKLNKYHDWLWE
NHEKLSLS"
misc_feature 278..508
/gene="MBD4"
/gene_synonym="MED1"
/note="MBD; Region: Methyl-CpG binding domain. DNA binding
domain found in several chromosomal proteins. Also known
as the TAM (TTF-IIP5, ARBP, MeCP1) domain. MBD proteins
may be involved in recruiting histone deacetylases to
methyl CpG-enriched regions in the genome to repress
transcription"
/db_xref="CDD:cd00122"
misc_feature 1406..1762
/gene="MBD4"
/gene_synonym="MED1"
/note="HhH-GPD; Region: HhH-GPD superfamily base excision
DNA repair protein. This family contains a diverse range
of structurally related DNA repair proteins. The
superfamily is called the HhH-GPD family after its
hallmark Helix-hairpin-helix and Gly/Pro rich loop
followed by a conserved aspartate. This includes
endonuclease III, EC:4.2.99.18 and MutY an A/G-specific
adenine glycosylase, both have a C terminal 4Fe-4S
cluster. The family also includes 8-oxoguanine DNA
glycosylases. The methyl-CPG binding protein MBD4 also
contains a related domain that is a thymine DNA
glycosylase. The family also includes DNA-3-methyladenine
glycosylase II EC:3.2.2.21 and other members of the AlkA
family"
/db_xref="CDD:pfam00730"
BASE COUNT 673 a 388 c 440 g 511 t
ORIGIN
1 gcgttgcggc gctgggctcg ttgctgcagc cggaccctgc tcgatgggca cgactgggct
61 ggagagtctg agtctggggg accgcggagc tgcccccacc gtcacctcta gtgagcgcct
121 agtcccagac ccgccgaatg acctccgcaa agaagatgtt gctatggaat tggaaagagt
181 gggagaagat gaggaacaaa tgatgataaa aagaagcagt gaatgtaatc ccttgctaca
241 agaacccatc gcttctgctc agtttggtgc tactgcagga acagaatgcc gtaagtctgt
301 cccatgtgga tgggaaagag ttgtgaagca aaggttattt gggaagacag caggaagatt
361 tgatgtgtac tttatcagcc cacaaggact gaagttcaga tccaaaagtt cacttgctaa
421 ttatcttcac aaaaatggag agacttctct taagccagaa gattttgatt ttactgtact
481 ttctaaaagg ggtatcaagt caagatataa agactgcagc atggcagccc tgacatccca
541 tctacaaaac caaagtaaca attcaaactg gaacctcagg acccgaagca agtgcaaaaa
601 ggatgtgttt atgccgccaa gtagtagttc agagttgcag gagagcagag gactctctaa
661 ctttacttcc actcatttgc ttttgaaaga agatgagggt gttgatgatg ttaacttcag
721 aaaggttaga aagcccaaag gaaaggtgac tattttgaaa ggaatcccaa ttaagaaaac
781 taaaaaagga tgtaggaaga gctgttcagg ttttgttcaa agtgatagca aaagagaatc
841 tgtgtgtaat aaagcagatg ctgaaagtga acctgttgca caaaaaagtc agcttgatag
901 aactgtctgc atttctgatg ctggagcatg tggtgagacc ctcagtgtga ccagtgaaga
961 aaacagcctt gtaaaaaaaa aagaaagatc attgagttca ggatcaaatt tttgttctga
1021 acaaaaaact tctggcatca taaacaaatt ttgttcagcc aaagactcag aacacaacga
1081 gaagtatgag gatacctttt tagaatctga agaaatcgga acaaaagtag aagttgtgga
1141 aaggaaagaa catttgcata ctgacatttt aaaacgtggc tctgaaatgg acaacaactg
1201 ctcaccaacc aggaaagact tcactgaaga taccatccca cgaacacaga tagaaagaag
1261 gaaaacaagc ctgtattttt ccagcaaata taacaaagaa gctcttagcc ccccacgacg
1321 taaagccttt aagaaatgga cacctcctcg gtcacctttt aatctcgttc aagaaacact
1381 ttttcatgat ccatggaagc ttctcatcgc tactatattt ctcaatcgga cctcaggcaa
1441 aatggcaata cctgtgcttt ggaagtttct ggagaagtat ccttcagctg aggtagcaag
1501 aaccgcagac tggagagatg tgtcagaact tcttaaacct cttggtctct acgatcttcg
1561 ggcaaaaacc attgtcaagt tctcagatga atacctgaca aagcagtgga agtatccaat
1621 tgagcttcat gggattggta aatatggcaa cgactcttac cgaatttttt gtgtcaatga
1681 gtggaagcag gtgcaccctg aagaccacaa attaaataaa tatcatgact ggctttggga
1741 aaatcatgaa aaattaagtc tatcttaaac tctgcagctt tcaagctcat ctgttatgca
1801 tagctttgca cttcaaaaaa gcttaattaa gtacaaccaa ccacctttcc agccatagag
1861 attttaatta gcccaactag aagcctagtg tgtgtgcttt cttaatgtgt gtgccaatgg
1921 tggatctttg ctactgaatg tgtttgaaca tgttttgaga tttttttaaa ataaattatt
1981 atttgacaac aaaaaaaaaa aaaaaaaaaa aa
//