LOCUS BC011752 2012 bp mRNA linear HUM 23-DEC-2003 DEFINITION Homo sapiens methyl-CpG binding domain protein 4, mRNA (cDNA clone MGC:19710 IMAGE:3534047), complete cds. ACCESSION BC011752 VERSION BC011752.2 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2012) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2012) AUTHORS Strausberg,R. TITLE Direct Submission JOURNAL Submitted (30-JUL-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Cancer Genomics Office, National Cancer Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Dec 19, 2003 this sequence version replaced BC011752.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: DCTD/DTP cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 27 Row: d Column: 6 This clone was selected for full length sequencing because it passed the following selection criteria: Similarity but not identity to protein. FEATURES Location/Qualifiers source 1..2012 /db_xref="H-InvDB:HIT000035425" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:19710 IMAGE:3534047" /tissue_type="Lung, small cell carcinoma" /clone_lib="NIH_MGC_7" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..2012 /gene="MBD4" /gene_synonym="MED1" /db_xref="GeneID:8930" /db_xref="MIM:603574" CDS 44..1768 /gene="MBD4" /gene_synonym="MED1" /codon_start=1 /product="MBD4 protein" /protein_id="AAH11752.1" /db_xref="GeneID:8930" /db_xref="MIM:603574" /translation="MGTTGLESLSLGDRGAAPTVTSSERLVPDPPNDLRKEDVAMELE RVGEDEEQMMIKRSSECNPLLQEPIASAQFGATAGTECRKSVPCGWERVVKQRLFGKT AGRFDVYFISPQGLKFRSKSSLANYLHKNGETSLKPEDFDFTVLSKRGIKSRYKDCSM AALTSHLQNQSNNSNWNLRTRSKCKKDVFMPPSSSSELQESRGLSNFTSTHLLLKEDE GVDDVNFRKVRKPKGKVTILKGIPIKKTKKGCRKSCSGFVQSDSKRESVCNKADAESE PVAQKSQLDRTVCISDAGACGETLSVTSEENSLVKKKERSLSSGSNFCSEQKTSGIIN KFCSAKDSEHNEKYEDTFLESEEIGTKVEVVERKEHLHTDILKRGSEMDNNCSPTRKD FTEDTIPRTQIERRKTSLYFSSKYNKEALSPPRRKAFKKWTPPRSPFNLVQETLFHDP WKLLIATIFLNRTSGKMAIPVLWKFLEKYPSAEVARTADWRDVSELLKPLGLYDLRAK TIVKFSDEYLTKQWKYPIELHGIGKYGNDSYRIFCVNEWKQVHPEDHKLNKYHDWLWE NHEKLSLS" misc_feature 278..508 /gene="MBD4" /gene_synonym="MED1" /note="MBD; Region: Methyl-CpG binding domain. DNA binding domain found in several chromosomal proteins. Also known as the TAM (TTF-IIP5, ARBP, MeCP1) domain. MBD proteins may be involved in recruiting histone deacetylases to methyl CpG-enriched regions in the genome to repress transcription" /db_xref="CDD:cd00122" misc_feature 1406..1762 /gene="MBD4" /gene_synonym="MED1" /note="HhH-GPD; Region: HhH-GPD superfamily base excision DNA repair protein. This family contains a diverse range of structurally related DNA repair proteins. The superfamily is called the HhH-GPD family after its hallmark Helix-hairpin-helix and Gly/Pro rich loop followed by a conserved aspartate. This includes endonuclease III, EC:4.2.99.18 and MutY an A/G-specific adenine glycosylase, both have a C terminal 4Fe-4S cluster. The family also includes 8-oxoguanine DNA glycosylases. The methyl-CPG binding protein MBD4 also contains a related domain that is a thymine DNA glycosylase. The family also includes DNA-3-methyladenine glycosylase II EC:3.2.2.21 and other members of the AlkA family" /db_xref="CDD:pfam00730" BASE COUNT 673 a 388 c 440 g 511 t ORIGIN 1 gcgttgcggc gctgggctcg ttgctgcagc cggaccctgc tcgatgggca cgactgggct 61 ggagagtctg agtctggggg accgcggagc tgcccccacc gtcacctcta gtgagcgcct 121 agtcccagac ccgccgaatg acctccgcaa agaagatgtt gctatggaat tggaaagagt 181 gggagaagat gaggaacaaa tgatgataaa aagaagcagt gaatgtaatc ccttgctaca 241 agaacccatc gcttctgctc agtttggtgc tactgcagga acagaatgcc gtaagtctgt 301 cccatgtgga tgggaaagag ttgtgaagca aaggttattt gggaagacag caggaagatt 361 tgatgtgtac tttatcagcc cacaaggact gaagttcaga tccaaaagtt cacttgctaa 421 ttatcttcac aaaaatggag agacttctct taagccagaa gattttgatt ttactgtact 481 ttctaaaagg ggtatcaagt caagatataa agactgcagc atggcagccc tgacatccca 541 tctacaaaac caaagtaaca attcaaactg gaacctcagg acccgaagca agtgcaaaaa 601 ggatgtgttt atgccgccaa gtagtagttc agagttgcag gagagcagag gactctctaa 661 ctttacttcc actcatttgc ttttgaaaga agatgagggt gttgatgatg ttaacttcag 721 aaaggttaga aagcccaaag gaaaggtgac tattttgaaa ggaatcccaa ttaagaaaac 781 taaaaaagga tgtaggaaga gctgttcagg ttttgttcaa agtgatagca aaagagaatc 841 tgtgtgtaat aaagcagatg ctgaaagtga acctgttgca caaaaaagtc agcttgatag 901 aactgtctgc atttctgatg ctggagcatg tggtgagacc ctcagtgtga ccagtgaaga 961 aaacagcctt gtaaaaaaaa aagaaagatc attgagttca ggatcaaatt tttgttctga 1021 acaaaaaact tctggcatca taaacaaatt ttgttcagcc aaagactcag aacacaacga 1081 gaagtatgag gatacctttt tagaatctga agaaatcgga acaaaagtag aagttgtgga 1141 aaggaaagaa catttgcata ctgacatttt aaaacgtggc tctgaaatgg acaacaactg 1201 ctcaccaacc aggaaagact tcactgaaga taccatccca cgaacacaga tagaaagaag 1261 gaaaacaagc ctgtattttt ccagcaaata taacaaagaa gctcttagcc ccccacgacg 1321 taaagccttt aagaaatgga cacctcctcg gtcacctttt aatctcgttc aagaaacact 1381 ttttcatgat ccatggaagc ttctcatcgc tactatattt ctcaatcgga cctcaggcaa 1441 aatggcaata cctgtgcttt ggaagtttct ggagaagtat ccttcagctg aggtagcaag 1501 aaccgcagac tggagagatg tgtcagaact tcttaaacct cttggtctct acgatcttcg 1561 ggcaaaaacc attgtcaagt tctcagatga atacctgaca aagcagtgga agtatccaat 1621 tgagcttcat gggattggta aatatggcaa cgactcttac cgaatttttt gtgtcaatga 1681 gtggaagcag gtgcaccctg aagaccacaa attaaataaa tatcatgact ggctttggga 1741 aaatcatgaa aaattaagtc tatcttaaac tctgcagctt tcaagctcat ctgttatgca 1801 tagctttgca cttcaaaaaa gcttaattaa gtacaaccaa ccacctttcc agccatagag 1861 attttaatta gcccaactag aagcctagtg tgtgtgcttt cttaatgtgt gtgccaatgg 1921 tggatctttg ctactgaatg tgtttgaaca tgttttgaga tttttttaaa ataaattatt 1981 atttgacaac aaaaaaaaaa aaaaaaaaaa aa //