LOCUS BC028671 4420 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens SET domain, bifurcated 1, mRNA (cDNA clone MGC:16563 IMAGE:4098658), complete cds. ACCESSION BC028671 VERSION BC028671.2 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4420) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 4420) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (29-APR-2002) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Aug 20, 2003 this sequence version replaced BC028671.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Institute for Systems Biology http://www.systemsbiology.org contact: amadan@systemsbiology.org Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 25 Row: p Column: 12 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 41281392. FEATURES Location/Qualifiers source 1..4420 /db_xref="H-InvDB:HIT000040598" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:16563 IMAGE:4098658" /tissue_type="Muscle, rhabdomyosarcoma" /clone_lib="NIH_MGC_17" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..4420 /gene="SETDB1" /gene_synonym="ESET" /gene_synonym="KG1T" /gene_synonym="KIAA0067" /db_xref="GeneID:9869" /db_xref="HGNC:HGNC:10761" /db_xref="MIM:604396" CDS 118..3990 /gene="SETDB1" /gene_synonym="ESET" /gene_synonym="KG1T" /gene_synonym="KIAA0067" /codon_start=1 /product="SET domain, bifurcated 1" /protein_id="AAH28671.1" /db_xref="GeneID:9869" /db_xref="HGNC:HGNC:10761" /db_xref="MIM:604396" /translation="MSSLPGCIGLDAATATVESEEIAELQQAVVEELGISMEELRHFI DEELEKMDCVQQRKKQLAELETWVIQKESEVAHVDQLFDDASRAVTNCESLVKDFYSK LGLQYRDSSSEDESSRPTEIIEIPDEDDDVLSIDSGDAGSRTPKDQKLREAMAALRKS AQDVQKFMDAVNKKSSSQDLHKGTLSQMSGELSKDGDLIVSMRILGKKRTKTWHKGTL IAIQTVGPGKKYKVKFDNKGKSLLSGNHIAYDYHPPADKLYVGSRVVAKYKDGNQVWL YAGIVAETPNVKNKLRFLIFFDDGYASYVTQSELYPICRPLKKTWEDIEDISCRDFIE EYVTAYPNRPMVLLKSGQLIKTEWEGTWWKSRVEEVDGSLVRILFLDDKRCEWIYRGS TRLEPMFSMKTSSASALEKKQGQLRTRPNMGAVRSKGPVVQYTQDLTGTGTQFKPVEP PQPTAPPAPPFPPAPPLSPQAGDSDLESQLAQSRKQVAKKSTSFRPGSVGSGHSSSTS PALSENVSGGKPGINQTYRSPLGSTASAPAPSALPAPPAPPVFHGMLERAPAEPSYRA PMEKLFYLPHVCSYTCLSRVRPMRNEQYRGKNPLLVPLLYDFRRMTARRRVNRKMGFH VIYKTPCGLCLRTMQEIERYLFETGCDFLFLEMFCLDPYVLVDRKFQPYKPFYYILDI TYGKEDVPLSCVNEIDTTPPPQVAYSKERIPGKGVFINTGPEFLVGCDCKDGCRDKSK CACHQLTIQATACTPGGQINPNSGYQYKRLEECLPTGVYECNKRCKCDPNMCTNRLVQ HGLQVRLQLFKTQNKGWGIRCLDDIAKGSFVCIYAGKILTDDFADKEGLEMGDEYFAN LDHIESVENFKEGYESDAPCSSDSSGVDLKDQEDGNSGTEDPEESNDDSSDDNFCKDE DFSTSSVWRSYATRRQTRGQKENGLSETTSKDSHPPDLGPPHIPVPPSIPVGGCNPPS SEETPKNKVASWLSCNSVSEGGFADSDSHSSFKTNEGGEGRAGGSRMEAEKASTSGLG IKDEGDIKQAKKEDTDDRNKMSVVTESSRNYGYNPSPVKPEGLRRPPSKTSMHQSRRL MASAQSNPDDVLTLSSSTESEGESGTSRKPTAGQTSATAVDSDDIQTISSGSEGDDFE DKKNMTGPMKRQVAVKSTRGFALKSTHGIAIKSTNMASVDKGESAPVRKNTRQFYDGE ESCYIIDAKLEGNLGRYLNHSCSPNLFVQNVFVDTHDLRFPWVAFFASKIRAGTELTW DYNYEVGSVEGKELLCCCGAIECRGRLL" BASE COUNT 1200 a 1099 c 1117 g 1004 t ORIGIN 1 ctcccttatc ccttcgcttt cgctcttttc cgtcgaggcc gacccctgag ttgtgagtct 61 ggggtctggt tggtgaaaaa gagcccttga agctggaaga cgggagagga caaaagcatg 121 tcttcccttc ctgggtgcat tggtttggat gcagcaacag ctacagtgga gtctgaagag 181 attgcagagc tgcaacaggc agtggttgag gaactgggta tctctatgga ggaacttcgg 241 catttcatcg atgaggaact ggagaagatg gattgtgtac agcaacgcaa gaagcagcta 301 gcagagttag agacatgggt aatacagaaa gaatctgagg tggctcacgt tgaccaactc 361 tttgatgatg catccagggc agtgactaat tgtgagtctt tggtgaagga cttctactcc 421 aagctgggac tacaataccg ggacagtagc tctgaggacg aatcttcccg gcctacagaa 481 ataattgaga ttcctgatga agatgatgat gtcctcagta ttgattcagg tgatgctggg 541 agcagaactc caaaagacca gaagctccgt gaagctatgg ctgccttaag aaagtcagct 601 caagatgttc agaagttcat ggatgctgtc aacaagaaga gcagttccca ggatctgcat 661 aaaggaacct tgagtcagat gtctggagaa ctaagcaaag atggtgacct gatagtcagc 721 atgcgaattc tgggcaagaa gagaactaag acttggcaca aaggcaccct tattgccatc 781 cagacagttg ggccagggaa gaaatacaag gtgaaatttg acaacaaagg aaagagtcta 841 ctgtcgggga accatattgc ctatgattac caccctcctg ctgacaagct gtatgtgggc 901 agtcgggtgg tcgccaaata caaagatggg aatcaggtct ggctctatgc tggcattgta 961 gctgagacac caaacgtcaa aaacaagctc aggtttctca ttttctttga tgatggctat 1021 gcttcctatg tcacacagtc ggaactgtat cccatttgcc ggccactgaa aaagacttgg 1081 gaggacatag aagacatctc ctgccgtgac ttcatagagg agtatgtcac tgcctacccc 1141 aaccgcccca tggtactgct caagagtggc cagcttatca agactgagtg ggaaggcacg 1201 tggtggaagt cccgagttga ggaggtggat ggcagcctag tcaggatcct cttcctggat 1261 gacaaaagat gtgagtggat ctatcgaggc tctacacggc tggagcccat gttcagcatg 1321 aaaacatcct cagcctctgc actggagaag aagcaaggac agctcaggac acgtccaaat 1381 atgggtgctg tgaggagcaa aggccctgtt gtccagtaca cacaggatct gaccggtact 1441 ggaacccagt tcaagccagt ggaaccccca cagcctacag ctccacctgc cccacctttc 1501 ccacctgctc cacctctatc cccccaagca ggtgacagtg acttggaaag ccagcttgcc 1561 cagtcacgga agcaggtagc caaaaagagc acgtcctttc gaccaggatc tgtgggctct 1621 ggtcattcct cctctacatc tcctgcactc agtgaaaatg tctctggtgg gaaacctggg 1681 atcaaccaga catatagatc acctttaggc tccacagcct ctgccccagc accctcagca 1741 ctcccggccc ctccagcacc cccagtcttc catggcatgc tggagcgggc cccagcagag 1801 ccctcctacc gtgctcccat ggagaagctt ttctacttac ctcatgtctg cagctatacc 1861 tgcctgtctc gagtcagacc tatgaggaat gagcagtacc ggggcaagaa ccctctgctg 1921 gtcccgttac tatatgactt ccggcggatg acagcccggc gtcgagttaa ccgcaagatg 1981 ggctttcatg ttatctataa gacaccttgt ggtctctgcc ttcggacaat gcaggagata 2041 gaacgctacc ttttcgagac tggctgtgac ttcctcttcc tggagatgtt ctgtttggat 2101 ccatatgttc ttgtggaccg aaagtttcag ccctataagc ctttttacta tattttggac 2161 atcacttatg ggaaggaaga tgttccccta tcctgtgtca atgagattga cacaacccct 2221 ccaccccagg tggcctacag caaggaacgt atcccgggca agggtgtttt cattaacaca 2281 ggccctgaat ttctggttgg ctgtgactgc aaggatgggt gtcgggacaa gtccaagtgt 2341 gcctgccatc aactaactat ccaggctaca gcctgtaccc caggaggcca aatcaaccct 2401 aactctggct accagtacaa gagactagaa gagtgtctac ccacaggggt atatgagtgt 2461 aacaaacgct gcaaatgtga cccaaacatg tgcacaaacc ggttggtgca acatggacta 2521 caagttcggc tacagctatt caagacacag aacaagggct ggggtatccg ctgcttggat 2581 gacattgcca aaggctcttt tgtttgtatt tatgcaggca aaatcctgac agatgacttt 2641 gcagacaagg agggtctgga aatgggtgat gagtactttg caaatctgga ccatatcgag 2701 agcgtggaga acttcaaaga aggatatgag agtgatgccc cctgttcctc tgacagcagt 2761 ggtgtagact tgaaggacca ggaagatggc aacagcggta cagaggaccc tgaagagtcc 2821 aatgatgata gctcagatga taacttctgt aaggatgagg acttcagcac cagttcagtg 2881 tggcggagct atgctacccg gaggcagacc cggggccaga aagagaacgg actctctgag 2941 acaacttcca aggactccca ccccccagat cttggacccc cacatattcc tgttcctccc 3001 tcaatccctg taggtggctg caatccacct tcctccgaag agacacccaa gaacaaggtg 3061 gcctcatggt tgagctgcaa tagtgtcagt gaaggtggtt ttgctgactc tgatagccat 3121 tcatccttca agactaatga aggtggggag ggccgggctg ggggaagccg aatggaggct 3181 gagaaggcct ccacctcagg actaggcatc aaggatgagg gagacatcaa acaggccaag 3241 aaagaggaca ctgacgaccg aaacaagatg tcagtagtta ctgaaagctc tcgaaattac 3301 ggttacaatc cttctcctgt gaagcctgaa ggacttcgcc gcccacctag taagactagt 3361 atgcatcaaa gccgaagact catggcttct gctcagtcca accctgatga tgtcctgaca 3421 ctgtccagca gcacagaaag tgagggggaa agtgggacca gccgaaagcc cactgctggt 3481 cagacttcgg ctacagcggt tgacagtgat gatatccaga ccatatcctc tggctctgaa 3541 ggggatgact ttgaggacaa gaagaacatg actggtccaa tgaagcgtca agtggcagta 3601 aaatcaaccc gaggctttgc tcttaaatca acccatggga ttgcaattaa atcaaccaac 3661 atggcctctg tggacaaggg ggagagcgca cctgttcgta agaacacacg ccaattctat 3721 gatggcgagg agtcttgcta catcattgat gccaagcttg aaggcaacct gggccgctac 3781 ctcaaccaca gttgcagccc caacctgttt gtccagaatg tcttcgtgga tacccatgat 3841 cttcgcttcc cctgggtggc cttctttgcc agcaaaatcc gggctgggac agaacttact 3901 tgggactaca actacgaggt gggcagtgtg gaaggcaagg agctactctg ttgctgtggg 3961 gccattgaat gcagaggacg tcttctttag aggacagcct tcttcccaac ccttcttgaa 4021 ctgtcgtttc ctcaggaact gggtcttcct gattgttgaa ccctgacccg aagtctctgg 4081 gctagctact ccccccagct cctagttgat agaaatgggg gttctggacc agatgatccc 4141 ttccaatgtg gtgctagcag gcaggatccc ttctccacct ccaaaggccc taaagggtgg 4201 ggagagatca ccactctaac ctcggcctga catccctccc atcccatatt tgtccaagtg 4261 ttcctgcttc taacagactt tgttcttaga atggagcctg tgtatctact atctccagtt 4321 tgtattattt cttgaaagtc ttttaacaat atgataaaac taaaaaaaaa aaaaaaaaaa 4381 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa //