LOCUS       BC028671                4420 bp    mRNA    linear   HUM 15-JUL-2006
DEFINITION  Homo sapiens SET domain, bifurcated 1, mRNA (cDNA clone MGC:16563
            IMAGE:4098658), complete cds.
ACCESSION   BC028671
VERSION     BC028671.2
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 4420)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 4420)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (29-APR-2002) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     On Aug 20, 2003 this sequence version replaced BC028671.1.
            Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Institute for Systems Biology
            http://www.systemsbiology.org
            contact: amadan@systemsbiology.org
            Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
            Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 25 Row: p Column: 12
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 41281392.
FEATURES             Location/Qualifiers
     source          1..4420
                     /db_xref="H-InvDB:HIT000040598"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:16563 IMAGE:4098658"
                     /tissue_type="Muscle, rhabdomyosarcoma"
                     /clone_lib="NIH_MGC_17"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     gene            1..4420
                     /gene="SETDB1"
                     /gene_synonym="ESET"
                     /gene_synonym="KG1T"
                     /gene_synonym="KIAA0067"
                     /db_xref="GeneID:9869"
                     /db_xref="HGNC:HGNC:10761"
                     /db_xref="MIM:604396"
     CDS             118..3990
                     /gene="SETDB1"
                     /gene_synonym="ESET"
                     /gene_synonym="KG1T"
                     /gene_synonym="KIAA0067"
                     /codon_start=1
                     /product="SET domain, bifurcated 1"
                     /protein_id="AAH28671.1"
                     /db_xref="GeneID:9869"
                     /db_xref="HGNC:HGNC:10761"
                     /db_xref="MIM:604396"
                     /translation="MSSLPGCIGLDAATATVESEEIAELQQAVVEELGISMEELRHFI
                     DEELEKMDCVQQRKKQLAELETWVIQKESEVAHVDQLFDDASRAVTNCESLVKDFYSK
                     LGLQYRDSSSEDESSRPTEIIEIPDEDDDVLSIDSGDAGSRTPKDQKLREAMAALRKS
                     AQDVQKFMDAVNKKSSSQDLHKGTLSQMSGELSKDGDLIVSMRILGKKRTKTWHKGTL
                     IAIQTVGPGKKYKVKFDNKGKSLLSGNHIAYDYHPPADKLYVGSRVVAKYKDGNQVWL
                     YAGIVAETPNVKNKLRFLIFFDDGYASYVTQSELYPICRPLKKTWEDIEDISCRDFIE
                     EYVTAYPNRPMVLLKSGQLIKTEWEGTWWKSRVEEVDGSLVRILFLDDKRCEWIYRGS
                     TRLEPMFSMKTSSASALEKKQGQLRTRPNMGAVRSKGPVVQYTQDLTGTGTQFKPVEP
                     PQPTAPPAPPFPPAPPLSPQAGDSDLESQLAQSRKQVAKKSTSFRPGSVGSGHSSSTS
                     PALSENVSGGKPGINQTYRSPLGSTASAPAPSALPAPPAPPVFHGMLERAPAEPSYRA
                     PMEKLFYLPHVCSYTCLSRVRPMRNEQYRGKNPLLVPLLYDFRRMTARRRVNRKMGFH
                     VIYKTPCGLCLRTMQEIERYLFETGCDFLFLEMFCLDPYVLVDRKFQPYKPFYYILDI
                     TYGKEDVPLSCVNEIDTTPPPQVAYSKERIPGKGVFINTGPEFLVGCDCKDGCRDKSK
                     CACHQLTIQATACTPGGQINPNSGYQYKRLEECLPTGVYECNKRCKCDPNMCTNRLVQ
                     HGLQVRLQLFKTQNKGWGIRCLDDIAKGSFVCIYAGKILTDDFADKEGLEMGDEYFAN
                     LDHIESVENFKEGYESDAPCSSDSSGVDLKDQEDGNSGTEDPEESNDDSSDDNFCKDE
                     DFSTSSVWRSYATRRQTRGQKENGLSETTSKDSHPPDLGPPHIPVPPSIPVGGCNPPS
                     SEETPKNKVASWLSCNSVSEGGFADSDSHSSFKTNEGGEGRAGGSRMEAEKASTSGLG
                     IKDEGDIKQAKKEDTDDRNKMSVVTESSRNYGYNPSPVKPEGLRRPPSKTSMHQSRRL
                     MASAQSNPDDVLTLSSSTESEGESGTSRKPTAGQTSATAVDSDDIQTISSGSEGDDFE
                     DKKNMTGPMKRQVAVKSTRGFALKSTHGIAIKSTNMASVDKGESAPVRKNTRQFYDGE
                     ESCYIIDAKLEGNLGRYLNHSCSPNLFVQNVFVDTHDLRFPWVAFFASKIRAGTELTW
                     DYNYEVGSVEGKELLCCCGAIECRGRLL"
BASE COUNT         1200 a         1099 c         1117 g         1004 t
ORIGIN      
        1 ctcccttatc ccttcgcttt cgctcttttc cgtcgaggcc gacccctgag ttgtgagtct
       61 ggggtctggt tggtgaaaaa gagcccttga agctggaaga cgggagagga caaaagcatg
      121 tcttcccttc ctgggtgcat tggtttggat gcagcaacag ctacagtgga gtctgaagag
      181 attgcagagc tgcaacaggc agtggttgag gaactgggta tctctatgga ggaacttcgg
      241 catttcatcg atgaggaact ggagaagatg gattgtgtac agcaacgcaa gaagcagcta
      301 gcagagttag agacatgggt aatacagaaa gaatctgagg tggctcacgt tgaccaactc
      361 tttgatgatg catccagggc agtgactaat tgtgagtctt tggtgaagga cttctactcc
      421 aagctgggac tacaataccg ggacagtagc tctgaggacg aatcttcccg gcctacagaa
      481 ataattgaga ttcctgatga agatgatgat gtcctcagta ttgattcagg tgatgctggg
      541 agcagaactc caaaagacca gaagctccgt gaagctatgg ctgccttaag aaagtcagct
      601 caagatgttc agaagttcat ggatgctgtc aacaagaaga gcagttccca ggatctgcat
      661 aaaggaacct tgagtcagat gtctggagaa ctaagcaaag atggtgacct gatagtcagc
      721 atgcgaattc tgggcaagaa gagaactaag acttggcaca aaggcaccct tattgccatc
      781 cagacagttg ggccagggaa gaaatacaag gtgaaatttg acaacaaagg aaagagtcta
      841 ctgtcgggga accatattgc ctatgattac caccctcctg ctgacaagct gtatgtgggc
      901 agtcgggtgg tcgccaaata caaagatggg aatcaggtct ggctctatgc tggcattgta
      961 gctgagacac caaacgtcaa aaacaagctc aggtttctca ttttctttga tgatggctat
     1021 gcttcctatg tcacacagtc ggaactgtat cccatttgcc ggccactgaa aaagacttgg
     1081 gaggacatag aagacatctc ctgccgtgac ttcatagagg agtatgtcac tgcctacccc
     1141 aaccgcccca tggtactgct caagagtggc cagcttatca agactgagtg ggaaggcacg
     1201 tggtggaagt cccgagttga ggaggtggat ggcagcctag tcaggatcct cttcctggat
     1261 gacaaaagat gtgagtggat ctatcgaggc tctacacggc tggagcccat gttcagcatg
     1321 aaaacatcct cagcctctgc actggagaag aagcaaggac agctcaggac acgtccaaat
     1381 atgggtgctg tgaggagcaa aggccctgtt gtccagtaca cacaggatct gaccggtact
     1441 ggaacccagt tcaagccagt ggaaccccca cagcctacag ctccacctgc cccacctttc
     1501 ccacctgctc cacctctatc cccccaagca ggtgacagtg acttggaaag ccagcttgcc
     1561 cagtcacgga agcaggtagc caaaaagagc acgtcctttc gaccaggatc tgtgggctct
     1621 ggtcattcct cctctacatc tcctgcactc agtgaaaatg tctctggtgg gaaacctggg
     1681 atcaaccaga catatagatc acctttaggc tccacagcct ctgccccagc accctcagca
     1741 ctcccggccc ctccagcacc cccagtcttc catggcatgc tggagcgggc cccagcagag
     1801 ccctcctacc gtgctcccat ggagaagctt ttctacttac ctcatgtctg cagctatacc
     1861 tgcctgtctc gagtcagacc tatgaggaat gagcagtacc ggggcaagaa ccctctgctg
     1921 gtcccgttac tatatgactt ccggcggatg acagcccggc gtcgagttaa ccgcaagatg
     1981 ggctttcatg ttatctataa gacaccttgt ggtctctgcc ttcggacaat gcaggagata
     2041 gaacgctacc ttttcgagac tggctgtgac ttcctcttcc tggagatgtt ctgtttggat
     2101 ccatatgttc ttgtggaccg aaagtttcag ccctataagc ctttttacta tattttggac
     2161 atcacttatg ggaaggaaga tgttccccta tcctgtgtca atgagattga cacaacccct
     2221 ccaccccagg tggcctacag caaggaacgt atcccgggca agggtgtttt cattaacaca
     2281 ggccctgaat ttctggttgg ctgtgactgc aaggatgggt gtcgggacaa gtccaagtgt
     2341 gcctgccatc aactaactat ccaggctaca gcctgtaccc caggaggcca aatcaaccct
     2401 aactctggct accagtacaa gagactagaa gagtgtctac ccacaggggt atatgagtgt
     2461 aacaaacgct gcaaatgtga cccaaacatg tgcacaaacc ggttggtgca acatggacta
     2521 caagttcggc tacagctatt caagacacag aacaagggct ggggtatccg ctgcttggat
     2581 gacattgcca aaggctcttt tgtttgtatt tatgcaggca aaatcctgac agatgacttt
     2641 gcagacaagg agggtctgga aatgggtgat gagtactttg caaatctgga ccatatcgag
     2701 agcgtggaga acttcaaaga aggatatgag agtgatgccc cctgttcctc tgacagcagt
     2761 ggtgtagact tgaaggacca ggaagatggc aacagcggta cagaggaccc tgaagagtcc
     2821 aatgatgata gctcagatga taacttctgt aaggatgagg acttcagcac cagttcagtg
     2881 tggcggagct atgctacccg gaggcagacc cggggccaga aagagaacgg actctctgag
     2941 acaacttcca aggactccca ccccccagat cttggacccc cacatattcc tgttcctccc
     3001 tcaatccctg taggtggctg caatccacct tcctccgaag agacacccaa gaacaaggtg
     3061 gcctcatggt tgagctgcaa tagtgtcagt gaaggtggtt ttgctgactc tgatagccat
     3121 tcatccttca agactaatga aggtggggag ggccgggctg ggggaagccg aatggaggct
     3181 gagaaggcct ccacctcagg actaggcatc aaggatgagg gagacatcaa acaggccaag
     3241 aaagaggaca ctgacgaccg aaacaagatg tcagtagtta ctgaaagctc tcgaaattac
     3301 ggttacaatc cttctcctgt gaagcctgaa ggacttcgcc gcccacctag taagactagt
     3361 atgcatcaaa gccgaagact catggcttct gctcagtcca accctgatga tgtcctgaca
     3421 ctgtccagca gcacagaaag tgagggggaa agtgggacca gccgaaagcc cactgctggt
     3481 cagacttcgg ctacagcggt tgacagtgat gatatccaga ccatatcctc tggctctgaa
     3541 ggggatgact ttgaggacaa gaagaacatg actggtccaa tgaagcgtca agtggcagta
     3601 aaatcaaccc gaggctttgc tcttaaatca acccatggga ttgcaattaa atcaaccaac
     3661 atggcctctg tggacaaggg ggagagcgca cctgttcgta agaacacacg ccaattctat
     3721 gatggcgagg agtcttgcta catcattgat gccaagcttg aaggcaacct gggccgctac
     3781 ctcaaccaca gttgcagccc caacctgttt gtccagaatg tcttcgtgga tacccatgat
     3841 cttcgcttcc cctgggtggc cttctttgcc agcaaaatcc gggctgggac agaacttact
     3901 tgggactaca actacgaggt gggcagtgtg gaaggcaagg agctactctg ttgctgtggg
     3961 gccattgaat gcagaggacg tcttctttag aggacagcct tcttcccaac ccttcttgaa
     4021 ctgtcgtttc ctcaggaact gggtcttcct gattgttgaa ccctgacccg aagtctctgg
     4081 gctagctact ccccccagct cctagttgat agaaatgggg gttctggacc agatgatccc
     4141 ttccaatgtg gtgctagcag gcaggatccc ttctccacct ccaaaggccc taaagggtgg
     4201 ggagagatca ccactctaac ctcggcctga catccctccc atcccatatt tgtccaagtg
     4261 ttcctgcttc taacagactt tgttcttaga atggagcctg tgtatctact atctccagtt
     4321 tgtattattt cttgaaagtc ttttaacaat atgataaaac taaaaaaaaa aaaaaaaaaa
     4381 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
//