LOCUS BC028671 4420 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens SET domain, bifurcated 1, mRNA (cDNA clone MGC:16563
IMAGE:4098658), complete cds.
ACCESSION BC028671
VERSION BC028671.2
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 4420)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 4420)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (29-APR-2002) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT On Aug 20, 2003 this sequence version replaced BC028671.1.
Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Institute for Systems Biology
http://www.systemsbiology.org
contact: amadan@systemsbiology.org
Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 25 Row: p Column: 12
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 41281392.
FEATURES Location/Qualifiers
source 1..4420
/db_xref="H-InvDB:HIT000040598"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:16563 IMAGE:4098658"
/tissue_type="Muscle, rhabdomyosarcoma"
/clone_lib="NIH_MGC_17"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene 1..4420
/gene="SETDB1"
/gene_synonym="ESET"
/gene_synonym="KG1T"
/gene_synonym="KIAA0067"
/db_xref="GeneID:9869"
/db_xref="HGNC:HGNC:10761"
/db_xref="MIM:604396"
CDS 118..3990
/gene="SETDB1"
/gene_synonym="ESET"
/gene_synonym="KG1T"
/gene_synonym="KIAA0067"
/codon_start=1
/product="SET domain, bifurcated 1"
/protein_id="AAH28671.1"
/db_xref="GeneID:9869"
/db_xref="HGNC:HGNC:10761"
/db_xref="MIM:604396"
/translation="MSSLPGCIGLDAATATVESEEIAELQQAVVEELGISMEELRHFI
DEELEKMDCVQQRKKQLAELETWVIQKESEVAHVDQLFDDASRAVTNCESLVKDFYSK
LGLQYRDSSSEDESSRPTEIIEIPDEDDDVLSIDSGDAGSRTPKDQKLREAMAALRKS
AQDVQKFMDAVNKKSSSQDLHKGTLSQMSGELSKDGDLIVSMRILGKKRTKTWHKGTL
IAIQTVGPGKKYKVKFDNKGKSLLSGNHIAYDYHPPADKLYVGSRVVAKYKDGNQVWL
YAGIVAETPNVKNKLRFLIFFDDGYASYVTQSELYPICRPLKKTWEDIEDISCRDFIE
EYVTAYPNRPMVLLKSGQLIKTEWEGTWWKSRVEEVDGSLVRILFLDDKRCEWIYRGS
TRLEPMFSMKTSSASALEKKQGQLRTRPNMGAVRSKGPVVQYTQDLTGTGTQFKPVEP
PQPTAPPAPPFPPAPPLSPQAGDSDLESQLAQSRKQVAKKSTSFRPGSVGSGHSSSTS
PALSENVSGGKPGINQTYRSPLGSTASAPAPSALPAPPAPPVFHGMLERAPAEPSYRA
PMEKLFYLPHVCSYTCLSRVRPMRNEQYRGKNPLLVPLLYDFRRMTARRRVNRKMGFH
VIYKTPCGLCLRTMQEIERYLFETGCDFLFLEMFCLDPYVLVDRKFQPYKPFYYILDI
TYGKEDVPLSCVNEIDTTPPPQVAYSKERIPGKGVFINTGPEFLVGCDCKDGCRDKSK
CACHQLTIQATACTPGGQINPNSGYQYKRLEECLPTGVYECNKRCKCDPNMCTNRLVQ
HGLQVRLQLFKTQNKGWGIRCLDDIAKGSFVCIYAGKILTDDFADKEGLEMGDEYFAN
LDHIESVENFKEGYESDAPCSSDSSGVDLKDQEDGNSGTEDPEESNDDSSDDNFCKDE
DFSTSSVWRSYATRRQTRGQKENGLSETTSKDSHPPDLGPPHIPVPPSIPVGGCNPPS
SEETPKNKVASWLSCNSVSEGGFADSDSHSSFKTNEGGEGRAGGSRMEAEKASTSGLG
IKDEGDIKQAKKEDTDDRNKMSVVTESSRNYGYNPSPVKPEGLRRPPSKTSMHQSRRL
MASAQSNPDDVLTLSSSTESEGESGTSRKPTAGQTSATAVDSDDIQTISSGSEGDDFE
DKKNMTGPMKRQVAVKSTRGFALKSTHGIAIKSTNMASVDKGESAPVRKNTRQFYDGE
ESCYIIDAKLEGNLGRYLNHSCSPNLFVQNVFVDTHDLRFPWVAFFASKIRAGTELTW
DYNYEVGSVEGKELLCCCGAIECRGRLL"
BASE COUNT 1200 a 1099 c 1117 g 1004 t
ORIGIN
1 ctcccttatc ccttcgcttt cgctcttttc cgtcgaggcc gacccctgag ttgtgagtct
61 ggggtctggt tggtgaaaaa gagcccttga agctggaaga cgggagagga caaaagcatg
121 tcttcccttc ctgggtgcat tggtttggat gcagcaacag ctacagtgga gtctgaagag
181 attgcagagc tgcaacaggc agtggttgag gaactgggta tctctatgga ggaacttcgg
241 catttcatcg atgaggaact ggagaagatg gattgtgtac agcaacgcaa gaagcagcta
301 gcagagttag agacatgggt aatacagaaa gaatctgagg tggctcacgt tgaccaactc
361 tttgatgatg catccagggc agtgactaat tgtgagtctt tggtgaagga cttctactcc
421 aagctgggac tacaataccg ggacagtagc tctgaggacg aatcttcccg gcctacagaa
481 ataattgaga ttcctgatga agatgatgat gtcctcagta ttgattcagg tgatgctggg
541 agcagaactc caaaagacca gaagctccgt gaagctatgg ctgccttaag aaagtcagct
601 caagatgttc agaagttcat ggatgctgtc aacaagaaga gcagttccca ggatctgcat
661 aaaggaacct tgagtcagat gtctggagaa ctaagcaaag atggtgacct gatagtcagc
721 atgcgaattc tgggcaagaa gagaactaag acttggcaca aaggcaccct tattgccatc
781 cagacagttg ggccagggaa gaaatacaag gtgaaatttg acaacaaagg aaagagtcta
841 ctgtcgggga accatattgc ctatgattac caccctcctg ctgacaagct gtatgtgggc
901 agtcgggtgg tcgccaaata caaagatggg aatcaggtct ggctctatgc tggcattgta
961 gctgagacac caaacgtcaa aaacaagctc aggtttctca ttttctttga tgatggctat
1021 gcttcctatg tcacacagtc ggaactgtat cccatttgcc ggccactgaa aaagacttgg
1081 gaggacatag aagacatctc ctgccgtgac ttcatagagg agtatgtcac tgcctacccc
1141 aaccgcccca tggtactgct caagagtggc cagcttatca agactgagtg ggaaggcacg
1201 tggtggaagt cccgagttga ggaggtggat ggcagcctag tcaggatcct cttcctggat
1261 gacaaaagat gtgagtggat ctatcgaggc tctacacggc tggagcccat gttcagcatg
1321 aaaacatcct cagcctctgc actggagaag aagcaaggac agctcaggac acgtccaaat
1381 atgggtgctg tgaggagcaa aggccctgtt gtccagtaca cacaggatct gaccggtact
1441 ggaacccagt tcaagccagt ggaaccccca cagcctacag ctccacctgc cccacctttc
1501 ccacctgctc cacctctatc cccccaagca ggtgacagtg acttggaaag ccagcttgcc
1561 cagtcacgga agcaggtagc caaaaagagc acgtcctttc gaccaggatc tgtgggctct
1621 ggtcattcct cctctacatc tcctgcactc agtgaaaatg tctctggtgg gaaacctggg
1681 atcaaccaga catatagatc acctttaggc tccacagcct ctgccccagc accctcagca
1741 ctcccggccc ctccagcacc cccagtcttc catggcatgc tggagcgggc cccagcagag
1801 ccctcctacc gtgctcccat ggagaagctt ttctacttac ctcatgtctg cagctatacc
1861 tgcctgtctc gagtcagacc tatgaggaat gagcagtacc ggggcaagaa ccctctgctg
1921 gtcccgttac tatatgactt ccggcggatg acagcccggc gtcgagttaa ccgcaagatg
1981 ggctttcatg ttatctataa gacaccttgt ggtctctgcc ttcggacaat gcaggagata
2041 gaacgctacc ttttcgagac tggctgtgac ttcctcttcc tggagatgtt ctgtttggat
2101 ccatatgttc ttgtggaccg aaagtttcag ccctataagc ctttttacta tattttggac
2161 atcacttatg ggaaggaaga tgttccccta tcctgtgtca atgagattga cacaacccct
2221 ccaccccagg tggcctacag caaggaacgt atcccgggca agggtgtttt cattaacaca
2281 ggccctgaat ttctggttgg ctgtgactgc aaggatgggt gtcgggacaa gtccaagtgt
2341 gcctgccatc aactaactat ccaggctaca gcctgtaccc caggaggcca aatcaaccct
2401 aactctggct accagtacaa gagactagaa gagtgtctac ccacaggggt atatgagtgt
2461 aacaaacgct gcaaatgtga cccaaacatg tgcacaaacc ggttggtgca acatggacta
2521 caagttcggc tacagctatt caagacacag aacaagggct ggggtatccg ctgcttggat
2581 gacattgcca aaggctcttt tgtttgtatt tatgcaggca aaatcctgac agatgacttt
2641 gcagacaagg agggtctgga aatgggtgat gagtactttg caaatctgga ccatatcgag
2701 agcgtggaga acttcaaaga aggatatgag agtgatgccc cctgttcctc tgacagcagt
2761 ggtgtagact tgaaggacca ggaagatggc aacagcggta cagaggaccc tgaagagtcc
2821 aatgatgata gctcagatga taacttctgt aaggatgagg acttcagcac cagttcagtg
2881 tggcggagct atgctacccg gaggcagacc cggggccaga aagagaacgg actctctgag
2941 acaacttcca aggactccca ccccccagat cttggacccc cacatattcc tgttcctccc
3001 tcaatccctg taggtggctg caatccacct tcctccgaag agacacccaa gaacaaggtg
3061 gcctcatggt tgagctgcaa tagtgtcagt gaaggtggtt ttgctgactc tgatagccat
3121 tcatccttca agactaatga aggtggggag ggccgggctg ggggaagccg aatggaggct
3181 gagaaggcct ccacctcagg actaggcatc aaggatgagg gagacatcaa acaggccaag
3241 aaagaggaca ctgacgaccg aaacaagatg tcagtagtta ctgaaagctc tcgaaattac
3301 ggttacaatc cttctcctgt gaagcctgaa ggacttcgcc gcccacctag taagactagt
3361 atgcatcaaa gccgaagact catggcttct gctcagtcca accctgatga tgtcctgaca
3421 ctgtccagca gcacagaaag tgagggggaa agtgggacca gccgaaagcc cactgctggt
3481 cagacttcgg ctacagcggt tgacagtgat gatatccaga ccatatcctc tggctctgaa
3541 ggggatgact ttgaggacaa gaagaacatg actggtccaa tgaagcgtca agtggcagta
3601 aaatcaaccc gaggctttgc tcttaaatca acccatggga ttgcaattaa atcaaccaac
3661 atggcctctg tggacaaggg ggagagcgca cctgttcgta agaacacacg ccaattctat
3721 gatggcgagg agtcttgcta catcattgat gccaagcttg aaggcaacct gggccgctac
3781 ctcaaccaca gttgcagccc caacctgttt gtccagaatg tcttcgtgga tacccatgat
3841 cttcgcttcc cctgggtggc cttctttgcc agcaaaatcc gggctgggac agaacttact
3901 tgggactaca actacgaggt gggcagtgtg gaaggcaagg agctactctg ttgctgtggg
3961 gccattgaat gcagaggacg tcttctttag aggacagcct tcttcccaac ccttcttgaa
4021 ctgtcgtttc ctcaggaact gggtcttcct gattgttgaa ccctgacccg aagtctctgg
4081 gctagctact ccccccagct cctagttgat agaaatgggg gttctggacc agatgatccc
4141 ttccaatgtg gtgctagcag gcaggatccc ttctccacct ccaaaggccc taaagggtgg
4201 ggagagatca ccactctaac ctcggcctga catccctccc atcccatatt tgtccaagtg
4261 ttcctgcttc taacagactt tgttcttaga atggagcctg tgtatctact atctccagtt
4321 tgtattattt cttgaaagtc ttttaacaat atgataaaac taaaaaaaaa aaaaaaaaaa
4381 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
//