LOCUS BC017078 1202 bp mRNA linear HUM 30-SEP-2003 DEFINITION Homo sapiens SET domain, bifurcated 2, mRNA (cDNA clone IMAGE:3847827), partial cds. ACCESSION BC017078 VERSION BC017078.1 KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1202) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 1202) AUTHORS Strausberg,R. TITLE Direct Submission JOURNAL Submitted (05-NOV-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Cancer Genomics Office, National Cancer Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Sequencing Group at the Stanford Human Genome Center, Stanford University School of Medicine, Stanford, CA 94305 Web site: http://www-shgc.stanford.edu Contact: (Dickson, Mark) mcd@paxil.stanford.edu Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers, R. M. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 20 Row: j Column: 21. FEATURES Location/Qualifiers source 1..1202 /db_xref="H-InvDB:HIT000089180" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="IMAGE:3847827" /tissue_type="Colon, adenocarcinoma" /clone_lib="NIH_MGC_65" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene <1..1202 /gene="SETDB2" /gene_synonym="C13orf4" /gene_synonym="CLLD8" /gene_synonym="CLLL8" /db_xref="GeneID:83852" /db_xref="MIM:607865" CDS <1..868 /gene="SETDB2" /gene_synonym="C13orf4" /gene_synonym="CLLD8" /gene_synonym="CLLL8" /codon_start=2 /product="SETDB2 protein" /protein_id="AAH17078.1" /db_xref="GeneID:83852" /db_xref="MIM:607865" /translation="LEVACSDCEVEVLPLGLETHPRTAKTEKCPPKFSNNPKELTVET KYDNISRIQYHSVIRDPESKTAIFQHNGKKMEFVSSESVTPEDNDGFKPPREHLNSKT KGAQKDSSSNHVDEFEDNLLIESDVIDITKYREETPPRSRCNQATTLDNQNIKKAIEV QIQKPQEGRSTACQRQQVFCDEELLSETKNTSSDSLTKFNKGNVFLLDATKEGNVGRF LNHSCCPNLLVQNVFVETHNRNFPLVAFFTNRYVKARTELTWDYGYEAGTVPEKEIFC QCGVNKCRKKIL" misc_feature 596..796 /gene="SETDB2" /gene_synonym="C13orf4" /gene_synonym="CLLD8" /gene_synonym="CLLL8" /note="SET; Region: SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET domains have been called PR domains. These domains are divergent in sequence from other SET domains, but also appear to mediate protein-protein interaction" /db_xref="CDD:pfam00856" BASE COUNT 435 a 194 c 234 g 339 t ORIGIN 1 attagaagtt gcatgttcag attgtgaagt tgaagttctc ccattaggat tggaaacaca 61 tcctagaact gctaaaactg agaaatgtcc accaaagttc agtaataatc ccaaggagct 121 tactgtggaa acgaaatatg ataatatttc aagaattcaa tatcattcag ttattagaga 181 tcctgaatcc aagacagcca tttttcaaca caatgggaaa aaaatggaat ttgtttcctc 241 ggagtctgtc actccagaag ataatgatgg atttaaacca ccccgagagc atctgaactc 301 taaaaccaag ggagcacaaa aggactcaag ttcaaaccat gttgatgagt ttgaagataa 361 tctgctgatt gaatcagatg tgatagatat aactaaatat agagaagaaa ctccaccaag 421 gagcagatgt aaccaggcga ccacattgga taatcagaat attaaaaagg caattgaggt 481 tcaaattcag aaaccccaag agggacgatc tacagcatgt caaagacagc aggtattttg 541 tgatgaagag ttgctaagtg aaaccaagaa tacttcatct gattctctaa caaagttcaa 601 taaagggaat gtgtttttat tggatgccac aaaagaagga aatgtcggcc gcttccttaa 661 tcatagttgt tgcccaaatc tcttggtaca gaatgttttt gtagaaacac acaacaggaa 721 ttttccattg gtggcattct tcaccaacag gtatgtgaaa gcaagaacag agctaacatg 781 ggattatggc tatgaagctg ggactgtgcc tgagaaggaa atcttctgcc aatgtggggt 841 taataaatgt agaaaaaaaa tattataaat atgtaactaa cgcctgtttg tgaaattagc 901 ttatcaggct gaaattaaag ccatgcaaaa gaaggtctag gtccatcaag gaaattcccc 961 tccgttttcc tttgtcatgg ggtttatgtt ttatttcaga ttttatttgt gtgacttaga 1021 aattccagga acacaattag gatattttca tacacatagg gtatcttgtt cactgctgtg 1081 ctactttaca tgagtaggat ggaagtgtat attttatatg aaataccact gtacaattta 1141 taatttattt acaaattata tattaagaga aacaaatgtc ataacagaaa aaaaaaaaaa 1201 aa //