LOCUS AAH28671.1 1290 aa PRT HUM 15-JUL-2006 DEFINITION Homo sapiens SET domain, bifurcated 1 protein. ACCESSION BC028671-1 PROTEIN_ID AAH28671.1 SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4420) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 4420) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (29-APR-2002) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Aug 20, 2003 this sequence version replaced BC028671.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Institute for Systems Biology http://www.systemsbiology.org contact: amadan@systemsbiology.org Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 25 Row: p Column: 12 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 41281392. FEATURES Qualifiers source /db_xref="H-InvDB:HIT000040598" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:16563 IMAGE:4098658" /tissue_type="Muscle, rhabdomyosarcoma" /clone_lib="NIH_MGC_17" /lab_host="DH10B-R" /note="Vector: pOTB7" protein /gene="SETDB1" /gene_synonym="ESET" /gene_synonym="KG1T" /gene_synonym="KIAA0067" /db_xref="GeneID:9869" /db_xref="HGNC:HGNC:10761" /db_xref="MIM:604396" BEGIN 1 MSSLPGCIGL DAATATVESE EIAELQQAVV EELGISMEEL RHFIDEELEK MDCVQQRKKQ 61 LAELETWVIQ KESEVAHVDQ LFDDASRAVT NCESLVKDFY SKLGLQYRDS SSEDESSRPT 121 EIIEIPDEDD DVLSIDSGDA GSRTPKDQKL REAMAALRKS AQDVQKFMDA VNKKSSSQDL 181 HKGTLSQMSG ELSKDGDLIV SMRILGKKRT KTWHKGTLIA IQTVGPGKKY KVKFDNKGKS 241 LLSGNHIAYD YHPPADKLYV GSRVVAKYKD GNQVWLYAGI VAETPNVKNK LRFLIFFDDG 301 YASYVTQSEL YPICRPLKKT WEDIEDISCR DFIEEYVTAY PNRPMVLLKS GQLIKTEWEG 361 TWWKSRVEEV DGSLVRILFL DDKRCEWIYR GSTRLEPMFS MKTSSASALE KKQGQLRTRP 421 NMGAVRSKGP VVQYTQDLTG TGTQFKPVEP PQPTAPPAPP FPPAPPLSPQ AGDSDLESQL 481 AQSRKQVAKK STSFRPGSVG SGHSSSTSPA LSENVSGGKP GINQTYRSPL GSTASAPAPS 541 ALPAPPAPPV FHGMLERAPA EPSYRAPMEK LFYLPHVCSY TCLSRVRPMR NEQYRGKNPL 601 LVPLLYDFRR MTARRRVNRK MGFHVIYKTP CGLCLRTMQE IERYLFETGC DFLFLEMFCL 661 DPYVLVDRKF QPYKPFYYIL DITYGKEDVP LSCVNEIDTT PPPQVAYSKE RIPGKGVFIN 721 TGPEFLVGCD CKDGCRDKSK CACHQLTIQA TACTPGGQIN PNSGYQYKRL EECLPTGVYE 781 CNKRCKCDPN MCTNRLVQHG LQVRLQLFKT QNKGWGIRCL DDIAKGSFVC IYAGKILTDD 841 FADKEGLEMG DEYFANLDHI ESVENFKEGY ESDAPCSSDS SGVDLKDQED GNSGTEDPEE 901 SNDDSSDDNF CKDEDFSTSS VWRSYATRRQ TRGQKENGLS ETTSKDSHPP DLGPPHIPVP 961 PSIPVGGCNP PSSEETPKNK VASWLSCNSV SEGGFADSDS HSSFKTNEGG EGRAGGSRME 1021 AEKASTSGLG IKDEGDIKQA KKEDTDDRNK MSVVTESSRN YGYNPSPVKP EGLRRPPSKT 1081 SMHQSRRLMA SAQSNPDDVL TLSSSTESEG ESGTSRKPTA GQTSATAVDS DDIQTISSGS 1141 EGDDFEDKKN MTGPMKRQVA VKSTRGFALK STHGIAIKST NMASVDKGES APVRKNTRQF 1201 YDGEESCYII DAKLEGNLGR YLNHSCSPNL FVQNVFVDTH DLRFPWVAFF ASKIRAGTEL 1261 TWDYNYEVGS VEGKELLCCC GAIECRGRLL //