LOCUS       AAH28671.1              1290 aa    PRT              HUM 15-JUL-2006
DEFINITION  Homo sapiens SET domain, bifurcated 1 protein.
ACCESSION   BC028671-1
PROTEIN_ID  AAH28671.1
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 4420)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 4420)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (29-APR-2002) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     On Aug 20, 2003 this sequence version replaced BC028671.1.
            Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Institute for Systems Biology
            http://www.systemsbiology.org
            contact: amadan@systemsbiology.org
            Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
            Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 25 Row: p Column: 12
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 41281392.
FEATURES             Qualifiers
     source          /db_xref="H-InvDB:HIT000040598"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:16563 IMAGE:4098658"
                     /tissue_type="Muscle, rhabdomyosarcoma"
                     /clone_lib="NIH_MGC_17"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     protein         /gene="SETDB1"
                     /gene_synonym="ESET"
                     /gene_synonym="KG1T"
                     /gene_synonym="KIAA0067"
                     /db_xref="GeneID:9869"
                     /db_xref="HGNC:HGNC:10761"
                     /db_xref="MIM:604396"
BEGIN
        1 MSSLPGCIGL DAATATVESE EIAELQQAVV EELGISMEEL RHFIDEELEK MDCVQQRKKQ
       61 LAELETWVIQ KESEVAHVDQ LFDDASRAVT NCESLVKDFY SKLGLQYRDS SSEDESSRPT
      121 EIIEIPDEDD DVLSIDSGDA GSRTPKDQKL REAMAALRKS AQDVQKFMDA VNKKSSSQDL
      181 HKGTLSQMSG ELSKDGDLIV SMRILGKKRT KTWHKGTLIA IQTVGPGKKY KVKFDNKGKS
      241 LLSGNHIAYD YHPPADKLYV GSRVVAKYKD GNQVWLYAGI VAETPNVKNK LRFLIFFDDG
      301 YASYVTQSEL YPICRPLKKT WEDIEDISCR DFIEEYVTAY PNRPMVLLKS GQLIKTEWEG
      361 TWWKSRVEEV DGSLVRILFL DDKRCEWIYR GSTRLEPMFS MKTSSASALE KKQGQLRTRP
      421 NMGAVRSKGP VVQYTQDLTG TGTQFKPVEP PQPTAPPAPP FPPAPPLSPQ AGDSDLESQL
      481 AQSRKQVAKK STSFRPGSVG SGHSSSTSPA LSENVSGGKP GINQTYRSPL GSTASAPAPS
      541 ALPAPPAPPV FHGMLERAPA EPSYRAPMEK LFYLPHVCSY TCLSRVRPMR NEQYRGKNPL
      601 LVPLLYDFRR MTARRRVNRK MGFHVIYKTP CGLCLRTMQE IERYLFETGC DFLFLEMFCL
      661 DPYVLVDRKF QPYKPFYYIL DITYGKEDVP LSCVNEIDTT PPPQVAYSKE RIPGKGVFIN
      721 TGPEFLVGCD CKDGCRDKSK CACHQLTIQA TACTPGGQIN PNSGYQYKRL EECLPTGVYE
      781 CNKRCKCDPN MCTNRLVQHG LQVRLQLFKT QNKGWGIRCL DDIAKGSFVC IYAGKILTDD
      841 FADKEGLEMG DEYFANLDHI ESVENFKEGY ESDAPCSSDS SGVDLKDQED GNSGTEDPEE
      901 SNDDSSDDNF CKDEDFSTSS VWRSYATRRQ TRGQKENGLS ETTSKDSHPP DLGPPHIPVP
      961 PSIPVGGCNP PSSEETPKNK VASWLSCNSV SEGGFADSDS HSSFKTNEGG EGRAGGSRME
     1021 AEKASTSGLG IKDEGDIKQA KKEDTDDRNK MSVVTESSRN YGYNPSPVKP EGLRRPPSKT
     1081 SMHQSRRLMA SAQSNPDDVL TLSSSTESEG ESGTSRKPTA GQTSATAVDS DDIQTISSGS
     1141 EGDDFEDKKN MTGPMKRQVA VKSTRGFALK STHGIAIKST NMASVDKGES APVRKNTRQF
     1201 YDGEESCYII DAKLEGNLGR YLNHSCSPNL FVQNVFVDTH DLRFPWVAFF ASKIRAGTEL
     1261 TWDYNYEVGS VEGKELLCCC GAIECRGRLL
//