LOCUS       BC020956                3380 bp    mRNA    linear   HUM 06-JUN-2006
DEFINITION  Homo sapiens SET domain containing 5, mRNA (cDNA clone MGC:8816
            IMAGE:3851178), complete cds.
ACCESSION   BC020956
VERSION     BC020956.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3380)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 3380)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (03-JAN-2002) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Sequencing Group at the Stanford Human Genome
            Center, Stanford University School of Medicine, Stanford, CA  94305
            Web site:       http://www-shgc.stanford.edu
            Contact:  (Dickson, Mark) mcd@paxil.stanford.edu
            Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
            R. M.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 20 Row: m Column: 22
            This clone was selected for full length sequencing because it
            passed the following selection criteria: GenomeScan gene
            prediction.
FEATURES             Location/Qualifiers
     source          1..3380
                     /db_xref="H-InvDB:HIT000039044"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:8816 IMAGE:3851178"
                     /tissue_type="Colon, adenocarcinoma"
                     /clone_lib="NIH_MGC_65"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     gene            1..3380
                     /gene="SETD5"
                     /gene_synonym="FLJ10707"
                     /gene_synonym="KIAA1757"
                     /db_xref="GeneID:55209"
                     /db_xref="HGNC:HGNC:25566"
     CDS             223..2997
                     /gene="SETD5"
                     /gene_synonym="FLJ10707"
                     /gene_synonym="KIAA1757"
                     /codon_start=1
                     /product="SETD5 protein"
                     /protein_id="AAH20956.1"
                     /db_xref="GeneID:55209"
                     /db_xref="HGNC:HGNC:25566"
                     /translation="MHAFENLEKRKKRRDQPLEQSNSDVEITTTTSETPVGEETKTEA
                     PESEVSNSVSNVTIPSTPQSVGVNTRRSSQAGDIAAEKLVPKPPPAKPSRPRPKSRIS
                     RYRTSSAQRLKRQKQANAQQAELSQAALEEGGSNSLVTPTEAGSLDSSGENRPLTGSD
                     PTVVSITGSHVNRAASKYPKTKKYLVTEWLNDKAEKQECPVECPLRITTDPTVLATTL
                     NMLPGLIHSPLICTTPKHYIRFGSPFIPERRRRPLLPDGTFSSCKKRWIKQALEEGMT
                     QTSSVPQETRTQHLYQSNENSSSSSICKDNADLLSPLKKWKSRYLMEQNVTKLLRPLS
                     PVTPPPPNSGSKSPQLATPGSSHPGEEECRNGYSLMFSPVTSLTTASRCNTPLQFELC
                     HRKDLDLAKVGYLDSNTNSCADRPSLLNSGHSDLAPHPSLGPTSETGFPSRSGDGHQT
                     LVRNSDQAFRTEFNLMYAYSPLNAMPRADGLYRGSPLVGDRKPLHLDGGYCSPAEGFS
                     SRYEHGLMKDLSRGSLSPGGERACEGVPSAPQNPPQRKKVSLLEYRKRKQEAKENSAG
                     GGGDSAQSKSKSAGAGQGSSNSVSDTGAHGVQGSSARTPSSPHKKFSPSHSSMSHLEA
                     VSPSDSRGTSSSHCRPQENISSRWMVPTSVERLREGGSIPKVLRSSVRVAQKGEPSPT
                     WESNITEKDSDPADGEGPETLSSALSKGATVYSPSRYSYQLLQCDSPRTESQSLLQQS
                     SSPFRGHPTQSPGYSYRTTALRPGNPPSHGSSESSLSSTSYSSPAHPVSTDSLAPFTG
                     TPGYFSSQPHSGNSTGSNLPRRSCPSSAASPTLQGPSDSPTSDSVSQSSTGTLSSTSF
                     PQNSRSSLPSDLRTISLPSAGQSAVYQASRVSAVSNSQHYPHRGSGGVHQYRLQPLQG
                     SGVKTQTGLS"
BASE COUNT          961 a          900 c          790 g          729 t
ORIGIN      
        1 cgacggaaag agctagagat ggagcagcag aatgaggctt cagaggagaa taatgaccag
       61 caatcacaag aagttccaga aaaagtaact gtatccagtg atcatgagga agtagacaat
      121 ccagaagaaa aaccagaaga agagaaagaa gaggttatag atgaccagga gaacctagct
      181 catagcagga ggaccaggga agatagaaag gtagaagcca tcatgcatgc ttttgaaaac
      241 ttagagaaaa gaaagaagcg gcgggatcag cccttggaac agagcaactc tgatgtagag
      301 attactacaa ccacctcaga gactcctgtt ggtgaagaga caaaaactga agcccctgaa
      361 tctgaagtta gcaactctgt ttcaaatgtt accatcccaa gcaccccaca gagtgttggt
      421 gtgaataccc ggaggtcttc ccaagcaggg gatattgctg cagaaaaact agtccccaag
      481 ccacctccag caaagccttc taggccccgg ccgaagagtc gaatttctcg gtacaggacc
      541 agttcagccc aaagactaaa gcgtcagaag caggccaatg cacagcaggc agaattgtca
      601 caagctgcct tggaagaggg aggaagtaac agtttagtaa ctcctactga agctggaagt
      661 ctagacagtt caggagaaaa caggccatta acagggtctg acccaactgt ggtgtcaatt
      721 actggatccc atgtcaaccg tgctgcatct aaatacccca aaaccaaaaa gtatctagtt
      781 acagaatggt tgaatgacaa agcagagaag caagagtgcc ctgttgagtg ccctttacgt
      841 atcacaacgg atccaactgt actggcaacg accctaaaca tgttaccagg tcttatccat
      901 tccccgttaa tttgcaccac ccccaaacac tacattcgct ttggctcacc ctttatccct
      961 gagagacgtc gaaggcccct tctgcctgat ggcacattca gctcctgtaa gaagcgctgg
     1021 ataaaacaag ccttagaaga agggatgact caaacatcat ctgtacccca agagactaga
     1081 actcagcacc tataccaaag caatgagaat agtagctctt ctagtatctg caaagacaat
     1141 gcagacttgt tgagcccatt aaagaaatgg aagtctcgct atctgatgga gcagaatgtc
     1201 accaagttac ttcggcctct gtctccagtc acaccacccc ctcccaattc aggctcaaag
     1261 agtccccagc tggccacacc tggctcatct cacccaggag aagaggagtg tcgaaatgga
     1321 tacagcctca tgttttcacc agtcacatct cttactactg ctagtcgctg caacactcct
     1381 ctacagtttg agctttgtca ccgaaaagac ctggatttgg caaaagtagg ataccttgac
     1441 tccaacacta acagctgtgc tgatagacct tccctactca actcaggtca ttctgacctg
     1501 gctcctcatc cctccctcgg acccacttct gagactggtt tcccaagcag aagtggagat
     1561 ggacatcaga ccctcgtgag aaactcagac caggcatttc ggacagagtt caacttgatg
     1621 tatgcctact cccctttgaa tgctatgcct cgagcagatg gactgtatcg aggatctcct
     1681 ctagtggggg ataggaagcc tttacatttg gatgggggat attgttcccc tgcagaagga
     1741 ttttccagca gatatgaaca tggcttaatg aaagacctct ctcgtggatc cttgtcacct
     1801 ggtggtgaaa gggcctgtga aggagtccca tctgcccccc agaacccacc acagaggaaa
     1861 aaagtatccc tgctggagta ccgaaaacgg aaacaagaag ctaaggaaaa ttctgctggt
     1921 gggggaggtg actctgcaca gagcaaaagc aagtctgcag gagctgggca aggcagcagt
     1981 aactccgttt ccgacactgg tgcccatggt gtgcagggat cctcagcccg aactccatct
     2041 tcccctcaca aaaaattctc cccatctcat tcctctatgt cccatttgga ggcggtaagc
     2101 ccatcagatt ccagaggcac ttcttcatct cactgcagac ctcaagagaa tatcagcagt
     2161 aggtggatgg ttcccacatc agtagaacga ctccgagaag gagggagcat ccccaaggtc
     2221 ctccgaagca gcgtgagggt ggcccaaaag ggagagccct ctcccacatg ggagagtaac
     2281 atcacagaga aagactcaga ccctgcagat ggagaaggcc cagagacatt aagctcagca
     2341 ctctctaaag gagcaacagt ttacagccct tccagataca gctaccagct cctgcagtgt
     2401 gatagtcctc ggacagaatc acaaagcctc cttcagcaga gttcctcccc cttcagagga
     2461 catcctacac agtctccagg atacagttat cgaactactg cactgagacc tggaaacccc
     2521 ccctctcacg gttcttcaga atcatccctc tcttccacgt cctattccag ccccgcccac
     2581 cctgtgtcca cagactcgtt ggccccattt acggggacac cagggtattt tagcagccag
     2641 ccacattctg gaaacagcac tggcagcaat cttccaagga ggagctgccc ttctagtgct
     2701 gctagcccta ccctgcaggg accctcagac tcgccaacct cagattcagt ttctcagtcc
     2761 agcacaggaa ctctgagttc cacctccttt cctcagaact ctaggtcgtc attgccatca
     2821 gacttacgga ctatcagtct gcccagtgct gggcagtcag ctgtctacca ggcctccagg
     2881 gtatctgcgg tttccaattc acagcactac ccacaccgtg ggagtggggg tgtgcaccag
     2941 taccgactcc agccactgca agggtcagga gtcaagactc agacgggact ttcctagggc
     3001 ttctggattt gggcaaacag aactgaatga gcccatagct gcttccttcc agctgcctct
     3061 ggaacctagg ccgagcatat tgctgaggaa cggggggtac aaggtgccag aggattgggt
     3121 ctggtggaca agaaacaaga cttgtggtca caattggcct ctggccttgg agaaagctgt
     3181 aaatcttgtc tgaagcagag actataaaga agtttctccc tgctgtcaag ggtacattgt
     3241 tgacaagcaa atggtgtttc ggttagtaac ggttctaagt gcaatgagtt gtgttgaagc
     3301 ctccgtctcc catccttgcc tgtagcccgt agtcacttgt gcagtgagga catcttttta
     3361 aatttaaaaa aaaaaaaaaa
//