LOCUS BC020956 3380 bp mRNA linear HUM 06-JUN-2006 DEFINITION Homo sapiens SET domain containing 5, mRNA (cDNA clone MGC:8816 IMAGE:3851178), complete cds. ACCESSION BC020956 VERSION BC020956.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3380) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 3380) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (03-JAN-2002) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Sequencing Group at the Stanford Human Genome Center, Stanford University School of Medicine, Stanford, CA 94305 Web site: http://www-shgc.stanford.edu Contact: (Dickson, Mark) mcd@paxil.stanford.edu Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers, R. M. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 20 Row: m Column: 22 This clone was selected for full length sequencing because it passed the following selection criteria: GenomeScan gene prediction. FEATURES Location/Qualifiers source 1..3380 /db_xref="H-InvDB:HIT000039044" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:8816 IMAGE:3851178" /tissue_type="Colon, adenocarcinoma" /clone_lib="NIH_MGC_65" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..3380 /gene="SETD5" /gene_synonym="FLJ10707" /gene_synonym="KIAA1757" /db_xref="GeneID:55209" /db_xref="HGNC:HGNC:25566" CDS 223..2997 /gene="SETD5" /gene_synonym="FLJ10707" /gene_synonym="KIAA1757" /codon_start=1 /product="SETD5 protein" /protein_id="AAH20956.1" /db_xref="GeneID:55209" /db_xref="HGNC:HGNC:25566" /translation="MHAFENLEKRKKRRDQPLEQSNSDVEITTTTSETPVGEETKTEA PESEVSNSVSNVTIPSTPQSVGVNTRRSSQAGDIAAEKLVPKPPPAKPSRPRPKSRIS RYRTSSAQRLKRQKQANAQQAELSQAALEEGGSNSLVTPTEAGSLDSSGENRPLTGSD PTVVSITGSHVNRAASKYPKTKKYLVTEWLNDKAEKQECPVECPLRITTDPTVLATTL NMLPGLIHSPLICTTPKHYIRFGSPFIPERRRRPLLPDGTFSSCKKRWIKQALEEGMT QTSSVPQETRTQHLYQSNENSSSSSICKDNADLLSPLKKWKSRYLMEQNVTKLLRPLS PVTPPPPNSGSKSPQLATPGSSHPGEEECRNGYSLMFSPVTSLTTASRCNTPLQFELC HRKDLDLAKVGYLDSNTNSCADRPSLLNSGHSDLAPHPSLGPTSETGFPSRSGDGHQT LVRNSDQAFRTEFNLMYAYSPLNAMPRADGLYRGSPLVGDRKPLHLDGGYCSPAEGFS SRYEHGLMKDLSRGSLSPGGERACEGVPSAPQNPPQRKKVSLLEYRKRKQEAKENSAG GGGDSAQSKSKSAGAGQGSSNSVSDTGAHGVQGSSARTPSSPHKKFSPSHSSMSHLEA VSPSDSRGTSSSHCRPQENISSRWMVPTSVERLREGGSIPKVLRSSVRVAQKGEPSPT WESNITEKDSDPADGEGPETLSSALSKGATVYSPSRYSYQLLQCDSPRTESQSLLQQS SSPFRGHPTQSPGYSYRTTALRPGNPPSHGSSESSLSSTSYSSPAHPVSTDSLAPFTG TPGYFSSQPHSGNSTGSNLPRRSCPSSAASPTLQGPSDSPTSDSVSQSSTGTLSSTSF PQNSRSSLPSDLRTISLPSAGQSAVYQASRVSAVSNSQHYPHRGSGGVHQYRLQPLQG SGVKTQTGLS" BASE COUNT 961 a 900 c 790 g 729 t ORIGIN 1 cgacggaaag agctagagat ggagcagcag aatgaggctt cagaggagaa taatgaccag 61 caatcacaag aagttccaga aaaagtaact gtatccagtg atcatgagga agtagacaat 121 ccagaagaaa aaccagaaga agagaaagaa gaggttatag atgaccagga gaacctagct 181 catagcagga ggaccaggga agatagaaag gtagaagcca tcatgcatgc ttttgaaaac 241 ttagagaaaa gaaagaagcg gcgggatcag cccttggaac agagcaactc tgatgtagag 301 attactacaa ccacctcaga gactcctgtt ggtgaagaga caaaaactga agcccctgaa 361 tctgaagtta gcaactctgt ttcaaatgtt accatcccaa gcaccccaca gagtgttggt 421 gtgaataccc ggaggtcttc ccaagcaggg gatattgctg cagaaaaact agtccccaag 481 ccacctccag caaagccttc taggccccgg ccgaagagtc gaatttctcg gtacaggacc 541 agttcagccc aaagactaaa gcgtcagaag caggccaatg cacagcaggc agaattgtca 601 caagctgcct tggaagaggg aggaagtaac agtttagtaa ctcctactga agctggaagt 661 ctagacagtt caggagaaaa caggccatta acagggtctg acccaactgt ggtgtcaatt 721 actggatccc atgtcaaccg tgctgcatct aaatacccca aaaccaaaaa gtatctagtt 781 acagaatggt tgaatgacaa agcagagaag caagagtgcc ctgttgagtg ccctttacgt 841 atcacaacgg atccaactgt actggcaacg accctaaaca tgttaccagg tcttatccat 901 tccccgttaa tttgcaccac ccccaaacac tacattcgct ttggctcacc ctttatccct 961 gagagacgtc gaaggcccct tctgcctgat ggcacattca gctcctgtaa gaagcgctgg 1021 ataaaacaag ccttagaaga agggatgact caaacatcat ctgtacccca agagactaga 1081 actcagcacc tataccaaag caatgagaat agtagctctt ctagtatctg caaagacaat 1141 gcagacttgt tgagcccatt aaagaaatgg aagtctcgct atctgatgga gcagaatgtc 1201 accaagttac ttcggcctct gtctccagtc acaccacccc ctcccaattc aggctcaaag 1261 agtccccagc tggccacacc tggctcatct cacccaggag aagaggagtg tcgaaatgga 1321 tacagcctca tgttttcacc agtcacatct cttactactg ctagtcgctg caacactcct 1381 ctacagtttg agctttgtca ccgaaaagac ctggatttgg caaaagtagg ataccttgac 1441 tccaacacta acagctgtgc tgatagacct tccctactca actcaggtca ttctgacctg 1501 gctcctcatc cctccctcgg acccacttct gagactggtt tcccaagcag aagtggagat 1561 ggacatcaga ccctcgtgag aaactcagac caggcatttc ggacagagtt caacttgatg 1621 tatgcctact cccctttgaa tgctatgcct cgagcagatg gactgtatcg aggatctcct 1681 ctagtggggg ataggaagcc tttacatttg gatgggggat attgttcccc tgcagaagga 1741 ttttccagca gatatgaaca tggcttaatg aaagacctct ctcgtggatc cttgtcacct 1801 ggtggtgaaa gggcctgtga aggagtccca tctgcccccc agaacccacc acagaggaaa 1861 aaagtatccc tgctggagta ccgaaaacgg aaacaagaag ctaaggaaaa ttctgctggt 1921 gggggaggtg actctgcaca gagcaaaagc aagtctgcag gagctgggca aggcagcagt 1981 aactccgttt ccgacactgg tgcccatggt gtgcagggat cctcagcccg aactccatct 2041 tcccctcaca aaaaattctc cccatctcat tcctctatgt cccatttgga ggcggtaagc 2101 ccatcagatt ccagaggcac ttcttcatct cactgcagac ctcaagagaa tatcagcagt 2161 aggtggatgg ttcccacatc agtagaacga ctccgagaag gagggagcat ccccaaggtc 2221 ctccgaagca gcgtgagggt ggcccaaaag ggagagccct ctcccacatg ggagagtaac 2281 atcacagaga aagactcaga ccctgcagat ggagaaggcc cagagacatt aagctcagca 2341 ctctctaaag gagcaacagt ttacagccct tccagataca gctaccagct cctgcagtgt 2401 gatagtcctc ggacagaatc acaaagcctc cttcagcaga gttcctcccc cttcagagga 2461 catcctacac agtctccagg atacagttat cgaactactg cactgagacc tggaaacccc 2521 ccctctcacg gttcttcaga atcatccctc tcttccacgt cctattccag ccccgcccac 2581 cctgtgtcca cagactcgtt ggccccattt acggggacac cagggtattt tagcagccag 2641 ccacattctg gaaacagcac tggcagcaat cttccaagga ggagctgccc ttctagtgct 2701 gctagcccta ccctgcaggg accctcagac tcgccaacct cagattcagt ttctcagtcc 2761 agcacaggaa ctctgagttc cacctccttt cctcagaact ctaggtcgtc attgccatca 2821 gacttacgga ctatcagtct gcccagtgct gggcagtcag ctgtctacca ggcctccagg 2881 gtatctgcgg tttccaattc acagcactac ccacaccgtg ggagtggggg tgtgcaccag 2941 taccgactcc agccactgca agggtcagga gtcaagactc agacgggact ttcctagggc 3001 ttctggattt gggcaaacag aactgaatga gcccatagct gcttccttcc agctgcctct 3061 ggaacctagg ccgagcatat tgctgaggaa cggggggtac aaggtgccag aggattgggt 3121 ctggtggaca agaaacaaga cttgtggtca caattggcct ctggccttgg agaaagctgt 3181 aaatcttgtc tgaagcagag actataaaga agtttctccc tgctgtcaag ggtacattgt 3241 tgacaagcaa atggtgtttc ggttagtaac ggttctaagt gcaatgagtt gtgttgaagc 3301 ctccgtctcc catccttgcc tgtagcccgt agtcacttgt gcagtgagga catcttttta 3361 aatttaaaaa aaaaaaaaaa //