LOCUS BC027450 2135 bp mRNA linear HUM 18-AUG-2006 DEFINITION Homo sapiens SET domain containing 1A, mRNA (cDNA clone IMAGE:4156088), partial cds. ACCESSION BC027450 VERSION BC027450.1 KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2135) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2135) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (04-APR-2002) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: David N. Louis, M.D. cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Baylor College of Medicine Human Genome Sequencing Center Center code: BCM-HGSC Web site: http://www.hgsc.bcm.tmc.edu/cdna/ Contact: amg@bcm.tmc.edu Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H., Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati, A.N., Gibbs, R.A. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 52 Row: a Column: 11. FEATURES Location/Qualifiers source 1..2135 /db_xref="H-InvDB:HIT000091330" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="IMAGE:4156088" /tissue_type="Brain, anaplastic oligodendroglioma with 1p/19q loss" /clone_lib="NCI_CGAP_Brn67" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene <1..2135 /gene="SETD1A" /gene_synonym="KIAA0339" /gene_synonym="Set1" /db_xref="GeneID:9739" /db_xref="HGNC:HGNC:29010" CDS <1..1410 /gene="SETD1A" /gene_synonym="KIAA0339" /gene_synonym="Set1" /codon_start=1 /product="SETD1A protein" /protein_id="AAH27450.1" /db_xref="GeneID:9739" /db_xref="HGNC:HGNC:29010" /translation="LFLGEEAEPGTEVDLAVLADLALTPARRGLPALPAVEDSEATET SDEAERPRPLLSHILLEHNYALAVKPTPPAPALRPPEPVPAPAALFSSPADEVLEAPE VVVAEAEEPKPQQLQQQREEGEEEGEEEGEEEEEESSDSSSSSDGEGALRRRSLRSHA RRRRPPPPPPPPPPRAYEPRSEFEQMTILYDIWNSGLDSEDMSYLRLTYERLLQQTSG ADWLNDTHWVHHTITNLTTPKRKRRPQDGPREHQTGSARSEGYYPISKKEKDKYLDVC PVSARQLEGVDTQGTNRVLSERRSEQRRLLSAIGTSAIMDSDLLKLNQLKFRKKKLRF GRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIGSSYLFRVDHD TIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEITYDYKFPL EDNKIPCLCGTESCRGSLN" BASE COUNT 487 a 675 c 616 g 357 t ORIGIN 1 ctcttcctcg gtgaggaggc tgagccaggg acagaggtgg acctggcggt cctggccgac 61 ctggccctga cccctgcccg gcgcgggctg cctgccctgc ctgctgttga agactcagag 121 gccacagaga catcggacga ggccgagcgc cctaggcccc tgctcagcca catcctcctg 181 gagcacaact atgccctggc cgtcaagccc acgccccctg cgccagccct gcggcccccg 241 gagccagtgc ccgcacccgc cgccctcttc agttccccag ctgatgaggt cctggaggcc 301 cccgaggtgg tggtggctga ggcggaggag cccaagccgc agcaactgca gcagcagcgg 361 gaggagggcg aagaggaggg ggaggaagag ggggaggaag aggaggagga gtcctctgac 421 agcagcagca gcagcgatgg ggagggcgcc ctccggaggc gcagcctccg ctcccacgcc 481 cggcgccgcc gccctccgcc cccacccccg ccgccaccgc cccgcgccta cgagccacgc 541 agtgagtttg aacagatgac catcctgtat gacatttgga actcgggcct ggactcagag 601 gacatgagtt acctgcggct tacgtacgag cggctgctgc agcagacaag cggggctgac 661 tggctcaacg acactcactg ggtccatcac acaatcacca acctgaccac cccaaaacgc 721 aagcggcggc cccaggatgg gccccgggag caccagacag gctcagcccg cagcgaaggc 781 tactacccca tcagcaagaa ggagaaggac aagtacctgg acgtgtgccc agtctcggcc 841 cggcagctgg agggcgtgga cactcagggg acgaaccgcg tgctgtccga gcgccggtcc 901 gagcagcggc ggctgctgag cgccatcggt acctccgcca tcatggacag tgacctgctg 961 aaactcaacc agctcaagtt ccggaagaag aagctccgat ttggccggag ccggatccac 1021 gagtggggtc tgtttgccat ggaacccatt gctgctgacg agatggtcat cgaatacgtg 1081 ggtcagaaca tccgtcagat ggtggccgac atgcgggaga agcgctacgt gcaggagggc 1141 attggcagca gctacctgtt ccgggtggac cacgacacca tcatcgatgc caccaagtgt 1201 ggcaacctgg ccagattcat caaccactgc tgcacgccta actgctacgc caaggtcatc 1261 accatcgagt cccagaagaa gatcgtgatc tactccaagc agcccattgg cgtggacgag 1321 gagatcacct acgactacaa gttcccactg gaagacaaca agatcccgtg tctgtgtggc 1381 acagagagct gccggggctc cctaaactga ggtggggcag gatgggtgcc cacaccccta 1441 tttattcccc ctggtgccct gagctcccag caccccccca gccttagtgg gctcagcagg 1501 gcccacatgc ccccatctcc aagcgtgggg ttgggggccc caagcccagc gagggagcct 1561 cagtccctgg aggcagcttc tgcctctcct gtcacccctg cccaccaccc cctgattgtt 1621 tttctttgcg gagaagaagc tgtaaatgtt ttgtagcagc cagcagctgt ttcctgtgga 1681 aacctggggt gccggcctgt acagattctg tcctgggggg ctacacagtc ctcttgcttt 1741 gtgttaatgg ggacttcccc ttacgccctg cgtgtacccc tccccagttt aggggtctct 1801 ggggcagtgg ccatgttctc cccctggggg ggctctgcac ccccagtcct ggggactccg 1861 tgcctggaac cctgcctcat ctgttcctgc cagaccctga gggtcaccct tccaccctgg 1921 tgtcactccc cggctcagcc aggccaggat ggcggggtgg gtcccttttg ctgggctgga 1981 ctgtacatat gttaatagcg caaacccgac gccacatttt tataattgtg attaaacttt 2041 attgtacaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2101 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaa //