LOCUS BC050674 1393 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens THO complex 6 homolog (Drosophila), mRNA (cDNA clone MGC:60218 IMAGE:6067702), complete cds. ACCESSION BC050674 VERSION BC050674.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1393) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 1393) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (08-APR-2003) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Sequencing Group at the Stanford Human Genome Center, Stanford University School of Medicine, Stanford, CA 94305 Web site: http://www-shgc.stanford.edu Contact: (Dickson, Mark) mcd@paxil.stanford.edu Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers, R. M. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 110 Row: i Column: 18 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 31543163. FEATURES Location/Qualifiers source 1..1393 /db_xref="H-InvDB:HIT000053548" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:60218 IMAGE:6067702" /tissue_type="Testis, embryonal carcinoma" /clone_lib="NIH_MGC_92" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..1393 /gene="THOC6" /gene_synonym="MGC2655" /db_xref="GeneID:79228" /db_xref="HGNC:HGNC:28369" CDS 236..1261 /gene="THOC6" /gene_synonym="MGC2655" /codon_start=1 /product="THO complex 6 homolog (Drosophila)" /protein_id="AAH50674.1" /db_xref="GeneID:79228" /db_xref="HGNC:HGNC:28369" /translation="MERAVPLAVPLGQTEVFQALQRLHMTIFSQSVSPCGKFLAAGNN YGQIAIFSLSSALSSEAKEESKKPVVTFQAHDGPVYSMVSTDRHLLSAGDGEVKAWLW AEMLKKGCKELWRRQPPYRTSLEVPEINALLLVPKENSLILAGGDCQLHTMDLETGTF TRVLRGHTDYIHCLALRERSPEVLSGGEDGAVRLWDLRTAKEVQTIEVYKHEECSRPH NGRWIGCLATDSDWMVCGGGPALTLWHLRSSTPTTIFPIRAPQKHVTFYQDLILSAGQ GRCVNQWQLSGELKAQVPGSSPGLLSLSLNQQPAAPECKVLTAAGNSCRVDVFTNLGY RAFSLSF" BASE COUNT 293 a 388 c 414 g 298 t ORIGIN 1 cttgctcctc ggggtggggg agggtatccg gcttaagggg gctgcggtgg acaccacttc 61 ttaatgtcgg gggtcttcgc ggcgctcacc tcggctccta gggttcggga cggtacgcac 121 cagccacctt cgcgccgaag gcggtagggc gccacggaga ggaaccgctc taggcacgta 181 aggcctcgtg aggttgcgtc gcgcgcggag cactctggga cttgtagttc tggagatgga 241 gcgagctgtg ccgctcgcgg tgcctctggg tcagacagag gtgttccagg ccttgcagcg 301 gctccatatg accatcttct cccagagcgt ctcaccatgt gggaagtttc tggcggctgg 361 caacaattac gggcagattg ccatcttcag cttgtcctct gctttgagct cagaagccaa 421 agaggaaagt aagaagccgg tggtgacttt ccaagcccat gatgggcccg tctatagcat 481 ggtttccacc gatcgacatc tgcttagtgc tggggatggg gaggtgaagg cctggctttg 541 ggcggagatg ctcaagaagg gctgtaagga gctgtggcgt cgtcagcctc catacaggac 601 cagcctggaa gtgcctgaga tcaacgcttt gctgctggtc cccaaggaga attccctcat 661 cctggctggg ggagactgtc agttgcacac tatggacctt gaaactggga ctttcacgag 721 ggtcctccgg ggccacacag actacatcca ctgcctggca ctgcgggaaa ggagcccaga 781 ggtgctgtca ggtggcgagg atggagctgt tcgactttgg gacctgcgca cagccaagga 841 ggtccagacg atcgaggtct ataagcacga ggagtgctcg aggccccaca atgggcgctg 901 gattggatgt ttggcaactg attccgactg gatggtctgt ggagggggcc cagccctcac 961 cctctggcac ctccgatcct ccacacccac caccatcttc cccatccggg cgccacagaa 1021 gcacgtcacc ttctaccagg acctgattct gtcagctggc cagggccgct gcgtcaacca 1081 gtggcagctg agcggggagc tgaaggccca ggtgcctggc tcctccccag ggctgctcag 1141 cctcagcctc aaccagcagc ctgccgcgcc tgagtgcaag gtcctgacag ctgcaggcaa 1201 cagctgccgg gtggatgtct tcaccaacct gggttaccga gccttctccc tgtccttctg 1261 atctctgacg acacccccag ccagctcagg gttttagagt gtttttcatt ttcttttttt 1321 ttttttttac aataaagttt caggcttttt taaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1381 aaaaaaaaaa aaa //