LOCUS BC004935 1708 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens general transcription factor IIH, polypeptide 4, 52kDa, mRNA (cDNA clone MGC:10768 IMAGE:3606718), complete cds. ACCESSION BC004935 VERSION BC004935.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1708) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 1708) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (21-MAR-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Genome Sequence Centre, BC Cancer Agency, Vancouver, BC, Canada info@bcgsc.bc.ca Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao, Kim MacDonald, Mike R. Mayo, Josh Moran, Diana Palmquist, JR Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang, Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 13 Row: e Column: 9 This clone was selected for full length sequencing because it passed the following selection criteria: Hexamer frequency ORF analysis. FEATURES Location/Qualifiers source 1..1708 /db_xref="H-InvDB:HIT000032096_07" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:10768 IMAGE:3606718" /tissue_type="Uterus, endometrium adenocarcinoma" /clone_lib="NIH_MGC_44" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..1708 /gene="GTF2H4" /gene_synonym="TFIIH" /db_xref="GeneID:2968" /db_xref="HGNC:HGNC:4658" /db_xref="MIM:601760" CDS 180..1568 /gene="GTF2H4" /gene_synonym="TFIIH" /codon_start=1 /product="general transcription factor IIH, polypeptide 4, 52kDa" /protein_id="AAH04935.1" /db_xref="GeneID:2968" /db_xref="HGNC:HGNC:4658" /db_xref="MIM:601760" /translation="MESTPSRGLNRVHLQCRNLQEFLGGLSPGVLDRLYGHPATCLAV FRELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLP GGLQGLILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVL HFMVGSPSAAVSQDLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQ YLQTAQSRGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKR KSRRYYPTRLAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYTESELQIALIALFSEM LYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIR LWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSD VKRFWKRQKHSS" BASE COUNT 353 a 474 c 485 g 396 t ORIGIN 1 ctcttctgaa ttctccattc tgggctcttg cctgtgaaat ctttctttgc tttccccatc 61 ttttcctcgc attttttcac catctttccc tcaatctcca ggagccaatg cgagactttg 121 gctccgatta agcgacggcc cgagactcgg ggtgcgcgag gaggatcgac agagtggtga 181 tggagagcac cccttcaagg ggactgaacc gagtacacct acaatgcagg aatctgcagg 241 aattcttagg gggcctgagc cctggggtat tggaccgatt gtatgggcac cctgccacat 301 gtctggctgt cttcagggag ctcccatcct tggctaagaa ctgggtgatg cggatgctct 361 ttctggagca gcctttgcca caggctgctg tagctctgtg ggtaaagaag gaattcagca 421 aggctcagga ggaaagtaca gggctgctga gcggcctccg gatctggcac acacagctgc 481 tcccaggcgg gctccagggc ctcatcctca accccatttt ccgccagaac ctccgcattg 541 cccttctggg tggggggaag gcctggtctg atgacacaag tcagctggga ccagacaagc 601 atgcccggga cgttccctcc cttgacaagt acgccgagga gcgatgggag gtggtcttgc 661 acttcatggt gggctccccc agtgcagctg tcagccagga cttggctcag ctcctcagcc 721 aggctgggct catgaagagt actgaacctg gagagccgcc ctgcattact tccgctggct 781 tccagttcct gttgctggac accccggctc agctctggta ctttatgttg cagtatttgc 841 agacagccca gagccggggc atggacctgg tagagattct ctccttcctc ttccagctca 901 gcttctctac tctgggcaag gattactctg tggaaggtat gagtgattct ctgttgaact 961 tcctgcaaca tctgcgtgag tttgggcttg ttttccagag gaagaggaaa tctcggcgtt 1021 actaccccac acgcctggcc atcaatctct catcaggtgt ctctggagct gggggcactg 1081 tgcatcagcc aggtttcatt gtcgtggaaa ccaattaccg actgtatgcc tacacggagt 1141 cggagctgca gattgccctc attgccctct tctctgagat gctctatcgg ttccccaaca 1201 tggtggtggc gcaggtgacc cgggagagtg tgcagcaggc aatcgccagt ggcatcacag 1261 cccagcagat aatccatttc ctaaggacaa gagcccaccc agtgatgctc aaacagacac 1321 ctgtgctgcc ccccaccatc accgaccaga tccggctctg ggagctggaa agggacagac 1381 tccggttcac tgagggtgtc ctgtataacc agttcctgtc gcaagtggac tttgagctgc 1441 tgctggccca cgcgcgggag ctgggcgtgc tcgtgttcga gaactcggcc aagcggctca 1501 tggtggtgac cccggccggg cacagcgacg tcaagcgctt ttggaagcgg cagaaacata 1561 gctcctgaga gcgcgggact tggacacgga cctcggcggg cgggactggg cggggcgggg 1621 catcagaact caggtgtttt ttatttacgc gtcagggctt ttcttgttta ataaagttat 1681 gatagctaaa aaaaaaaaaa aaaaaaaa //