LOCUS BC016302 1708 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens general transcription factor IIH, polypeptide 4, 52kDa, mRNA (cDNA clone MGC:16269 IMAGE:3830902), complete cds. ACCESSION BC016302 VERSION BC016302.2 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1708) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 1708) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (29-OCT-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Aug 19, 2003 this sequence version replaced BC016302.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC/DCTD/DTP cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Institute for Systems Biology http://www.systemsbiology.org contact: amadan@systemsbiology.org Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 24 Row: b Column: 4 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 54144651. FEATURES Location/Qualifiers source 1..1708 /db_xref="H-InvDB:HIT000037522_07" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:16269 IMAGE:3830902" /tissue_type="Skin, melanotic melanoma." /clone_lib="NIH_MGC_20" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..1708 /gene="GTF2H4" /gene_synonym="TFIIH" /db_xref="GeneID:2968" /db_xref="HGNC:HGNC:4658" /db_xref="MIM:601760" CDS 180..1568 /gene="GTF2H4" /gene_synonym="TFIIH" /codon_start=1 /product="general transcription factor IIH, polypeptide 4, 52kDa" /protein_id="AAH16302.1" /db_xref="GeneID:2968" /db_xref="HGNC:HGNC:4658" /db_xref="MIM:601760" /translation="MESTPSRGLNRVHLQCRNLQEFLGGLSPGVLDRLYGHPATCLAV FRELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLP GGLQGLILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVL HFMVGSPSAAVSQDLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQ YLQTAQSRGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKR KSRRYYPTRLAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYTESELQIALIALFSEM LYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIR LWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSD VKRFWKRQKHSS" BASE COUNT 353 a 474 c 485 g 396 t ORIGIN 1 ctcttctgaa ttctccattc tgggctcttg cctgtgaaat ctttctttgc tttccccatc 61 ttttcctcgc attttttcac catctttccc tcaatctcca ggagccaatg cgagactttg 121 gctccgatta agcgacggcc cgagactcgg ggtgcgcgag gaggatcgac agagtggtga 181 tggagagcac cccttcaagg ggactgaacc gagtacacct acaatgcagg aatctgcagg 241 aattcttagg gggcctgagc cctggggtat tggaccgatt gtatgggcac cctgccacat 301 gtctggctgt cttcagggag ctcccatcct tggctaagaa ctgggtgatg cggatgctct 361 ttctggagca gcctttgcca caggctgctg tagctctgtg ggtaaagaag gaattcagca 421 aggctcagga ggaaagtaca gggctgctga gcggcctccg gatctggcac acacagctgc 481 tcccaggcgg gctccagggc ctcatcctca accccatttt ccgccagaac ctccgcattg 541 cccttctggg tggggggaag gcctggtctg atgacacaag tcagctggga ccagacaagc 601 atgcccggga cgttccctcc cttgacaagt acgccgagga gcgatgggag gtggtcttgc 661 acttcatggt gggctccccc agtgcagctg tcagccagga cttggctcag ctcctcagcc 721 aggctgggct catgaagagt actgaacctg gagagccgcc ctgcattact tccgctggct 781 tccagttcct gttgctggac accccggctc agctctggta ctttatgttg cagtatttgc 841 agacagccca gagccggggc atggacctgg tagagattct ctccttcctc ttccagctca 901 gcttctctac tctgggcaag gattactctg tggaaggtat gagtgattct ctgttgaact 961 tcctgcaaca tctgcgtgag tttgggcttg ttttccagag gaagaggaaa tctcggcgtt 1021 actaccccac acgcctggcc atcaatctct catcaggtgt ctctggagct gggggcactg 1081 tgcatcagcc aggtttcatt gtcgtggaaa ccaattaccg actgtatgcc tacacggagt 1141 cggagctgca gattgccctc attgccctct tctctgagat gctctatcgg ttccccaaca 1201 tggtggtggc gcaggtgacc cgggagagtg tgcagcaggc aatcgccagt ggcatcacag 1261 cccagcagat aatccatttc ctaaggacaa gagcccaccc agtgatgctc aaacagacac 1321 ctgtgctgcc ccccaccatc accgaccaga tccggctctg ggagctggaa agggacagac 1381 tccggttcac tgagggtgtc ctgtataacc agttcctgtc gcaagtggac tttgagctgc 1441 tgctggccca cgcgcgggag ctgggcgtgc tcgtgttcga gaactcggcc aagcggctca 1501 tggtggtgac cccggccggg cacagcgacg tcaagcgctt ttggaagcgg cagaaacata 1561 gctcctgaga gcgcgggact tggacacgga cctcggcggg cgggactggg cggggcgggg 1621 catcagaact caggtgtttt ttatttacgc gtcagggctt ttcttgttta ataaagttat 1681 gatagctaaa aaaaaaaaaa aaaaaaaa //