LOCUS       BC039726                2334 bp    mRNA    linear   HUM 16-SEP-2003
DEFINITION  Homo sapiens general transcription factor IIH, polypeptide 3,
            34kDa, mRNA (cDNA clone IMAGE:5582960), partial cds.
ACCESSION   BC039726
VERSION     BC039726.2
KEYWORDS    .
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2334)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 2334)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (12-NOV-2002) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     On Sep 16, 2003 this sequence version replaced BC039726.1.
            Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: National Institutes of Health Intramural
            Sequencing Center (NISC),
            Gaithersburg, Maryland;
            Web site: http://www.nisc.nih.gov/
            Contact: nisc_mgc@nhgri.nih.gov
            Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
            Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
            Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
            Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
            Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
            McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
            Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
            Young,A., Zhang,L.-H. and Green,E.D.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 82 Row: h Column: 18
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 28376643.
FEATURES             Location/Qualifiers
     source          1..2334
                     /db_xref="H-InvDB:HIT000095895"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="IMAGE:5582960"
                     /tissue_type="Testis, embryonal carcinoma"
                     /clone_lib="NIH_MGC_92"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     gene            <1..2334
                     /gene="GTF2H3"
                     /gene_synonym="BTF2"
                     /gene_synonym="TFIIH"
                     /db_xref="GeneID:2967"
                     /db_xref="MIM:601750"
     CDS             <1..914
                     /gene="GTF2H3"
                     /gene_synonym="BTF2"
                     /gene_synonym="TFIIH"
                     /codon_start=3
                     /product="GTF2H3 protein"
                     /protein_id="AAH39726.1"
                     /db_xref="GeneID:2967"
                     /db_xref="MIM:601750"
                     /translation="DELNLLVIVVDANPIWWGKQALKESQFTLSKCIDAVMVLGNSHL
                     FMNRSNKLAVIASHIQESRFLYPGKNGRLGDFFGDPGNPPEFNPSGSKDGKYELLTSA
                     NEVIVEEIKDLMTKSDIKGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIK
                     AAEDSALQYMNFMNVIFAAQKQNILIDACVLDSDSGLLQQACDITGGLYLKVPQMPSL
                     LQYLLWVFLPDQDQRSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPI
                     CTTCETAFKISLPPVLKAKKKKLKVSA"
     misc_feature    6..845
                     /gene="GTF2H3"
                     /gene_synonym="BTF2"
                     /gene_synonym="TFIIH"
                     /note="Tfb4; Region: Transcription factor Tfb4"
                     /db_xref="CDD:pfam03850"
BASE COUNT          722 a          422 c          493 g          697 t
ORIGIN      
        1 aagatgaatt gaatcttctg gttattgtag ttgatgccaa cccaatttgg tggggaaagc
       61 aagcattaaa ggaatctcag ttcactttat ccaaatgcat agatgccgtg atggtgctgg
      121 gaaattcgca tttattcatg aatcgttcca acaaacttgc tgtgatagca agtcacattc
      181 aagaaagccg attcttatat cctggaaaga atggcagact tggagacttc ttcggagacc
      241 ctggcaaccc tcctgaattt aatccctctg ggagtaaaga tggaaaatac gaacttttaa
      301 cctcagcaaa tgaagttatt gttgaagaga ttaaagatct aatgaccaaa agtgacataa
      361 agggtcaaca tacagaaact ttgctggcag gatccctggc caaagccctt tgctacattc
      421 atagaatgaa caaggaagtt aaagacaatc aggaaatgaa atcaaggata ttggtgatta
      481 aggctgcaga agacagtgcg ttgcagtata tgaacttcat gaatgtcatc tttgcagcac
      541 agaaacagaa tattttgatt gatgcctgtg ttttagactc cgactcaggg ctcctccaac
      601 aggcttgtga catcacggga ggactgtacc tgaaggtgcc tcagatgcct tctcttctgc
      661 agtatttgct gtgggtgttt cttcccgatc aagatcagag atctcagtta atcctcccac
      721 ccccagttca tgttgactac agggctgctt gcttctgtca tcgaaatctc attgaaattg
      781 gttatgtctg ttctgtgtgt ttgtcaatat tctgcaattt cagccccatt tgtactacgt
      841 gcgagacagc ctttaaaatt tctctgcctc cagtgctgaa agccaagaaa aagaaactga
      901 aagtgtctgc ctgaggataa aatattttcc ccatctttta gagctgttaa tagaaattat
      961 atagcagatt ctttgttggg aagactgaaa aaaataaaga taggtatagg ataattttta
     1021 atatggtgac cttacagaaa atatttccca aacatccttt tcatcctgtg cttctggagg
     1081 actgatttgt ttgagggaat cattctatgc attatatcct aaaatattct atgactggtt
     1141 tctgtccatg tttgtggctt tcattttttt aatgggatga ctattagtca aagtcagctt
     1201 gtcatgactc atcataggct ttctaaccta ctccctgaat ccgggtcctc attgtgaaat
     1261 gcatgccata cgaaatttga acgtagcttt ggaaaaaggg actatttgtg gagtaatggc
     1321 attaatcaac atagaacatc ttatttgaat caacagttaa cttcagtagt catgtgaata
     1381 aaattcttat tgtctaaatt gagacagcct cagatatttg cagatattta ctttttgtct
     1441 gatatcagta catatttgga caaagtcatc taaataatag tttgtcacca aataactaca
     1501 aaatctcatt ttaaatgagt aaggagaacg tgtacagaag caaattttct tcaaaatagt
     1561 tgtgggaaga gcttatatgt gaaagcttat gactggtttt gagggagaac ttactggaga
     1621 aaatggactc tatgttaagt atggttttca gatagaattc tttccttttt taatgaggaa
     1681 aaaaaatcca cattaatatt gaaactgcac ctgtaatccc agcactttgg gaggctgagg
     1741 acagaggatt gcttgagccc aggagttcga gagcagcctg ggcagcaaag tgagacccca
     1801 tctctactaa aaatttaaat gtatttatta aaactgttct ctagaagctt tggactgaat
     1861 cccaaaagtg tttataagtt caaaagcaaa agtatttgta atttcaacaa caaaaaatgt
     1921 atttctttat gtaatcttga aattattaaa agtcctttta gcttctagca catatttgta
     1981 caaagagttt aaggaatggt ggctggtttg gtttgttttt taaaaatgtt tactgacgag
     2041 gccgggcgtg gtggctcacc cctgcaatcc cagcactttg ggaggccgag gcaggcagat
     2101 cacaaggtca ggagttcaag atcagcctgg ccagtatggt gaaaccctgt ctctactaaa
     2161 aatagaaaaa ttagccatgc gaagtagcag gtgcctgtag ttccagctac tcgggaggct
     2221 gaggcaagag aattgcttga atccaggagg cagaggttgc agtgagccaa gatagcgcct
     2281 ctgtactcca gcctgggtga cagagcgaga ctctgtatca aaaaaaaaaa aaaa
//