LOCUS       BC047868                1460 bp    mRNA    linear   HUM 16-SEP-2003
DEFINITION  Homo sapiens general transcription factor IIH, polypeptide 3,
            34kDa, mRNA (cDNA clone IMAGE:5197419), partial cds.
ACCESSION   BC047868
VERSION     BC047868.2
KEYWORDS    .
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1460)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 1460)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (03-MAR-2003) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     On Sep 16, 2003 this sequence version replaced BC047868.1.
            Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: Life Technologies, Inc.
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Sequencing Group at the Stanford Human Genome
            Center, Stanford University School of Medicine, Stanford, CA  94305
            Web site:       http://www-shgc.stanford.edu
            Contact:  (Dickson, Mark) mcd@paxil.stanford.edu
            Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
            R. M.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 107 Row: d Column: 19
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 28376643.
FEATURES             Location/Qualifiers
     source          1..1460
                     /db_xref="H-InvDB:HIT000053260"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="IMAGE:5197419"
                     /tissue_type="Brain, adult, 6 pooled whole brains"
                     /clone_lib="NIH_MGC_114"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     gene            <1..1460
                     /gene="GTF2H3"
                     /gene_synonym="BTF2"
                     /gene_synonym="TFIIH"
                     /db_xref="GeneID:2967"
                     /db_xref="MIM:601750"
     CDS             <1..914
                     /gene="GTF2H3"
                     /gene_synonym="BTF2"
                     /gene_synonym="TFIIH"
                     /codon_start=3
                     /product="GTF2H3 protein"
                     /protein_id="AAH47868.2"
                     /db_xref="GeneID:2967"
                     /db_xref="MIM:601750"
                     /translation="DELNLLVIVVDANPIWWGKQALKESQFTLSKCIDAVMVLGNSHL
                     FMNRSNKLAVIASHIQESRFLYPGKNGRLGDFFGDPGNPPEFNPSGSKDGKYELLTSA
                     NEVIVEEIKDLMTKSDIKGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIK
                     AAEDSALQYMNFMNVIFAAQKQNILIDACVLDSDSGLLQQACDITGGLYLKVPQMPSL
                     LQYLLWVFLPDQDQRSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPI
                     CTTCETAFKISLPPVLKAKKKKLKVSA"
     misc_feature    6..845
                     /gene="GTF2H3"
                     /gene_synonym="BTF2"
                     /gene_synonym="TFIIH"
                     /note="Tfb4; Region: Transcription factor Tfb4"
                     /db_xref="CDD:pfam03850"
BASE COUNT          478 a          263 c          285 g          434 t
ORIGIN      
        1 aagatgaatt gaatcttctg gttattgtag ttgatgccaa cccaatttgg tggggaaagc
       61 aagcattaaa ggaatctcag ttcactttat ccaaatgcat agatgccgtg atggtgctgg
      121 gaaattcgca tttattcatg aatcgttcca acaaacttgc tgtgatagca agtcacattc
      181 aagaaagccg attcttatat cctggaaaga atggcagact tggagacttc ttcggagacc
      241 ctggcaaccc tcctgaattt aatccctctg ggagtaaaga tggaaaatac gaacttttaa
      301 cctcagcaaa tgaagttatt gttgaagaga ttaaagatct aatgaccaaa agtgacataa
      361 agggtcaaca tacagaaact ttgctggcag gatccctggc caaagccctt tgctacattc
      421 atagaatgaa caaggaagtt aaagacaatc aggaaatgaa atcaaggata ttggtgatta
      481 aggctgcaga agacagtgcg ttgcagtata tgaacttcat gaatgtcatc tttgcagcac
      541 agaaacagaa tattttgatt gatgcctgtg ttttagactc cgactcaggg ctcctccaac
      601 aggcttgtga catcacggga ggactgtacc tgaaggtgcc tcagatgcct tctcttctgc
      661 agtatttgct gtgggtgttt cttcccgatc aagatcagag atctcagtta atcctcccac
      721 ccccagttca tgttgactac agggctgctt gcttctgtca tcgaaatctc attgaaattg
      781 gttatgtctg ttctgtgtgt ttgtcaatat tctgcaattt cagccccatt tgtactacgt
      841 gcgagacagc ctttaaaatt tctctgcctc cagtgctgaa agccaagaaa aagaaactga
      901 aagtgtctgc ctgaggataa aatattttcc ccatctttta gagctgttaa tagaaattat
      961 atagcagatt ctttgttggg aagactgaaa aaaataaaga taggtatagg ataattttta
     1021 atatggtgac cttacagaaa atatttccca aacatccttt tcatcctgtg cttctggagg
     1081 actgatttgt ttgagggaat cattctatgc attatatcct aaaatattct atgactggtt
     1141 tctgtccatg tttgtggctt tcattttttt aatgggatga ctattagtca aagtcagctt
     1201 gtcatgactc atcataggct ttctaaccta ctccctgaat ccgggtcctc attgtgaaat
     1261 gcatgccata cgaaatttga acgtagcttt ggaaaaaggg actatttgtg gagtaatggc
     1321 attaatcaac atagaacatc ttatttgaat caacagttaa cttcagtagt catgtgaata
     1381 aaattcttat tgtctaaatt gaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
     1441 aaaaaaaaaa aaaaaaaaaa
//