LOCUS       BC000365                2989 bp    mRNA    linear   HUM 15-JUL-2006
DEFINITION  Homo sapiens general transcription factor IIH, polypeptide 1,
            62kDa, mRNA (cDNA clone MGC:8323 IMAGE:2819217), complete cds.
ACCESSION   BC000365
VERSION     BC000365.2
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2989)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 2989)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (15-NOV-2000) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     On Aug 20, 2003 this sequence version replaced BC000365.1.
            Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: DCTD/DTP
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: National Institutes of Health Intramural
            Sequencing Center (NISC),
            Gaithersburg, Maryland;
            Web site: http://www.nisc.nih.gov/
            Contact: nisc_mgc@nhgri.nih.gov
            Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
            Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
            Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
            Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
            Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
            McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
            Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
            Young,A., Zhang,L.-H. and Green,E.D.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 1 Row: a Column: 2
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 19923304.
FEATURES             Location/Qualifiers
     source          1..2989
                     /db_xref="H-InvDB:HIT000029556"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:8323 IMAGE:2819217"
                     /tissue_type="Lung, small cell carcinoma"
                     /clone_lib="NIH_MGC_7"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     gene            1..2989
                     /gene="GTF2H1"
                     /gene_synonym="BTF2"
                     /gene_synonym="TFIIH"
                     /db_xref="GeneID:2965"
                     /db_xref="HGNC:HGNC:4655"
                     /db_xref="MIM:189972"
     CDS             161..1807
                     /gene="GTF2H1"
                     /gene_synonym="BTF2"
                     /gene_synonym="TFIIH"
                     /codon_start=1
                     /product="general transcription factor IIH, polypeptide 1,
                     62kDa"
                     /protein_id="AAH00365.1"
                     /db_xref="GeneID:2965"
                     /db_xref="HGNC:HGNC:4655"
                     /db_xref="MIM:189972"
                     /translation="MATSSEEVLLIVKKVRQKKQDGALYLMAERIAWAPEGKDRFTIS
                     HMYADIKCQKISPEGKAKIQLQLVLHAGDTTNFHFSNESTAVKERDAVKDLLQQLLPK
                     FKRKANKELEEKNRMLQEDPVLFQLYKDLVVSQVISAEEFWANRLNVNATDSSSTSNH
                     KQDVGISAAFLADVRPQTDGCNGLRYNLTSDIIESIFRTYPAVKMKYAENVPHNMTEK
                     EFWTRFFQSHYFHRDRLNTGSKDLFAECAKIDEKGLKTMVSLGVKNPLLDLTALEDKP
                     LDEGYGISSVPSASNSKSIKENSNAAIIKRFNHHSAMVLAAGLRKQEAQNEQTSEPSN
                     MDGNSGDADCFQPAVKRAKLQESIEYEDLGKNNSVKTIALNLKKSDRYYHGPTPIQSL
                     QYATSQDIINSFQSIRQEMEAYTPKLTQVLSSSAASSTITALSPGGALMQGGTQQAIN
                     QMVPNDIQSELKHLYVAVGELLRHFWSCFPVNTPFLEEKVVKMKSNLERFQVTKLCPF
                     QEKIRRQYLSTNLVSHIEEMLQTAYNKLHTWQSRRLMKKT"
BASE COUNT         1004 a          567 c          604 g          814 t
ORIGIN      
        1 gaccccctag taacagaggc ggtggctact gctgcggcca ctgggtttcg gcctcttccc
       61 agcagcggct ctaagaagcg cagcggaact cgaccggatc caacccagtt agttacttcc
      121 tgtctagagt tgtagcttcc acctgcacct tctagccacc atggcaacct catctgaaga
      181 agttttgctg attgtaaaga aagtgcgtca aaagaagcag gatggagctc tgtacctcat
      241 ggcagaaaga attgcttggg cacctgaagg caaagataga tttacaatca gccatatgta
      301 tgcagatatt aaatgccaga aaattagtcc agaaggaaaa gctaaaattc agcttcagct
      361 ggtcctacat gcaggggaca caactaactt ccatttttcc aatgaaagca cagcagtgaa
      421 agagcgagat gcagtaaaag accttcttca gcagctgctg cccaaattca agaggaaagc
      481 aaataaagaa ctggaagaga agaacagaat gctgcaagaa gatcctgttt tgtttcagct
      541 ttataaagac cttgttgtga gtcaagtgat cagtgctgag gaattctggg ccaatcgttt
      601 aaatgtgaat gcaacagata gttcttccac atccaatcat aagcaggatg ttggcatttc
      661 tgctgcattt ctggctgatg tccggcccca aactgatggc tgtaacggtc taagatataa
      721 tttaacttct gatatcattg agtccatatt taggacctat ccagcagtaa aaatgaaata
      781 tgcagaaaat gttccccaca acatgacaga gaaggaattc tggacacgtt ttttccagtc
      841 ccattatttt cacagggatc ggctgaatac agggtcaaag gatctctttg cagaatgtgc
      901 caaaatagat gaaaaaggcc taaaaacaat ggtttcatta ggagtgaaaa acccactact
      961 agatttaaca gctttggaag ataaaccatt agatgagggc tatggcattt cctctgtgcc
     1021 atctgcttcc aattctaaat ccataaaaga gaatagtaat gctgccatca tcaagagatt
     1081 taaccatcac agtgccatgg tcctggcagc tggactcaga aaacaagaag cacaaaatga
     1141 acaaactagt gagcccagca acatggatgg aaattccgga gatgcagact gctttcagcc
     1201 agcagtcaaa agggcgaaat tacaagagtc cattgaatat gaagacttgg ggaaaaataa
     1261 ttctgtaaaa acgattgcac taaacctcaa gaagtcagat aggtattatc atggtccaac
     1321 tccaatccag tcactacagt atgcaacaag tcaggacatt attaattctt ttcaaagtat
     1381 tagacaagaa atggaagctt atacacccaa gttaactcag gttctctcaa gtagtgctgc
     1441 cagtagtacc atcacagcac tgtcacctgg aggggcactt atgcagggag gaacacagca
     1501 agccataaac cagatggtgc caaatgatat tcaatctgaa ttgaaacact tatatgtagc
     1561 tgttggagaa cttctacgac atttctggtc ctgctttcct gttaatacgc cattcctaga
     1621 agaaaaggta gtgaaaatga aaagtaattt ggaacgattc caagttacga agctctgtcc
     1681 attccaagaa aagattcgga gacagtattt aagcacaaat ttggtaagtc acatagaaga
     1741 gatgctccag acagcctaca acaagctcca cacatggcag tcacggcgtc tgatgaagaa
     1801 aacgtgaggt ggccatgatg cttacaggtt ttgtgagatt gagagaacta tgacctgcag
     1861 caactctgga aacctggcct gacagacaag cagatgacct cacaggagtg ataagaaaca
     1921 tctgctccac gccaactccc agagctgatg ctattgtact tgcacattgg agactgaaag
     1981 gaaagaaggg actaaatgct ggggaggtaa attaagacag aaccaaatga gctaagttgc
     2041 aaatatatat atatacacac acacacatat atgtacatgt gtatgtacat atatatttta
     2101 aaagactgtt tactgcagtt gctcaggaac tgcttttgat tcacattaag ctgctttcag
     2161 aaattaaaaa aacacttttt aaagggtgca ttgataaaat ctgaggtttt ttggttgtcg
     2221 tttttttctg tgtacatttt tttcctaagt ttatggcaca gggtagacct taagtattcc
     2281 tcctccatcc ttcattcttc accctccatt ggatcctcaa gttttaatga attccaatta
     2341 taccttacat cagcaagtta aaaaaagtac tttaaaataa agcaaaggga gactgttgct
     2401 caaccatcag gaaacagttg tcagaagaca tcattggttc tgtgtttcct acggaaataa
     2461 gaaacgataa atattgcact gaatgtttgt ggtttggagt ccctgaataa taaagaggga
     2521 atatatttgc agaaagtcgc atagggtttt ttaatgcaga attttgtcag aagacaatgg
     2581 cgctgcatgt ttttctttga gtgcaaatgt acattgctaa gattttttta agatggcatg
     2641 tgctttgaaa agaagatatt gcatttttaa gagtttaaaa atcttatgag tgagaaatat
     2701 taaaaaaatc ttattttcac ctctttagaa gaaataaaag atgtttctcc tatctccttt
     2761 tctctagtat ttgactgtta ctgtccttgg cgaatcgata atcattgcat agtgactgaa
     2821 aagcctaagt gcaaaaaaaa aaaaaaaaag atgttcttgt ttctgaactt cgtgccatat
     2881 tttgttcctg atgggatcaa cttaatgttt aagactttag atgtcttgta ttaaaaatta
     2941 cacaaaaaaa gtaaaacttt ttatacttaa aaaaaaaaaa aaaaaaaaa
//