LOCUS BC000365 2989 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens general transcription factor IIH, polypeptide 1, 62kDa, mRNA (cDNA clone MGC:8323 IMAGE:2819217), complete cds. ACCESSION BC000365 VERSION BC000365.2 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2989) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2989) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (15-NOV-2000) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Aug 20, 2003 this sequence version replaced BC000365.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: DCTD/DTP cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 1 Row: a Column: 2 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 19923304. FEATURES Location/Qualifiers source 1..2989 /db_xref="H-InvDB:HIT000029556" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:8323 IMAGE:2819217" /tissue_type="Lung, small cell carcinoma" /clone_lib="NIH_MGC_7" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..2989 /gene="GTF2H1" /gene_synonym="BTF2" /gene_synonym="TFIIH" /db_xref="GeneID:2965" /db_xref="HGNC:HGNC:4655" /db_xref="MIM:189972" CDS 161..1807 /gene="GTF2H1" /gene_synonym="BTF2" /gene_synonym="TFIIH" /codon_start=1 /product="general transcription factor IIH, polypeptide 1, 62kDa" /protein_id="AAH00365.1" /db_xref="GeneID:2965" /db_xref="HGNC:HGNC:4655" /db_xref="MIM:189972" /translation="MATSSEEVLLIVKKVRQKKQDGALYLMAERIAWAPEGKDRFTIS HMYADIKCQKISPEGKAKIQLQLVLHAGDTTNFHFSNESTAVKERDAVKDLLQQLLPK FKRKANKELEEKNRMLQEDPVLFQLYKDLVVSQVISAEEFWANRLNVNATDSSSTSNH KQDVGISAAFLADVRPQTDGCNGLRYNLTSDIIESIFRTYPAVKMKYAENVPHNMTEK EFWTRFFQSHYFHRDRLNTGSKDLFAECAKIDEKGLKTMVSLGVKNPLLDLTALEDKP LDEGYGISSVPSASNSKSIKENSNAAIIKRFNHHSAMVLAAGLRKQEAQNEQTSEPSN MDGNSGDADCFQPAVKRAKLQESIEYEDLGKNNSVKTIALNLKKSDRYYHGPTPIQSL QYATSQDIINSFQSIRQEMEAYTPKLTQVLSSSAASSTITALSPGGALMQGGTQQAIN QMVPNDIQSELKHLYVAVGELLRHFWSCFPVNTPFLEEKVVKMKSNLERFQVTKLCPF QEKIRRQYLSTNLVSHIEEMLQTAYNKLHTWQSRRLMKKT" BASE COUNT 1004 a 567 c 604 g 814 t ORIGIN 1 gaccccctag taacagaggc ggtggctact gctgcggcca ctgggtttcg gcctcttccc 61 agcagcggct ctaagaagcg cagcggaact cgaccggatc caacccagtt agttacttcc 121 tgtctagagt tgtagcttcc acctgcacct tctagccacc atggcaacct catctgaaga 181 agttttgctg attgtaaaga aagtgcgtca aaagaagcag gatggagctc tgtacctcat 241 ggcagaaaga attgcttggg cacctgaagg caaagataga tttacaatca gccatatgta 301 tgcagatatt aaatgccaga aaattagtcc agaaggaaaa gctaaaattc agcttcagct 361 ggtcctacat gcaggggaca caactaactt ccatttttcc aatgaaagca cagcagtgaa 421 agagcgagat gcagtaaaag accttcttca gcagctgctg cccaaattca agaggaaagc 481 aaataaagaa ctggaagaga agaacagaat gctgcaagaa gatcctgttt tgtttcagct 541 ttataaagac cttgttgtga gtcaagtgat cagtgctgag gaattctggg ccaatcgttt 601 aaatgtgaat gcaacagata gttcttccac atccaatcat aagcaggatg ttggcatttc 661 tgctgcattt ctggctgatg tccggcccca aactgatggc tgtaacggtc taagatataa 721 tttaacttct gatatcattg agtccatatt taggacctat ccagcagtaa aaatgaaata 781 tgcagaaaat gttccccaca acatgacaga gaaggaattc tggacacgtt ttttccagtc 841 ccattatttt cacagggatc ggctgaatac agggtcaaag gatctctttg cagaatgtgc 901 caaaatagat gaaaaaggcc taaaaacaat ggtttcatta ggagtgaaaa acccactact 961 agatttaaca gctttggaag ataaaccatt agatgagggc tatggcattt cctctgtgcc 1021 atctgcttcc aattctaaat ccataaaaga gaatagtaat gctgccatca tcaagagatt 1081 taaccatcac agtgccatgg tcctggcagc tggactcaga aaacaagaag cacaaaatga 1141 acaaactagt gagcccagca acatggatgg aaattccgga gatgcagact gctttcagcc 1201 agcagtcaaa agggcgaaat tacaagagtc cattgaatat gaagacttgg ggaaaaataa 1261 ttctgtaaaa acgattgcac taaacctcaa gaagtcagat aggtattatc atggtccaac 1321 tccaatccag tcactacagt atgcaacaag tcaggacatt attaattctt ttcaaagtat 1381 tagacaagaa atggaagctt atacacccaa gttaactcag gttctctcaa gtagtgctgc 1441 cagtagtacc atcacagcac tgtcacctgg aggggcactt atgcagggag gaacacagca 1501 agccataaac cagatggtgc caaatgatat tcaatctgaa ttgaaacact tatatgtagc 1561 tgttggagaa cttctacgac atttctggtc ctgctttcct gttaatacgc cattcctaga 1621 agaaaaggta gtgaaaatga aaagtaattt ggaacgattc caagttacga agctctgtcc 1681 attccaagaa aagattcgga gacagtattt aagcacaaat ttggtaagtc acatagaaga 1741 gatgctccag acagcctaca acaagctcca cacatggcag tcacggcgtc tgatgaagaa 1801 aacgtgaggt ggccatgatg cttacaggtt ttgtgagatt gagagaacta tgacctgcag 1861 caactctgga aacctggcct gacagacaag cagatgacct cacaggagtg ataagaaaca 1921 tctgctccac gccaactccc agagctgatg ctattgtact tgcacattgg agactgaaag 1981 gaaagaaggg actaaatgct ggggaggtaa attaagacag aaccaaatga gctaagttgc 2041 aaatatatat atatacacac acacacatat atgtacatgt gtatgtacat atatatttta 2101 aaagactgtt tactgcagtt gctcaggaac tgcttttgat tcacattaag ctgctttcag 2161 aaattaaaaa aacacttttt aaagggtgca ttgataaaat ctgaggtttt ttggttgtcg 2221 tttttttctg tgtacatttt tttcctaagt ttatggcaca gggtagacct taagtattcc 2281 tcctccatcc ttcattcttc accctccatt ggatcctcaa gttttaatga attccaatta 2341 taccttacat cagcaagtta aaaaaagtac tttaaaataa agcaaaggga gactgttgct 2401 caaccatcag gaaacagttg tcagaagaca tcattggttc tgtgtttcct acggaaataa 2461 gaaacgataa atattgcact gaatgtttgt ggtttggagt ccctgaataa taaagaggga 2521 atatatttgc agaaagtcgc atagggtttt ttaatgcaga attttgtcag aagacaatgg 2581 cgctgcatgt ttttctttga gtgcaaatgt acattgctaa gattttttta agatggcatg 2641 tgctttgaaa agaagatatt gcatttttaa gagtttaaaa atcttatgag tgagaaatat 2701 taaaaaaatc ttattttcac ctctttagaa gaaataaaag atgtttctcc tatctccttt 2761 tctctagtat ttgactgtta ctgtccttgg cgaatcgata atcattgcat agtgactgaa 2821 aagcctaagt gcaaaaaaaa aaaaaaaaag atgttcttgt ttctgaactt cgtgccatat 2881 tttgttcctg atgggatcaa cttaatgttt aagactttag atgtcttgta ttaaaaatta 2941 cacaaaaaaa gtaaaacttt ttatacttaa aaaaaaaaaa aaaaaaaaa //