LOCUS BC047868 1460 bp mRNA linear HUM 16-SEP-2003 DEFINITION Homo sapiens general transcription factor IIH, polypeptide 3, 34kDa, mRNA (cDNA clone IMAGE:5197419), partial cds. ACCESSION BC047868 VERSION BC047868.2 KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1460) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 1460) AUTHORS Strausberg,R. TITLE Direct Submission JOURNAL Submitted (03-MAR-2003) National Institutes of Health, Mammalian Gene Collection (MGC), Cancer Genomics Office, National Cancer Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Sep 16, 2003 this sequence version replaced BC047868.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Life Technologies, Inc. cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Sequencing Group at the Stanford Human Genome Center, Stanford University School of Medicine, Stanford, CA 94305 Web site: http://www-shgc.stanford.edu Contact: (Dickson, Mark) mcd@paxil.stanford.edu Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers, R. M. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 107 Row: d Column: 19 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 28376643. FEATURES Location/Qualifiers source 1..1460 /db_xref="H-InvDB:HIT000053260" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="IMAGE:5197419" /tissue_type="Brain, adult, 6 pooled whole brains" /clone_lib="NIH_MGC_114" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene <1..1460 /gene="GTF2H3" /gene_synonym="BTF2" /gene_synonym="TFIIH" /db_xref="GeneID:2967" /db_xref="MIM:601750" CDS <1..914 /gene="GTF2H3" /gene_synonym="BTF2" /gene_synonym="TFIIH" /codon_start=3 /product="GTF2H3 protein" /protein_id="AAH47868.2" /db_xref="GeneID:2967" /db_xref="MIM:601750" /translation="DELNLLVIVVDANPIWWGKQALKESQFTLSKCIDAVMVLGNSHL FMNRSNKLAVIASHIQESRFLYPGKNGRLGDFFGDPGNPPEFNPSGSKDGKYELLTSA NEVIVEEIKDLMTKSDIKGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIK AAEDSALQYMNFMNVIFAAQKQNILIDACVLDSDSGLLQQACDITGGLYLKVPQMPSL LQYLLWVFLPDQDQRSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPI CTTCETAFKISLPPVLKAKKKKLKVSA" misc_feature 6..845 /gene="GTF2H3" /gene_synonym="BTF2" /gene_synonym="TFIIH" /note="Tfb4; Region: Transcription factor Tfb4" /db_xref="CDD:pfam03850" BASE COUNT 478 a 263 c 285 g 434 t ORIGIN 1 aagatgaatt gaatcttctg gttattgtag ttgatgccaa cccaatttgg tggggaaagc 61 aagcattaaa ggaatctcag ttcactttat ccaaatgcat agatgccgtg atggtgctgg 121 gaaattcgca tttattcatg aatcgttcca acaaacttgc tgtgatagca agtcacattc 181 aagaaagccg attcttatat cctggaaaga atggcagact tggagacttc ttcggagacc 241 ctggcaaccc tcctgaattt aatccctctg ggagtaaaga tggaaaatac gaacttttaa 301 cctcagcaaa tgaagttatt gttgaagaga ttaaagatct aatgaccaaa agtgacataa 361 agggtcaaca tacagaaact ttgctggcag gatccctggc caaagccctt tgctacattc 421 atagaatgaa caaggaagtt aaagacaatc aggaaatgaa atcaaggata ttggtgatta 481 aggctgcaga agacagtgcg ttgcagtata tgaacttcat gaatgtcatc tttgcagcac 541 agaaacagaa tattttgatt gatgcctgtg ttttagactc cgactcaggg ctcctccaac 601 aggcttgtga catcacggga ggactgtacc tgaaggtgcc tcagatgcct tctcttctgc 661 agtatttgct gtgggtgttt cttcccgatc aagatcagag atctcagtta atcctcccac 721 ccccagttca tgttgactac agggctgctt gcttctgtca tcgaaatctc attgaaattg 781 gttatgtctg ttctgtgtgt ttgtcaatat tctgcaattt cagccccatt tgtactacgt 841 gcgagacagc ctttaaaatt tctctgcctc cagtgctgaa agccaagaaa aagaaactga 901 aagtgtctgc ctgaggataa aatattttcc ccatctttta gagctgttaa tagaaattat 961 atagcagatt ctttgttggg aagactgaaa aaaataaaga taggtatagg ataattttta 1021 atatggtgac cttacagaaa atatttccca aacatccttt tcatcctgtg cttctggagg 1081 actgatttgt ttgagggaat cattctatgc attatatcct aaaatattct atgactggtt 1141 tctgtccatg tttgtggctt tcattttttt aatgggatga ctattagtca aagtcagctt 1201 gtcatgactc atcataggct ttctaaccta ctccctgaat ccgggtcctc attgtgaaat 1261 gcatgccata cgaaatttga acgtagcttt ggaaaaaggg actatttgtg gagtaatggc 1321 attaatcaac atagaacatc ttatttgaat caacagttaa cttcagtagt catgtgaata 1381 aaattcttat tgtctaaatt gaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1441 aaaaaaaaaa aaaaaaaaaa //