LOCUS BC047868 1460 bp mRNA linear HUM 16-SEP-2003
DEFINITION Homo sapiens general transcription factor IIH, polypeptide 3,
34kDa, mRNA (cDNA clone IMAGE:5197419), partial cds.
ACCESSION BC047868
VERSION BC047868.2
KEYWORDS .
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 1460)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 1460)
AUTHORS Strausberg,R.
TITLE Direct Submission
JOURNAL Submitted (03-MAR-2003) National Institutes of Health, Mammalian
Gene Collection (MGC), Cancer Genomics Office, National Cancer
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT On Sep 16, 2003 this sequence version replaced BC047868.1.
Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: Life Technologies, Inc.
cDNA Library Preparation: Life Technologies, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Sequencing Group at the Stanford Human Genome
Center, Stanford University School of Medicine, Stanford, CA 94305
Web site: http://www-shgc.stanford.edu
Contact: (Dickson, Mark) mcd@paxil.stanford.edu
Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
R. M.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 107 Row: d Column: 19
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 28376643.
FEATURES Location/Qualifiers
source 1..1460
/db_xref="H-InvDB:HIT000053260"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="IMAGE:5197419"
/tissue_type="Brain, adult, 6 pooled whole brains"
/clone_lib="NIH_MGC_114"
/lab_host="DH10B"
/note="Vector: pCMV-SPORT6"
gene <1..1460
/gene="GTF2H3"
/gene_synonym="BTF2"
/gene_synonym="TFIIH"
/db_xref="GeneID:2967"
/db_xref="MIM:601750"
CDS <1..914
/gene="GTF2H3"
/gene_synonym="BTF2"
/gene_synonym="TFIIH"
/codon_start=3
/product="GTF2H3 protein"
/protein_id="AAH47868.2"
/db_xref="GeneID:2967"
/db_xref="MIM:601750"
/translation="DELNLLVIVVDANPIWWGKQALKESQFTLSKCIDAVMVLGNSHL
FMNRSNKLAVIASHIQESRFLYPGKNGRLGDFFGDPGNPPEFNPSGSKDGKYELLTSA
NEVIVEEIKDLMTKSDIKGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIK
AAEDSALQYMNFMNVIFAAQKQNILIDACVLDSDSGLLQQACDITGGLYLKVPQMPSL
LQYLLWVFLPDQDQRSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPI
CTTCETAFKISLPPVLKAKKKKLKVSA"
misc_feature 6..845
/gene="GTF2H3"
/gene_synonym="BTF2"
/gene_synonym="TFIIH"
/note="Tfb4; Region: Transcription factor Tfb4"
/db_xref="CDD:pfam03850"
BASE COUNT 478 a 263 c 285 g 434 t
ORIGIN
1 aagatgaatt gaatcttctg gttattgtag ttgatgccaa cccaatttgg tggggaaagc
61 aagcattaaa ggaatctcag ttcactttat ccaaatgcat agatgccgtg atggtgctgg
121 gaaattcgca tttattcatg aatcgttcca acaaacttgc tgtgatagca agtcacattc
181 aagaaagccg attcttatat cctggaaaga atggcagact tggagacttc ttcggagacc
241 ctggcaaccc tcctgaattt aatccctctg ggagtaaaga tggaaaatac gaacttttaa
301 cctcagcaaa tgaagttatt gttgaagaga ttaaagatct aatgaccaaa agtgacataa
361 agggtcaaca tacagaaact ttgctggcag gatccctggc caaagccctt tgctacattc
421 atagaatgaa caaggaagtt aaagacaatc aggaaatgaa atcaaggata ttggtgatta
481 aggctgcaga agacagtgcg ttgcagtata tgaacttcat gaatgtcatc tttgcagcac
541 agaaacagaa tattttgatt gatgcctgtg ttttagactc cgactcaggg ctcctccaac
601 aggcttgtga catcacggga ggactgtacc tgaaggtgcc tcagatgcct tctcttctgc
661 agtatttgct gtgggtgttt cttcccgatc aagatcagag atctcagtta atcctcccac
721 ccccagttca tgttgactac agggctgctt gcttctgtca tcgaaatctc attgaaattg
781 gttatgtctg ttctgtgtgt ttgtcaatat tctgcaattt cagccccatt tgtactacgt
841 gcgagacagc ctttaaaatt tctctgcctc cagtgctgaa agccaagaaa aagaaactga
901 aagtgtctgc ctgaggataa aatattttcc ccatctttta gagctgttaa tagaaattat
961 atagcagatt ctttgttggg aagactgaaa aaaataaaga taggtatagg ataattttta
1021 atatggtgac cttacagaaa atatttccca aacatccttt tcatcctgtg cttctggagg
1081 actgatttgt ttgagggaat cattctatgc attatatcct aaaatattct atgactggtt
1141 tctgtccatg tttgtggctt tcattttttt aatgggatga ctattagtca aagtcagctt
1201 gtcatgactc atcataggct ttctaaccta ctccctgaat ccgggtcctc attgtgaaat
1261 gcatgccata cgaaatttga acgtagcttt ggaaaaaggg actatttgtg gagtaatggc
1321 attaatcaac atagaacatc ttatttgaat caacagttaa cttcagtagt catgtgaata
1381 aaattcttat tgtctaaatt gaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
1441 aaaaaaaaaa aaaaaaaaaa
//