LOCUS BC000365 2989 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens general transcription factor IIH, polypeptide 1,
62kDa, mRNA (cDNA clone MGC:8323 IMAGE:2819217), complete cds.
ACCESSION BC000365
VERSION BC000365.2
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 2989)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 2989)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (15-NOV-2000) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT On Aug 20, 2003 this sequence version replaced BC000365.1.
Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: DCTD/DTP
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: National Institutes of Health Intramural
Sequencing Center (NISC),
Gaithersburg, Maryland;
Web site: http://www.nisc.nih.gov/
Contact: nisc_mgc@nhgri.nih.gov
Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
Young,A., Zhang,L.-H. and Green,E.D.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 1 Row: a Column: 2
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 19923304.
FEATURES Location/Qualifiers
source 1..2989
/db_xref="H-InvDB:HIT000029556"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:8323 IMAGE:2819217"
/tissue_type="Lung, small cell carcinoma"
/clone_lib="NIH_MGC_7"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene 1..2989
/gene="GTF2H1"
/gene_synonym="BTF2"
/gene_synonym="TFIIH"
/db_xref="GeneID:2965"
/db_xref="HGNC:HGNC:4655"
/db_xref="MIM:189972"
CDS 161..1807
/gene="GTF2H1"
/gene_synonym="BTF2"
/gene_synonym="TFIIH"
/codon_start=1
/product="general transcription factor IIH, polypeptide 1,
62kDa"
/protein_id="AAH00365.1"
/db_xref="GeneID:2965"
/db_xref="HGNC:HGNC:4655"
/db_xref="MIM:189972"
/translation="MATSSEEVLLIVKKVRQKKQDGALYLMAERIAWAPEGKDRFTIS
HMYADIKCQKISPEGKAKIQLQLVLHAGDTTNFHFSNESTAVKERDAVKDLLQQLLPK
FKRKANKELEEKNRMLQEDPVLFQLYKDLVVSQVISAEEFWANRLNVNATDSSSTSNH
KQDVGISAAFLADVRPQTDGCNGLRYNLTSDIIESIFRTYPAVKMKYAENVPHNMTEK
EFWTRFFQSHYFHRDRLNTGSKDLFAECAKIDEKGLKTMVSLGVKNPLLDLTALEDKP
LDEGYGISSVPSASNSKSIKENSNAAIIKRFNHHSAMVLAAGLRKQEAQNEQTSEPSN
MDGNSGDADCFQPAVKRAKLQESIEYEDLGKNNSVKTIALNLKKSDRYYHGPTPIQSL
QYATSQDIINSFQSIRQEMEAYTPKLTQVLSSSAASSTITALSPGGALMQGGTQQAIN
QMVPNDIQSELKHLYVAVGELLRHFWSCFPVNTPFLEEKVVKMKSNLERFQVTKLCPF
QEKIRRQYLSTNLVSHIEEMLQTAYNKLHTWQSRRLMKKT"
BASE COUNT 1004 a 567 c 604 g 814 t
ORIGIN
1 gaccccctag taacagaggc ggtggctact gctgcggcca ctgggtttcg gcctcttccc
61 agcagcggct ctaagaagcg cagcggaact cgaccggatc caacccagtt agttacttcc
121 tgtctagagt tgtagcttcc acctgcacct tctagccacc atggcaacct catctgaaga
181 agttttgctg attgtaaaga aagtgcgtca aaagaagcag gatggagctc tgtacctcat
241 ggcagaaaga attgcttggg cacctgaagg caaagataga tttacaatca gccatatgta
301 tgcagatatt aaatgccaga aaattagtcc agaaggaaaa gctaaaattc agcttcagct
361 ggtcctacat gcaggggaca caactaactt ccatttttcc aatgaaagca cagcagtgaa
421 agagcgagat gcagtaaaag accttcttca gcagctgctg cccaaattca agaggaaagc
481 aaataaagaa ctggaagaga agaacagaat gctgcaagaa gatcctgttt tgtttcagct
541 ttataaagac cttgttgtga gtcaagtgat cagtgctgag gaattctggg ccaatcgttt
601 aaatgtgaat gcaacagata gttcttccac atccaatcat aagcaggatg ttggcatttc
661 tgctgcattt ctggctgatg tccggcccca aactgatggc tgtaacggtc taagatataa
721 tttaacttct gatatcattg agtccatatt taggacctat ccagcagtaa aaatgaaata
781 tgcagaaaat gttccccaca acatgacaga gaaggaattc tggacacgtt ttttccagtc
841 ccattatttt cacagggatc ggctgaatac agggtcaaag gatctctttg cagaatgtgc
901 caaaatagat gaaaaaggcc taaaaacaat ggtttcatta ggagtgaaaa acccactact
961 agatttaaca gctttggaag ataaaccatt agatgagggc tatggcattt cctctgtgcc
1021 atctgcttcc aattctaaat ccataaaaga gaatagtaat gctgccatca tcaagagatt
1081 taaccatcac agtgccatgg tcctggcagc tggactcaga aaacaagaag cacaaaatga
1141 acaaactagt gagcccagca acatggatgg aaattccgga gatgcagact gctttcagcc
1201 agcagtcaaa agggcgaaat tacaagagtc cattgaatat gaagacttgg ggaaaaataa
1261 ttctgtaaaa acgattgcac taaacctcaa gaagtcagat aggtattatc atggtccaac
1321 tccaatccag tcactacagt atgcaacaag tcaggacatt attaattctt ttcaaagtat
1381 tagacaagaa atggaagctt atacacccaa gttaactcag gttctctcaa gtagtgctgc
1441 cagtagtacc atcacagcac tgtcacctgg aggggcactt atgcagggag gaacacagca
1501 agccataaac cagatggtgc caaatgatat tcaatctgaa ttgaaacact tatatgtagc
1561 tgttggagaa cttctacgac atttctggtc ctgctttcct gttaatacgc cattcctaga
1621 agaaaaggta gtgaaaatga aaagtaattt ggaacgattc caagttacga agctctgtcc
1681 attccaagaa aagattcgga gacagtattt aagcacaaat ttggtaagtc acatagaaga
1741 gatgctccag acagcctaca acaagctcca cacatggcag tcacggcgtc tgatgaagaa
1801 aacgtgaggt ggccatgatg cttacaggtt ttgtgagatt gagagaacta tgacctgcag
1861 caactctgga aacctggcct gacagacaag cagatgacct cacaggagtg ataagaaaca
1921 tctgctccac gccaactccc agagctgatg ctattgtact tgcacattgg agactgaaag
1981 gaaagaaggg actaaatgct ggggaggtaa attaagacag aaccaaatga gctaagttgc
2041 aaatatatat atatacacac acacacatat atgtacatgt gtatgtacat atatatttta
2101 aaagactgtt tactgcagtt gctcaggaac tgcttttgat tcacattaag ctgctttcag
2161 aaattaaaaa aacacttttt aaagggtgca ttgataaaat ctgaggtttt ttggttgtcg
2221 tttttttctg tgtacatttt tttcctaagt ttatggcaca gggtagacct taagtattcc
2281 tcctccatcc ttcattcttc accctccatt ggatcctcaa gttttaatga attccaatta
2341 taccttacat cagcaagtta aaaaaagtac tttaaaataa agcaaaggga gactgttgct
2401 caaccatcag gaaacagttg tcagaagaca tcattggttc tgtgtttcct acggaaataa
2461 gaaacgataa atattgcact gaatgtttgt ggtttggagt ccctgaataa taaagaggga
2521 atatatttgc agaaagtcgc atagggtttt ttaatgcaga attttgtcag aagacaatgg
2581 cgctgcatgt ttttctttga gtgcaaatgt acattgctaa gattttttta agatggcatg
2641 tgctttgaaa agaagatatt gcatttttaa gagtttaaaa atcttatgag tgagaaatat
2701 taaaaaaatc ttattttcac ctctttagaa gaaataaaag atgtttctcc tatctccttt
2761 tctctagtat ttgactgtta ctgtccttgg cgaatcgata atcattgcat agtgactgaa
2821 aagcctaagt gcaaaaaaaa aaaaaaaaag atgttcttgt ttctgaactt cgtgccatat
2881 tttgttcctg atgggatcaa cttaatgttt aagactttag atgtcttgta ttaaaaatta
2941 cacaaaaaaa gtaaaacttt ttatacttaa aaaaaaaaaa aaaaaaaaa
//