LOCUS BC000120 1762 bp mRNA linear HUM 24-JUL-2006
DEFINITION Homo sapiens general transcription factor IIF, polypeptide 1,
74kDa, mRNA (cDNA clone MGC:1732 IMAGE:3352627), complete cds.
ACCESSION BC000120
VERSION BC000120.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 1762)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 1762)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (03-NOV-2000) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Genome Sequence Centre,
BC Cancer Agency, Vancouver, BC, Canada
info@bcgsc.bc.ca
Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson
Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen
Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel
Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave
Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth
Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao,
Kim MacDonald, Mike R. Mayo, Josh Moran, Diana Palmquist, JR
Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang,
Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 6 Row: k Column: 21
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 4504196.
FEATURES Location/Qualifiers
source 1..1762
/db_xref="H-InvDB:HIT000029360"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:1732 IMAGE:3352627"
/tissue_type="Eye, retinoblastoma"
/clone_lib="NIH_MGC_16"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene 1..1762
/gene="GTF2F1"
/gene_synonym="BTF4"
/gene_synonym="RAP74"
/gene_synonym="TF2F1"
/gene_synonym="TFIIF"
/db_xref="GeneID:2962"
/db_xref="HGNC:HGNC:4652"
/db_xref="MIM:189968"
CDS 148..1701
/gene="GTF2F1"
/gene_synonym="BTF4"
/gene_synonym="RAP74"
/gene_synonym="TF2F1"
/gene_synonym="TFIIF"
/codon_start=1
/product="general transcription factor IIF, polypeptide 1,
74kDa"
/protein_id="AAH00120.1"
/db_xref="GeneID:2962"
/db_xref="HGNC:HGNC:4652"
/db_xref="MIM:189968"
/translation="MAALGPSSQNVTEYVVRVPKNTTKKYNIMAFNAADKVNFATWNQ
ARLERDLSNKKIYQEEEMPESGAGSEFNRKLREEARRKKYGIVLKEFRPEDQPWLLRV
NGKSGRKFKGIKKGGVTENTSYYIFTQCPDGAFEAFPVHNWYNFTPLARHRTLTAEEA
EEEWERRNKVLNHFSIMQQRRLKDQDQDEDEEEKEKRGRRKASELRIHDLEDDLEMSS
DASDASGEEGGRVPKAKKKAPLAKGGRKKKKKKGSDDEAFEDSDDGDFEGQEVDYMSD
GSSSSQEEPESKAKAPQQEEGPKGVDEQSDSSEESEEEKPPEEDKEEEEEKKAPTPQE
KKRRKDSSEESDSSEESDIDSEASSALFMAKKKTPPKRERKPSGGSSRGNSRPGTPSA
EGGSTSSTLRAAASKLEQGKRVSEMPAAKRLRLDTGPQSLSGKSTPQPPSGKTTPNSG
DVQVTEDAVRRYLTRKPMTTKDLLKKFQTKKTGLSSEQTVNVLAQILKRLNPERKMIN
DKMHFSLKE"
BASE COUNT 502 a 474 c 579 g 207 t
ORIGIN
1 cagccagagg cgcctagggt tggggtcctc gctcaggcac agagacccga caccgagcgg
61 cggcttcccc gggatcgagg gacgcgcacg ccagaggaga cgaaaggaac ccgggtcgga
121 ccagatcgga accactgacc attgcccatg gcggccctag gccctagcag ccagaatgtc
181 actgaatacg tcgttcgagt tcctaagaat acaaccaaaa aatataacat catggctttt
241 aatgcagccg acaaagtcaa ctttgctacg tggaatcagg ctcggctgga gcgggacttg
301 agcaacaaga aaatctacca agaggaggag atgcccgaat cgggcgcggg cagtgagttc
361 aaccgcaagc ttcgggagga ggctcggagg aagaagtacg gcatcgtcct caaggagttc
421 cggcccgagg accagccctg gctgctccgg gtcaacggca aatcaggcag gaagttcaag
481 ggcatcaaga agggaggcgt aacagagaac acgtcctact acatcttcac ccagtgcccc
541 gacggggcct tcgaggcctt ccccgtgcac aactggtaca atttcacacc gctggcccgg
601 catcgcacgc tcactgccga ggaggccgag gaggagtggg agaggaggaa caaggtgctg
661 aaccacttca gcatcatgca gcagcggcgg ctcaaggatc aggaccagga cgaggatgag
721 gaggagaagg agaaacgtgg ccgcaggaag gcgagcgagc tgcgcatcca cgacctggag
781 gacgacctgg agatgtcgtc cgatgccagt gatgccagtg gtgaggaggg gggcagagtc
841 cccaaggcca agaagaaggc gccgctggcc aagggcggca ggaaaaagaa gaagaagaag
901 ggttcagacg acgaggcctt cgaggacagc gatgatgggg acttcgaggg ccaagaggtg
961 gactacatgt cagacggctc cagtagctcc caagaagagc ctgagagcaa ggccaaggcg
1021 ccgcagcagg aggaggggcc caagggtgtc gatgagcaga gcgacagtag tgaggagagt
1081 gaggaggaga agccgcctga ggaggacaag gaggaggagg aggagaagaa ggcacccacc
1141 ccgcaggaga agaagcgcag gaaagacagc agcgaggagt cggacagctc agaggagagc
1201 gacattgaca gcgaggcctc ctcagccctc ttcatggcga agaagaagac gccacccaag
1261 agagagcgga agccgtcggg agggagctca aggggcaaca gccgcccagg cacgcccagc
1321 gcagagggtg gcagcacctc ctccaccctg cgggcggctg ccagcaaact cgagcaaggg
1381 aagcgggtga gcgagatgcc tgcagccaag cggttgcggc tggacacggg accccagagc
1441 ctgtctggga agtcgacacc ccagccacca tcaggcaaga caacacccaa cagcggcgac
1501 gtgcaggtga ctgaggatgc cgtgcgccgc tacctgacac ggaagcccat gaccactaag
1561 gacctgctga aaaagttcca gaccaagaag acagggctga gcagcgagca gacagtgaac
1621 gtgttggccc agatcctcaa gcgactcaac cccgagcgca agatgatcaa cgacaaaatg
1681 cacttctccc tcaaggagtg aggcttggtc caatacatgg ctctgccccc caaaaaaaaa
1741 aaaaaaaaaa aaaaaaaaaa aa
//