LOCUS BC000120 1762 bp mRNA linear HUM 24-JUL-2006 DEFINITION Homo sapiens general transcription factor IIF, polypeptide 1, 74kDa, mRNA (cDNA clone MGC:1732 IMAGE:3352627), complete cds. ACCESSION BC000120 VERSION BC000120.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1762) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 1762) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (03-NOV-2000) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Genome Sequence Centre, BC Cancer Agency, Vancouver, BC, Canada info@bcgsc.bc.ca Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao, Kim MacDonald, Mike R. Mayo, Josh Moran, Diana Palmquist, JR Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang, Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 6 Row: k Column: 21 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 4504196. FEATURES Location/Qualifiers source 1..1762 /db_xref="H-InvDB:HIT000029360" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:1732 IMAGE:3352627" /tissue_type="Eye, retinoblastoma" /clone_lib="NIH_MGC_16" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..1762 /gene="GTF2F1" /gene_synonym="BTF4" /gene_synonym="RAP74" /gene_synonym="TF2F1" /gene_synonym="TFIIF" /db_xref="GeneID:2962" /db_xref="HGNC:HGNC:4652" /db_xref="MIM:189968" CDS 148..1701 /gene="GTF2F1" /gene_synonym="BTF4" /gene_synonym="RAP74" /gene_synonym="TF2F1" /gene_synonym="TFIIF" /codon_start=1 /product="general transcription factor IIF, polypeptide 1, 74kDa" /protein_id="AAH00120.1" /db_xref="GeneID:2962" /db_xref="HGNC:HGNC:4652" /db_xref="MIM:189968" /translation="MAALGPSSQNVTEYVVRVPKNTTKKYNIMAFNAADKVNFATWNQ ARLERDLSNKKIYQEEEMPESGAGSEFNRKLREEARRKKYGIVLKEFRPEDQPWLLRV NGKSGRKFKGIKKGGVTENTSYYIFTQCPDGAFEAFPVHNWYNFTPLARHRTLTAEEA EEEWERRNKVLNHFSIMQQRRLKDQDQDEDEEEKEKRGRRKASELRIHDLEDDLEMSS DASDASGEEGGRVPKAKKKAPLAKGGRKKKKKKGSDDEAFEDSDDGDFEGQEVDYMSD GSSSSQEEPESKAKAPQQEEGPKGVDEQSDSSEESEEEKPPEEDKEEEEEKKAPTPQE KKRRKDSSEESDSSEESDIDSEASSALFMAKKKTPPKRERKPSGGSSRGNSRPGTPSA EGGSTSSTLRAAASKLEQGKRVSEMPAAKRLRLDTGPQSLSGKSTPQPPSGKTTPNSG DVQVTEDAVRRYLTRKPMTTKDLLKKFQTKKTGLSSEQTVNVLAQILKRLNPERKMIN DKMHFSLKE" BASE COUNT 502 a 474 c 579 g 207 t ORIGIN 1 cagccagagg cgcctagggt tggggtcctc gctcaggcac agagacccga caccgagcgg 61 cggcttcccc gggatcgagg gacgcgcacg ccagaggaga cgaaaggaac ccgggtcgga 121 ccagatcgga accactgacc attgcccatg gcggccctag gccctagcag ccagaatgtc 181 actgaatacg tcgttcgagt tcctaagaat acaaccaaaa aatataacat catggctttt 241 aatgcagccg acaaagtcaa ctttgctacg tggaatcagg ctcggctgga gcgggacttg 301 agcaacaaga aaatctacca agaggaggag atgcccgaat cgggcgcggg cagtgagttc 361 aaccgcaagc ttcgggagga ggctcggagg aagaagtacg gcatcgtcct caaggagttc 421 cggcccgagg accagccctg gctgctccgg gtcaacggca aatcaggcag gaagttcaag 481 ggcatcaaga agggaggcgt aacagagaac acgtcctact acatcttcac ccagtgcccc 541 gacggggcct tcgaggcctt ccccgtgcac aactggtaca atttcacacc gctggcccgg 601 catcgcacgc tcactgccga ggaggccgag gaggagtggg agaggaggaa caaggtgctg 661 aaccacttca gcatcatgca gcagcggcgg ctcaaggatc aggaccagga cgaggatgag 721 gaggagaagg agaaacgtgg ccgcaggaag gcgagcgagc tgcgcatcca cgacctggag 781 gacgacctgg agatgtcgtc cgatgccagt gatgccagtg gtgaggaggg gggcagagtc 841 cccaaggcca agaagaaggc gccgctggcc aagggcggca ggaaaaagaa gaagaagaag 901 ggttcagacg acgaggcctt cgaggacagc gatgatgggg acttcgaggg ccaagaggtg 961 gactacatgt cagacggctc cagtagctcc caagaagagc ctgagagcaa ggccaaggcg 1021 ccgcagcagg aggaggggcc caagggtgtc gatgagcaga gcgacagtag tgaggagagt 1081 gaggaggaga agccgcctga ggaggacaag gaggaggagg aggagaagaa ggcacccacc 1141 ccgcaggaga agaagcgcag gaaagacagc agcgaggagt cggacagctc agaggagagc 1201 gacattgaca gcgaggcctc ctcagccctc ttcatggcga agaagaagac gccacccaag 1261 agagagcgga agccgtcggg agggagctca aggggcaaca gccgcccagg cacgcccagc 1321 gcagagggtg gcagcacctc ctccaccctg cgggcggctg ccagcaaact cgagcaaggg 1381 aagcgggtga gcgagatgcc tgcagccaag cggttgcggc tggacacggg accccagagc 1441 ctgtctggga agtcgacacc ccagccacca tcaggcaaga caacacccaa cagcggcgac 1501 gtgcaggtga ctgaggatgc cgtgcgccgc tacctgacac ggaagcccat gaccactaag 1561 gacctgctga aaaagttcca gaccaagaag acagggctga gcagcgagca gacagtgaac 1621 gtgttggccc agatcctcaa gcgactcaac cccgagcgca agatgatcaa cgacaaaatg 1681 cacttctccc tcaaggagtg aggcttggtc caatacatgg ctctgccccc caaaaaaaaa 1741 aaaaaaaaaa aaaaaaaaaa aa //