LOCUS BC013007 2433 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens general transcription factor IIF, polypeptide 1, 74kDa, mRNA (cDNA clone MGC:4144 IMAGE:3009685), complete cds. ACCESSION BC013007 VERSION BC013007.2 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2433) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2433) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (20-AUG-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Aug 19, 2003 this sequence version replaced BC013007.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Institute for Systems Biology http://www.systemsbiology.org contact: amadan@systemsbiology.org Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 10 Row: g Column: 14 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 4504196. FEATURES Location/Qualifiers source 1..2433 /db_xref="H-InvDB:HIT000036077" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:4144 IMAGE:3009685" /tissue_type="Muscle, rhabdomyosarcoma" /clone_lib="NIH_MGC_17" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..2433 /gene="GTF2F1" /gene_synonym="BTF4" /gene_synonym="RAP74" /gene_synonym="TF2F1" /gene_synonym="TFIIF" /db_xref="GeneID:2962" /db_xref="HGNC:HGNC:4652" /db_xref="MIM:189968" CDS 148..1701 /gene="GTF2F1" /gene_synonym="BTF4" /gene_synonym="RAP74" /gene_synonym="TF2F1" /gene_synonym="TFIIF" /codon_start=1 /product="general transcription factor IIF, polypeptide 1, 74kDa" /protein_id="AAH13007.1" /db_xref="GeneID:2962" /db_xref="HGNC:HGNC:4652" /db_xref="MIM:189968" /translation="MAALGPSSQNVTEYVVRVPKNTTKKYNIMAFNAADKVNFATWNQ ARLERDLSNKKIYQEEEMPESGAGSEFNRKLREEARRKKYGIVLKEFRPEDQPWLLRV NGKSGRKFKGIKKGGVTENTSYYIFTQCPDGAFEAFPVHNWYNFTPLARHRTLTAEEA EEEWERRNKVLNHFSIMQQRRLKDQDQDEDEEEKEKRGRRKASELRIHDLEDDLEMSS DASDASGEEGGRVPKAKKKAPLAKGGRKKKKKKGSDDEAFEDSDDGDFEGQEVDYMSD GSSSSQEEPESKAKAPQQEEGPKGVDEQSDSSEESEEEKPPEEDKEEEEEKKAPTPQE KKRRKDSSEESDSSEESDIDSEASSALFMAKKKTPPKRERKPSGGSSRGNSRPGTPSA EGGSTSSTLRAAASKLEQGKRVSEMPAAKRLRLDTGPQSLSGKSTPQPPSGKTTPNSG DVQVTEDAVRRYLTRKPMTTKDLLKKFQTKKTGLSSEQTVNVLAQILKRLNPERKMIN DKMHFSLKE" BASE COUNT 643 a 673 c 736 g 381 t ORIGIN 1 gtgccagagg cgcctagggt tggggtcctc gctcaggcac agagacccga caccgagcgg 61 cggcttcccc gggatcgagg gacgcgcacg ccagaggaga cgaaaggaac ccgggtcgga 121 ccagatcgga accactgacc attgcccatg gcggccctag gccctagcag ccagaatgtc 181 actgaatacg tcgttcgagt tcctaagaat acaaccaaaa aatataacat catggctttt 241 aatgcagccg acaaagtcaa ctttgctacg tggaatcagg ctcggctgga gcgggacttg 301 agcaacaaga aaatctacca agaggaggag atgcccgaat cgggcgcggg cagtgagttc 361 aaccgcaagc ttcgggagga ggctcggagg aagaagtacg gcatcgtcct caaggagttc 421 cggcccgagg accagccctg gctgctccgg gtcaacggca aatcaggcag gaagttcaag 481 ggcatcaaga agggaggcgt aacagagaac acgtcctact acatcttcac ccagtgcccc 541 gacggggcct tcgaggcctt ccccgtgcac aactggtaca atttcacacc gctggcccgg 601 catcgcacgc tcactgccga ggaggccgag gaggagtggg agaggaggaa caaggtgctg 661 aaccacttca gcatcatgca gcagcggcgg ctcaaggatc aggaccagga cgaggatgag 721 gaggagaagg agaaacgtgg ccgcaggaag gcgagcgagc tgcgcatcca cgacctggag 781 gacgacctgg agatgtcgtc cgatgccagt gatgccagtg gtgaggaggg gggcagagtc 841 cccaaggcca agaagaaggc gccgctggcc aagggcggca ggaaaaagaa gaagaagaag 901 ggttcagacg acgaggcctt cgaggacagc gatgatgggg acttcgaggg ccaagaggta 961 gactacatgt cagacggctc cagtagctcc caagaagagc ctgagagcaa ggccaaggcg 1021 ccgcagcagg aggaggggcc caagggtgtc gatgagcaga gcgacagtag tgaggagagt 1081 gaggaggaga agccgcctga ggaggacaag gaggaggagg aggagaagaa ggcacccacc 1141 ccgcaggaga agaagcgcag gaaagacagc agcgaggagt cggacagctc agaggagagc 1201 gacattgaca gcgaggcctc ctcagccctc ttcatggcga agaagaagac gccacccaag 1261 agagagcgga agccgtcggg agggagctca aggggcaaca gccgcccagg cacgcccagc 1321 gcagagggtg gcagcacctc ctccaccctg cgggcggctg ccagcaaact cgagcaaggg 1381 aagcgggtga gcgagatgcc tgcagccaag cggttgcggc tggacacggg accccagagc 1441 ctgtctggga agtcgacacc ccagccacca tcaggcaaga caacacccaa cagcggcgac 1501 gtgcaggtga ctgaggatgc cgtgcgccgc tacctgacac ggaagcccat gaccactaag 1561 gacctgctga aaaagttcca gaccaagaag acagggctga gcagcgagca gacagtgaac 1621 gtgttggccc agatcctcaa gcgactcaac cccgagcgca agatgatcaa cgacaaaatg 1681 cacttctccc tcaaggagtg aggcttggtc caatacatgg ctctgccccc cagaacttaa 1741 ggctctcact gccccttcgc catcctagag tgaggctctg tccaatacat ggctctgccc 1801 tccagaactt caggctctca gtgacccttc gacatcctgc ttgctccctg actcccaggg 1861 ccccgtagtt agcaattctg gaaaagttaa gccatctcct cctctggccc ttccttctgg 1921 aatcttcaga tgcctgttag gccttcttat tgtcctcctc ctcctggctc ggcctccctc 1981 acactgacca agggcctgtg ctgcccactg ggtaacttct acagttctcc cttccacttc 2041 cctaagtctc tcttcagctg tgacttatcc accacatagc ccagtaagtc ttcagttttg 2101 gctgggcatg atggtgagcg cctgcagtcc cagctacttg ggaggctaag cccagttcaa 2161 ggctgcagtg aactatgatg gtgccactgc attccagcct gggtgacaga atgaaatcct 2221 ggcacaaaaa aaaaaaaaag tagccaggca tggtggcggg agcctgttgt cccagctgtt 2281 ccgtaggctg aggcacgaga ttcacttgaa cctgggaggt ggaggttgct gtgagctgac 2341 accatgccac tgcactccag cctgggtgac agtgagactc tgtctcaata aataaaaaat 2401 aataataaat tgaaaaaaaa aaaaaaaaaa aaa //