LOCUS BC013007 2433 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens general transcription factor IIF, polypeptide 1,
74kDa, mRNA (cDNA clone MGC:4144 IMAGE:3009685), complete cds.
ACCESSION BC013007
VERSION BC013007.2
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 2433)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 2433)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (20-AUG-2001) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT On Aug 19, 2003 this sequence version replaced BC013007.1.
Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Institute for Systems Biology
http://www.systemsbiology.org
contact: amadan@systemsbiology.org
Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 10 Row: g Column: 14
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 4504196.
FEATURES Location/Qualifiers
source 1..2433
/db_xref="H-InvDB:HIT000036077"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:4144 IMAGE:3009685"
/tissue_type="Muscle, rhabdomyosarcoma"
/clone_lib="NIH_MGC_17"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene 1..2433
/gene="GTF2F1"
/gene_synonym="BTF4"
/gene_synonym="RAP74"
/gene_synonym="TF2F1"
/gene_synonym="TFIIF"
/db_xref="GeneID:2962"
/db_xref="HGNC:HGNC:4652"
/db_xref="MIM:189968"
CDS 148..1701
/gene="GTF2F1"
/gene_synonym="BTF4"
/gene_synonym="RAP74"
/gene_synonym="TF2F1"
/gene_synonym="TFIIF"
/codon_start=1
/product="general transcription factor IIF, polypeptide 1,
74kDa"
/protein_id="AAH13007.1"
/db_xref="GeneID:2962"
/db_xref="HGNC:HGNC:4652"
/db_xref="MIM:189968"
/translation="MAALGPSSQNVTEYVVRVPKNTTKKYNIMAFNAADKVNFATWNQ
ARLERDLSNKKIYQEEEMPESGAGSEFNRKLREEARRKKYGIVLKEFRPEDQPWLLRV
NGKSGRKFKGIKKGGVTENTSYYIFTQCPDGAFEAFPVHNWYNFTPLARHRTLTAEEA
EEEWERRNKVLNHFSIMQQRRLKDQDQDEDEEEKEKRGRRKASELRIHDLEDDLEMSS
DASDASGEEGGRVPKAKKKAPLAKGGRKKKKKKGSDDEAFEDSDDGDFEGQEVDYMSD
GSSSSQEEPESKAKAPQQEEGPKGVDEQSDSSEESEEEKPPEEDKEEEEEKKAPTPQE
KKRRKDSSEESDSSEESDIDSEASSALFMAKKKTPPKRERKPSGGSSRGNSRPGTPSA
EGGSTSSTLRAAASKLEQGKRVSEMPAAKRLRLDTGPQSLSGKSTPQPPSGKTTPNSG
DVQVTEDAVRRYLTRKPMTTKDLLKKFQTKKTGLSSEQTVNVLAQILKRLNPERKMIN
DKMHFSLKE"
BASE COUNT 643 a 673 c 736 g 381 t
ORIGIN
1 gtgccagagg cgcctagggt tggggtcctc gctcaggcac agagacccga caccgagcgg
61 cggcttcccc gggatcgagg gacgcgcacg ccagaggaga cgaaaggaac ccgggtcgga
121 ccagatcgga accactgacc attgcccatg gcggccctag gccctagcag ccagaatgtc
181 actgaatacg tcgttcgagt tcctaagaat acaaccaaaa aatataacat catggctttt
241 aatgcagccg acaaagtcaa ctttgctacg tggaatcagg ctcggctgga gcgggacttg
301 agcaacaaga aaatctacca agaggaggag atgcccgaat cgggcgcggg cagtgagttc
361 aaccgcaagc ttcgggagga ggctcggagg aagaagtacg gcatcgtcct caaggagttc
421 cggcccgagg accagccctg gctgctccgg gtcaacggca aatcaggcag gaagttcaag
481 ggcatcaaga agggaggcgt aacagagaac acgtcctact acatcttcac ccagtgcccc
541 gacggggcct tcgaggcctt ccccgtgcac aactggtaca atttcacacc gctggcccgg
601 catcgcacgc tcactgccga ggaggccgag gaggagtggg agaggaggaa caaggtgctg
661 aaccacttca gcatcatgca gcagcggcgg ctcaaggatc aggaccagga cgaggatgag
721 gaggagaagg agaaacgtgg ccgcaggaag gcgagcgagc tgcgcatcca cgacctggag
781 gacgacctgg agatgtcgtc cgatgccagt gatgccagtg gtgaggaggg gggcagagtc
841 cccaaggcca agaagaaggc gccgctggcc aagggcggca ggaaaaagaa gaagaagaag
901 ggttcagacg acgaggcctt cgaggacagc gatgatgggg acttcgaggg ccaagaggta
961 gactacatgt cagacggctc cagtagctcc caagaagagc ctgagagcaa ggccaaggcg
1021 ccgcagcagg aggaggggcc caagggtgtc gatgagcaga gcgacagtag tgaggagagt
1081 gaggaggaga agccgcctga ggaggacaag gaggaggagg aggagaagaa ggcacccacc
1141 ccgcaggaga agaagcgcag gaaagacagc agcgaggagt cggacagctc agaggagagc
1201 gacattgaca gcgaggcctc ctcagccctc ttcatggcga agaagaagac gccacccaag
1261 agagagcgga agccgtcggg agggagctca aggggcaaca gccgcccagg cacgcccagc
1321 gcagagggtg gcagcacctc ctccaccctg cgggcggctg ccagcaaact cgagcaaggg
1381 aagcgggtga gcgagatgcc tgcagccaag cggttgcggc tggacacggg accccagagc
1441 ctgtctggga agtcgacacc ccagccacca tcaggcaaga caacacccaa cagcggcgac
1501 gtgcaggtga ctgaggatgc cgtgcgccgc tacctgacac ggaagcccat gaccactaag
1561 gacctgctga aaaagttcca gaccaagaag acagggctga gcagcgagca gacagtgaac
1621 gtgttggccc agatcctcaa gcgactcaac cccgagcgca agatgatcaa cgacaaaatg
1681 cacttctccc tcaaggagtg aggcttggtc caatacatgg ctctgccccc cagaacttaa
1741 ggctctcact gccccttcgc catcctagag tgaggctctg tccaatacat ggctctgccc
1801 tccagaactt caggctctca gtgacccttc gacatcctgc ttgctccctg actcccaggg
1861 ccccgtagtt agcaattctg gaaaagttaa gccatctcct cctctggccc ttccttctgg
1921 aatcttcaga tgcctgttag gccttcttat tgtcctcctc ctcctggctc ggcctccctc
1981 acactgacca agggcctgtg ctgcccactg ggtaacttct acagttctcc cttccacttc
2041 cctaagtctc tcttcagctg tgacttatcc accacatagc ccagtaagtc ttcagttttg
2101 gctgggcatg atggtgagcg cctgcagtcc cagctacttg ggaggctaag cccagttcaa
2161 ggctgcagtg aactatgatg gtgccactgc attccagcct gggtgacaga atgaaatcct
2221 ggcacaaaaa aaaaaaaaag tagccaggca tggtggcggg agcctgttgt cccagctgtt
2281 ccgtaggctg aggcacgaga ttcacttgaa cctgggaggt ggaggttgct gtgagctgac
2341 accatgccac tgcactccag cctgggtgac agtgagactc tgtctcaata aataaaaaat
2401 aataataaat tgaaaaaaaa aaaaaaaaaa aaa
//