LOCUS BC136348 3191 bp mRNA linear HUM 12-MAY-2008 DEFINITION Homo sapiens TAF5 RNA polymerase II, TATA box binding protein (TBP)-associated factor, 100kDa, mRNA (cDNA clone MGC:167958 IMAGE:9020335), complete cds. ACCESSION BC136348 VERSION BC136348.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3191) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 3191) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (29-MAR-2007) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Mike Brownstein cDNA Library Preparation: British Columbia Cancer Research Center cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Genome Sequence Centre, BC Cancer Agency, Vancouver, BC, Canada info@bcgsc.bc.ca Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao, Kim MacDonald, Mike R. Mayo, Josh Moran, Diana Palmquist, JR Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang, Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: LLDM Plate: 670 Row: e Column: 5. FEATURES Location/Qualifiers source 1..3191 /db_xref="H-InvDB:HIT000500373" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:167958 IMAGE:9020335" /tissue_type="Testis, PCR rescued clones" /clone_lib="NIH_MGC_374" /lab_host="DH10B" /note="Vector: pCR4-TOPO with reversed insert; Clone identification sequence tag: TGGGCAAC" gene 1..3191 /gene="TAF5" /gene_synonym="TAFII100" /db_xref="GeneID:6877" /db_xref="HGNC:HGNC:11539" /db_xref="MIM:601787" CDS 19..2421 /gene="TAF5" /gene_synonym="TAFII100" /codon_start=1 /product="TAF5 RNA polymerase II, TATA box binding protein (TBP)-associated factor, 100kDa" /protein_id="AAI36349.1" /db_xref="GeneID:6877" /db_xref="HGNC:HGNC:11539" /db_xref="MIM:601787" /translation="MAALAEEQTEVAVKLEPEGPPTLLPPQAGDGAGEGSGGTTNNGP NGGGGNVAASSSTGGDGGTPKPTVAVSAAAPAGAAPVPAAAPDAGAPHDRQTLLAVLQ FLRQSKLREAEEALRREAGLLEEAVAGAGAPGEVDSAGAEVTSALLSRVTASAPGPAA PDPPGTGASGATVVSGSASGPAAPGKVGSVAVEDQPDVSAVLSAYNQQGDPTMYEEYY SGLKHFIECSLDCHRAELSQLFYPLFVHMYLELVYNQHENEAKSFFEKFHGDQECYYQ DDLRVLSSLTKKEHMKGNETMLDFRTSKFVLRISRDSYQLLKRHLQEKQNNQIWNIVQ EHLYIDIFDGMPRSKQQIDAMVGSLAGEAKREANKSKVFFGLLKEPEIEVPLDDEDEE GENEEGKPKKKKPKKDSIGSKSKKQDPNAPPQNRIPLPELKDSDKLDKIMNMKETTKR VRLGPDCLPSICFYTFLNAYQGLTAVDVTDDSSLIAGGFADSTVRVWSVTPKKLRSVK QASDLSLIDKESDDVLERIMDEKTASELKILYGHSGPVYGASFSPDRNYLLSSSEDGT VRLWSLQTFTCLVGYKGHNYPVWDTQFSPYGYYFVSGGHDRVARLWATDHYQPLRIFA GHLADVNCTRFHPNSNYVATGSADRTVRLWDVLNGNCVRIFTGHKGPIHSLTFSPNGR FLATGATDGRVLLWDIGHGLMVGELKGHTDTVCSLRFSRDGEILASGSMDNTVRLWDA IKAFEDLETDDFTTATGHINLPENSQELLLGTYMTKSTPVVHLHFTRRNLVLAAGAYS PQ" BASE COUNT 888 a 684 c 796 g 823 t ORIGIN 1 aggtggctca gccgcaagat ggcggcgctg gcggaggagc agacggaggt ggcggtcaag 61 ctagagcctg agggaccgcc aacgctgcta cctccgcagg cgggggacgg cgcaggcgag 121 ggtagcggcg gcactaccaa caacggcccc aacggcggcg gcgggaacgt tgcggcgtcg 181 tcgtccactg gcggggatgg cgggaccccc aagcccacgg tggctgtctc cgccgctgcc 241 ccggcggggg cggccccggt gcccgccgct gctccggacg ccggcgctcc gcatgaccga 301 cagactctac tggccgtgct gcagttccta cggcagagca aactccgcga ggccgaagag 361 gcgctgcgcc gtgaggccgg gctgctggag gaggcagtgg cgggcgccgg agccccggga 421 gaggtggaca gcgccggcgc tgaggtgacc agcgcgcttc tcagccgggt gaccgcctcg 481 gcccctggcc ctgcggcccc cgaccctccg ggcactggcg cttcgggggc cacggtcgtc 541 tcaggttcag cctcaggtcc tgcggctccg ggtaaagttg gaagtgttgc tgtggaagac 601 cagccagatg tcagtgccgt gttgtcagcc tacaaccaac aaggagatcc cacaatgtat 661 gaagaatact atagtggact gaaacacttc attgaatgtt ccctggactg ccatcgggca 721 gagttgtccc aactttttta tcctctgttt gtgcacatgt acttggagct agtctacaat 781 caacatgaga atgaagcaaa gtcattcttt gagaagttcc atggagatca ggaatgttat 841 taccaggatg acctacgagt attatctagt cttaccaaaa aggaacacat gaaagggaat 901 gagaccatgt tggattttcg aacaagtaaa tttgttctgc gtatttcccg tgactcgtac 961 caactcttga agaggcatct tcaggagaaa cagaacaatc agatatggaa catagttcag 1021 gagcacctct acattgacat ctttgatggg atgccgcgta gtaagcaaca gatagatgcg 1081 atggtgggaa gtttggcagg agaggctaaa cgagaggcaa acaaatcaaa ggtatttttt 1141 ggtttattaa aagaaccaga aattgaggta cctttggatg acgaggatga agagggagaa 1201 aatgaagaag gaaaacctaa aaagaagaag cctaaaaaag atagtattgg atccaaaagc 1261 aaaaaacaag atcccaatgc tccacctcag aacagaatcc ctcttcctga gttgaaagat 1321 tcagataagt tggataagat aatgaatatg aaagaaacca ccaaacgagt gcgccttggg 1381 ccggactgct taccctccat ttgtttctat acatttctca atgcttacca gggtctcact 1441 gcagtggatg tcactgatga ttctagtctg attgctggag gttttgcaga ttcaactgtc 1501 agagtgtggt cggtaacacc caaaaagctt cgtagtgtca aacaagcatc agatcttagt 1561 cttatagaca aagaatcaga tgatgtctta gaaagaatca tggatgagaa aacagcaagt 1621 gagttgaaga ttttgtatgg tcacagtggg cctgtctacg gagccagctt cagtccggat 1681 aggaactatc tgctttcctc ttcagaggac ggaactgtta gattgtggag ccttcaaaca 1741 tttacttgtt tggtgggata taaaggacac aactatccag tatgggacac acaattttct 1801 ccatatggat attattttgt gtcagggggc catgaccgag tagctcggct ctgggctaca 1861 gaccactatc agcctttaag aatatttgcc ggccatcttg ctgatgtgaa ttgtaccaga 1921 ttccatccaa attctaatta tgttgctacg ggctctgcag acagaactgt gcggctctgg 1981 gacgtcctga atggtaactg tgtaaggatc ttcactggac acaagggacc aattcattcc 2041 ttgacatttt ctcccaatgg gagattcctg gctacaggag caacagatgg cagagtgctt 2101 ctttgggata ttggacatgg tttgatggtt ggagaattaa aaggccacac tgatacagtc 2161 tgttcactta ggtttagtag agatggtgaa attttggcat caggttcaat ggataataca 2221 gttcgattat gggatgctat caaagccttt gaagatttag agaccgatga ctttactaca 2281 gccactgggc atataaattt acctgagaat tcacaggagt tattgttggg aacatatatg 2341 accaaatcaa caccagttgt acaccttcat tttactcgaa gaaacctggt tctagctgca 2401 ggagcttata gtccacaata aaccatcggt attaaagacc ttttggaagc tactgttttt 2461 aaaaagggag actaaaagca aatacctcag tgattaatat ttaagctaca gagaatgttt 2521 ttgtctatat ggatctggaa gtatgctgct tggaaaaatc tgaacaggac agttccacgt 2581 ttctatagca accacatttg actaatttcc gttagttgaa taagaggtat tatgatcatg 2641 gaggggacat ttatggtgct ttggattgtg tggaaactat gcattttctg ttcaaatgct 2701 attttaattt attacattta gaaaaaaagt tgatttcaat aattcatcct gcttcaagat 2761 tcaaattcag aaatatacta tcatcttgaa ttttagctga agaatcctat gagcatgtat 2821 gtttctgctg taaaaacgta gttactgtat ggcactcaaa aactatgtta aatgatccac 2881 taactttttt tttcttggcc catgattaat ggaatgtatg taactaggta gggttccttt 2941 cttagatcta gaggaagtac agccacccac tgacatctga atttatatac ctgttgagtt 3001 ttgagtgcac ccaaacactc gataaaccag gtgaagaaat ttagcttcca tgttctactt 3061 cagctaaaac agctacatac aacctagtac acttgaagtc agacagacat ttcagttgct 3121 tacctccagt actgagcctt gctttgggaa actaaaagat ttagaccaag tcactgccag 3181 tttttgcctt t //