LOCUS BC136340 3031 bp mRNA linear HUM 18-MAR-2009 DEFINITION Homo sapiens TAF5 RNA polymerase II, TATA box binding protein (TBP)-associated factor, 100kDa, mRNA (cDNA clone MGC:167950 IMAGE:9020327), complete cds. ACCESSION BC136340 VERSION BC136340.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3031) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 3031) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (29-MAR-2007) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Ambion cDNA Library Preparation: British Columbia Cancer Research Center cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Genome Sequence Centre, BC Cancer Agency, Vancouver, BC, Canada info@bcgsc.bc.ca Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao, Kim MacDonald, Mike R. Mayo, Josh Moran, Diana Palmquist, JR Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang, Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRCB Plate: 16 Row: N Column: 2. FEATURES Location/Qualifiers source 1..3031 /db_xref="H-InvDB:HIT000500366" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:167950 IMAGE:9020327" /tissue_type="Testicle, PCR rescued clones" /clone_lib="NIH_MGC_382" /lab_host="DH10B" /note="Vector: pCR4-TOPO with reversed insert; Clone identification sequence tag: TGTAGGGG" gene 1..3031 /gene="TAF5" /gene_synonym="TAFII100" /db_xref="GeneID:6877" /db_xref="HGNC:HGNC:11539" /db_xref="MIM:601787" CDS 19..2256 /gene="TAF5" /gene_synonym="TAFII100" /codon_start=1 /product="TAF5 protein" /protein_id="AAI36341.1" /db_xref="GeneID:6877" /db_xref="HGNC:HGNC:11539" /db_xref="MIM:601787" /translation="MAALAEEQTEVAVKLEPEGPPTLLPPQAGDGAGEGSGGTTNNGP NGGGGNVAASSSTGGDGGTPKPTVAVSAAAPAGAAPVPAAAPDAGAPHDRQTLLAVLQ FLRQSKLREAEEALRREAGLLEEAVAGSGAPGEVDSAGAEVTSALLSRVTASAPGPAA PDPPGTGASGATVVSGSASGPAAPGKVGSVAVEDQPDVSAVLSAYNQQGDPTMYEEYY SGLKHFIECSLDCHRAELSQLFYPLFVHMYLELVYNQHENEAKSFFEKFHGDQECYYQ DDLRVLSSLTKKEHMKGNETMLDFRTSKFVLRISRDSYQLLKRHLQEKQNNQIWNIVQ EHLYIDIFDGMPRSKQQIDAMVGSLAGEAKREANKSKVFFGLLKEPEIEVPLDDEDEE GENEEGKPKKKKPKKDSIGSKSKKQDPNAPPQNRIPLPELKDSDKLDKIMNMKETTKR VRLGPDCLPSICFYTFLNAYQGLTAVDVTDDSSLIAGGFADSTVRVWSVTPKKLRSVK QASDLSLIDKESDDVLERIMDEKTASELKILYGHSGPVYGASFSPDRLWATDHYQPLR IFAGHLADVNCTRFHPNSNYVATGSADRTVRLWDVLNGNCVRIFTGHKGPIHSLTFSP NGRFLATGATDGRVLLWDIGHGLMVGELKGHTDTVCSLRFSRDGEILASGSMDNTVRL WDAIKAFEDLETDDFTTATGHINLPENSQELLLGTYMTKSTPVVHLHFTRRNLVLAAG AYSPQ" BASE COUNT 845 a 653 c 758 g 775 t ORIGIN 1 aggtggctca gccgcaagat ggcggcgctg gcggaggagc agacggaggt ggcggtcaag 61 ctagagcctg agggaccgcc aacgctgcta cctccgcagg cgggggacgg cgcaggcgag 121 ggtagcggcg gcactaccaa caacggcccc aacggcggcg gcgggaacgt tgcggcgtcg 181 tcgtccactg gcggggatgg cgggaccccc aagcccacgg tggctgtctc cgccgctgcc 241 ccggcggggg cggccccggt gcccgccgct gctccggacg ccggcgctcc gcatgaccga 301 cagactctac tggccgtgct gcagttccta cggcagagca aactccgcga ggccgaagag 361 gcgctgcgcc gtgaggccgg gctgctggag gaggcagtgg cgggctccgg agccccggga 421 gaggtggaca gcgccggcgc tgaggtgacc agcgcgcttc tcagccgggt gaccgcctcg 481 gcccctggcc ctgcggcccc cgaccctccg ggcactggcg cttcgggggc cacggtcgtc 541 tcaggttcag cctcaggtcc tgcggctccg ggtaaagttg gaagtgttgc tgtggaagac 601 cagccagatg tcagtgccgt gttgtcagcc tacaaccaac aaggagatcc cacaatgtat 661 gaagaatact atagtggact gaaacacttc attgaatgtt ccctggactg ccatcgggca 721 gagttgtccc aactttttta tcctctgttt gtgcacatgt acttggagct agtctacaat 781 caacatgaga atgaagcaaa gtcattcttt gagaagttcc atggagatca ggaatgttat 841 taccaggatg acctacgagt attatctagt cttaccaaaa aggaacacat gaaagggaat 901 gagaccatgt tggattttcg aacaagtaaa tttgttctgc gtatttcccg tgactcgtac 961 caactcttga agaggcatct tcaggagaaa cagaacaatc agatatggaa catagttcag 1021 gagcacctct acattgacat ctttgatggg atgccgcgta gtaagcaaca gatagatgcg 1081 atggtgggaa gtttggcagg agaggctaaa cgagaggcaa acaaatcaaa ggtatttttt 1141 ggtttattaa aagaaccaga aattgaggta cctttggatg acgaggatga agagggagaa 1201 aatgaagaag gaaaacctaa aaagaagaag cctaaaaaag atagtattgg atccaaaagc 1261 aaaaaacaag atcccaatgc tccacctcag aacagaatcc ctcttcctga gttgaaagat 1321 tcagataagt tggataagat aatgaatatg aaagaaacca ccaaacgagt gcgccttggg 1381 ccggactgct taccctccat ttgtttctat acatttctca atgcttacca gggtctcact 1441 gcagtggatg tcactgatga ttctagtctg attgctggag gttttgcaga ttcaactgtc 1501 agagtgtggt cggtaacacc caaaaagctt cgtagtgtca aacaagcatc agatcttagt 1561 cttatagaca aagaatcaga tgatgtctta gaaagaatca tggatgagaa aacagcaagt 1621 gagttgaaga ttttgtatgg tcacagtggg cctgtctacg gagccagctt cagtccggat 1681 aggctctggg ctacagacca ctatcagcct ttaagaatat ttgccggcca tcttgctgat 1741 gtgaattgta ccagattcca tccaaattct aattatgttg ctacgggctc tgcagacaga 1801 actgtgcggc tctgggacgt cctgaatggt aactgtgtaa ggatcttcac tggacacaag 1861 ggaccaattc attccttgac attttctccc aatgggagat tcctggctac aggagcaaca 1921 gatggcagag tgcttctttg ggatattgga catggtttga tggttggaga attaaaaggc 1981 cacactgata cagtctgttc acttaggttt agtagagatg gtgaaatttt ggcatcaggt 2041 tcaatggata atacagttcg attatgggat gctatcaaag cctttgaaga tttagagacc 2101 gatgacttta ctacagccac tgggcatata aatttacctg agaattcaca ggagttattg 2161 ttgggaacat atatgaccaa atcaacacca gttgtacacc ttcattttac tcgaagaaac 2221 ctggttctag ctgcaggagc ttatagtcca caataaacca tcggtattaa agaccttttg 2281 gaagctactg tttttaaaaa gggagactaa aagcaaatac ctcagtgatt aatatttaag 2341 ctacagagaa tgtttttgtc tatatggatc tggaagtatg ctgcttggaa aaatctgaac 2401 aggacagttc cacgtttcta tagcaaccac atttgactaa tttccgttag ttgaataaga 2461 ggtattatga tcatggaggg gacatttatg gtgctttgga ttgtgtggaa actatgcatt 2521 ttctgttcaa atgctatttt aatttattac atttagaaaa aaagttgatt tcaataattc 2581 atcctgcttc aagattcaaa ttcagaaata tactatcatc ttgaatttta gctgaagaat 2641 cctatgagca tgtatgtttc tgctgtaaaa acgtagttac tgtatggcac tcaaaaacta 2701 tgttaaatga tccactaact ttttttttct tggcccatga ttaatggaat gtatgtaact 2761 aggtagggtt cctttcttag atctagagga agtacagcca cccactgaca tctgaattta 2821 tatacctgtt gagttttgag tgcacccaaa cactcgataa accaggtgaa gaaatttagc 2881 ttccatgttc tacttcagct aaaacagcta catacaacct agtacacttg aagtcagaca 2941 gacatttcag ttgcttacct ccagtactga gccttgcttt gggaaactaa aagatttaga 3001 ccaagtcact gccagttttt gcctttgttg c //