LOCUS BC052268 3271 bp mRNA linear HUM 09-MAR-2007 DEFINITION Homo sapiens TAF5 RNA polymerase II, TATA box binding protein (TBP)-associated factor, 100kDa, mRNA (cDNA clone MGC:59660 IMAGE:6575596), complete cds. ACCESSION BC052268 VERSION BC052268.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3271) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 3271) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (02-MAY-2003) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: DCTD/DTP cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 47 Row: n Column: 13 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 50363367. FEATURES Location/Qualifiers source 1..3271 /db_xref="H-InvDB:HIT000099490" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:59660 IMAGE:6575596" /tissue_type="Prostate, carcinoma" /clone_lib="NIH_MGC_40" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..3271 /gene="TAF5" /gene_synonym="TAFII100" /db_xref="GeneID:6877" /db_xref="HGNC:HGNC:11539" /db_xref="MIM:601787" CDS 16..2418 /gene="TAF5" /gene_synonym="TAFII100" /codon_start=1 /product="TAF5 RNA polymerase II, TATA box binding protein (TBP)-associated factor, 100kDa" /protein_id="AAH52268.2" /db_xref="GeneID:6877" /db_xref="HGNC:HGNC:11539" /db_xref="MIM:601787" /translation="MAALAEEQTEVAVKLEPEGPPTLLPPQAGDGAGEGSGGTTNNGP NGGGGNVAASSSTGGDGGTPKPTVAVSAAAPAGAAPVPAAAPDAGAPHDRQTLLAVLQ FLRQSKLREAEEALRREAGLLEEAVAGSGAPGEVDSAGAEVTSALLSRVTASAPGPAA PDPPGTGASGATVVSGSASGPAAPGKVGSVAVEDQPDVSAVLSAYNQQGDPTMYEEYY SGLKHFIECSLDCHRAELSQLFYPLFVHMYLELVYNQHENEAKSFFEKFHGDQECYYQ DDLRVLSSLTKKEHMKGNETMLDFRTSKFVLRISRDSYQLLKRHLQEKQNNQIWNIVQ EHLYIDIFDGMPRSKQQIDAMVGSLAGEAKREANKSKVFFGLLKEPEIEVPLDDEDEE GENEEGKPKKKKPKKDSIGSKSKKQDPNAPPQNRIPLPELKDSDKLDKIMNMKETTKR VRLGPDCLPSICFYTFLNAYQGLTAVDVTDDSSLIAGGFADSTVRVWSVTPKKLRSVK QASDLSLIDKESDDVLERIMDEKTASELKILYGHSGPVYGASFSPDRNYLLSSSEDGT VRLWSLQTFTCLVGYKGHNYPVWDTQFSPYGYYFVSGGHDRVARLWATDHYQPLRIFA GHLADVNCTRFHPNSNYVATGSADRTVRLWDVLNGNCVRIFTGHKGPIHSLTFSPNGR FLATGATDGRVLLWDIGHGLMVGELKGHTDTVCSLRFSRDGEILASGSMDNTVRLWDA IKAFEDLETDDFTTATGHINLPENSQELLLGTYMTKSTPVVHLHFTLRNLVLAAGAYS PQ" BASE COUNT 923 a 693 c 802 g 853 t ORIGIN 1 tggctcagcc gcaagatggc ggcgctggcg gaggagcaga cggaggtggc ggtcaagcta 61 gagcctgagg gaccgccaac gctgctacct ccgcaggcgg gggacggcgc aggcgagggt 121 agcggcggca ctaccaacaa cggccccaac ggcggcggcg ggaacgttgc ggcgtcgtcg 181 tccactggcg gggatggcgg gacccccaag cccacggtgg ctgtctccgc cgctgccccg 241 gcgggggcgg ccccggtgcc cgccgctgct ccggacgccg gcgctccgca tgaccgacag 301 actctactgg ccgtgctgca gttcctacgg cagagcaaac tccgcgaggc cgaagaggcg 361 ctgcgccgtg aggccgggct gctggaggag gcagtggcgg gctccggagc cccgggagag 421 gtggacagcg ccggcgctga ggtgaccagc gcgcttctca gccgggtgac cgcctcggcc 481 cctggccctg cggcccccga ccctccgggc actggcgctt cgggggccac ggtcgtctca 541 ggttcagcct caggtcctgc ggctccgggt aaagttggaa gtgttgctgt ggaagaccag 601 ccagatgtca gtgccgtgtt gtcagcctac aaccaacaag gagatcccac aatgtatgaa 661 gaatactata gtggactgaa acacttcatt gaatgttccc tggactgcca tcgggcagag 721 ttgtcccaac ttttttatcc tctgtttgtg cacatgtact tggagctagt ctacaatcaa 781 catgagaatg aagcaaagtc attctttgag aagttccatg gagatcagga atgttattac 841 caggatgacc tacgagtatt atctagtctt accaaaaagg aacacatgaa agggaatgag 901 accatgttgg attttcgaac aagtaaattt gttctgcgta tttcccgtga ctcgtaccaa 961 ctcttgaaga ggcatcttca ggagaaacag aacaatcaga tatggaacat agttcaggag 1021 cacctctaca ttgacatctt tgatgggatg ccgcgtagta agcaacagat agatgcgatg 1081 gtgggaagtt tggcaggaga ggctaaacga gaggcaaaca aatcaaaggt attttttggt 1141 ttattaaaag aaccagaaat tgaggtacct ttggatgacg aggatgaaga gggagaaaat 1201 gaagaaggaa aacctaaaaa gaagaagcct aaaaaagata gtattggatc caaaagcaaa 1261 aaacaagatc ccaatgctcc acctcagaac agaatccctc ttcctgagtt gaaagattca 1321 gataagttgg ataagataat gaatatgaaa gaaaccacca aacgagtgcg ccttgggccg 1381 gactgcttac cctccatttg tttctataca tttctcaatg cttaccaggg tctcactgca 1441 gtggatgtca ctgatgattc tagtctgatt gctggaggtt ttgcagattc aactgtcaga 1501 gtgtggtcgg taacacccaa aaagcttcgt agtgtcaaac aagcatcaga tcttagtctt 1561 atagacaaag aatcagatga tgtcttagaa agaatcatgg atgagaaaac agcaagtgag 1621 ttgaagattt tgtatggtca cagtgggcct gtctacggag ccagcttcag tccggatagg 1681 aactatctgc tttcctcttc agaggacgga actgttagat tgtggagcct tcaaacattt 1741 acttgtttgg tgggatataa aggacacaac tatccagtat gggacacaca attttctcca 1801 tatggatatt attttgtgtc agggggccat gaccgagtag ctcggctctg ggctacagac 1861 cactatcagc ctttaagaat atttgccggc catcttgctg atgtgaattg taccagattc 1921 catccaaatt ctaattatgt tgctacgggc tctgcagaca gaactgtgcg gctctgggac 1981 gtcctgaatg gtaactgtgt aaggatcttc actggacaca agggaccaat tcattccttg 2041 acattttctc ccaatgggag attcctggct acaggagcaa cagatggcag agtgcttctt 2101 tgggatattg gacatggttt gatggttgga gaattaaaag gccacactga tacagtctgt 2161 tcacttaggt ttagtagaga tggtgaaatt ttggcatcag gttcaatgga taatacagtt 2221 cgattatggg atgctatcaa agcctttgaa gatttagaga ccgatgactt tactacagcc 2281 actgggcata taaatttacc tgagaattca caggagttat tgttgggaac atatatgacc 2341 aaatcaacac cagttgtaca ccttcatttt actctaagaa acctggttct agctgcagga 2401 gcttatagtc cacaataaac catcggtatt aaagaccttt tggaagctac tgtttttaaa 2461 aagggagact aaaagcaaat acctcagtga ttaatattta agctacagag aatgtttttg 2521 tctatatgga tctggaagta tgctgcttgg aaaaatctga acaggacagt tccacgtttc 2581 tatagcaacc acatttgact aatttccgtt agttgaataa gaggtattat gatcatggag 2641 gggacattta tggtgctttg gattgtgtgg aaactatgca ttttctgttc aaatgctatt 2701 ttaatttatt acatttagaa aaaaagttga tttcaataat tcatcctgct tcaagattca 2761 aattcagaaa tatactatca tcttgaattt tagctgaaga atcctatgag catgtatgtt 2821 tctgctgtaa aaacgtagtt actgtatggc actcaaaaac tatgttaaat gatccactaa 2881 cttttttttt cttggcccat gattaatgga atgtatgtaa ctaggtaggg ttcctttctt 2941 agatctagag gaagtacagc cacccactga catctgaatt tatatacctg ttgagttttg 3001 agtgcaccca aacactcgat aaaccaggtg aagaaattta gcttccatgt tctacttcag 3061 ctaaaacagc tacatacaac ctagtacact tgaagtcaga cagacatttc agttgcttac 3121 ctccagtact gagccttgct ttgggaaact aaaagattta gaccaagtca ctgccagttt 3181 ttgcctttgt tgcattttgt acagttttta tatttttgat atcttgtaaa taaagacaac 3241 cagcttttcc aggaaaaaaa aaaaaaaaaa a //