LOCUS       BC052268                3271 bp    mRNA    linear   HUM 09-MAR-2007
DEFINITION  Homo sapiens TAF5 RNA polymerase II, TATA box binding protein
            (TBP)-associated factor, 100kDa, mRNA (cDNA clone MGC:59660
            IMAGE:6575596), complete cds.
ACCESSION   BC052268
VERSION     BC052268.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3271)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 3271)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (02-MAY-2003) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: DCTD/DTP
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: National Institutes of Health Intramural
            Sequencing Center (NISC),
            Gaithersburg, Maryland;
            Web site: http://www.nisc.nih.gov/
            Contact: nisc_mgc@nhgri.nih.gov
            Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
            Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
            Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
            Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
            Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
            McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
            Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
            Young,A., Zhang,L.-H. and Green,E.D.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 47 Row: n Column: 13
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 50363367.
FEATURES             Location/Qualifiers
     source          1..3271
                     /db_xref="H-InvDB:HIT000099490"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:59660 IMAGE:6575596"
                     /tissue_type="Prostate, carcinoma"
                     /clone_lib="NIH_MGC_40"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     gene            1..3271
                     /gene="TAF5"
                     /gene_synonym="TAFII100"
                     /db_xref="GeneID:6877"
                     /db_xref="HGNC:HGNC:11539"
                     /db_xref="MIM:601787"
     CDS             16..2418
                     /gene="TAF5"
                     /gene_synonym="TAFII100"
                     /codon_start=1
                     /product="TAF5 RNA polymerase II, TATA box binding protein
                     (TBP)-associated factor, 100kDa"
                     /protein_id="AAH52268.2"
                     /db_xref="GeneID:6877"
                     /db_xref="HGNC:HGNC:11539"
                     /db_xref="MIM:601787"
                     /translation="MAALAEEQTEVAVKLEPEGPPTLLPPQAGDGAGEGSGGTTNNGP
                     NGGGGNVAASSSTGGDGGTPKPTVAVSAAAPAGAAPVPAAAPDAGAPHDRQTLLAVLQ
                     FLRQSKLREAEEALRREAGLLEEAVAGSGAPGEVDSAGAEVTSALLSRVTASAPGPAA
                     PDPPGTGASGATVVSGSASGPAAPGKVGSVAVEDQPDVSAVLSAYNQQGDPTMYEEYY
                     SGLKHFIECSLDCHRAELSQLFYPLFVHMYLELVYNQHENEAKSFFEKFHGDQECYYQ
                     DDLRVLSSLTKKEHMKGNETMLDFRTSKFVLRISRDSYQLLKRHLQEKQNNQIWNIVQ
                     EHLYIDIFDGMPRSKQQIDAMVGSLAGEAKREANKSKVFFGLLKEPEIEVPLDDEDEE
                     GENEEGKPKKKKPKKDSIGSKSKKQDPNAPPQNRIPLPELKDSDKLDKIMNMKETTKR
                     VRLGPDCLPSICFYTFLNAYQGLTAVDVTDDSSLIAGGFADSTVRVWSVTPKKLRSVK
                     QASDLSLIDKESDDVLERIMDEKTASELKILYGHSGPVYGASFSPDRNYLLSSSEDGT
                     VRLWSLQTFTCLVGYKGHNYPVWDTQFSPYGYYFVSGGHDRVARLWATDHYQPLRIFA
                     GHLADVNCTRFHPNSNYVATGSADRTVRLWDVLNGNCVRIFTGHKGPIHSLTFSPNGR
                     FLATGATDGRVLLWDIGHGLMVGELKGHTDTVCSLRFSRDGEILASGSMDNTVRLWDA
                     IKAFEDLETDDFTTATGHINLPENSQELLLGTYMTKSTPVVHLHFTLRNLVLAAGAYS
                     PQ"
BASE COUNT          923 a          693 c          802 g          853 t
ORIGIN      
        1 tggctcagcc gcaagatggc ggcgctggcg gaggagcaga cggaggtggc ggtcaagcta
       61 gagcctgagg gaccgccaac gctgctacct ccgcaggcgg gggacggcgc aggcgagggt
      121 agcggcggca ctaccaacaa cggccccaac ggcggcggcg ggaacgttgc ggcgtcgtcg
      181 tccactggcg gggatggcgg gacccccaag cccacggtgg ctgtctccgc cgctgccccg
      241 gcgggggcgg ccccggtgcc cgccgctgct ccggacgccg gcgctccgca tgaccgacag
      301 actctactgg ccgtgctgca gttcctacgg cagagcaaac tccgcgaggc cgaagaggcg
      361 ctgcgccgtg aggccgggct gctggaggag gcagtggcgg gctccggagc cccgggagag
      421 gtggacagcg ccggcgctga ggtgaccagc gcgcttctca gccgggtgac cgcctcggcc
      481 cctggccctg cggcccccga ccctccgggc actggcgctt cgggggccac ggtcgtctca
      541 ggttcagcct caggtcctgc ggctccgggt aaagttggaa gtgttgctgt ggaagaccag
      601 ccagatgtca gtgccgtgtt gtcagcctac aaccaacaag gagatcccac aatgtatgaa
      661 gaatactata gtggactgaa acacttcatt gaatgttccc tggactgcca tcgggcagag
      721 ttgtcccaac ttttttatcc tctgtttgtg cacatgtact tggagctagt ctacaatcaa
      781 catgagaatg aagcaaagtc attctttgag aagttccatg gagatcagga atgttattac
      841 caggatgacc tacgagtatt atctagtctt accaaaaagg aacacatgaa agggaatgag
      901 accatgttgg attttcgaac aagtaaattt gttctgcgta tttcccgtga ctcgtaccaa
      961 ctcttgaaga ggcatcttca ggagaaacag aacaatcaga tatggaacat agttcaggag
     1021 cacctctaca ttgacatctt tgatgggatg ccgcgtagta agcaacagat agatgcgatg
     1081 gtgggaagtt tggcaggaga ggctaaacga gaggcaaaca aatcaaaggt attttttggt
     1141 ttattaaaag aaccagaaat tgaggtacct ttggatgacg aggatgaaga gggagaaaat
     1201 gaagaaggaa aacctaaaaa gaagaagcct aaaaaagata gtattggatc caaaagcaaa
     1261 aaacaagatc ccaatgctcc acctcagaac agaatccctc ttcctgagtt gaaagattca
     1321 gataagttgg ataagataat gaatatgaaa gaaaccacca aacgagtgcg ccttgggccg
     1381 gactgcttac cctccatttg tttctataca tttctcaatg cttaccaggg tctcactgca
     1441 gtggatgtca ctgatgattc tagtctgatt gctggaggtt ttgcagattc aactgtcaga
     1501 gtgtggtcgg taacacccaa aaagcttcgt agtgtcaaac aagcatcaga tcttagtctt
     1561 atagacaaag aatcagatga tgtcttagaa agaatcatgg atgagaaaac agcaagtgag
     1621 ttgaagattt tgtatggtca cagtgggcct gtctacggag ccagcttcag tccggatagg
     1681 aactatctgc tttcctcttc agaggacgga actgttagat tgtggagcct tcaaacattt
     1741 acttgtttgg tgggatataa aggacacaac tatccagtat gggacacaca attttctcca
     1801 tatggatatt attttgtgtc agggggccat gaccgagtag ctcggctctg ggctacagac
     1861 cactatcagc ctttaagaat atttgccggc catcttgctg atgtgaattg taccagattc
     1921 catccaaatt ctaattatgt tgctacgggc tctgcagaca gaactgtgcg gctctgggac
     1981 gtcctgaatg gtaactgtgt aaggatcttc actggacaca agggaccaat tcattccttg
     2041 acattttctc ccaatgggag attcctggct acaggagcaa cagatggcag agtgcttctt
     2101 tgggatattg gacatggttt gatggttgga gaattaaaag gccacactga tacagtctgt
     2161 tcacttaggt ttagtagaga tggtgaaatt ttggcatcag gttcaatgga taatacagtt
     2221 cgattatggg atgctatcaa agcctttgaa gatttagaga ccgatgactt tactacagcc
     2281 actgggcata taaatttacc tgagaattca caggagttat tgttgggaac atatatgacc
     2341 aaatcaacac cagttgtaca ccttcatttt actctaagaa acctggttct agctgcagga
     2401 gcttatagtc cacaataaac catcggtatt aaagaccttt tggaagctac tgtttttaaa
     2461 aagggagact aaaagcaaat acctcagtga ttaatattta agctacagag aatgtttttg
     2521 tctatatgga tctggaagta tgctgcttgg aaaaatctga acaggacagt tccacgtttc
     2581 tatagcaacc acatttgact aatttccgtt agttgaataa gaggtattat gatcatggag
     2641 gggacattta tggtgctttg gattgtgtgg aaactatgca ttttctgttc aaatgctatt
     2701 ttaatttatt acatttagaa aaaaagttga tttcaataat tcatcctgct tcaagattca
     2761 aattcagaaa tatactatca tcttgaattt tagctgaaga atcctatgag catgtatgtt
     2821 tctgctgtaa aaacgtagtt actgtatggc actcaaaaac tatgttaaat gatccactaa
     2881 cttttttttt cttggcccat gattaatgga atgtatgtaa ctaggtaggg ttcctttctt
     2941 agatctagag gaagtacagc cacccactga catctgaatt tatatacctg ttgagttttg
     3001 agtgcaccca aacactcgat aaaccaggtg aagaaattta gcttccatgt tctacttcag
     3061 ctaaaacagc tacatacaac ctagtacact tgaagtcaga cagacatttc agttgcttac
     3121 ctccagtact gagccttgct ttgggaaact aaaagattta gaccaagtca ctgccagttt
     3181 ttgcctttgt tgcattttgt acagttttta tatttttgat atcttgtaaa taaagacaac
     3241 cagcttttcc aggaaaaaaa aaaaaaaaaa a
//