LOCUS       BC136340                3031 bp    mRNA    linear   HUM 18-MAR-2009
DEFINITION  Homo sapiens TAF5 RNA polymerase II, TATA box binding protein
            (TBP)-associated factor, 100kDa, mRNA (cDNA clone MGC:167950
            IMAGE:9020327), complete cds.
ACCESSION   BC136340
VERSION     BC136340.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3031)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 3031)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (29-MAR-2007) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: Ambion
            cDNA Library Preparation: British Columbia Cancer Research Center
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Genome Sequence Centre,
            BC Cancer Agency, Vancouver, BC, Canada
            info@bcgsc.bc.ca
            Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson
            Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen
            Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel
            Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave
            Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth
            Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao,
            Kim MacDonald,  Mike R. Mayo, Josh Moran, Diana Palmquist, JR
            Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang,
            Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRCB Plate: 16 Row: N Column: 2.
FEATURES             Location/Qualifiers
     source          1..3031
                     /db_xref="H-InvDB:HIT000500366"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:167950 IMAGE:9020327"
                     /tissue_type="Testicle, PCR rescued clones"
                     /clone_lib="NIH_MGC_382"
                     /lab_host="DH10B"
                     /note="Vector: pCR4-TOPO with reversed insert; Clone
                     identification sequence tag: TGTAGGGG"
     gene            1..3031
                     /gene="TAF5"
                     /gene_synonym="TAFII100"
                     /db_xref="GeneID:6877"
                     /db_xref="HGNC:HGNC:11539"
                     /db_xref="MIM:601787"
     CDS             19..2256
                     /gene="TAF5"
                     /gene_synonym="TAFII100"
                     /codon_start=1
                     /product="TAF5 protein"
                     /protein_id="AAI36341.1"
                     /db_xref="GeneID:6877"
                     /db_xref="HGNC:HGNC:11539"
                     /db_xref="MIM:601787"
                     /translation="MAALAEEQTEVAVKLEPEGPPTLLPPQAGDGAGEGSGGTTNNGP
                     NGGGGNVAASSSTGGDGGTPKPTVAVSAAAPAGAAPVPAAAPDAGAPHDRQTLLAVLQ
                     FLRQSKLREAEEALRREAGLLEEAVAGSGAPGEVDSAGAEVTSALLSRVTASAPGPAA
                     PDPPGTGASGATVVSGSASGPAAPGKVGSVAVEDQPDVSAVLSAYNQQGDPTMYEEYY
                     SGLKHFIECSLDCHRAELSQLFYPLFVHMYLELVYNQHENEAKSFFEKFHGDQECYYQ
                     DDLRVLSSLTKKEHMKGNETMLDFRTSKFVLRISRDSYQLLKRHLQEKQNNQIWNIVQ
                     EHLYIDIFDGMPRSKQQIDAMVGSLAGEAKREANKSKVFFGLLKEPEIEVPLDDEDEE
                     GENEEGKPKKKKPKKDSIGSKSKKQDPNAPPQNRIPLPELKDSDKLDKIMNMKETTKR
                     VRLGPDCLPSICFYTFLNAYQGLTAVDVTDDSSLIAGGFADSTVRVWSVTPKKLRSVK
                     QASDLSLIDKESDDVLERIMDEKTASELKILYGHSGPVYGASFSPDRLWATDHYQPLR
                     IFAGHLADVNCTRFHPNSNYVATGSADRTVRLWDVLNGNCVRIFTGHKGPIHSLTFSP
                     NGRFLATGATDGRVLLWDIGHGLMVGELKGHTDTVCSLRFSRDGEILASGSMDNTVRL
                     WDAIKAFEDLETDDFTTATGHINLPENSQELLLGTYMTKSTPVVHLHFTRRNLVLAAG
                     AYSPQ"
BASE COUNT          845 a          653 c          758 g          775 t
ORIGIN      
        1 aggtggctca gccgcaagat ggcggcgctg gcggaggagc agacggaggt ggcggtcaag
       61 ctagagcctg agggaccgcc aacgctgcta cctccgcagg cgggggacgg cgcaggcgag
      121 ggtagcggcg gcactaccaa caacggcccc aacggcggcg gcgggaacgt tgcggcgtcg
      181 tcgtccactg gcggggatgg cgggaccccc aagcccacgg tggctgtctc cgccgctgcc
      241 ccggcggggg cggccccggt gcccgccgct gctccggacg ccggcgctcc gcatgaccga
      301 cagactctac tggccgtgct gcagttccta cggcagagca aactccgcga ggccgaagag
      361 gcgctgcgcc gtgaggccgg gctgctggag gaggcagtgg cgggctccgg agccccggga
      421 gaggtggaca gcgccggcgc tgaggtgacc agcgcgcttc tcagccgggt gaccgcctcg
      481 gcccctggcc ctgcggcccc cgaccctccg ggcactggcg cttcgggggc cacggtcgtc
      541 tcaggttcag cctcaggtcc tgcggctccg ggtaaagttg gaagtgttgc tgtggaagac
      601 cagccagatg tcagtgccgt gttgtcagcc tacaaccaac aaggagatcc cacaatgtat
      661 gaagaatact atagtggact gaaacacttc attgaatgtt ccctggactg ccatcgggca
      721 gagttgtccc aactttttta tcctctgttt gtgcacatgt acttggagct agtctacaat
      781 caacatgaga atgaagcaaa gtcattcttt gagaagttcc atggagatca ggaatgttat
      841 taccaggatg acctacgagt attatctagt cttaccaaaa aggaacacat gaaagggaat
      901 gagaccatgt tggattttcg aacaagtaaa tttgttctgc gtatttcccg tgactcgtac
      961 caactcttga agaggcatct tcaggagaaa cagaacaatc agatatggaa catagttcag
     1021 gagcacctct acattgacat ctttgatggg atgccgcgta gtaagcaaca gatagatgcg
     1081 atggtgggaa gtttggcagg agaggctaaa cgagaggcaa acaaatcaaa ggtatttttt
     1141 ggtttattaa aagaaccaga aattgaggta cctttggatg acgaggatga agagggagaa
     1201 aatgaagaag gaaaacctaa aaagaagaag cctaaaaaag atagtattgg atccaaaagc
     1261 aaaaaacaag atcccaatgc tccacctcag aacagaatcc ctcttcctga gttgaaagat
     1321 tcagataagt tggataagat aatgaatatg aaagaaacca ccaaacgagt gcgccttggg
     1381 ccggactgct taccctccat ttgtttctat acatttctca atgcttacca gggtctcact
     1441 gcagtggatg tcactgatga ttctagtctg attgctggag gttttgcaga ttcaactgtc
     1501 agagtgtggt cggtaacacc caaaaagctt cgtagtgtca aacaagcatc agatcttagt
     1561 cttatagaca aagaatcaga tgatgtctta gaaagaatca tggatgagaa aacagcaagt
     1621 gagttgaaga ttttgtatgg tcacagtggg cctgtctacg gagccagctt cagtccggat
     1681 aggctctggg ctacagacca ctatcagcct ttaagaatat ttgccggcca tcttgctgat
     1741 gtgaattgta ccagattcca tccaaattct aattatgttg ctacgggctc tgcagacaga
     1801 actgtgcggc tctgggacgt cctgaatggt aactgtgtaa ggatcttcac tggacacaag
     1861 ggaccaattc attccttgac attttctccc aatgggagat tcctggctac aggagcaaca
     1921 gatggcagag tgcttctttg ggatattgga catggtttga tggttggaga attaaaaggc
     1981 cacactgata cagtctgttc acttaggttt agtagagatg gtgaaatttt ggcatcaggt
     2041 tcaatggata atacagttcg attatgggat gctatcaaag cctttgaaga tttagagacc
     2101 gatgacttta ctacagccac tgggcatata aatttacctg agaattcaca ggagttattg
     2161 ttgggaacat atatgaccaa atcaacacca gttgtacacc ttcattttac tcgaagaaac
     2221 ctggttctag ctgcaggagc ttatagtcca caataaacca tcggtattaa agaccttttg
     2281 gaagctactg tttttaaaaa gggagactaa aagcaaatac ctcagtgatt aatatttaag
     2341 ctacagagaa tgtttttgtc tatatggatc tggaagtatg ctgcttggaa aaatctgaac
     2401 aggacagttc cacgtttcta tagcaaccac atttgactaa tttccgttag ttgaataaga
     2461 ggtattatga tcatggaggg gacatttatg gtgctttgga ttgtgtggaa actatgcatt
     2521 ttctgttcaa atgctatttt aatttattac atttagaaaa aaagttgatt tcaataattc
     2581 atcctgcttc aagattcaaa ttcagaaata tactatcatc ttgaatttta gctgaagaat
     2641 cctatgagca tgtatgtttc tgctgtaaaa acgtagttac tgtatggcac tcaaaaacta
     2701 tgttaaatga tccactaact ttttttttct tggcccatga ttaatggaat gtatgtaact
     2761 aggtagggtt cctttcttag atctagagga agtacagcca cccactgaca tctgaattta
     2821 tatacctgtt gagttttgag tgcacccaaa cactcgataa accaggtgaa gaaatttagc
     2881 ttccatgttc tacttcagct aaaacagcta catacaacct agtacacttg aagtcagaca
     2941 gacatttcag ttgcttacct ccagtactga gccttgcttt gggaaactaa aagatttaga
     3001 ccaagtcact gccagttttt gcctttgttg c
//