LOCUS BC064830 2083 bp mRNA linear HUM 13-JAN-2004
DEFINITION Homo sapiens TAF2 RNA polymerase II, TATA box binding protein
(TBP)-associated factor, 150kDa, mRNA (cDNA clone IMAGE:6453698),
partial cds.
ACCESSION BC064830
VERSION BC064830.1
KEYWORDS .
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 2083)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 2083)
AUTHORS Strausberg,R.
TITLE Direct Submission
JOURNAL Submitted (02-JAN-2004) National Institutes of Health, Mammalian
Gene Collection (MGC), Cancer Genomics Office, National Cancer
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: NCI
cDNA Library Preparation: Michael Brownstein / Ted Usdin
Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Genome Sequence Centre,
BC Cancer Agency, Vancouver, BC, Canada
info@bcgsc.bc.ca
Steven Jones, Jennifer Asano, Ian Bosdet, Yaron Butterfield,
Susanna Chan, Readman Chiu, Chris Fjell, Erin Garland, Ran Guin,
Letticia Hsiao, Martin Krzywinski, Reta Kutsche, Oliver Lee, Soo
Sen Lee, Victor Ling, Carrie Mathewson, Candice McLeavy, Steven
Ness, Pawan Pandoh, Anna-Liisa Prabhu, Parvaneh Saeedi, Jacqueline
Schein, Duane Smailus, Michael Smith, Lorraine Spence, Jeff Stott,
Michael Thorne, Miranada Tsai, Natasja van den Bosch, Jill Vardy,
George Yang, Scott Zuyderduyn, Marco Marra.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 51 Row: m Column: 9
This clone was selected for full length sequencing because it
passed the following selection criteria: Similarity but not
identity to protein.
FEATURES Location/Qualifiers
source 1..2083
/db_xref="H-InvDB:HIT000261587"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="IMAGE:6453698"
/tissue_type="Pooled, 40 cell lines"
/clone_lib="NIH_MGC_142"
/lab_host="DH10B"
/note="Vector: pDNR-LIB"
gene 1..>2083
/gene="TAF2"
/gene_synonym="CIF150"
/gene_synonym="TAF2B"
/gene_synonym="TAFII150"
/db_xref="GeneID:6873"
/db_xref="MIM:604912"
CDS 275..>2083
/gene="TAF2"
/gene_synonym="CIF150"
/gene_synonym="TAF2B"
/gene_synonym="TAFII150"
/codon_start=1
/product="TAF2 protein"
/protein_id="AAH64830.1"
/db_xref="GeneID:6873"
/db_xref="MIM:604912"
/translation="MPLTGVEPARMNRKKGDKGFESPRPYKLTHQVVCINNINFQRKS
VVGFVELTIFPTVANLNRIKLNSKQCRIYRVRINDLEAAFIYNDPTLEVCHSESKQRN
LNYFSNAYAAAVSAVDPDAGNGELCIKVPSELWKHVDELKVLKIHINFSLDQPKGCLH
FVVPSVEGSMAERGAHVFSCGYQNSTRFWFPCVDSYSELCTWKLEFTVDAAMVAVSNG
DLVETVYTHDMRKKTFHYMLTIPTAASNISLAIGPFEILVDPYMHEVTHFCLPQLLPL
LKHTTSYLHEVFEFYEEILTCRYPYSCFKTVFIDEAYVEVAAYASMSIFSTNLLHSAM
IIDETPLTRRCLAQSLAQQFFGCFISRMSWSDEWVLKGISGYIYGLWMKKTFGVNEYR
HWIKEELDKIVAYELKTGGVLLHPIFGGGKEKDNPASHLHFSIKHPHTLSWEYYTMFQ
CKAHLVMRLIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSIS
NVSGKDIQPLIKQWVDQSGVVKFYGSFAFNRKRNVLELEIKQDYTSPGTQKYVGPLKV
TVQELDGSFNHTLQIEENSLKHDIPCHKKKKKKKKKK"
misc_feature 341..1525
/gene="TAF2"
/gene_synonym="CIF150"
/gene_synonym="TAF2B"
/gene_synonym="TAFII150"
/note="Peptidase_M1; Region: Peptidase family M1. Members
of this family are aminopeptidases. The members differ
widely in specificity, hydrolysing acidic, basic or
neutral N-terminal residues. This family includes
leukotriene-A4 hydrolase, this enzyme also has an
aminopeptidase activity"
/db_xref="CDD:pfam01433"
BASE COUNT 627 a 389 c 483 g 584 t
ORIGIN
1 ggggacagac aagatgtcgg cggatggtag cttcgagccc ttgcggagag gagcatctct
61 gtgacagaag cttgtcgacg gcggcttcta ggagctagtc gaaggagcga ggttgaggcg
121 ggcagcgacc cgtcaggtcg ctcacctggg caccggccag ctgcgagacg tgacttgggg
181 accgcagggg agtggagagt gtgaggtgcc aaagactagt aatgccccgt atccccctag
241 gaagccggga agccaagctc cgcgggaccg cttcatgccg ctgactggtg tagagcccgc
301 cagaatgaac aggaagaaag gagacaaggg ctttgaaagc ccaaggccat ataaattaac
361 ccatcaggtc gtctgcatca acaacataaa tttccagaga aaatctgttg tgggatttgt
421 ggaactgact atatttccca cagttgcaaa cttgaataga atcaagttga acagcaaaca
481 gtgtagaata taccgagtaa ggatcaatga tttagaggct gcttttattt ataatgaccc
541 aaccttggaa gtttgtcaca gtgaatcaaa acagagaaac ctcaattatt tttccaatgc
601 ttatgcagct gcagttagtg ctgtggaccc tgatgcagga aatggagaac tttgcattaa
661 ggttccatca gagctatgga aacacgttga tgagttaaag gtcctgaaga tacacatcaa
721 tttttctttg gatcagccca aaggatgtct tcattttgtg gtacccagtg tagagggaag
781 tatggcagag agaggtgctc atgttttctc ttgtgggtat caaaattcta caagattttg
841 gttcccttgt gttgattcat actctgaatt gtgtacatgg aaattagaat ttacagtaga
901 tgctgcaatg gttgctgttt ctaatggcga tttggtggag acagtgtata ctcatgatat
961 gaggaagaaa actttccatt atatgcttac cattcctaca gcagcgtcaa atatctcctt
1021 ggccattgga ccatttgaaa tactggtaga tccatacatg catgaggtta ctcatttttg
1081 tttgccccaa cttcttccat tgctgaaaca taccacatca taccttcatg aagtctttga
1141 attttatgaa gaaattctta catgtcgtta cccatactcc tgttttaaga ctgtcttcat
1201 tgatgaggct tatgttgaag tggctgctta tgcttccatg agcattttta gcacaaatct
1261 tttacacagt gccatgatta tagatgagac acctttgact agaaggtgtt tagcccaatc
1321 cttggcccag cagttttttg gttgtttcat atctagaatg tcttggtctg atgaatgggt
1381 gctgaaggga atttcaggct atatctatgg actttggatg aaaaaaactt ttggtgttaa
1441 tgagtaccgc cattggatta aagaggagct agacaaaata gtggcatatg aactaaaaac
1501 tggtggggtt ttactacatc ccatatttgg tggaggaaaa gagaaggata atccggcttc
1561 ccatctacac ttttcaataa agcatccaca tacactgtcc tgggaatact acactatgtt
1621 tcagtgtaaa gcccaccttg tgatgagatt gattgaaaat aggatcagta tggaatttat
1681 gctacaagtt ttcaataaac tgctaagtct ggctagtact gcttcatctc agaagttcca
1741 gtcacatatg tggagtcaga tgttggtttc cacatctggg tttttgaaat ccatttcaaa
1801 tgtctctggc aaagatattc agccgttaat aaagcagtgg gtagatcaga gtggagtggt
1861 aaaattttat ggaagttttg catttaatag aaaacgaaat gtcttggaac tggaaataaa
1921 acaggactat acatctcctg gaactcagaa atacgtggga ccacttaaag tgacagtgca
1981 ggagttagat ggatccttca atcatacact gcaaattgaa gaaaacagcc ttaaacatga
2041 tataccctgc cataaaaaaa aaaaaaaaaa aaaaaaaaaa aaa
//