LOCUS BC004346 1242 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens THAP domain containing 7, mRNA (cDNA clone MGC:10963 IMAGE:3633743), complete cds. ACCESSION BC004346 VERSION BC004346.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1242) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 1242) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (01-MAR-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Louis M. Staudt, M.D., Ph.D. cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Genome Sequence Centre, BC Cancer Agency, Vancouver, BC, Canada info@bcgsc.bc.ca Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao, Kim MacDonald, Mike R. Mayo, Josh Moran, Diana Palmquist, JR Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang, Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 13 Row: e Column: 24 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 56788350. FEATURES Location/Qualifiers source 1..1242 /db_xref="H-InvDB:HIT000031875" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:10963 IMAGE:3633743" /tissue_type="Lymph, Burkitt lymphoma" /clone_lib="NIH_MGC_8" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..1242 /gene="THAP7" /gene_synonym="MGC10963" /db_xref="GeneID:80764" /db_xref="HGNC:HGNC:23190" /db_xref="MIM:609518" CDS 176..1105 /gene="THAP7" /gene_synonym="MGC10963" /codon_start=1 /product="THAP domain containing 7" /protein_id="AAH04346.1" /db_xref="GeneID:80764" /db_xref="HGNC:HGNC:23190" /db_xref="MIM:609518" /translation="MPRHCSAAGCCTRDTRETRNRGISFHRLPKKDNPRRGLWLANCQ RLDPSGQGLWDPASEYIYFCSKHFEEDCFELVGISGYHRLKEGAVPTIFESFSKLRRT TKTKGHSYPPGPPEVSRLRRCRKRCSEGRGPTTPFSPPPPADVTCFPVEEASAPATLP ASPAGRLEPGLSSPFSDLLGPLGAQADEAGCSAQPSPERQPSPLEPRPVSPSAYMLRL PPPAGAYIQNEHSYQVGSALLWKRRAEAALDALDKAQRQLQACKRREQRLRLRLTKLQ QERAREKRAQADARQTLKEHVQDFAMQLSSSMA" BASE COUNT 273 a 407 c 374 g 188 t ORIGIN 1 cgtgagtgcc gctgacagaa gtcaagagaa tcggctggga cggggttggg gcgacaacgg 61 gccggggggg acccgacagg ccagagcccc ttggggagga gcggcggctg gaggcgcgag 121 gctcctccgg atgcccggag agccgcttgc gacttaactc ccgcctcttt cccagatgcc 181 gcgtcactgc tccgccgccg gctgctgcac acgggacacg cgcgagacgc gcaaccgcgg 241 catctccttc cacagacttc ccaagaagga caacccgagg cgaggcttgt ggctggccaa 301 ctgccagcgg ctggacccca gcggccaggg cctgtgggac ccggcatccg agtacatcta 361 cttctgctcc aaacactttg aggaggactg ctttgagctg gtgggaatca gtggatatca 421 caggctaaag gagggggcag tccccaccat atttgagtct ttctccaagt tgcgccggac 481 aaccaagacc aaaggacaca gttacccacc tggcccccct gaagtcagcc ggctcagacg 541 atgcaggaag cgctgctccg agggccgagg gcccacaact ccattttctc cacctccacc 601 tgctgatgtc acctgctttc ctgtggaaga ggcctcagca cctgccactt tgccggcctc 661 cccagctggg aggctggagc ctggccttag cagccccttt tcagacctac tgggcccctt 721 gggtgcccag gcagatgaag caggctgcag cgcccagcct tcaccagagc ggcagccctc 781 ccctctcgaa ccacggccag tctccccctc agcgtatatg ctgcgcctgc ccccacccgc 841 cggagcctac atccagaatg aacacagcta ccaggtgggc agcgccttac tctggaagcg 901 gcgagccgag gcagcccttg atgcccttga caaggcccag cgccagctgc aggcctgcaa 961 gcggcgggag cagcggctgc ggttgagact gaccaagctg cagcaggagc gggcacggga 1021 gaagcgggca caggcagatg cccgccagac tctgaaggag catgtgcagg actttgccat 1081 gcagctgagc agcagcatgg cctgaggggc tgctggactg accgaggggc tgcccagcaa 1141 gactgcagcc tcttcctccc tcagatccca ccagacccac caggtgccat aataaagcgg 1201 attctagacg gagaaaaaaa aaaaaaaaaa aaaaaaaaaa aa //