LOCUS BC022989 2248 bp mRNA linear HUM 19-OCT-2006 DEFINITION Homo sapiens THAP domain containing 6, mRNA (cDNA clone MGC:30052 IMAGE:5113206), complete cds. ACCESSION BC022989 VERSION BC022989.2 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2248) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2248) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (04-FEB-2002) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Sep 16, 2003 this sequence version replaced BC022989.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Baylor College of Medicine Human Genome Sequencing Center Center code: BCM-HGSC Web site: http://www.hgsc.bcm.tmc.edu/cdna/ Contact: amg@bcm.tmc.edu Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H., Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati, A.N., Gibbs, R.A. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 42 Row: m Column: 20 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 62821788. FEATURES Location/Qualifiers source 1..2248 /db_xref="H-InvDB:HIT000039689" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:30052 IMAGE:5113206" /tissue_type="Cervix, carcinoma" /clone_lib="NIH_MGC_12" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..2248 /gene="THAP6" /gene_synonym="MGC30052" /db_xref="GeneID:152815" /db_xref="HGNC:HGNC:23189" CDS 23..691 /gene="THAP6" /gene_synonym="MGC30052" /codon_start=1 /product="THAP domain containing 6" /protein_id="AAH22989.1" /db_xref="GeneID:152815" /db_xref="HGNC:HGNC:23189" /translation="MVKCCSAIGCASRCLPNSKLKGLTFHVFPTDENIKRKWVLAMKR LDVNAAGIWEPKKGDVLCSRHFKKTDFDRSAPNIKLKPGVIPSIFDSPYHLQGKREKL HCRKNFTLKTVPATNYNHHLVGASSCIEEFQSQFIFEHSYSVMDSPKKLKHKLDHVIG ELEDTKESLRNVLDREKRFQKSLRKTIRELKDECLISQETANRLDTFCWDCCQESIEQ DYIS" BASE COUNT 766 a 361 c 400 g 721 t ORIGIN 1 cgttttgtta tgagttgcta aaatggtgaa atgctgctcc gccattggat gtgcttctcg 61 ctgcttgcca aattcgaagt taaaaggact gacatttcac gtattcccca cagatgaaaa 121 catcaaaagg aaatgggtat tagcaatgaa aagacttgat gtgaatgcag ccggcatttg 181 ggagcctaaa aaaggagatg tgttgtgttc gaggcacttt aagaagacag attttgacag 241 aagtgctcca aatattaaac tgaaacctgg agtcatacct tctatctttg attctccata 301 tcacctacag gggaaaagag aaaaacttca ttgtagaaaa aacttcaccc tcaaaaccgt 361 tccagccact aactacaatc accatcttgt tggtgcttcc tcatgtattg aagaattcca 421 atcccagttc atttttgaac atagctacag tgtaatggac agtccaaaga aacttaagca 481 taaattagat catgtgatcg gcgagctaga ggatacaaag gaaagtctac ggaatgtttt 541 agaccgagaa aaacgttttc agaaatcatt gaggaagaca atcagggaat taaaggatga 601 atgtctgatc agccaagaaa cagcaaatag actggacact ttctgttggg actgttgtca 661 ggagagcata gaacaggact atatttcatg aaataatttc atgttacgtt ccacctaaaa 721 ttgtcattgg tacaaatttt tataaaatct catttaccat cactaaataa tatccatcat 781 ttaaagtgct gctttggatt ctctggagca ttatgcatta tagttgttat ccaaagactt 841 ttttgaaaat atgcagaaat ttgtggtaat tatgtatttg tgtcttgtga caattatgtt 901 ttatagacct acactagtgc caggtcacta ttgtaagatg ttaaaatctc aagaaaattt 961 cacagagcta aagaaatgat gtcaaattag tcacattaag ctatagtaga aggaattgga 1021 cacttctcca gatatttggc ttcaaaggag tacctttact tacatgtgct ttatggtaag 1081 tacattgaat tttactttaa atgcatttta ctacaaagca caattcattt gtaatgcata 1141 tccatcttgg attcaatcca aggtgcttta gctatcagta gtaccaaagg atctttttac 1201 aaggcttcct gtggtattga ctctgagaat aacacatagt gaagatctgt gggcttttaa 1261 aattgttcac agccaattta agaagacccc tcatgaagtc tcagttttca gtacagtaca 1321 tcattcctcc tcactaggag cactttgatg taaaccagaa tagctttaaa aagacaaaaa 1381 ggatcgtaga tctgattttt aaatggttgg ttgctttgac agatctgaac actttgcttc 1441 atgactattt cgtcataaag gtatatgttt aaaatctgaa tggcagtact agctctatac 1501 ttttaatact gctttgtatt ttatatgtaa agtagtattg ctgacatttt aaaaaaatac 1561 aaaatacaaa agaaaccatt agaaattaat aactgtggct cttccagttg aaataggaat 1621 tggagagaaa ggattagaat attttaatta ggggagtaga ttattgtcca aaggctttta 1681 tttagagaaa cgggtaatta aaacagcagc tttagaatag cttcttactg aatatgcaaa 1741 agaataattc cttgttattt cctaattgat ccaagtctca taaatttagc ttttgtcata 1801 attccttacc gaaaacaact gaaattgaga gtcataaata ctgtgggtta gaataaaaac 1861 catttgccaa agcaacactc tacttagaag cacatgtaca tacatggacc tcattcagaa 1921 gtccatgttg tagcagttag aatttgagta tcagccattt cattgtagta acaaaaattg 1981 aattgcattt tgtgctcagt tgtttattgt aattttattt ttgttacatt aatattagtt 2041 aagatatggt cacttgaatt ttttgtattt aagaattttc tgttttaatg catgttatac 2101 ttttatgtag gattccaaac cttccctcta aatgggattt aacccacatc tgcgagatca 2161 gcgttatgct aagaggaaat cactgaggcc atatcttttt acaatctgaa aaaaaagtag 2221 taaaaaggta gttaaaaaaa aaaaaaaa //