LOCUS       BC022989                2248 bp    mRNA    linear   HUM 19-OCT-2006
DEFINITION  Homo sapiens THAP domain containing 6, mRNA (cDNA clone MGC:30052
            IMAGE:5113206), complete cds.
ACCESSION   BC022989
VERSION     BC022989.2
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2248)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 2248)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (04-FEB-2002) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     On Sep 16, 2003 this sequence version replaced BC022989.1.
            Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Baylor College of Medicine Human Genome
            Sequencing Center
            Center code: BCM-HGSC
            Web site: http://www.hgsc.bcm.tmc.edu/cdna/
            Contact: amg@bcm.tmc.edu
            Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H.,
            Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati,
            A.N., Gibbs, R.A.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 42 Row: m Column: 20
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 62821788.
FEATURES             Location/Qualifiers
     source          1..2248
                     /db_xref="H-InvDB:HIT000039689"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:30052 IMAGE:5113206"
                     /tissue_type="Cervix, carcinoma"
                     /clone_lib="NIH_MGC_12"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     gene            1..2248
                     /gene="THAP6"
                     /gene_synonym="MGC30052"
                     /db_xref="GeneID:152815"
                     /db_xref="HGNC:HGNC:23189"
     CDS             23..691
                     /gene="THAP6"
                     /gene_synonym="MGC30052"
                     /codon_start=1
                     /product="THAP domain containing 6"
                     /protein_id="AAH22989.1"
                     /db_xref="GeneID:152815"
                     /db_xref="HGNC:HGNC:23189"
                     /translation="MVKCCSAIGCASRCLPNSKLKGLTFHVFPTDENIKRKWVLAMKR
                     LDVNAAGIWEPKKGDVLCSRHFKKTDFDRSAPNIKLKPGVIPSIFDSPYHLQGKREKL
                     HCRKNFTLKTVPATNYNHHLVGASSCIEEFQSQFIFEHSYSVMDSPKKLKHKLDHVIG
                     ELEDTKESLRNVLDREKRFQKSLRKTIRELKDECLISQETANRLDTFCWDCCQESIEQ
                     DYIS"
BASE COUNT          766 a          361 c          400 g          721 t
ORIGIN      
        1 cgttttgtta tgagttgcta aaatggtgaa atgctgctcc gccattggat gtgcttctcg
       61 ctgcttgcca aattcgaagt taaaaggact gacatttcac gtattcccca cagatgaaaa
      121 catcaaaagg aaatgggtat tagcaatgaa aagacttgat gtgaatgcag ccggcatttg
      181 ggagcctaaa aaaggagatg tgttgtgttc gaggcacttt aagaagacag attttgacag
      241 aagtgctcca aatattaaac tgaaacctgg agtcatacct tctatctttg attctccata
      301 tcacctacag gggaaaagag aaaaacttca ttgtagaaaa aacttcaccc tcaaaaccgt
      361 tccagccact aactacaatc accatcttgt tggtgcttcc tcatgtattg aagaattcca
      421 atcccagttc atttttgaac atagctacag tgtaatggac agtccaaaga aacttaagca
      481 taaattagat catgtgatcg gcgagctaga ggatacaaag gaaagtctac ggaatgtttt
      541 agaccgagaa aaacgttttc agaaatcatt gaggaagaca atcagggaat taaaggatga
      601 atgtctgatc agccaagaaa cagcaaatag actggacact ttctgttggg actgttgtca
      661 ggagagcata gaacaggact atatttcatg aaataatttc atgttacgtt ccacctaaaa
      721 ttgtcattgg tacaaatttt tataaaatct catttaccat cactaaataa tatccatcat
      781 ttaaagtgct gctttggatt ctctggagca ttatgcatta tagttgttat ccaaagactt
      841 ttttgaaaat atgcagaaat ttgtggtaat tatgtatttg tgtcttgtga caattatgtt
      901 ttatagacct acactagtgc caggtcacta ttgtaagatg ttaaaatctc aagaaaattt
      961 cacagagcta aagaaatgat gtcaaattag tcacattaag ctatagtaga aggaattgga
     1021 cacttctcca gatatttggc ttcaaaggag tacctttact tacatgtgct ttatggtaag
     1081 tacattgaat tttactttaa atgcatttta ctacaaagca caattcattt gtaatgcata
     1141 tccatcttgg attcaatcca aggtgcttta gctatcagta gtaccaaagg atctttttac
     1201 aaggcttcct gtggtattga ctctgagaat aacacatagt gaagatctgt gggcttttaa
     1261 aattgttcac agccaattta agaagacccc tcatgaagtc tcagttttca gtacagtaca
     1321 tcattcctcc tcactaggag cactttgatg taaaccagaa tagctttaaa aagacaaaaa
     1381 ggatcgtaga tctgattttt aaatggttgg ttgctttgac agatctgaac actttgcttc
     1441 atgactattt cgtcataaag gtatatgttt aaaatctgaa tggcagtact agctctatac
     1501 ttttaatact gctttgtatt ttatatgtaa agtagtattg ctgacatttt aaaaaaatac
     1561 aaaatacaaa agaaaccatt agaaattaat aactgtggct cttccagttg aaataggaat
     1621 tggagagaaa ggattagaat attttaatta ggggagtaga ttattgtcca aaggctttta
     1681 tttagagaaa cgggtaatta aaacagcagc tttagaatag cttcttactg aatatgcaaa
     1741 agaataattc cttgttattt cctaattgat ccaagtctca taaatttagc ttttgtcata
     1801 attccttacc gaaaacaact gaaattgaga gtcataaata ctgtgggtta gaataaaaac
     1861 catttgccaa agcaacactc tacttagaag cacatgtaca tacatggacc tcattcagaa
     1921 gtccatgttg tagcagttag aatttgagta tcagccattt cattgtagta acaaaaattg
     1981 aattgcattt tgtgctcagt tgtttattgt aattttattt ttgttacatt aatattagtt
     2041 aagatatggt cacttgaatt ttttgtattt aagaattttc tgttttaatg catgttatac
     2101 ttttatgtag gattccaaac cttccctcta aatgggattt aacccacatc tgcgagatca
     2161 gcgttatgct aagaggaaat cactgaggcc atatcttttt acaatctgaa aaaaaagtag
     2221 taaaaaggta gttaaaaaaa aaaaaaaa
//