LOCUS       BC010381                2131 bp    mRNA    linear   HUM 15-JUL-2006
DEFINITION  Homo sapiens THO complex 1, mRNA (cDNA clone MGC:13557
            IMAGE:4046908), complete cds.
ACCESSION   BC010381
VERSION     BC010381.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2131)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 2131)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (09-JUL-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: CLONTECH Laboratories, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Institute for Systems Biology
            http://www.systemsbiology.org
            contact: amadan@systemsbiology.org
            Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
            Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 19 Row: b Column: 4
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 4826881.
FEATURES             Location/Qualifiers
     source          1..2131
                     /db_xref="H-InvDB:HIT000034910"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:13557 IMAGE:4046908"
                     /tissue_type="Bone marrow, chronic myelogenous leukemia"
                     /clone_lib="NIH_MGC_54"
                     /lab_host="DH10B"
                     /note="Vector: pDNR-LIB"
     gene            1..2131
                     /gene="THOC1"
                     /gene_synonym="HPR1"
                     /gene_synonym="P84"
                     /gene_synonym="P84N5"
                     /db_xref="GeneID:9984"
                     /db_xref="HGNC:HGNC:19070"
                     /db_xref="MIM:606930"
     CDS             32..2005
                     /gene="THOC1"
                     /gene_synonym="HPR1"
                     /gene_synonym="P84"
                     /gene_synonym="P84N5"
                     /codon_start=1
                     /product="THO complex 1"
                     /protein_id="AAH10381.1"
                     /db_xref="GeneID:9984"
                     /db_xref="HGNC:HGNC:19070"
                     /db_xref="MIM:606930"
                     /translation="MSPTPPLFSLPEARTRFTKSTREALNNKNIKPLLSTFSQVPGSE
                     NEKKCTLDQAFRGILEEEIINHSSCENVLAIISLAIGGVTEGICTASTPFVLLGDVLD
                     CLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFCGRIQ
                     LFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGEMG
                     DEEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL
                     DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKG
                     QVKFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWK
                     NEGCPSFVKERTSDTKPTRIIRKRTAPEDFLGKGPTKKILMGNEELTRLWNLCPDNME
                     ACKSETREHMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPHFF
                     QPTNQQFKSLPEYLENMVIKLAKELPPPSEEIKTGEDEDEEDNDALLKENESPDVRRD
                     KPVTGEQIEVFANKLGEQWKILAPYLEMKDSEIRQIECDSEDMKMRAKQLLVAWQDQE
                     GVHATPENLINALNKSGLSDLAESLTNDNETNS"
BASE COUNT          740 a          371 c          464 g          556 t
ORIGIN      
        1 ggggcagtgg cgggcacgcg cagccgagaa gatgtctccg acgccgccgc tcttcagttt
       61 gcccgaagcg cggacgcggt ttacgaagtc taccagagag gccttgaaca acaaaaacat
      121 caagccattg ttaagtacct tcagccaggt acctggcagt gaaaatgaaa aaaaatgtac
      181 ccttgaccaa gctttcagag gtattctaga agaagaaatt ataaatcatt catcatgtga
      241 aaacgtttta gctattattt ctcttgctat tgggggagta actgaaggta tttgtaccgc
      301 atctacacct tttgtattgt tgggagatgt tttggattgt cttcctttgg atcagtgtga
      361 cacaatattc acttttgtgg aaaaaaatgt tgctacttgg aaatcaaata cattctattc
      421 tgctgggaaa aattacttac tacgtatgtg caatgatctc ctaagaagat tgtctaaatc
      481 ccagaataca gtcttctgtg gacggattca gctctttttg gccaggcttt tccctctgtc
      541 tgagaaatca ggtcttaact tgcagagtca gtttaatctg gaaaatgtca ctgttttcaa
      601 tacaaatgag caggaaagca ccctgggtca gaagcacact gaagatagag aagaaggaat
      661 ggatgtagaa gaaggcgaaa tgggagatga ggaagctcca acaacgtgct ctattccaat
      721 tgattacaac ctgtatcgaa aattctggtc acttcaggat tacttcagga accctgtgca
      781 atgctatgag aagatttcat ggaaaacttt tctcaagtat tctgaagaag ttttagctgt
      841 ttttaagagt tataaattag atgatactca ggcctcaaga aaaaagatgg aagaattgaa
      901 aacaggagga gaacatgtat attttgcaaa atttttaaca agtgaaaagc tgatggattt
      961 acaactgagt gacagtaact ttcgtcgaca catcctgttg cagtatctca ttttattcca
     1021 atatctcaag gggcaggtca aattcaaaag ttcaaactat gttttaactg atgagcaatc
     1081 actttggatt gaagatacta caaaatcagt ttatcaacta ctatctgaaa acccccccga
     1141 tggagaaaga ttttcaaaga tggtagagca tatattaaac actgaagaaa actggaactc
     1201 gtggaaaaat gaaggttgcc caagttttgt gaaagaaaga acatcagata ccaaacctac
     1261 gagaataatt cggaagagaa cagcacccga ggacttccta gggaaaggac ccaccaaaaa
     1321 aattctgatg ggaaatgagg agttaacaag gctttggaat ctttgccctg ataatatgga
     1381 agcctgtaaa tcagagacaa gggaacacat gcccactttg gaggaattct ttgaagaagc
     1441 cattgaacag gcagaccctg aaaatatggt ggaaaatgaa tataaggctg tgaacaattc
     1501 aaattatggt tggagagccc tgagactatt agcacggaga agccctcact tcttccagcc
     1561 aaccaaccag cagtttaaaa gtttaccaga atatcttgaa aatatggtaa taaagctagc
     1621 caaggaatta ccgcctcctt ctgaagaaat aaaaacaggt gaggatgaag atgaggaaga
     1681 taatgatgct ctactgaagg aaaatgaaag tcctgatgtt cggcgagaca aacctgtaac
     1741 aggagaacaa atagaggtat ttgccaacaa gctgggtgaa caatggaaga ttctggctcc
     1801 ctacttggaa atgaaagact cagaaattag gcagattgag tgtgacagtg aagacatgaa
     1861 gatgagagct aagcagctcc tggttgcctg gcaagatcaa gagggagttc atgcaacacc
     1921 tgagaatctg attaatgcac tgaataagtc tggattaagt gaccttgcag aaagtctaac
     1981 taatgacaat gagacaaata gttagcttct ttttttttct ttttattaaa actgtgatag
     2041 attttgttac caagcagcat ttgataagag gtccactggt tttggtaaac aataaacatt
     2101 tttaaaaaaa aaaaaaaaaa aaaaaaaaaa a
//