LOCUS BC010381 2131 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens THO complex 1, mRNA (cDNA clone MGC:13557 IMAGE:4046908), complete cds. ACCESSION BC010381 VERSION BC010381.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2131) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2131) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (09-JUL-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: CLONTECH Laboratories, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Institute for Systems Biology http://www.systemsbiology.org contact: amadan@systemsbiology.org Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 19 Row: b Column: 4 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 4826881. FEATURES Location/Qualifiers source 1..2131 /db_xref="H-InvDB:HIT000034910" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:13557 IMAGE:4046908" /tissue_type="Bone marrow, chronic myelogenous leukemia" /clone_lib="NIH_MGC_54" /lab_host="DH10B" /note="Vector: pDNR-LIB" gene 1..2131 /gene="THOC1" /gene_synonym="HPR1" /gene_synonym="P84" /gene_synonym="P84N5" /db_xref="GeneID:9984" /db_xref="HGNC:HGNC:19070" /db_xref="MIM:606930" CDS 32..2005 /gene="THOC1" /gene_synonym="HPR1" /gene_synonym="P84" /gene_synonym="P84N5" /codon_start=1 /product="THO complex 1" /protein_id="AAH10381.1" /db_xref="GeneID:9984" /db_xref="HGNC:HGNC:19070" /db_xref="MIM:606930" /translation="MSPTPPLFSLPEARTRFTKSTREALNNKNIKPLLSTFSQVPGSE NEKKCTLDQAFRGILEEEIINHSSCENVLAIISLAIGGVTEGICTASTPFVLLGDVLD CLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFCGRIQ LFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGEMG DEEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKG QVKFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWK NEGCPSFVKERTSDTKPTRIIRKRTAPEDFLGKGPTKKILMGNEELTRLWNLCPDNME ACKSETREHMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPHFF QPTNQQFKSLPEYLENMVIKLAKELPPPSEEIKTGEDEDEEDNDALLKENESPDVRRD KPVTGEQIEVFANKLGEQWKILAPYLEMKDSEIRQIECDSEDMKMRAKQLLVAWQDQE GVHATPENLINALNKSGLSDLAESLTNDNETNS" BASE COUNT 740 a 371 c 464 g 556 t ORIGIN 1 ggggcagtgg cgggcacgcg cagccgagaa gatgtctccg acgccgccgc tcttcagttt 61 gcccgaagcg cggacgcggt ttacgaagtc taccagagag gccttgaaca acaaaaacat 121 caagccattg ttaagtacct tcagccaggt acctggcagt gaaaatgaaa aaaaatgtac 181 ccttgaccaa gctttcagag gtattctaga agaagaaatt ataaatcatt catcatgtga 241 aaacgtttta gctattattt ctcttgctat tgggggagta actgaaggta tttgtaccgc 301 atctacacct tttgtattgt tgggagatgt tttggattgt cttcctttgg atcagtgtga 361 cacaatattc acttttgtgg aaaaaaatgt tgctacttgg aaatcaaata cattctattc 421 tgctgggaaa aattacttac tacgtatgtg caatgatctc ctaagaagat tgtctaaatc 481 ccagaataca gtcttctgtg gacggattca gctctttttg gccaggcttt tccctctgtc 541 tgagaaatca ggtcttaact tgcagagtca gtttaatctg gaaaatgtca ctgttttcaa 601 tacaaatgag caggaaagca ccctgggtca gaagcacact gaagatagag aagaaggaat 661 ggatgtagaa gaaggcgaaa tgggagatga ggaagctcca acaacgtgct ctattccaat 721 tgattacaac ctgtatcgaa aattctggtc acttcaggat tacttcagga accctgtgca 781 atgctatgag aagatttcat ggaaaacttt tctcaagtat tctgaagaag ttttagctgt 841 ttttaagagt tataaattag atgatactca ggcctcaaga aaaaagatgg aagaattgaa 901 aacaggagga gaacatgtat attttgcaaa atttttaaca agtgaaaagc tgatggattt 961 acaactgagt gacagtaact ttcgtcgaca catcctgttg cagtatctca ttttattcca 1021 atatctcaag gggcaggtca aattcaaaag ttcaaactat gttttaactg atgagcaatc 1081 actttggatt gaagatacta caaaatcagt ttatcaacta ctatctgaaa acccccccga 1141 tggagaaaga ttttcaaaga tggtagagca tatattaaac actgaagaaa actggaactc 1201 gtggaaaaat gaaggttgcc caagttttgt gaaagaaaga acatcagata ccaaacctac 1261 gagaataatt cggaagagaa cagcacccga ggacttccta gggaaaggac ccaccaaaaa 1321 aattctgatg ggaaatgagg agttaacaag gctttggaat ctttgccctg ataatatgga 1381 agcctgtaaa tcagagacaa gggaacacat gcccactttg gaggaattct ttgaagaagc 1441 cattgaacag gcagaccctg aaaatatggt ggaaaatgaa tataaggctg tgaacaattc 1501 aaattatggt tggagagccc tgagactatt agcacggaga agccctcact tcttccagcc 1561 aaccaaccag cagtttaaaa gtttaccaga atatcttgaa aatatggtaa taaagctagc 1621 caaggaatta ccgcctcctt ctgaagaaat aaaaacaggt gaggatgaag atgaggaaga 1681 taatgatgct ctactgaagg aaaatgaaag tcctgatgtt cggcgagaca aacctgtaac 1741 aggagaacaa atagaggtat ttgccaacaa gctgggtgaa caatggaaga ttctggctcc 1801 ctacttggaa atgaaagact cagaaattag gcagattgag tgtgacagtg aagacatgaa 1861 gatgagagct aagcagctcc tggttgcctg gcaagatcaa gagggagttc atgcaacacc 1921 tgagaatctg attaatgcac tgaataagtc tggattaagt gaccttgcag aaagtctaac 1981 taatgacaat gagacaaata gttagcttct ttttttttct ttttattaaa actgtgatag 2041 attttgttac caagcagcat ttgataagag gtccactggt tttggtaaac aataaacatt 2101 tttaaaaaaa aaaaaaaaaa aaaaaaaaaa a //