LOCUS BC010381 2131 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens THO complex 1, mRNA (cDNA clone MGC:13557
IMAGE:4046908), complete cds.
ACCESSION BC010381
VERSION BC010381.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 2131)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 2131)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (09-JUL-2001) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: CLONTECH Laboratories, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Institute for Systems Biology
http://www.systemsbiology.org
contact: amadan@systemsbiology.org
Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 19 Row: b Column: 4
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 4826881.
FEATURES Location/Qualifiers
source 1..2131
/db_xref="H-InvDB:HIT000034910"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:13557 IMAGE:4046908"
/tissue_type="Bone marrow, chronic myelogenous leukemia"
/clone_lib="NIH_MGC_54"
/lab_host="DH10B"
/note="Vector: pDNR-LIB"
gene 1..2131
/gene="THOC1"
/gene_synonym="HPR1"
/gene_synonym="P84"
/gene_synonym="P84N5"
/db_xref="GeneID:9984"
/db_xref="HGNC:HGNC:19070"
/db_xref="MIM:606930"
CDS 32..2005
/gene="THOC1"
/gene_synonym="HPR1"
/gene_synonym="P84"
/gene_synonym="P84N5"
/codon_start=1
/product="THO complex 1"
/protein_id="AAH10381.1"
/db_xref="GeneID:9984"
/db_xref="HGNC:HGNC:19070"
/db_xref="MIM:606930"
/translation="MSPTPPLFSLPEARTRFTKSTREALNNKNIKPLLSTFSQVPGSE
NEKKCTLDQAFRGILEEEIINHSSCENVLAIISLAIGGVTEGICTASTPFVLLGDVLD
CLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFCGRIQ
LFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGEMG
DEEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL
DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKG
QVKFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWK
NEGCPSFVKERTSDTKPTRIIRKRTAPEDFLGKGPTKKILMGNEELTRLWNLCPDNME
ACKSETREHMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPHFF
QPTNQQFKSLPEYLENMVIKLAKELPPPSEEIKTGEDEDEEDNDALLKENESPDVRRD
KPVTGEQIEVFANKLGEQWKILAPYLEMKDSEIRQIECDSEDMKMRAKQLLVAWQDQE
GVHATPENLINALNKSGLSDLAESLTNDNETNS"
BASE COUNT 740 a 371 c 464 g 556 t
ORIGIN
1 ggggcagtgg cgggcacgcg cagccgagaa gatgtctccg acgccgccgc tcttcagttt
61 gcccgaagcg cggacgcggt ttacgaagtc taccagagag gccttgaaca acaaaaacat
121 caagccattg ttaagtacct tcagccaggt acctggcagt gaaaatgaaa aaaaatgtac
181 ccttgaccaa gctttcagag gtattctaga agaagaaatt ataaatcatt catcatgtga
241 aaacgtttta gctattattt ctcttgctat tgggggagta actgaaggta tttgtaccgc
301 atctacacct tttgtattgt tgggagatgt tttggattgt cttcctttgg atcagtgtga
361 cacaatattc acttttgtgg aaaaaaatgt tgctacttgg aaatcaaata cattctattc
421 tgctgggaaa aattacttac tacgtatgtg caatgatctc ctaagaagat tgtctaaatc
481 ccagaataca gtcttctgtg gacggattca gctctttttg gccaggcttt tccctctgtc
541 tgagaaatca ggtcttaact tgcagagtca gtttaatctg gaaaatgtca ctgttttcaa
601 tacaaatgag caggaaagca ccctgggtca gaagcacact gaagatagag aagaaggaat
661 ggatgtagaa gaaggcgaaa tgggagatga ggaagctcca acaacgtgct ctattccaat
721 tgattacaac ctgtatcgaa aattctggtc acttcaggat tacttcagga accctgtgca
781 atgctatgag aagatttcat ggaaaacttt tctcaagtat tctgaagaag ttttagctgt
841 ttttaagagt tataaattag atgatactca ggcctcaaga aaaaagatgg aagaattgaa
901 aacaggagga gaacatgtat attttgcaaa atttttaaca agtgaaaagc tgatggattt
961 acaactgagt gacagtaact ttcgtcgaca catcctgttg cagtatctca ttttattcca
1021 atatctcaag gggcaggtca aattcaaaag ttcaaactat gttttaactg atgagcaatc
1081 actttggatt gaagatacta caaaatcagt ttatcaacta ctatctgaaa acccccccga
1141 tggagaaaga ttttcaaaga tggtagagca tatattaaac actgaagaaa actggaactc
1201 gtggaaaaat gaaggttgcc caagttttgt gaaagaaaga acatcagata ccaaacctac
1261 gagaataatt cggaagagaa cagcacccga ggacttccta gggaaaggac ccaccaaaaa
1321 aattctgatg ggaaatgagg agttaacaag gctttggaat ctttgccctg ataatatgga
1381 agcctgtaaa tcagagacaa gggaacacat gcccactttg gaggaattct ttgaagaagc
1441 cattgaacag gcagaccctg aaaatatggt ggaaaatgaa tataaggctg tgaacaattc
1501 aaattatggt tggagagccc tgagactatt agcacggaga agccctcact tcttccagcc
1561 aaccaaccag cagtttaaaa gtttaccaga atatcttgaa aatatggtaa taaagctagc
1621 caaggaatta ccgcctcctt ctgaagaaat aaaaacaggt gaggatgaag atgaggaaga
1681 taatgatgct ctactgaagg aaaatgaaag tcctgatgtt cggcgagaca aacctgtaac
1741 aggagaacaa atagaggtat ttgccaacaa gctgggtgaa caatggaaga ttctggctcc
1801 ctacttggaa atgaaagact cagaaattag gcagattgag tgtgacagtg aagacatgaa
1861 gatgagagct aagcagctcc tggttgcctg gcaagatcaa gagggagttc atgcaacacc
1921 tgagaatctg attaatgcac tgaataagtc tggattaagt gaccttgcag aaagtctaac
1981 taatgacaat gagacaaata gttagcttct ttttttttct ttttattaaa actgtgatag
2041 attttgttac caagcagcat ttgataagag gtccactggt tttggtaaac aataaacatt
2101 tttaaaaaaa aaaaaaaaaa aaaaaaaaaa a
//