LOCUS BC052302 1102 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens THO complex 4, mRNA (cDNA clone MGC:59943
IMAGE:6381528), complete cds.
ACCESSION BC052302
VERSION BC052302.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 1102)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 1102)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (02-MAY-2003) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: DCTD/DTP/Gazdar
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: National Institutes of Health Intramural
Sequencing Center (NISC),
Gaithersburg, Maryland;
Web site: http://www.nisc.nih.gov/
Contact: nisc_mgc@nhgri.nih.gov
Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
Young,A., Zhang,L.-H. and Green,E.D.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 47 Row: l Column: 16
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 55770863.
FEATURES Location/Qualifiers
source 1..1102
/db_xref="H-InvDB:HIT000053776"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:59943 IMAGE:6381528"
/tissue_type="Lung, large cell carcinoma"
/clone_lib="NIH_MGC_18"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene 1..1102
/gene="THOC4"
/gene_synonym="ALY"
/gene_synonym="BEF"
/db_xref="GeneID:10189"
/db_xref="HGNC:HGNC:19071"
/db_xref="MIM:604171"
CDS 28..801
/gene="THOC4"
/gene_synonym="ALY"
/gene_synonym="BEF"
/codon_start=1
/product="THO complex 4"
/protein_id="AAH52302.1"
/db_xref="GeneID:10189"
/db_xref="HGNC:HGNC:19071"
/db_xref="MIM:604171"
/translation="MADKMDMSLDDIIKLNRSQRGGRGGGRGRGRAGSQGGRGGGAQA
AARVNRGGGPIRNRPAIARGAAGGGGRNRPAPYSRPKQLPDKWQHDLFDSGFGGGAGV
ETGGKLLVSNLDFGVSDADIQELFAEFGTLKKAAVHYDRSGRSLGTADVHFERKADAL
KAMKQYNGVPLDGRPMNIQLVTSQIDAQRRPAQSVNRGGMTRNRGAGGFGGGGGTRRG
TRGGARGRGRGAGRNSKQQLSAEELDAQLDAYNARMDTS"
BASE COUNT 250 a 271 c 355 g 226 t
ORIGIN
1 gagccgatgc ccgattccgc gcccgccatg gccgacaaaa tggacatgtc tctggacgac
61 atcattaaac tgaaccggag ccagcgaggc ggccggggcg ggggccgggg ccgcggccgg
121 gccggctccc agggcggccg cggcggtggg gcgcaggccg ccgcgcgagt gaatcgaggc
181 ggcgggccca tccggaaccg gccggccatc gcccgcggcg cggccggcgg aggcggcagg
241 aaccgaccgg cgccctacag caggccaaaa caacttcccg acaagtggca gcacgatctt
301 ttcgacagtg gcttcggcgg tggtgccggc gtggagacag gtgggaaact gctggtgtcc
361 aatctggatt ttggagtctc agacgccgat attcaggaac tctttgctga atttggaacg
421 ctgaagaagg cggctgtgca ctatgatcgc tctggtcgca gcttaggaac agcagacgtg
481 cactttgagc ggaaggcaga tgccctgaag gccatgaagc agtacaacgg cgtccctctg
541 gatggccgcc ccatgaacat tcagcttgtc acgtcacaga ttgacgcaca gcggaggcct
601 gcacagagcg taaacagagg tggcatgact agaaaccgtg gcgctggagg ttttggtggt
661 ggtggaggca cccggagagg cacccgcgga ggcgcccgtg gaagaggcag aggtgccggc
721 aggaattcaa agcagcagct ttcggcagag gagctggatg cccagctgga cgcctataat
781 gcgagaatgg acaccagtta aacagaccag caaatccgcg tgcggaacag gacccaggcg
841 tctcctcttg ctccctggtt ggggggcggt ggctggggct gtgcggccaa tgatggattt
901 gtttctttta tgttttaaaa taggatttaa aaactcatgt aaaggttttt ttttttcttt
961 tttttttttt aattctgaaa cagacctgtt ttgtaccgag ttatttttgg gataaatttt
1021 actggttgct gttgtggaga aggtggcgtt tccacctttt ccataataaa atagaaatgt
1081 gtgtaaaaaa aaaaaaaaaa aa
//