LOCUS BC052302 1102 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens THO complex 4, mRNA (cDNA clone MGC:59943 IMAGE:6381528), complete cds. ACCESSION BC052302 VERSION BC052302.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1102) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 1102) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (02-MAY-2003) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: DCTD/DTP/Gazdar cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 47 Row: l Column: 16 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 55770863. FEATURES Location/Qualifiers source 1..1102 /db_xref="H-InvDB:HIT000053776" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:59943 IMAGE:6381528" /tissue_type="Lung, large cell carcinoma" /clone_lib="NIH_MGC_18" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..1102 /gene="THOC4" /gene_synonym="ALY" /gene_synonym="BEF" /db_xref="GeneID:10189" /db_xref="HGNC:HGNC:19071" /db_xref="MIM:604171" CDS 28..801 /gene="THOC4" /gene_synonym="ALY" /gene_synonym="BEF" /codon_start=1 /product="THO complex 4" /protein_id="AAH52302.1" /db_xref="GeneID:10189" /db_xref="HGNC:HGNC:19071" /db_xref="MIM:604171" /translation="MADKMDMSLDDIIKLNRSQRGGRGGGRGRGRAGSQGGRGGGAQA AARVNRGGGPIRNRPAIARGAAGGGGRNRPAPYSRPKQLPDKWQHDLFDSGFGGGAGV ETGGKLLVSNLDFGVSDADIQELFAEFGTLKKAAVHYDRSGRSLGTADVHFERKADAL KAMKQYNGVPLDGRPMNIQLVTSQIDAQRRPAQSVNRGGMTRNRGAGGFGGGGGTRRG TRGGARGRGRGAGRNSKQQLSAEELDAQLDAYNARMDTS" BASE COUNT 250 a 271 c 355 g 226 t ORIGIN 1 gagccgatgc ccgattccgc gcccgccatg gccgacaaaa tggacatgtc tctggacgac 61 atcattaaac tgaaccggag ccagcgaggc ggccggggcg ggggccgggg ccgcggccgg 121 gccggctccc agggcggccg cggcggtggg gcgcaggccg ccgcgcgagt gaatcgaggc 181 ggcgggccca tccggaaccg gccggccatc gcccgcggcg cggccggcgg aggcggcagg 241 aaccgaccgg cgccctacag caggccaaaa caacttcccg acaagtggca gcacgatctt 301 ttcgacagtg gcttcggcgg tggtgccggc gtggagacag gtgggaaact gctggtgtcc 361 aatctggatt ttggagtctc agacgccgat attcaggaac tctttgctga atttggaacg 421 ctgaagaagg cggctgtgca ctatgatcgc tctggtcgca gcttaggaac agcagacgtg 481 cactttgagc ggaaggcaga tgccctgaag gccatgaagc agtacaacgg cgtccctctg 541 gatggccgcc ccatgaacat tcagcttgtc acgtcacaga ttgacgcaca gcggaggcct 601 gcacagagcg taaacagagg tggcatgact agaaaccgtg gcgctggagg ttttggtggt 661 ggtggaggca cccggagagg cacccgcgga ggcgcccgtg gaagaggcag aggtgccggc 721 aggaattcaa agcagcagct ttcggcagag gagctggatg cccagctgga cgcctataat 781 gcgagaatgg acaccagtta aacagaccag caaatccgcg tgcggaacag gacccaggcg 841 tctcctcttg ctccctggtt ggggggcggt ggctggggct gtgcggccaa tgatggattt 901 gtttctttta tgttttaaaa taggatttaa aaactcatgt aaaggttttt ttttttcttt 961 tttttttttt aattctgaaa cagacctgtt ttgtaccgag ttatttttgg gataaatttt 1021 actggttgct gttgtggaga aggtggcgtt tccacctttt ccataataaa atagaaatgt 1081 gtgtaaaaaa aaaaaaaaaa aa //