LOCUS       BC052302                1102 bp    mRNA    linear   HUM 15-JUL-2006
DEFINITION  Homo sapiens THO complex 4, mRNA (cDNA clone MGC:59943
            IMAGE:6381528), complete cds.
ACCESSION   BC052302
VERSION     BC052302.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1102)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 1102)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (02-MAY-2003) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: DCTD/DTP/Gazdar
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: National Institutes of Health Intramural
            Sequencing Center (NISC),
            Gaithersburg, Maryland;
            Web site: http://www.nisc.nih.gov/
            Contact: nisc_mgc@nhgri.nih.gov
            Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
            Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
            Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
            Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
            Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
            McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
            Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
            Young,A., Zhang,L.-H. and Green,E.D.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 47 Row: l Column: 16
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 55770863.
FEATURES             Location/Qualifiers
     source          1..1102
                     /db_xref="H-InvDB:HIT000053776"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:59943 IMAGE:6381528"
                     /tissue_type="Lung, large cell carcinoma"
                     /clone_lib="NIH_MGC_18"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     gene            1..1102
                     /gene="THOC4"
                     /gene_synonym="ALY"
                     /gene_synonym="BEF"
                     /db_xref="GeneID:10189"
                     /db_xref="HGNC:HGNC:19071"
                     /db_xref="MIM:604171"
     CDS             28..801
                     /gene="THOC4"
                     /gene_synonym="ALY"
                     /gene_synonym="BEF"
                     /codon_start=1
                     /product="THO complex 4"
                     /protein_id="AAH52302.1"
                     /db_xref="GeneID:10189"
                     /db_xref="HGNC:HGNC:19071"
                     /db_xref="MIM:604171"
                     /translation="MADKMDMSLDDIIKLNRSQRGGRGGGRGRGRAGSQGGRGGGAQA
                     AARVNRGGGPIRNRPAIARGAAGGGGRNRPAPYSRPKQLPDKWQHDLFDSGFGGGAGV
                     ETGGKLLVSNLDFGVSDADIQELFAEFGTLKKAAVHYDRSGRSLGTADVHFERKADAL
                     KAMKQYNGVPLDGRPMNIQLVTSQIDAQRRPAQSVNRGGMTRNRGAGGFGGGGGTRRG
                     TRGGARGRGRGAGRNSKQQLSAEELDAQLDAYNARMDTS"
BASE COUNT          250 a          271 c          355 g          226 t
ORIGIN      
        1 gagccgatgc ccgattccgc gcccgccatg gccgacaaaa tggacatgtc tctggacgac
       61 atcattaaac tgaaccggag ccagcgaggc ggccggggcg ggggccgggg ccgcggccgg
      121 gccggctccc agggcggccg cggcggtggg gcgcaggccg ccgcgcgagt gaatcgaggc
      181 ggcgggccca tccggaaccg gccggccatc gcccgcggcg cggccggcgg aggcggcagg
      241 aaccgaccgg cgccctacag caggccaaaa caacttcccg acaagtggca gcacgatctt
      301 ttcgacagtg gcttcggcgg tggtgccggc gtggagacag gtgggaaact gctggtgtcc
      361 aatctggatt ttggagtctc agacgccgat attcaggaac tctttgctga atttggaacg
      421 ctgaagaagg cggctgtgca ctatgatcgc tctggtcgca gcttaggaac agcagacgtg
      481 cactttgagc ggaaggcaga tgccctgaag gccatgaagc agtacaacgg cgtccctctg
      541 gatggccgcc ccatgaacat tcagcttgtc acgtcacaga ttgacgcaca gcggaggcct
      601 gcacagagcg taaacagagg tggcatgact agaaaccgtg gcgctggagg ttttggtggt
      661 ggtggaggca cccggagagg cacccgcgga ggcgcccgtg gaagaggcag aggtgccggc
      721 aggaattcaa agcagcagct ttcggcagag gagctggatg cccagctgga cgcctataat
      781 gcgagaatgg acaccagtta aacagaccag caaatccgcg tgcggaacag gacccaggcg
      841 tctcctcttg ctccctggtt ggggggcggt ggctggggct gtgcggccaa tgatggattt
      901 gtttctttta tgttttaaaa taggatttaa aaactcatgt aaaggttttt ttttttcttt
      961 tttttttttt aattctgaaa cagacctgtt ttgtaccgag ttatttttgg gataaatttt
     1021 actggttgct gttgtggaga aggtggcgtt tccacctttt ccataataaa atagaaatgt
     1081 gtgtaaaaaa aaaaaaaaaa aa
//