LOCUS       BC053577                2639 bp    mRNA    linear   HUM 16-SEP-2003
DEFINITION  Homo sapiens splicing factor 3b, subunit 2, 145kDa, mRNA (cDNA
            clone IMAGE:4478742), partial cds.
ACCESSION   BC053577
VERSION     BC053577.1
KEYWORDS    .
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2639)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 2639)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (09-JUN-2003) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: National Institutes of Health Intramural
            Sequencing Center (NISC),
            Gaithersburg, Maryland;
            Web site: http://www.nisc.nih.gov/
            Contact: nisc_mgc@nhgri.nih.gov
            Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
            Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
            Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
            Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
            Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
            McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
            Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
            Young,A., Zhang,L.-H. and Green,E.D.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 115 Row: a Column: 24.
FEATURES             Location/Qualifiers
     source          1..2639
                     /db_xref="H-InvDB:HIT000259295"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="IMAGE:4478742"
                     /tissue_type="Testis, embryonal carcinoma"
                     /clone_lib="NIH_MGC_92"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     gene            1..>2639
                     /gene="SF3B2"
                     /gene_synonym="SAP145"
                     /gene_synonym="SF3b1"
                     /gene_synonym="SF3B145"
                     /gene_synonym="SF3b150"
                     /db_xref="GeneID:10992"
                     /db_xref="MIM:605591"
     CDS             8..>2639
                     /gene="SF3B2"
                     /gene_synonym="SAP145"
                     /gene_synonym="SF3b1"
                     /gene_synonym="SF3B145"
                     /gene_synonym="SF3b150"
                     /codon_start=1
                     /product="SF3B2 protein"
                     /protein_id="AAH53577.1"
                     /db_xref="GeneID:10992"
                     /db_xref="MIM:605591"
                     /translation="MATEHPEPPKAELQLPPPPPPGHYGAWAAQELQAKLAEIGAPIQ
                     GNREELVERLQSYTRQTGIVLNRPVLRGEDGDKAAPPPMSAQLPGIPMPPPPLGLPPL
                     QPPPPPPPPPPGLGLGFPMAHPPNLGPPPPLRVGEPVALSEEERLKLAQQQAALLMQQ
                     EERAKQQGDHSLKEHELLEQQKRAAVLLEQERQQEIAKMGTPVPRPPQDMGQIGVRTP
                     LGPRVAAPVGPVGPTPTVLPMGAPVPRPRGPPPPPGDENREMDDPSVGPKIPQALEKI
                     LQLKESRQEEMNSQQEEEEMETDARSSLGQSASETEEDTVSVSKKEKNRKRRNRKKKK
                     KPQRVRGVSSESSGDREKDSTRSRGSDSPAADVEIEYVTEEPEIYEPNFIFFKRIFEA
                     FKLTDDVKKEKEKEPEKLDKLENSAAPKKKGFEEEHKDSDDDSSDDEQEKKPEAPKLS
                     KKKLRRMNRFTVAELKQLVARPDVVEMHDVTAQDPKLLVHLKATRNSVPVPRHWCFKR
                     KYLQGKRGIEKPPFELPDFIKRTGIQEMREALQEKEEQKTMKSKMREKVRPKMGKIDI
                     DYQKLHDAFFKWQTKPKLTIHGDLYYEGKEFETRLKEKKPGDLSDELRISLGMPVGPN
                     AHKVPPPWLIAMQRYGPPPSYPNLKIPGLNSPIPESCSFGYHAGGWGKPPVDETGKPL
                     YGDVFGTNAAEFQTKTEEEEIDRTPWGELEPSDEESSEEEEEEESDEDKPDETGFITP
                     ADSGLITPGGFSSVPAGMETPELIELRKKKIEEAMDGSETPQLFTVLPEKRTATVGGA
                     MMGSTHIYDMSTVMSRKGPAPELQGVEVALAPEELELDPMAMTQKYEEHVREQQAQVE
                     KEDFSDMVAEHAAKQKQKKKK"
     misc_feature    77..181
                     /gene="SF3B2"
                     /gene_synonym="SAP145"
                     /gene_synonym="SF3b1"
                     /gene_synonym="SF3B145"
                     /gene_synonym="SF3b150"
                     /note="SAP; Region: SAP domain. The SAP (after SAF-A/B,
                     Acinus and PIAS) motif is a putative DNA binding domain
                     found in diverse nuclear proteins"
                     /db_xref="CDD:pfam02037"
     misc_feature    1370..2107
                     /gene="SF3B2"
                     /gene_synonym="SAP145"
                     /gene_synonym="SF3b1"
                     /gene_synonym="SF3B145"
                     /gene_synonym="SF3b150"
                     /note="CUS1; Region: COG5182, CUS1, Splicing factor 3b,
                     subunit 2 [RNA processing and modification]"
                     /db_xref="CDD:COG5182"
BASE COUNT          697 a          690 c          789 g          463 t
ORIGIN      
        1 ggctaagatg gcgacggagc atcccgagcc tcccaaagca gaattgcagc tgccgccgcc
       61 gccacctcca ggccactatg gcgcctgggc tgcccaggag cttcaggcca agttggcaga
      121 gatcggagct ccgatccagg gtaatcgcga ggagctggtg gagcggctgc agagctacac
      181 ccgccagact ggcatcgtgc tgaatcggcc ggttttgaga ggggaagatg gggacaaagc
      241 cgctccacct cccatgtcgg cacagctccc tggaattccc atgccaccac cacctttggg
      301 actcccccct ctgcagcctc ctccgccacc cccaccacct ccaccaggcc ttggccttgg
      361 ctttcctatg gcccacccac caaatttggg gcccccgcct cctctccgtg tgggtgagcc
      421 agtggcactg tcagaggagg agcggctgaa gttggctcag cagcaggcgg cattgctgat
      481 gcagcaggag gagcgtgcca agcagcaggg agatcattcg ctgaaggaac atgagctctt
      541 ggagcagcag aagcgggcag ctgtgttact ggagcaggaa cgacagcagg agattgccaa
      601 gatgggcacc ccagtccctc ggcccccaca agacatgggc cagattggtg tgcgcactcc
      661 tctgggtcct cgagtagctg ctccagtggg cccagtgggc cccactccta cagttttgcc
      721 catgggagcc cctgttcccc ggcctcgtgg tcccccaccg ccccctggag atgagaacag
      781 agagatggat gacccctctg tgggccccaa gatcccccag gctttggaga agatcctgca
      841 gctgaaggag agccgccagg aagagatgaa ttctcagcag gaggaagagg aaatggaaac
      901 agatgctcgc tcgtccctgg gccagtcagc gtcagagact gaggaggaca cagtgtccgt
      961 atctaaaaag gagaaaaacc ggaagcgtag gaaccgaaag aagaagaaaa agccccagcg
     1021 ggtgcgaggg gtgtcctctg agagctctgg ggaccgggag aaagactcaa cccggtcccg
     1081 tggctctgat tccccagcag ctgatgttga gattgagtat gtgactgaag aacctgaaat
     1141 ttacgagccc aactttatct tctttaagag gatctttgag gcttttaagc tcactgatga
     1201 tgtgaagaag gagaaagaga aggagccaga gaaacttgac aaactggaga actctgcagc
     1261 ccccaagaag aagggatttg aagaggagca caaggacagt gatgatgaca gcagtgatga
     1321 cgagcaggaa aagaagccag aagcccccaa gctgtccaag aagaagttgc gccgaatgaa
     1381 ccgcttcact gtggctgaac tcaagcagct ggtggctcgg cccgatgtcg tggagatgca
     1441 cgatgtgaca gcgcaggacc ctaagctctt ggttcacctc aaggccactc ggaactctgt
     1501 gcctgtgcca cgccactggt gttttaagcg caaatacctg cagggcaaac ggggcattga
     1561 gaagcccccc ttcgagctgc cagacttcat caaacgcaca ggcatccagg agatgcgaga
     1621 ggccctgcag gagaaggaag aacagaagac catgaagtca aaaatgcgag agaaagttcg
     1681 gcctaagatg ggcaaaattg acatcgacta ccagaaactg catgatgcct tcttcaagtg
     1741 gcagaccaag ccaaagctga ccatccatgg ggacctgtac tatgagggga aggagttcga
     1801 gacacgactg aaggagaaga agccaggaga tctgtctgat gagctaagga tttccttggg
     1861 gatgccagta ggaccaaatg cccacaaggt ccctccccca tggctgattg ccatgcagcg
     1921 atatggacca cccccatcgt atcccaacct gaaaatccct gggctgaact cgcccatccc
     1981 tgagagctgt tcctttgggt accatgctgg tggctggggc aaacctccag tggatgagac
     2041 tgggaaaccg ctctatgggg acgtgtttgg aaccaatgct gctgaatttc agaccaagac
     2101 tgaggaagaa gagattgatc ggaccccttg gggggaactg gaaccatctg atgaagaatc
     2161 ctcagaagaa gaggaagagg aagaaagtga tgaagacaaa ccagatgaga caggctttat
     2221 tacccctgca gacagtggcc ttatcactcc tggaggcttt tcatcagtgc ctgctggaat
     2281 ggagacccct gaactcattg agctgaggaa gaagaagatt gaggaggcga tggacggaag
     2341 tgagacacct cagctcttca ctgtgttgcc agagaagaga acagccactg ttggaggggc
     2401 catgatggga tcaacccaca tttatgacat gtccacggtt atgagccgga agggcccggc
     2461 tcctgagctg caaggtgtgg aagtggcgct ggcgcctgaa gagttggagc tggatcctat
     2521 ggccatgacc cagaagtatg aggagcatgt gcgggagcag caggctcaag tagagaagga
     2581 ggacttcagt gacatggtgg ctgagcacgc tgccaaacag aagcaaaaaa aaaaaaaaa
//