LOCUS       BC000401                2864 bp    mRNA    linear   HUM 16-SEP-2003
DEFINITION  Homo sapiens splicing factor 3b, subunit 2, 145kDa, mRNA (cDNA
            clone IMAGE:2822659), partial cds.
ACCESSION   BC000401
VERSION     BC000401.2
KEYWORDS    .
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2864)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 2864)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (15-NOV-2000) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     On Aug 19, 2003 this sequence version replaced BC000401.1.
            Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: DCTD/DTP
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: National Institutes of Health Intramural
            Sequencing Center (NISC),
            Gaithersburg, Maryland;
            Web site: http://www.nisc.nih.gov/
            Contact: nisc_mgc@nhgri.nih.gov
            Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
            Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
            Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
            Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
            Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
            McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
            Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
            Young,A., Zhang,L.-H. and Green,E.D.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 1 Row: i Column: 22.
FEATURES             Location/Qualifiers
     source          1..2864
                     /db_xref="H-InvDB:HIT000085833"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="IMAGE:2822659"
                     /tissue_type="Lung, small cell carcinoma"
                     /clone_lib="NIH_MGC_7"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     gene            <1..2864
                     /gene="SF3B2"
                     /gene_synonym="SAP145"
                     /gene_synonym="SF3b1"
                     /gene_synonym="SF3B145"
                     /gene_synonym="SF3b150"
                     /db_xref="GeneID:10992"
                     /db_xref="MIM:605591"
     CDS             <1..2687
                     /gene="SF3B2"
                     /gene_synonym="SAP145"
                     /gene_synonym="SF3b1"
                     /gene_synonym="SF3B145"
                     /gene_synonym="SF3b150"
                     /codon_start=3
                     /product="SF3B2 protein"
                     /protein_id="AAH00401.2"
                     /db_xref="GeneID:10992"
                     /db_xref="MIM:605591"
                     /translation="ATEHPEPPKAELQLPPPPPPGHYGAWAAQELQAKLAEIGAPIQG
                     NREELVERLQSYTRQTGIVLNRPVLRGEDGDKAAPPPMSAQLPGIPMPPPPLGLPPLQ
                     PPPPPPPPPPGLGLGFPMAHPPNLGPPPPLRVGEPVALSEEERLKLAQQQAALLMQQE
                     ERAKQQGDHSLKEHELLEQQKRAAVLLEQERQQEIAKMGTPVPRPPQDMGQIGVRTPL
                     GPRVAAPVGPVGPTPTVLPMGAPVPRPRGPPPPPGDENREMDDPSVGPKIPQALEKIL
                     QLKESRQEEMNSQQEEEEMETDARSSLGQSASETEEDTVSVSKKEKNRKRRNRKKKKK
                     PQRVRGVSSESSGDREKDSTRSRGSDSPAADVEIEYVTEEPEIYEPNFIFFKRIFEAF
                     KLTDDVKKEKEKEPEKLDKLENSAAPKKKGFEEEHKDSDDDSSDDEQEKKPEAPKLSK
                     KKLRRMNRFTVAELKQLVARPDVVEMHDVTAQDPKLLVHLKATRNSVPVPRHWCFKRK
                     YLQGKRGIEKPPFELPDFIKRTGIQEMREALQEKEEQKTMKSKMREKVRPKMGKIDID
                     YQKLHDAFFKWQTKPKLTIHGDLYYEGKEFETRLKEKKPGDLSDELRISLGMPVGPNA
                     HKVPPPWLIAMQRYGPPPSYPNLKIPGLNSPIPESCSFGYHAGGWGKPPVDETGKPLY
                     GDVFGTNAAEFQTKTEEEEIDRTPWGELEPSDEESSEEEEEEESDEDKPDETGFITPA
                     DSGLITPGGFSSVPAGMETPELIELRKKKIEEAMDGSETPQLFTVLPEKRTATVGGAM
                     MGSTHIYDMSTVMSRKGPAPELQGVEVALAPEELELDPMAMTQKYEEHVREQQAQVEK
                     EDFSDMVAEHAAKQKQKKRKAQPQDSRGGSKKYKEFKF"
     misc_feature    69..173
                     /gene="SF3B2"
                     /gene_synonym="SAP145"
                     /gene_synonym="SF3b1"
                     /gene_synonym="SF3B145"
                     /gene_synonym="SF3b150"
                     /note="SAP; Region: SAP domain. The SAP (after SAF-A/B,
                     Acinus and PIAS) motif is a putative DNA binding domain
                     found in diverse nuclear proteins"
                     /db_xref="CDD:pfam02037"
     misc_feature    1362..2099
                     /gene="SF3B2"
                     /gene_synonym="SAP145"
                     /gene_synonym="SF3b1"
                     /gene_synonym="SF3B145"
                     /gene_synonym="SF3b150"
                     /note="CUS1; Region: COG5182, CUS1, Splicing factor 3b,
                     subunit 2 [RNA processing and modification]"
                     /db_xref="CDD:COG5182"
BASE COUNT          766 a          745 c          833 g          520 t
ORIGIN      
        1 tggcgacgga gcatcccgag cctcccaaag cagaattgca gctgccgccg ccgccacctc
       61 caggccacta tggcgcctgg gctgcccagg agcttcaggc caagttggca gagatcggag
      121 ctccgatcca gggtaatcgc gaggagctgg tggagcggct gcagagctac acccgccaga
      181 ctggcatcgt gctgaatcgg ccggttttga gaggggaaga tggggacaaa gccgctccac
      241 ctcccatgtc ggcacagctc cctggaattc ccatgccacc accacctttg ggactccccc
      301 ctctgcagcc tcctccgcca cccccaccac ctccaccagg ccttggcctt ggctttccta
      361 tggcccaccc accaaatttg gggcccccgc ctcctctccg tgtgggtgag ccagtggcac
      421 tgtcagagga ggagcggctg aagttggctc agcagcaggc ggcattgctg atgcagcagg
      481 aggagcgtgc caagcagcag ggagatcatt cgctgaagga acatgagctc ttggagcagc
      541 agaagcgggc agctgtgtta ctggagcagg aacgacagca ggagattgcc aagatgggca
      601 ccccagtccc tcggccccca caagacatgg gccagattgg tgtgcgcact cctctgggtc
      661 ctcgagtagc tgctccagtg ggcccagtgg gccccactcc tacagttttg cccatgggag
      721 cccctgttcc ccggcctcgt ggtcccccac cgccccctgg agatgagaac agagagatgg
      781 atgacccctc tgtgggcccc aagatccccc aggctttgga gaagatcctg cagctgaagg
      841 agagccgcca ggaagagatg aattctcagc aggaggaaga ggaaatggaa acagatgctc
      901 gctcgtccct gggccagtca gcgtcagaga ctgaggagga cacagtgtcc gtatctaaaa
      961 aggagaaaaa ccggaagcgt aggaaccgaa agaagaagaa aaagccccag cgggtgcgag
     1021 gggtgtcctc tgagagctct ggggaccggg agaaagactc aacccggtcc cgtggctctg
     1081 attccccagc agctgatgtt gagattgagt atgtgactga agaacctgaa atttacgagc
     1141 ccaactttat cttctttaag aggatctttg aggcttttaa gctcactgat gatgtgaaga
     1201 aggagaaaga gaaagagcca gagaaacttg acaaactgga gaactctgca gcccccaaga
     1261 agaagggatt tgaagaggag cacaaggaca gtgatgatga cagcagtgat gacgagcagg
     1321 aaaagaagcc agaagccccc aagctgtcca agaagaagtt gcgccgaatg aaccgcttca
     1381 ctgtggctga actcaagcag ctggtggctc ggcccgatgt cgtggagatg cacgatgtga
     1441 cagcgcagga ccctaagctc ttggttcacc tcaaggccac tcggaactct gtgcctgtgc
     1501 cacgccactg gtgttttaag cgcaaatacc tgcagggcaa acggggcatt gagaagcccc
     1561 ccttcgagct gccagacttc atcaaacgca caggcatcca ggagatgcga gaggccctgc
     1621 aggagaagga agaacagaag accatgaagt caaaaatgcg agagaaagtt cggcctaaga
     1681 tgggcaaaat tgacatcgac taccagaaac tgcatgatgc cttcttcaag tggcagacca
     1741 agccaaagct gaccatccat ggggacctgt actatgaggg gaaggagttc gagacacgac
     1801 tgaaggagaa gaagccagga gatctgtctg atgagctaag gatttccttg gggatgccag
     1861 taggaccaaa tgcccacaag gtccctcccc catggctgat tgccatgcag cgatatggac
     1921 cacccccatc gtatcccaac ctgaaaatcc ctgggctgaa ctcgcccatc cctgagagct
     1981 gttcctttgg gtaccatgct ggtggctggg gcaaacctcc agtggatgag actgggaaac
     2041 cgctctatgg ggacgtgttt ggaaccaatg ctgctgaatt tcagaccaag actgaggaag
     2101 aagagattga tcggacccct tggggggaac tggaaccatc tgatgaagaa tcctcagaag
     2161 aagaggaaga ggaagaaagt gatgaagaca aaccagatga gacaggcttt attacccctg
     2221 cagacagtgg ccttatcact cctggaggct tttcatcagt gcctgctgga atggagaccc
     2281 ctgaactcat tgagctgagg aagaagaaga ttgaggaggc gatggacgga agtgagacac
     2341 ctcagctctt cactgtgttg ccagagaaga gaacagccac tgttggaggg gccatgatgg
     2401 gatcaaccca catttatgac atgtccacgg ttatgagccg gaagggcccg gctcctgagc
     2461 tgcaaggtgt ggaagtggcg ctggcgcctg aagagttgga gctggatcct atggccatga
     2521 cccagaagta tgaggagcat gtgcgggagc agcaggctca agtagagaag gaggacttca
     2581 gtgacatggt ggctgagcac gctgccaaac agaagcaaaa aaaacggaaa gctcagcccc
     2641 aggacagccg tgggggcagc aagaaatata aggagttcaa gttttaggtc ccctcacact
     2701 agcccttttt ttggccctac gtctggatgc ctgggcttca cacaagaacc acctctcccg
     2761 cagttcccaa ggacttgtca tttcatgttc ttattttaga cctgttttgt aaataaagct
     2821 gtttcccaag gaaagagatg aaaaaaaaaa aaaaaaaaaa aaaa
//