LOCUS       BC001976                2910 bp    mRNA    linear   HUM 15-JUL-2006
DEFINITION  Homo sapiens splicing factor 3a, subunit 1, 120kDa, mRNA (cDNA
            clone MGC:5589 IMAGE:3459964), complete cds.
ACCESSION   BC001976
VERSION     BC001976.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2910)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 2910)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (31-JAN-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Sequencing Group at the Stanford Human Genome
            Center, Stanford University School of Medicine, Stanford, CA  94305
            Web site:       http://www-shgc.stanford.edu
            Contact:  (Dickson, Mark) mcd@paxil.stanford.edu
            Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
            R. M.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 4 Row: i Column: 18
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 53831993.
FEATURES             Location/Qualifiers
     source          1..2910
                     /db_xref="H-InvDB:HIT000258889"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:5589 IMAGE:3459964"
                     /tissue_type="Cervix, carcinoma"
                     /clone_lib="NIH_MGC_12"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     gene            1..2910
                     /gene="SF3A1"
                     /gene_synonym="PRP21"
                     /gene_synonym="PRPF21"
                     /gene_synonym="SAP114"
                     /gene_synonym="SF3A120"
                     /db_xref="GeneID:10291"
                     /db_xref="HGNC:HGNC:10765"
                     /db_xref="MIM:605595"
     CDS             96..2477
                     /gene="SF3A1"
                     /gene_synonym="PRP21"
                     /gene_synonym="PRPF21"
                     /gene_synonym="SAP114"
                     /gene_synonym="SF3A120"
                     /codon_start=1
                     /product="splicing factor 3a, subunit 1, 120kDa"
                     /protein_id="AAH01976.1"
                     /db_xref="GeneID:10291"
                     /db_xref="HGNC:HGNC:10765"
                     /db_xref="MIM:605595"
                     /translation="MPAGPVQAVPPPPPVPTEPKQPTEEEASSKEDSAPSKPVVGIIY
                     PPPEVRNIVDKTASFVARNGPEFEARIRQNEINNPKFNFLNPNDPYHAYYRHKVSEFK
                     EGKAQEPSAAIPKVMQQQQQTTQQQLPQKVQAQVIQETIVPKEPPPEFEFIADPPSIS
                     AFDLDVVKLTAQFVARNGRQFLTQLMQKEQRNYQFDFLRPQHSLFNYFTKLVEQYTKI
                     LIPPKGLFSKLKKEAENPREVLDQVCYRVEWAKFQERERKKEEEEKEKERVAYAQIDW
                     HDFVVVETVDFQPNEQGNFPPPTTPEELGARILIQERYEKFGESEEVEMEVESDEEDD
                     KQEKAEEPPSQLDQDTQVQDMDEGSDDEEEGQKVPPPPETPMPPPLPPTPDQVIVRKD
                     YDPKASKPLPPAPAPDEYLVSPITGEKIPASKMQEHMRIGLLDPRWLEQRDRSIREKQ
                     SDDEVYAPGLDIESSLKQLAERRTDIFGVEETAIGKKIGEEEIQKPEEKVTWDGHSGS
                     MARTQQAAQANITLQEQIEAIHKAKGLVPEDDTKEKIGPSKPNEIPQQPPPPSSATNI
                     PSSAPPITSVPRPPTMPPPVRTTVVSAVPVMPRPPMASVVRLPPGSVIAPMPPIIHAP
                     RINVVPMPPSAPPIMAPRPPPMIVPTAFVPAPPVAPVPAPAPMPPVHPPPPMEDEPTS
                     KKLKTEDSLMPEEEFLRRNKGPVSIKVQVPNMQDKTEWKLNGQVLVFTLPLTDQVSVI
                     KVKIHEATGMPAGKQKLQYEGIFIKDSNSLAYYNMANGAVIHLALKERGGRKK"
BASE COUNT          744 a          860 c          741 g          565 t
ORIGIN      
        1 tgcgagctcg tcgtactgac cgagcgggga ggctgtcttg aggcggcacc gctcaccgac
       61 accgaggcgg actggcagcc ctgagcgtcg cagtcatgcc ggccggaccc gtgcaggcgg
      121 tgcccccgcc gccgcccgtg cccacggagc ccaaacagcc cacagaagaa gaagcatctt
      181 caaaggagga ttctgcacct tctaagccag ttgtggggat tatttaccct cctccagagg
      241 tcagaaatat tgttgacaag actgccagct ttgtggccag aaacgggcct gaatttgaag
      301 ctaggatccg acagaacgag atcaacaacc ccaagttcaa ctttctgaac cccaatgacc
      361 cttaccatgc ctactaccgc cacaaggtca gcgagttcaa ggaagggaag gctcaggagc
      421 cgtccgccgc catccccaag gtcatgcagc agcagcagca gaccacccag cagcagctgc
      481 cccagaaggt ccaagcccaa gtaatccaag agaccatcgt gcccaaagag cctcctcctg
      541 agtttgagtt cattgctgat cctccctcta tctcagcctt cgacttggat gtggtgaagc
      601 tgacggctca gtttgtggcc aggaatgggc gccagtttct gacccagctg atgcagaaag
      661 agcagcgcaa ctaccagttt gactttctcc gcccacagca cagcctcttc aactacttca
      721 cgaagctagt ggaacagtac accaagatct tgattccacc caaaggttta ttttcaaagc
      781 tcaagaaaga ggctgaaaac ccccgagaag ttttggatca ggtgtgttac cgagtggaat
      841 gggccaaatt ccaggaacgt gagaggaaga aggaagaaga ggagaaggag aaggagcggg
      901 tggcctatgc tcagatcgac tggcatgatt ttgtggtggt ggaaacagtg gacttccaac
      961 ccaatgagca agggaacttc cctcccccca ccacgccaga ggagctgggg gcccgaatcc
     1021 tcattcagga gcgctatgaa aagtttgggg agagtgagga agttgagatg gaggtcgagt
     1081 ctgatgagga ggatgacaaa caggagaagg cggaggagcc tccttcccag ctggaccagg
     1141 acacccaagt acaagatatg gatgagggtt cagatgatga agaagaaggg cagaaagtgc
     1201 ccccaccccc agagacaccc atgcctccac ctctgccccc aactccagac caagtcattg
     1261 tccgcaagga ttatgatccc aaagcctcca agcccttgcc tccagcccct gctccagatg
     1321 agtatcttgt gtcccccatt actggggaga agatccccgc cagcaaaatg caggaacaca
     1381 tgcgcattgg acttcttgac cctcgctggc tggagcagcg ggatcgctcc atccgtgaga
     1441 agcagagcga tgatgaggtg tacgcaccag gtctggatat tgagagcagc ttgaagcagt
     1501 tggctgagcg gcgtactgac atcttcggtg tagaggaaac agccattggt aagaagatcg
     1561 gtgaggagga gatccagaag ccagaggaaa aggtgacctg ggatggccac tcaggcagca
     1621 tggcccggac ccagcaggct gcccaggcca acatcaccct ccaggagcag attgaggcca
     1681 ttcacaaggc caaaggcctg gtgccagagg atgacactaa agagaagatt ggccccagca
     1741 agcccaatga aatccctcaa cagccaccgc caccatcttc agccaccaac atccccagct
     1801 cggctccacc catcacttca gtgccccgac cacccacaat gccacctcca gttcgtacta
     1861 cagttgtctc cgcagtaccc gtcatgcccc ggcccccaat ggcatctgtg gtccggctgc
     1921 ccccaggctc agtgatcgcc cccatgccgc ccatcatcca cgcgcccaga atcaacgtgg
     1981 tgcccatgcc tccctcggcc cctcctatta tggccccccg cccacccccc atgattgtgc
     2041 caacagcctt tgtgcctgct ccacctgtgg cacctgtccc agctccagcc ccaatgcccc
     2101 ctgtgcatcc cccacctccc atggaagatg agcccacctc caaaaaactg aagacagagg
     2161 acagcctcat gccagaggag gagttcctgc gcagaaacaa gggtccagtg tccatcaaag
     2221 tccaggtgcc caacatgcag gataagacgg aatggaaact gaatgggcag gtgctggtct
     2281 tcaccctccc actcacggac caggtctctg tcattaaggt gaagattcat gaagccacag
     2341 gcatgcctgc agggaaacag aagctacagt atgagggtat cttcatcaaa gattccaact
     2401 cactggctta ctacaacatg gccaatggcg cagtcatcca cctggccctc aaggagagag
     2461 gcgggaggaa gaagtagaca agaggaacct gctgtcaagt ccctgccatt ttgcctctcc
     2521 tgtctcccac cccctgcccc agacccagga gcccccctga ggctttgcct tgcctgcata
     2581 tttgtttcgc tcttactcag tttgggaatt caaattgtcc tgcagaggtt cattcccctg
     2641 accctttccc cacattggta agagtagctg ggttttctaa gccactctct ggaatctctt
     2701 tgtgttaggg tctcgatttg aggacattca tttcttcagc agcccattag caactgagag
     2761 cccagggatg tcctacagga tagtttcata gtgacaggtg gcacttggct aatagaatat
     2821 ggctgatatt gtcattaatc attttgtacc ttgacatggg ttgtctaata aaactcggac
     2881 ccttcttgtg aaaaaaaaaa aaaaaaaaaa
//