LOCUS BC001976 2910 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens splicing factor 3a, subunit 1, 120kDa, mRNA (cDNA clone MGC:5589 IMAGE:3459964), complete cds. ACCESSION BC001976 VERSION BC001976.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2910) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2910) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (31-JAN-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Sequencing Group at the Stanford Human Genome Center, Stanford University School of Medicine, Stanford, CA 94305 Web site: http://www-shgc.stanford.edu Contact: (Dickson, Mark) mcd@paxil.stanford.edu Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers, R. M. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 4 Row: i Column: 18 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 53831993. FEATURES Location/Qualifiers source 1..2910 /db_xref="H-InvDB:HIT000258889" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:5589 IMAGE:3459964" /tissue_type="Cervix, carcinoma" /clone_lib="NIH_MGC_12" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..2910 /gene="SF3A1" /gene_synonym="PRP21" /gene_synonym="PRPF21" /gene_synonym="SAP114" /gene_synonym="SF3A120" /db_xref="GeneID:10291" /db_xref="HGNC:HGNC:10765" /db_xref="MIM:605595" CDS 96..2477 /gene="SF3A1" /gene_synonym="PRP21" /gene_synonym="PRPF21" /gene_synonym="SAP114" /gene_synonym="SF3A120" /codon_start=1 /product="splicing factor 3a, subunit 1, 120kDa" /protein_id="AAH01976.1" /db_xref="GeneID:10291" /db_xref="HGNC:HGNC:10765" /db_xref="MIM:605595" /translation="MPAGPVQAVPPPPPVPTEPKQPTEEEASSKEDSAPSKPVVGIIY PPPEVRNIVDKTASFVARNGPEFEARIRQNEINNPKFNFLNPNDPYHAYYRHKVSEFK EGKAQEPSAAIPKVMQQQQQTTQQQLPQKVQAQVIQETIVPKEPPPEFEFIADPPSIS AFDLDVVKLTAQFVARNGRQFLTQLMQKEQRNYQFDFLRPQHSLFNYFTKLVEQYTKI LIPPKGLFSKLKKEAENPREVLDQVCYRVEWAKFQERERKKEEEEKEKERVAYAQIDW HDFVVVETVDFQPNEQGNFPPPTTPEELGARILIQERYEKFGESEEVEMEVESDEEDD KQEKAEEPPSQLDQDTQVQDMDEGSDDEEEGQKVPPPPETPMPPPLPPTPDQVIVRKD YDPKASKPLPPAPAPDEYLVSPITGEKIPASKMQEHMRIGLLDPRWLEQRDRSIREKQ SDDEVYAPGLDIESSLKQLAERRTDIFGVEETAIGKKIGEEEIQKPEEKVTWDGHSGS MARTQQAAQANITLQEQIEAIHKAKGLVPEDDTKEKIGPSKPNEIPQQPPPPSSATNI PSSAPPITSVPRPPTMPPPVRTTVVSAVPVMPRPPMASVVRLPPGSVIAPMPPIIHAP RINVVPMPPSAPPIMAPRPPPMIVPTAFVPAPPVAPVPAPAPMPPVHPPPPMEDEPTS KKLKTEDSLMPEEEFLRRNKGPVSIKVQVPNMQDKTEWKLNGQVLVFTLPLTDQVSVI KVKIHEATGMPAGKQKLQYEGIFIKDSNSLAYYNMANGAVIHLALKERGGRKK" BASE COUNT 744 a 860 c 741 g 565 t ORIGIN 1 tgcgagctcg tcgtactgac cgagcgggga ggctgtcttg aggcggcacc gctcaccgac 61 accgaggcgg actggcagcc ctgagcgtcg cagtcatgcc ggccggaccc gtgcaggcgg 121 tgcccccgcc gccgcccgtg cccacggagc ccaaacagcc cacagaagaa gaagcatctt 181 caaaggagga ttctgcacct tctaagccag ttgtggggat tatttaccct cctccagagg 241 tcagaaatat tgttgacaag actgccagct ttgtggccag aaacgggcct gaatttgaag 301 ctaggatccg acagaacgag atcaacaacc ccaagttcaa ctttctgaac cccaatgacc 361 cttaccatgc ctactaccgc cacaaggtca gcgagttcaa ggaagggaag gctcaggagc 421 cgtccgccgc catccccaag gtcatgcagc agcagcagca gaccacccag cagcagctgc 481 cccagaaggt ccaagcccaa gtaatccaag agaccatcgt gcccaaagag cctcctcctg 541 agtttgagtt cattgctgat cctccctcta tctcagcctt cgacttggat gtggtgaagc 601 tgacggctca gtttgtggcc aggaatgggc gccagtttct gacccagctg atgcagaaag 661 agcagcgcaa ctaccagttt gactttctcc gcccacagca cagcctcttc aactacttca 721 cgaagctagt ggaacagtac accaagatct tgattccacc caaaggttta ttttcaaagc 781 tcaagaaaga ggctgaaaac ccccgagaag ttttggatca ggtgtgttac cgagtggaat 841 gggccaaatt ccaggaacgt gagaggaaga aggaagaaga ggagaaggag aaggagcggg 901 tggcctatgc tcagatcgac tggcatgatt ttgtggtggt ggaaacagtg gacttccaac 961 ccaatgagca agggaacttc cctcccccca ccacgccaga ggagctgggg gcccgaatcc 1021 tcattcagga gcgctatgaa aagtttgggg agagtgagga agttgagatg gaggtcgagt 1081 ctgatgagga ggatgacaaa caggagaagg cggaggagcc tccttcccag ctggaccagg 1141 acacccaagt acaagatatg gatgagggtt cagatgatga agaagaaggg cagaaagtgc 1201 ccccaccccc agagacaccc atgcctccac ctctgccccc aactccagac caagtcattg 1261 tccgcaagga ttatgatccc aaagcctcca agcccttgcc tccagcccct gctccagatg 1321 agtatcttgt gtcccccatt actggggaga agatccccgc cagcaaaatg caggaacaca 1381 tgcgcattgg acttcttgac cctcgctggc tggagcagcg ggatcgctcc atccgtgaga 1441 agcagagcga tgatgaggtg tacgcaccag gtctggatat tgagagcagc ttgaagcagt 1501 tggctgagcg gcgtactgac atcttcggtg tagaggaaac agccattggt aagaagatcg 1561 gtgaggagga gatccagaag ccagaggaaa aggtgacctg ggatggccac tcaggcagca 1621 tggcccggac ccagcaggct gcccaggcca acatcaccct ccaggagcag attgaggcca 1681 ttcacaaggc caaaggcctg gtgccagagg atgacactaa agagaagatt ggccccagca 1741 agcccaatga aatccctcaa cagccaccgc caccatcttc agccaccaac atccccagct 1801 cggctccacc catcacttca gtgccccgac cacccacaat gccacctcca gttcgtacta 1861 cagttgtctc cgcagtaccc gtcatgcccc ggcccccaat ggcatctgtg gtccggctgc 1921 ccccaggctc agtgatcgcc cccatgccgc ccatcatcca cgcgcccaga atcaacgtgg 1981 tgcccatgcc tccctcggcc cctcctatta tggccccccg cccacccccc atgattgtgc 2041 caacagcctt tgtgcctgct ccacctgtgg cacctgtccc agctccagcc ccaatgcccc 2101 ctgtgcatcc cccacctccc atggaagatg agcccacctc caaaaaactg aagacagagg 2161 acagcctcat gccagaggag gagttcctgc gcagaaacaa gggtccagtg tccatcaaag 2221 tccaggtgcc caacatgcag gataagacgg aatggaaact gaatgggcag gtgctggtct 2281 tcaccctccc actcacggac caggtctctg tcattaaggt gaagattcat gaagccacag 2341 gcatgcctgc agggaaacag aagctacagt atgagggtat cttcatcaaa gattccaact 2401 cactggctta ctacaacatg gccaatggcg cagtcatcca cctggccctc aaggagagag 2461 gcgggaggaa gaagtagaca agaggaacct gctgtcaagt ccctgccatt ttgcctctcc 2521 tgtctcccac cccctgcccc agacccagga gcccccctga ggctttgcct tgcctgcata 2581 tttgtttcgc tcttactcag tttgggaatt caaattgtcc tgcagaggtt cattcccctg 2641 accctttccc cacattggta agagtagctg ggttttctaa gccactctct ggaatctctt 2701 tgtgttaggg tctcgatttg aggacattca tttcttcagc agcccattag caactgagag 2761 cccagggatg tcctacagga tagtttcata gtgacaggtg gcacttggct aatagaatat 2821 ggctgatatt gtcattaatc attttgtacc ttgacatggg ttgtctaata aaactcggac 2881 ccttcttgtg aaaaaaaaaa aaaaaaaaaa //