LOCUS BC004434 1648 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens splicing factor 3a, subunit 2, 66kDa, mRNA (cDNA clone MGC:3975 IMAGE:2820900), complete cds. ACCESSION BC004434 VERSION BC004434.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1648) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 1648) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (12-MAR-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: DCTD/DTP cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Genome Sequence Centre, BC Cancer Agency, Vancouver, BC, Canada info@bcgsc.bc.ca Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao, Kim MacDonald, Mike R. Mayo, Josh Moran, Diana Palmquist, JR Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang, Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 9 Row: m Column: 21 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 54112114. FEATURES Location/Qualifiers source 1..1648 /db_xref="H-InvDB:HIT000031940" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:3975 IMAGE:2820900" /tissue_type="Lung, small cell carcinoma" /clone_lib="NIH_MGC_7" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..1648 /gene="SF3A2" /gene_synonym="PRP11" /gene_synonym="PRPF11" /gene_synonym="SAP62" /gene_synonym="SF3a66" /db_xref="GeneID:8175" /db_xref="HGNC:HGNC:10766" /db_xref="MIM:600796" CDS 93..1487 /gene="SF3A2" /gene_synonym="PRP11" /gene_synonym="PRPF11" /gene_synonym="SAP62" /gene_synonym="SF3a66" /codon_start=1 /product="splicing factor 3a, subunit 2, 66kDa" /protein_id="AAH04434.1" /db_xref="GeneID:8175" /db_xref="HGNC:HGNC:10766" /db_xref="MIM:600796" /translation="MDFQHRPGGKTGSGGVASSSESNRDRRERLRQLALETIDINKDP YFMKNHLGSYECKLCLTLHNNEGSYLAHTQGKKHQTNLARRAAKEAKEAPAQPAPEKV KVEVKKFVKIGRPGYKVTKQRDSEMGQQSLLFQIDYPEIAEGIMPRHRFMSAYEQRIE PPDRRWQYLLMAAEPYETIAFKVPSREIDKAEGKFWTHWNRETKQFFLQFHFKMEKPP APPSLPAGPPGVKRPPPPLMNGLPPRPPLPESLPPPPPGGLPLPPMPPTGPAPSGPPG PPQLPPPAPGVHPPAPVVHPPASGVHPPAPGVHPPAPGVHPPAPGVHPPTSGVHPPAP GVHPPAPGVHPPAPGVHPPAPGVHPPAPGVHPPPSAGVHPQAPGVHPAAPAVHPQAPG VHPPAPGMHPQAPGVHPQPPGVHPSAPGVHPQPPGVHPSNPGVHPPTPMPPMLRPPLP SEGPGNIPPPPPTN" BASE COUNT 331 a 649 c 434 g 234 t ORIGIN 1 ccaggggcag gggctggggc gaaacgagga ggagataacg cggccttggg ctctggtgtc 61 tcccagtctg ctaaagccct aaggccatca ccatggactt ccagcatcgc cccgggggca 121 agaccgggag cgggggcgtg gcctcctcct ccgagagcaa ccgtgaccgc agggagcgcc 181 tccggcagct ggccctggag accatcgaca tcaacaagga cccgtacttc atgaagaacc 241 acctgggctc ctatgaatgc aaactctgcc tgacacttca caacaatgag gggagctacc 301 tggcacatac gcaggggaag aagcaccaga ccaacctggc ccggcgagca gccaaggagg 361 ccaaggaggc ccctgcccag cccgcgcctg agaaggtcaa ggtggaggtg aagaagtttg 421 tgaagatcgg ccgcccgggc tacaaagtga ccaagcagag agactcggag atgggccagc 481 agagcctcct cttccagatt gactaccctg agatcgccga gggcatcatg ccacgtcacc 541 gcttcatgtc tgcgtacgag cagaggatcg agcctccgga ccggcgctgg cagtacctgc 601 tcatggccgc cgagccctac gagaccattg ccttcaaggt gccgagcaga gagatcgaca 661 aggcggaggg caagttctgg acacactgga accgggagac caagcagttc ttcctccagt 721 tccactttaa gatggagaag cccccggctc cacccagcct ccctgctggc ccccctgggg 781 tgaagcggcc tccacccccg ctgatgaacg gtctgccccc tcggccaccg ctgcctgagt 841 ctttgccacc gcccccgcca ggaggcctgc ctctgccacc catgcccccc acagggcctg 901 cgccctcagg gcccccggga ccaccccagc tacccccgcc agctccaggg gtccaccccc 961 cggccccagt ggtgcatccc cctgcatctg gggtccatcc cccagctcct ggcgtccacc 1021 ccccagctcc tggcgtccat cccccagccc ctggggtcca cccaccaacc tctggggtcc 1081 accccccagc tcctggagtc caccctccag cccccggggt tcacccacca gcccccggag 1141 tccacccacc agcccctggg gttcacccac cagccccagg ggtccatcct cccccatcag 1201 cgggggttca cccccaggcc ccgggggtgc acccagcagc ccccgccgtt caccctcagg 1261 ccccaggggt gcacccacca gccccaggga tgcaccctca ggccccgggg gtccaccccc 1321 aacctcccgg ggtccatccg tcggctcctg gggtccaccc tcagcctccg ggagttcacc 1381 cctcaaatcc tggggtgcac cccccaactc ccatgccccc aatgctgagg cccccacttc 1441 cctccgaagg cccagggaac atacctcccc ctcccccaac caactgagaa gctgctccct 1501 cccccagcaa gcccagcgcc aggtgctctt gccttttccc actgagagaa ggctgctctt 1561 ttgtactgcc ccccgctcat taaacagcct cccccagccc tgagtgcact gatgtccgca 1621 aaaaaaaaaa aaaaaaaaaa aaaaaaaa //