LOCUS BC001976 2910 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens splicing factor 3a, subunit 1, 120kDa, mRNA (cDNA
clone MGC:5589 IMAGE:3459964), complete cds.
ACCESSION BC001976
VERSION BC001976.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 2910)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 2910)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (31-JAN-2001) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Life Technologies, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Sequencing Group at the Stanford Human Genome
Center, Stanford University School of Medicine, Stanford, CA 94305
Web site: http://www-shgc.stanford.edu
Contact: (Dickson, Mark) mcd@paxil.stanford.edu
Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
R. M.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 4 Row: i Column: 18
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 53831993.
FEATURES Location/Qualifiers
source 1..2910
/db_xref="H-InvDB:HIT000258889"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:5589 IMAGE:3459964"
/tissue_type="Cervix, carcinoma"
/clone_lib="NIH_MGC_12"
/lab_host="DH10B"
/note="Vector: pCMV-SPORT6"
gene 1..2910
/gene="SF3A1"
/gene_synonym="PRP21"
/gene_synonym="PRPF21"
/gene_synonym="SAP114"
/gene_synonym="SF3A120"
/db_xref="GeneID:10291"
/db_xref="HGNC:HGNC:10765"
/db_xref="MIM:605595"
CDS 96..2477
/gene="SF3A1"
/gene_synonym="PRP21"
/gene_synonym="PRPF21"
/gene_synonym="SAP114"
/gene_synonym="SF3A120"
/codon_start=1
/product="splicing factor 3a, subunit 1, 120kDa"
/protein_id="AAH01976.1"
/db_xref="GeneID:10291"
/db_xref="HGNC:HGNC:10765"
/db_xref="MIM:605595"
/translation="MPAGPVQAVPPPPPVPTEPKQPTEEEASSKEDSAPSKPVVGIIY
PPPEVRNIVDKTASFVARNGPEFEARIRQNEINNPKFNFLNPNDPYHAYYRHKVSEFK
EGKAQEPSAAIPKVMQQQQQTTQQQLPQKVQAQVIQETIVPKEPPPEFEFIADPPSIS
AFDLDVVKLTAQFVARNGRQFLTQLMQKEQRNYQFDFLRPQHSLFNYFTKLVEQYTKI
LIPPKGLFSKLKKEAENPREVLDQVCYRVEWAKFQERERKKEEEEKEKERVAYAQIDW
HDFVVVETVDFQPNEQGNFPPPTTPEELGARILIQERYEKFGESEEVEMEVESDEEDD
KQEKAEEPPSQLDQDTQVQDMDEGSDDEEEGQKVPPPPETPMPPPLPPTPDQVIVRKD
YDPKASKPLPPAPAPDEYLVSPITGEKIPASKMQEHMRIGLLDPRWLEQRDRSIREKQ
SDDEVYAPGLDIESSLKQLAERRTDIFGVEETAIGKKIGEEEIQKPEEKVTWDGHSGS
MARTQQAAQANITLQEQIEAIHKAKGLVPEDDTKEKIGPSKPNEIPQQPPPPSSATNI
PSSAPPITSVPRPPTMPPPVRTTVVSAVPVMPRPPMASVVRLPPGSVIAPMPPIIHAP
RINVVPMPPSAPPIMAPRPPPMIVPTAFVPAPPVAPVPAPAPMPPVHPPPPMEDEPTS
KKLKTEDSLMPEEEFLRRNKGPVSIKVQVPNMQDKTEWKLNGQVLVFTLPLTDQVSVI
KVKIHEATGMPAGKQKLQYEGIFIKDSNSLAYYNMANGAVIHLALKERGGRKK"
BASE COUNT 744 a 860 c 741 g 565 t
ORIGIN
1 tgcgagctcg tcgtactgac cgagcgggga ggctgtcttg aggcggcacc gctcaccgac
61 accgaggcgg actggcagcc ctgagcgtcg cagtcatgcc ggccggaccc gtgcaggcgg
121 tgcccccgcc gccgcccgtg cccacggagc ccaaacagcc cacagaagaa gaagcatctt
181 caaaggagga ttctgcacct tctaagccag ttgtggggat tatttaccct cctccagagg
241 tcagaaatat tgttgacaag actgccagct ttgtggccag aaacgggcct gaatttgaag
301 ctaggatccg acagaacgag atcaacaacc ccaagttcaa ctttctgaac cccaatgacc
361 cttaccatgc ctactaccgc cacaaggtca gcgagttcaa ggaagggaag gctcaggagc
421 cgtccgccgc catccccaag gtcatgcagc agcagcagca gaccacccag cagcagctgc
481 cccagaaggt ccaagcccaa gtaatccaag agaccatcgt gcccaaagag cctcctcctg
541 agtttgagtt cattgctgat cctccctcta tctcagcctt cgacttggat gtggtgaagc
601 tgacggctca gtttgtggcc aggaatgggc gccagtttct gacccagctg atgcagaaag
661 agcagcgcaa ctaccagttt gactttctcc gcccacagca cagcctcttc aactacttca
721 cgaagctagt ggaacagtac accaagatct tgattccacc caaaggttta ttttcaaagc
781 tcaagaaaga ggctgaaaac ccccgagaag ttttggatca ggtgtgttac cgagtggaat
841 gggccaaatt ccaggaacgt gagaggaaga aggaagaaga ggagaaggag aaggagcggg
901 tggcctatgc tcagatcgac tggcatgatt ttgtggtggt ggaaacagtg gacttccaac
961 ccaatgagca agggaacttc cctcccccca ccacgccaga ggagctgggg gcccgaatcc
1021 tcattcagga gcgctatgaa aagtttgggg agagtgagga agttgagatg gaggtcgagt
1081 ctgatgagga ggatgacaaa caggagaagg cggaggagcc tccttcccag ctggaccagg
1141 acacccaagt acaagatatg gatgagggtt cagatgatga agaagaaggg cagaaagtgc
1201 ccccaccccc agagacaccc atgcctccac ctctgccccc aactccagac caagtcattg
1261 tccgcaagga ttatgatccc aaagcctcca agcccttgcc tccagcccct gctccagatg
1321 agtatcttgt gtcccccatt actggggaga agatccccgc cagcaaaatg caggaacaca
1381 tgcgcattgg acttcttgac cctcgctggc tggagcagcg ggatcgctcc atccgtgaga
1441 agcagagcga tgatgaggtg tacgcaccag gtctggatat tgagagcagc ttgaagcagt
1501 tggctgagcg gcgtactgac atcttcggtg tagaggaaac agccattggt aagaagatcg
1561 gtgaggagga gatccagaag ccagaggaaa aggtgacctg ggatggccac tcaggcagca
1621 tggcccggac ccagcaggct gcccaggcca acatcaccct ccaggagcag attgaggcca
1681 ttcacaaggc caaaggcctg gtgccagagg atgacactaa agagaagatt ggccccagca
1741 agcccaatga aatccctcaa cagccaccgc caccatcttc agccaccaac atccccagct
1801 cggctccacc catcacttca gtgccccgac cacccacaat gccacctcca gttcgtacta
1861 cagttgtctc cgcagtaccc gtcatgcccc ggcccccaat ggcatctgtg gtccggctgc
1921 ccccaggctc agtgatcgcc cccatgccgc ccatcatcca cgcgcccaga atcaacgtgg
1981 tgcccatgcc tccctcggcc cctcctatta tggccccccg cccacccccc atgattgtgc
2041 caacagcctt tgtgcctgct ccacctgtgg cacctgtccc agctccagcc ccaatgcccc
2101 ctgtgcatcc cccacctccc atggaagatg agcccacctc caaaaaactg aagacagagg
2161 acagcctcat gccagaggag gagttcctgc gcagaaacaa gggtccagtg tccatcaaag
2221 tccaggtgcc caacatgcag gataagacgg aatggaaact gaatgggcag gtgctggtct
2281 tcaccctccc actcacggac caggtctctg tcattaaggt gaagattcat gaagccacag
2341 gcatgcctgc agggaaacag aagctacagt atgagggtat cttcatcaaa gattccaact
2401 cactggctta ctacaacatg gccaatggcg cagtcatcca cctggccctc aaggagagag
2461 gcgggaggaa gaagtagaca agaggaacct gctgtcaagt ccctgccatt ttgcctctcc
2521 tgtctcccac cccctgcccc agacccagga gcccccctga ggctttgcct tgcctgcata
2581 tttgtttcgc tcttactcag tttgggaatt caaattgtcc tgcagaggtt cattcccctg
2641 accctttccc cacattggta agagtagctg ggttttctaa gccactctct ggaatctctt
2701 tgtgttaggg tctcgatttg aggacattca tttcttcagc agcccattag caactgagag
2761 cccagggatg tcctacagga tagtttcata gtgacaggtg gcacttggct aatagaatat
2821 ggctgatatt gtcattaatc attttgtacc ttgacatggg ttgtctaata aaactcggac
2881 ccttcttgtg aaaaaaaaaa aaaaaaaaaa
//