LOCUS BC053577 2639 bp mRNA linear HUM 16-SEP-2003 DEFINITION Homo sapiens splicing factor 3b, subunit 2, 145kDa, mRNA (cDNA clone IMAGE:4478742), partial cds. ACCESSION BC053577 VERSION BC053577.1 KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2639) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2639) AUTHORS Strausberg,R. TITLE Direct Submission JOURNAL Submitted (09-JUN-2003) National Institutes of Health, Mammalian Gene Collection (MGC), Cancer Genomics Office, National Cancer Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 115 Row: a Column: 24. FEATURES Location/Qualifiers source 1..2639 /db_xref="H-InvDB:HIT000259295" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="IMAGE:4478742" /tissue_type="Testis, embryonal carcinoma" /clone_lib="NIH_MGC_92" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..>2639 /gene="SF3B2" /gene_synonym="SAP145" /gene_synonym="SF3b1" /gene_synonym="SF3B145" /gene_synonym="SF3b150" /db_xref="GeneID:10992" /db_xref="MIM:605591" CDS 8..>2639 /gene="SF3B2" /gene_synonym="SAP145" /gene_synonym="SF3b1" /gene_synonym="SF3B145" /gene_synonym="SF3b150" /codon_start=1 /product="SF3B2 protein" /protein_id="AAH53577.1" /db_xref="GeneID:10992" /db_xref="MIM:605591" /translation="MATEHPEPPKAELQLPPPPPPGHYGAWAAQELQAKLAEIGAPIQ GNREELVERLQSYTRQTGIVLNRPVLRGEDGDKAAPPPMSAQLPGIPMPPPPLGLPPL QPPPPPPPPPPGLGLGFPMAHPPNLGPPPPLRVGEPVALSEEERLKLAQQQAALLMQQ EERAKQQGDHSLKEHELLEQQKRAAVLLEQERQQEIAKMGTPVPRPPQDMGQIGVRTP LGPRVAAPVGPVGPTPTVLPMGAPVPRPRGPPPPPGDENREMDDPSVGPKIPQALEKI LQLKESRQEEMNSQQEEEEMETDARSSLGQSASETEEDTVSVSKKEKNRKRRNRKKKK KPQRVRGVSSESSGDREKDSTRSRGSDSPAADVEIEYVTEEPEIYEPNFIFFKRIFEA FKLTDDVKKEKEKEPEKLDKLENSAAPKKKGFEEEHKDSDDDSSDDEQEKKPEAPKLS KKKLRRMNRFTVAELKQLVARPDVVEMHDVTAQDPKLLVHLKATRNSVPVPRHWCFKR KYLQGKRGIEKPPFELPDFIKRTGIQEMREALQEKEEQKTMKSKMREKVRPKMGKIDI DYQKLHDAFFKWQTKPKLTIHGDLYYEGKEFETRLKEKKPGDLSDELRISLGMPVGPN AHKVPPPWLIAMQRYGPPPSYPNLKIPGLNSPIPESCSFGYHAGGWGKPPVDETGKPL YGDVFGTNAAEFQTKTEEEEIDRTPWGELEPSDEESSEEEEEEESDEDKPDETGFITP ADSGLITPGGFSSVPAGMETPELIELRKKKIEEAMDGSETPQLFTVLPEKRTATVGGA MMGSTHIYDMSTVMSRKGPAPELQGVEVALAPEELELDPMAMTQKYEEHVREQQAQVE KEDFSDMVAEHAAKQKQKKKK" misc_feature 77..181 /gene="SF3B2" /gene_synonym="SAP145" /gene_synonym="SF3b1" /gene_synonym="SF3B145" /gene_synonym="SF3b150" /note="SAP; Region: SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins" /db_xref="CDD:pfam02037" misc_feature 1370..2107 /gene="SF3B2" /gene_synonym="SAP145" /gene_synonym="SF3b1" /gene_synonym="SF3B145" /gene_synonym="SF3b150" /note="CUS1; Region: COG5182, CUS1, Splicing factor 3b, subunit 2 [RNA processing and modification]" /db_xref="CDD:COG5182" BASE COUNT 697 a 690 c 789 g 463 t ORIGIN 1 ggctaagatg gcgacggagc atcccgagcc tcccaaagca gaattgcagc tgccgccgcc 61 gccacctcca ggccactatg gcgcctgggc tgcccaggag cttcaggcca agttggcaga 121 gatcggagct ccgatccagg gtaatcgcga ggagctggtg gagcggctgc agagctacac 181 ccgccagact ggcatcgtgc tgaatcggcc ggttttgaga ggggaagatg gggacaaagc 241 cgctccacct cccatgtcgg cacagctccc tggaattccc atgccaccac cacctttggg 301 actcccccct ctgcagcctc ctccgccacc cccaccacct ccaccaggcc ttggccttgg 361 ctttcctatg gcccacccac caaatttggg gcccccgcct cctctccgtg tgggtgagcc 421 agtggcactg tcagaggagg agcggctgaa gttggctcag cagcaggcgg cattgctgat 481 gcagcaggag gagcgtgcca agcagcaggg agatcattcg ctgaaggaac atgagctctt 541 ggagcagcag aagcgggcag ctgtgttact ggagcaggaa cgacagcagg agattgccaa 601 gatgggcacc ccagtccctc ggcccccaca agacatgggc cagattggtg tgcgcactcc 661 tctgggtcct cgagtagctg ctccagtggg cccagtgggc cccactccta cagttttgcc 721 catgggagcc cctgttcccc ggcctcgtgg tcccccaccg ccccctggag atgagaacag 781 agagatggat gacccctctg tgggccccaa gatcccccag gctttggaga agatcctgca 841 gctgaaggag agccgccagg aagagatgaa ttctcagcag gaggaagagg aaatggaaac 901 agatgctcgc tcgtccctgg gccagtcagc gtcagagact gaggaggaca cagtgtccgt 961 atctaaaaag gagaaaaacc ggaagcgtag gaaccgaaag aagaagaaaa agccccagcg 1021 ggtgcgaggg gtgtcctctg agagctctgg ggaccgggag aaagactcaa cccggtcccg 1081 tggctctgat tccccagcag ctgatgttga gattgagtat gtgactgaag aacctgaaat 1141 ttacgagccc aactttatct tctttaagag gatctttgag gcttttaagc tcactgatga 1201 tgtgaagaag gagaaagaga aggagccaga gaaacttgac aaactggaga actctgcagc 1261 ccccaagaag aagggatttg aagaggagca caaggacagt gatgatgaca gcagtgatga 1321 cgagcaggaa aagaagccag aagcccccaa gctgtccaag aagaagttgc gccgaatgaa 1381 ccgcttcact gtggctgaac tcaagcagct ggtggctcgg cccgatgtcg tggagatgca 1441 cgatgtgaca gcgcaggacc ctaagctctt ggttcacctc aaggccactc ggaactctgt 1501 gcctgtgcca cgccactggt gttttaagcg caaatacctg cagggcaaac ggggcattga 1561 gaagcccccc ttcgagctgc cagacttcat caaacgcaca ggcatccagg agatgcgaga 1621 ggccctgcag gagaaggaag aacagaagac catgaagtca aaaatgcgag agaaagttcg 1681 gcctaagatg ggcaaaattg acatcgacta ccagaaactg catgatgcct tcttcaagtg 1741 gcagaccaag ccaaagctga ccatccatgg ggacctgtac tatgagggga aggagttcga 1801 gacacgactg aaggagaaga agccaggaga tctgtctgat gagctaagga tttccttggg 1861 gatgccagta ggaccaaatg cccacaaggt ccctccccca tggctgattg ccatgcagcg 1921 atatggacca cccccatcgt atcccaacct gaaaatccct gggctgaact cgcccatccc 1981 tgagagctgt tcctttgggt accatgctgg tggctggggc aaacctccag tggatgagac 2041 tgggaaaccg ctctatgggg acgtgtttgg aaccaatgct gctgaatttc agaccaagac 2101 tgaggaagaa gagattgatc ggaccccttg gggggaactg gaaccatctg atgaagaatc 2161 ctcagaagaa gaggaagagg aagaaagtga tgaagacaaa ccagatgaga caggctttat 2221 tacccctgca gacagtggcc ttatcactcc tggaggcttt tcatcagtgc ctgctggaat 2281 ggagacccct gaactcattg agctgaggaa gaagaagatt gaggaggcga tggacggaag 2341 tgagacacct cagctcttca ctgtgttgcc agagaagaga acagccactg ttggaggggc 2401 catgatggga tcaacccaca tttatgacat gtccacggtt atgagccgga agggcccggc 2461 tcctgagctg caaggtgtgg aagtggcgct ggcgcctgaa gagttggagc tggatcctat 2521 ggccatgacc cagaagtatg aggagcatgt gcgggagcag caggctcaag tagagaagga 2581 ggacttcagt gacatggtgg ctgagcacgc tgccaaacag aagcaaaaaa aaaaaaaaa //