LOCUS BC000401 2864 bp mRNA linear HUM 16-SEP-2003 DEFINITION Homo sapiens splicing factor 3b, subunit 2, 145kDa, mRNA (cDNA clone IMAGE:2822659), partial cds. ACCESSION BC000401 VERSION BC000401.2 KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2864) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2864) AUTHORS Strausberg,R. TITLE Direct Submission JOURNAL Submitted (15-NOV-2000) National Institutes of Health, Mammalian Gene Collection (MGC), Cancer Genomics Office, National Cancer Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Aug 19, 2003 this sequence version replaced BC000401.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: DCTD/DTP cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 1 Row: i Column: 22. FEATURES Location/Qualifiers source 1..2864 /db_xref="H-InvDB:HIT000085833" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="IMAGE:2822659" /tissue_type="Lung, small cell carcinoma" /clone_lib="NIH_MGC_7" /lab_host="DH10B-R" /note="Vector: pOTB7" gene <1..2864 /gene="SF3B2" /gene_synonym="SAP145" /gene_synonym="SF3b1" /gene_synonym="SF3B145" /gene_synonym="SF3b150" /db_xref="GeneID:10992" /db_xref="MIM:605591" CDS <1..2687 /gene="SF3B2" /gene_synonym="SAP145" /gene_synonym="SF3b1" /gene_synonym="SF3B145" /gene_synonym="SF3b150" /codon_start=3 /product="SF3B2 protein" /protein_id="AAH00401.2" /db_xref="GeneID:10992" /db_xref="MIM:605591" /translation="ATEHPEPPKAELQLPPPPPPGHYGAWAAQELQAKLAEIGAPIQG NREELVERLQSYTRQTGIVLNRPVLRGEDGDKAAPPPMSAQLPGIPMPPPPLGLPPLQ PPPPPPPPPPGLGLGFPMAHPPNLGPPPPLRVGEPVALSEEERLKLAQQQAALLMQQE ERAKQQGDHSLKEHELLEQQKRAAVLLEQERQQEIAKMGTPVPRPPQDMGQIGVRTPL GPRVAAPVGPVGPTPTVLPMGAPVPRPRGPPPPPGDENREMDDPSVGPKIPQALEKIL QLKESRQEEMNSQQEEEEMETDARSSLGQSASETEEDTVSVSKKEKNRKRRNRKKKKK PQRVRGVSSESSGDREKDSTRSRGSDSPAADVEIEYVTEEPEIYEPNFIFFKRIFEAF KLTDDVKKEKEKEPEKLDKLENSAAPKKKGFEEEHKDSDDDSSDDEQEKKPEAPKLSK KKLRRMNRFTVAELKQLVARPDVVEMHDVTAQDPKLLVHLKATRNSVPVPRHWCFKRK YLQGKRGIEKPPFELPDFIKRTGIQEMREALQEKEEQKTMKSKMREKVRPKMGKIDID YQKLHDAFFKWQTKPKLTIHGDLYYEGKEFETRLKEKKPGDLSDELRISLGMPVGPNA HKVPPPWLIAMQRYGPPPSYPNLKIPGLNSPIPESCSFGYHAGGWGKPPVDETGKPLY GDVFGTNAAEFQTKTEEEEIDRTPWGELEPSDEESSEEEEEEESDEDKPDETGFITPA DSGLITPGGFSSVPAGMETPELIELRKKKIEEAMDGSETPQLFTVLPEKRTATVGGAM MGSTHIYDMSTVMSRKGPAPELQGVEVALAPEELELDPMAMTQKYEEHVREQQAQVEK EDFSDMVAEHAAKQKQKKRKAQPQDSRGGSKKYKEFKF" misc_feature 69..173 /gene="SF3B2" /gene_synonym="SAP145" /gene_synonym="SF3b1" /gene_synonym="SF3B145" /gene_synonym="SF3b150" /note="SAP; Region: SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins" /db_xref="CDD:pfam02037" misc_feature 1362..2099 /gene="SF3B2" /gene_synonym="SAP145" /gene_synonym="SF3b1" /gene_synonym="SF3B145" /gene_synonym="SF3b150" /note="CUS1; Region: COG5182, CUS1, Splicing factor 3b, subunit 2 [RNA processing and modification]" /db_xref="CDD:COG5182" BASE COUNT 766 a 745 c 833 g 520 t ORIGIN 1 tggcgacgga gcatcccgag cctcccaaag cagaattgca gctgccgccg ccgccacctc 61 caggccacta tggcgcctgg gctgcccagg agcttcaggc caagttggca gagatcggag 121 ctccgatcca gggtaatcgc gaggagctgg tggagcggct gcagagctac acccgccaga 181 ctggcatcgt gctgaatcgg ccggttttga gaggggaaga tggggacaaa gccgctccac 241 ctcccatgtc ggcacagctc cctggaattc ccatgccacc accacctttg ggactccccc 301 ctctgcagcc tcctccgcca cccccaccac ctccaccagg ccttggcctt ggctttccta 361 tggcccaccc accaaatttg gggcccccgc ctcctctccg tgtgggtgag ccagtggcac 421 tgtcagagga ggagcggctg aagttggctc agcagcaggc ggcattgctg atgcagcagg 481 aggagcgtgc caagcagcag ggagatcatt cgctgaagga acatgagctc ttggagcagc 541 agaagcgggc agctgtgtta ctggagcagg aacgacagca ggagattgcc aagatgggca 601 ccccagtccc tcggccccca caagacatgg gccagattgg tgtgcgcact cctctgggtc 661 ctcgagtagc tgctccagtg ggcccagtgg gccccactcc tacagttttg cccatgggag 721 cccctgttcc ccggcctcgt ggtcccccac cgccccctgg agatgagaac agagagatgg 781 atgacccctc tgtgggcccc aagatccccc aggctttgga gaagatcctg cagctgaagg 841 agagccgcca ggaagagatg aattctcagc aggaggaaga ggaaatggaa acagatgctc 901 gctcgtccct gggccagtca gcgtcagaga ctgaggagga cacagtgtcc gtatctaaaa 961 aggagaaaaa ccggaagcgt aggaaccgaa agaagaagaa aaagccccag cgggtgcgag 1021 gggtgtcctc tgagagctct ggggaccggg agaaagactc aacccggtcc cgtggctctg 1081 attccccagc agctgatgtt gagattgagt atgtgactga agaacctgaa atttacgagc 1141 ccaactttat cttctttaag aggatctttg aggcttttaa gctcactgat gatgtgaaga 1201 aggagaaaga gaaagagcca gagaaacttg acaaactgga gaactctgca gcccccaaga 1261 agaagggatt tgaagaggag cacaaggaca gtgatgatga cagcagtgat gacgagcagg 1321 aaaagaagcc agaagccccc aagctgtcca agaagaagtt gcgccgaatg aaccgcttca 1381 ctgtggctga actcaagcag ctggtggctc ggcccgatgt cgtggagatg cacgatgtga 1441 cagcgcagga ccctaagctc ttggttcacc tcaaggccac tcggaactct gtgcctgtgc 1501 cacgccactg gtgttttaag cgcaaatacc tgcagggcaa acggggcatt gagaagcccc 1561 ccttcgagct gccagacttc atcaaacgca caggcatcca ggagatgcga gaggccctgc 1621 aggagaagga agaacagaag accatgaagt caaaaatgcg agagaaagtt cggcctaaga 1681 tgggcaaaat tgacatcgac taccagaaac tgcatgatgc cttcttcaag tggcagacca 1741 agccaaagct gaccatccat ggggacctgt actatgaggg gaaggagttc gagacacgac 1801 tgaaggagaa gaagccagga gatctgtctg atgagctaag gatttccttg gggatgccag 1861 taggaccaaa tgcccacaag gtccctcccc catggctgat tgccatgcag cgatatggac 1921 cacccccatc gtatcccaac ctgaaaatcc ctgggctgaa ctcgcccatc cctgagagct 1981 gttcctttgg gtaccatgct ggtggctggg gcaaacctcc agtggatgag actgggaaac 2041 cgctctatgg ggacgtgttt ggaaccaatg ctgctgaatt tcagaccaag actgaggaag 2101 aagagattga tcggacccct tggggggaac tggaaccatc tgatgaagaa tcctcagaag 2161 aagaggaaga ggaagaaagt gatgaagaca aaccagatga gacaggcttt attacccctg 2221 cagacagtgg ccttatcact cctggaggct tttcatcagt gcctgctgga atggagaccc 2281 ctgaactcat tgagctgagg aagaagaaga ttgaggaggc gatggacgga agtgagacac 2341 ctcagctctt cactgtgttg ccagagaaga gaacagccac tgttggaggg gccatgatgg 2401 gatcaaccca catttatgac atgtccacgg ttatgagccg gaagggcccg gctcctgagc 2461 tgcaaggtgt ggaagtggcg ctggcgcctg aagagttgga gctggatcct atggccatga 2521 cccagaagta tgaggagcat gtgcgggagc agcaggctca agtagagaag gaggacttca 2581 gtgacatggt ggctgagcac gctgccaaac agaagcaaaa aaaacggaaa gctcagcccc 2641 aggacagccg tgggggcagc aagaaatata aggagttcaa gttttaggtc ccctcacact 2701 agcccttttt ttggccctac gtctggatgc ctgggcttca cacaagaacc acctctcccg 2761 cagttcccaa ggacttgtca tttcatgttc ttattttaga cctgttttgt aaataaagct 2821 gtttcccaag gaaagagatg aaaaaaaaaa aaaaaaaaaa aaaa //