LOCUS BC038446 2963 bp mRNA linear HUM 18-JAN-2018 DEFINITION Homo sapiens splicing factor 1, mRNA (cDNA clone MGC:45254 IMAGE:5493676), complete cds. ACCESSION BC038446 VERSION BC038446.2 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2963) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2963) AUTHORS Strausberg,R. TITLE Direct Submission JOURNAL Submitted (04-OCT-2002) National Institutes of Health, Mammalian Gene Collection (MGC), Cancer Genomics Office, National Cancer Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov REFERENCE 3 (bases 1 to 2963) AUTHORS Strausberg,R. TITLE Direct Submission JOURNAL Submitted (18-JAN-2018) National Institutes of Health, Mammalian Gene Collection (MGC), Cancer Genomics Office, National Cancer Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, USA REMARK Sequence update by database staff to remove vector contamination COMMENT On Jan 18, 2018 this sequence version replaced BC038446.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 69 Row: e Column: 16 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 4759339. FEATURES Location/Qualifiers source 1..2963 /db_xref="H-InvDB:HIT000052036" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:45254 IMAGE:5493676" /tissue_type="Eye, retinoblastoma" /clone_lib="NIH_MGC_67" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..2963 /gene="SF1" /gene_synonym="D11S636" /gene_synonym="ZFM1" /gene_synonym="ZNF162" /db_xref="GeneID:7536" /db_xref="MIM:601516" CDS 72..2093 /gene="SF1" /gene_synonym="D11S636" /gene_synonym="ZFM1" /gene_synonym="ZNF162" /codon_start=1 /product="SF1 protein" /protein_id="AAH38446.1" /db_xref="GeneID:7536" /db_xref="MIM:601516" /translation="MATGANATPLGKLGPPGLPPLPGPKGGFEPGPPPAPGPGAGLLA PGPPPPPPVGSMGALTAAFPFAALPPPPPPPPPPPPQQPPPPPPPPSPGASYPPPQPP PPPPLYQRVSPPQPPPPQPPRKDQQPGPAGGGGDFPSKKRKRSRWNQDTMEQKTVIPG MPTVIPPGLTREQERAYIVQLQIEDLTRKLRTGDLGIPPNPEDRSPSPEPIYNSEGKR LNTREFRTRKKLEEERHNLITEMVALNPDFKPPADYKPPATRVSDKVMIPQDEYPEIN FVGLLIGPRGNTLKNIEKECNAKIMIRGKGSVKEGKVGRKDGQMLPGEDEPLHALVTA NTMENVKKAVEQIRNILKQGIETPEDQNDLRKMQLRELARLNGTLREDDNRILRPWQS SETRSITNTTVCTKCGGAGHIASDCKFQRPGDPQSAQDKARMDKEYLSLMAELGEAPV PASVGSTSGPATTPLASAPRPAAPANNPPPPSLMSTTQSRPPWMNSGPSESRPYHGMH GGGPGGPGGGPHSFPHPLPSLTGGHGGHPMQHNPNGPPPPWMQPPPPPMNQGPHPPGH HGPPPMDQYLGSTPVGSGVYRLHQGKGMMPPPPMGMMPPPPPPPSGQPPPPPSGPLPP WQQQQQQPPPPPPPSSSMASSTPLPWQQRSLPAAAMARAMRVRTFRAHW" misc_feature 495..1217 /gene="SF1" /gene_synonym="D11S636" /gene_synonym="ZFM1" /gene_synonym="ZNF162" /note="MSL5; Region: COG5176, MSL5, Splicing factor (branch point binding protein) [RNA processing and modification]" /db_xref="CDD:COG5176" BASE COUNT 634 a 943 c 774 g 612 t ORIGIN 1 cccgggaggc ttgcgaagga gaagccgccg cagaggaaaa gcaggtgccg gtgcctgtcc 61 ccgggggcgc catggcgacc ggagcgaacg ccacgccgtt gggtaagctg ggcccccccg 121 ggctgccccc gctccccggg cccaaaggag gcttcgagcc gggccctccg cctgcacccg 181 ggcctggggc ggggctgctg gctcccgggc cgccgccgcc cccgcccgtg ggctcgatgg 241 gggccctgac cgcggccttc cccttcgcgg cgctgcctcc gccgcctccg ccgccgcccc 301 ctccgcctcc ccagcagccg ccgccgcctc caccgccacc gtcccccggc gcctcgtacc 361 cgccgccgca gccgccccct ccgccgccgc tctaccagcg cgtgtcgccg ccgcagccgc 421 cgccacccca gccgccgcgt aaggaccagc agccgggccc ggccggcggc ggaggagact 481 tcccaagtaa gaagcggaag aggagccgct ggaaccaaga cacaatggaa cagaagacag 541 tgattccagg aatgcctaca gttattcccc ctggacttac tcgagaacaa gaaagagctt 601 atatagtgca actgcagata gaagacctga ctcgtaaact gcgcacagga gacctgggca 661 tcccccctaa ccctgaggac aggtcccctt cccctgagcc catctacaat agcgagggga 721 agcggcttaa cacccgagag ttccgcaccc gcaaaaagct ggaagaggag cggcacaacc 781 tcatcacaga gatggttgca ctcaatccgg atttcaagcc acctgcagat tacaaacctc 841 cagcaacacg tgtgagtgat aaagtcatga ttccacaaga tgagtaccca gaaatcaact 901 ttgtggggct gctcatcggg cccagaggga acaccctgaa gaacatagag aaggagtgca 961 atgccaagat tatgatccgg gggaaagggt ctgtgaaaga agggaaggtt gggcgcaaag 1021 atggccagat gttgccagga gaagatgagc cacttcatgc cctggttact gccaatacaa 1081 tggagaacgt caaaaaggca gtggaacaga taagaaacat cctgaagcag ggtatcgaga 1141 ctccagagga ccagaatgat ctacggaaga tgcagcttcg ggagttggct cgcttaaatg 1201 ggacccttcg ggaagacgat aacaggatct taagaccctg gcagagctca gagacccgca 1261 gcattaccaa caccacagtg tgtaccaagt gtggaggggc tggccacatt gcttcagact 1321 gtaaattcca aaggcctggt gatcctcagt cagctcagga taaagcacgg atggataaag 1381 aatatttgtc cctcatggct gaactgggtg aagcacctgt cccagcatct gtgggctcca 1441 cctctgggcc tgccaccaca cccctggcca gcgcacctcg tcctgctgct cccgccaaca 1501 acccacctcc accgtctctc atgtctacca cccagagccg cccaccctgg atgaattctg 1561 gcccttcaga gagtcggccc taccacggca tgcatggagg tggtcctggt gggcccggag 1621 gtggccccca cagcttccca cacccattac ccagcctgac aggtgggcat ggtggacatc 1681 ccatgcagca caaccccaat ggacccccac ccccttggat gcagccacca ccaccaccga 1741 tgaaccaggg cccccaccct cctgggcacc atggccctcc tccaatggat cagtacctgg 1801 gaagtacgcc tgtgggctct ggggtctatc gcctgcatca aggaaaaggt atgatgccgc 1861 caccacctat gggcatgatg ccgccgccgc cgccgcctcc cagtgggcag cccccacccc 1921 ctccctctgg tcctcttccc ccatggcaac aacagcagca gcagcctccg ccaccccctc 1981 cgcccagcag cagtatggct tccagtaccc ccttgccatg gcagcaaaga tccctccccg 2041 cggcggcgat ggcccgagcc atgagagtga ggactttccg cgcccattgg tgacccttcc 2101 aggcagacag cctcagcaac gcccctggtg gacaggatgg ttcggcaaag cagcctgagt 2161 tatttttgtg gacggaatcg gaacacgctg gctccatatc gtgaaatttt tattaatttt 2221 tttctttttc ctttgttact tctttatctt ttcctttctt cagactccgt ccaaggagat 2281 gctctccccg gtcttctgct gcaatttaga ttcctttccc ttctctccag ttctccttcc 2341 cttaccaagg agaggggagc aaatggtttt gggcaagggc tttggccatt catgtcaagc 2401 tggttgtggg tttttcaagg tgccatagcc acccccaaat atgtttgttt aaagcgtggg 2461 gttttttaat ctctgccacc cttgtcaagg gagtcttgta aagttgccga gggtaggttc 2521 atctccaggt ttcgggattc ccatccgtcc tggcgatcct gccagcagtg ggtgggcagc 2581 ctgagctccc tcgggctcgc ctgccagcct ggagttcttc ctgtgctcct tgatcacctg 2641 agctgcctca gattccattt ggtcctctcc ttcctggaag gcttcctttt atgttttgtt 2701 ttaatcccaa atgtctgaat gttttgcagt gtgtaggggt ttgagcccct tgttcattct 2761 ccttcctttt tcctcccgct tccctctcca tgaagtgatt ctgttgacaa taatgtatac 2821 tgcgcgttct cttcactggt ttatctgcag aaatttctct gggctttttt cggtgttaga 2881 ttcaacactg cgctaaagcg gggatgttcc attgaataaa agagcagtgt ggttttctga 2941 aaaaaaaaaa aaaaaaaaaa aaa //