LOCUS       BC038446                2963 bp    mRNA    linear   HUM 18-JAN-2018
DEFINITION  Homo sapiens splicing factor 1, mRNA (cDNA clone MGC:45254
            IMAGE:5493676), complete cds.
ACCESSION   BC038446
VERSION     BC038446.2
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2963)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 2963)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (04-OCT-2002) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
REFERENCE   3  (bases 1 to 2963)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (18-JAN-2018) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    Sequence update by database staff to remove vector contamination
COMMENT     On Jan 18, 2018 this sequence version replaced BC038446.1.
            Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: National Institutes of Health Intramural
            Sequencing Center (NISC),
            Gaithersburg, Maryland;
            Web site: http://www.nisc.nih.gov/
            Contact: nisc_mgc@nhgri.nih.gov
            Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
            Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
            Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
            Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
            Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
            McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
            Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
            Young,A., Zhang,L.-H. and Green,E.D.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 69 Row: e Column: 16
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 4759339.
FEATURES             Location/Qualifiers
     source          1..2963
                     /db_xref="H-InvDB:HIT000052036"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:45254 IMAGE:5493676"
                     /tissue_type="Eye, retinoblastoma"
                     /clone_lib="NIH_MGC_67"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     gene            1..2963
                     /gene="SF1"
                     /gene_synonym="D11S636"
                     /gene_synonym="ZFM1"
                     /gene_synonym="ZNF162"
                     /db_xref="GeneID:7536"
                     /db_xref="MIM:601516"
     CDS             72..2093
                     /gene="SF1"
                     /gene_synonym="D11S636"
                     /gene_synonym="ZFM1"
                     /gene_synonym="ZNF162"
                     /codon_start=1
                     /product="SF1 protein"
                     /protein_id="AAH38446.1"
                     /db_xref="GeneID:7536"
                     /db_xref="MIM:601516"
                     /translation="MATGANATPLGKLGPPGLPPLPGPKGGFEPGPPPAPGPGAGLLA
                     PGPPPPPPVGSMGALTAAFPFAALPPPPPPPPPPPPQQPPPPPPPPSPGASYPPPQPP
                     PPPPLYQRVSPPQPPPPQPPRKDQQPGPAGGGGDFPSKKRKRSRWNQDTMEQKTVIPG
                     MPTVIPPGLTREQERAYIVQLQIEDLTRKLRTGDLGIPPNPEDRSPSPEPIYNSEGKR
                     LNTREFRTRKKLEEERHNLITEMVALNPDFKPPADYKPPATRVSDKVMIPQDEYPEIN
                     FVGLLIGPRGNTLKNIEKECNAKIMIRGKGSVKEGKVGRKDGQMLPGEDEPLHALVTA
                     NTMENVKKAVEQIRNILKQGIETPEDQNDLRKMQLRELARLNGTLREDDNRILRPWQS
                     SETRSITNTTVCTKCGGAGHIASDCKFQRPGDPQSAQDKARMDKEYLSLMAELGEAPV
                     PASVGSTSGPATTPLASAPRPAAPANNPPPPSLMSTTQSRPPWMNSGPSESRPYHGMH
                     GGGPGGPGGGPHSFPHPLPSLTGGHGGHPMQHNPNGPPPPWMQPPPPPMNQGPHPPGH
                     HGPPPMDQYLGSTPVGSGVYRLHQGKGMMPPPPMGMMPPPPPPPSGQPPPPPSGPLPP
                     WQQQQQQPPPPPPPSSSMASSTPLPWQQRSLPAAAMARAMRVRTFRAHW"
     misc_feature    495..1217
                     /gene="SF1"
                     /gene_synonym="D11S636"
                     /gene_synonym="ZFM1"
                     /gene_synonym="ZNF162"
                     /note="MSL5; Region: COG5176, MSL5, Splicing factor
                     (branch point binding protein) [RNA processing and
                     modification]"
                     /db_xref="CDD:COG5176"
BASE COUNT          634 a          943 c          774 g          612 t
ORIGIN      
        1 cccgggaggc ttgcgaagga gaagccgccg cagaggaaaa gcaggtgccg gtgcctgtcc
       61 ccgggggcgc catggcgacc ggagcgaacg ccacgccgtt gggtaagctg ggcccccccg
      121 ggctgccccc gctccccggg cccaaaggag gcttcgagcc gggccctccg cctgcacccg
      181 ggcctggggc ggggctgctg gctcccgggc cgccgccgcc cccgcccgtg ggctcgatgg
      241 gggccctgac cgcggccttc cccttcgcgg cgctgcctcc gccgcctccg ccgccgcccc
      301 ctccgcctcc ccagcagccg ccgccgcctc caccgccacc gtcccccggc gcctcgtacc
      361 cgccgccgca gccgccccct ccgccgccgc tctaccagcg cgtgtcgccg ccgcagccgc
      421 cgccacccca gccgccgcgt aaggaccagc agccgggccc ggccggcggc ggaggagact
      481 tcccaagtaa gaagcggaag aggagccgct ggaaccaaga cacaatggaa cagaagacag
      541 tgattccagg aatgcctaca gttattcccc ctggacttac tcgagaacaa gaaagagctt
      601 atatagtgca actgcagata gaagacctga ctcgtaaact gcgcacagga gacctgggca
      661 tcccccctaa ccctgaggac aggtcccctt cccctgagcc catctacaat agcgagggga
      721 agcggcttaa cacccgagag ttccgcaccc gcaaaaagct ggaagaggag cggcacaacc
      781 tcatcacaga gatggttgca ctcaatccgg atttcaagcc acctgcagat tacaaacctc
      841 cagcaacacg tgtgagtgat aaagtcatga ttccacaaga tgagtaccca gaaatcaact
      901 ttgtggggct gctcatcggg cccagaggga acaccctgaa gaacatagag aaggagtgca
      961 atgccaagat tatgatccgg gggaaagggt ctgtgaaaga agggaaggtt gggcgcaaag
     1021 atggccagat gttgccagga gaagatgagc cacttcatgc cctggttact gccaatacaa
     1081 tggagaacgt caaaaaggca gtggaacaga taagaaacat cctgaagcag ggtatcgaga
     1141 ctccagagga ccagaatgat ctacggaaga tgcagcttcg ggagttggct cgcttaaatg
     1201 ggacccttcg ggaagacgat aacaggatct taagaccctg gcagagctca gagacccgca
     1261 gcattaccaa caccacagtg tgtaccaagt gtggaggggc tggccacatt gcttcagact
     1321 gtaaattcca aaggcctggt gatcctcagt cagctcagga taaagcacgg atggataaag
     1381 aatatttgtc cctcatggct gaactgggtg aagcacctgt cccagcatct gtgggctcca
     1441 cctctgggcc tgccaccaca cccctggcca gcgcacctcg tcctgctgct cccgccaaca
     1501 acccacctcc accgtctctc atgtctacca cccagagccg cccaccctgg atgaattctg
     1561 gcccttcaga gagtcggccc taccacggca tgcatggagg tggtcctggt gggcccggag
     1621 gtggccccca cagcttccca cacccattac ccagcctgac aggtgggcat ggtggacatc
     1681 ccatgcagca caaccccaat ggacccccac ccccttggat gcagccacca ccaccaccga
     1741 tgaaccaggg cccccaccct cctgggcacc atggccctcc tccaatggat cagtacctgg
     1801 gaagtacgcc tgtgggctct ggggtctatc gcctgcatca aggaaaaggt atgatgccgc
     1861 caccacctat gggcatgatg ccgccgccgc cgccgcctcc cagtgggcag cccccacccc
     1921 ctccctctgg tcctcttccc ccatggcaac aacagcagca gcagcctccg ccaccccctc
     1981 cgcccagcag cagtatggct tccagtaccc ccttgccatg gcagcaaaga tccctccccg
     2041 cggcggcgat ggcccgagcc atgagagtga ggactttccg cgcccattgg tgacccttcc
     2101 aggcagacag cctcagcaac gcccctggtg gacaggatgg ttcggcaaag cagcctgagt
     2161 tatttttgtg gacggaatcg gaacacgctg gctccatatc gtgaaatttt tattaatttt
     2221 tttctttttc ctttgttact tctttatctt ttcctttctt cagactccgt ccaaggagat
     2281 gctctccccg gtcttctgct gcaatttaga ttcctttccc ttctctccag ttctccttcc
     2341 cttaccaagg agaggggagc aaatggtttt gggcaagggc tttggccatt catgtcaagc
     2401 tggttgtggg tttttcaagg tgccatagcc acccccaaat atgtttgttt aaagcgtggg
     2461 gttttttaat ctctgccacc cttgtcaagg gagtcttgta aagttgccga gggtaggttc
     2521 atctccaggt ttcgggattc ccatccgtcc tggcgatcct gccagcagtg ggtgggcagc
     2581 ctgagctccc tcgggctcgc ctgccagcct ggagttcttc ctgtgctcct tgatcacctg
     2641 agctgcctca gattccattt ggtcctctcc ttcctggaag gcttcctttt atgttttgtt
     2701 ttaatcccaa atgtctgaat gttttgcagt gtgtaggggt ttgagcccct tgttcattct
     2761 ccttcctttt tcctcccgct tccctctcca tgaagtgatt ctgttgacaa taatgtatac
     2821 tgcgcgttct cttcactggt ttatctgcag aaatttctct gggctttttt cggtgttaga
     2881 ttcaacactg cgctaaagcg gggatgttcc attgaataaa agagcagtgt ggttttctga
     2941 aaaaaaaaaa aaaaaaaaaa aaa
//