LOCUS BC038446 2963 bp mRNA linear HUM 18-JAN-2018
DEFINITION Homo sapiens splicing factor 1, mRNA (cDNA clone MGC:45254
IMAGE:5493676), complete cds.
ACCESSION BC038446
VERSION BC038446.2
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 2963)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 2963)
AUTHORS Strausberg,R.
TITLE Direct Submission
JOURNAL Submitted (04-OCT-2002) National Institutes of Health, Mammalian
Gene Collection (MGC), Cancer Genomics Office, National Cancer
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
REFERENCE 3 (bases 1 to 2963)
AUTHORS Strausberg,R.
TITLE Direct Submission
JOURNAL Submitted (18-JAN-2018) National Institutes of Health, Mammalian
Gene Collection (MGC), Cancer Genomics Office, National Cancer
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
USA
REMARK Sequence update by database staff to remove vector contamination
COMMENT On Jan 18, 2018 this sequence version replaced BC038446.1.
Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Life Technologies, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: National Institutes of Health Intramural
Sequencing Center (NISC),
Gaithersburg, Maryland;
Web site: http://www.nisc.nih.gov/
Contact: nisc_mgc@nhgri.nih.gov
Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
Young,A., Zhang,L.-H. and Green,E.D.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 69 Row: e Column: 16
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 4759339.
FEATURES Location/Qualifiers
source 1..2963
/db_xref="H-InvDB:HIT000052036"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:45254 IMAGE:5493676"
/tissue_type="Eye, retinoblastoma"
/clone_lib="NIH_MGC_67"
/lab_host="DH10B"
/note="Vector: pCMV-SPORT6"
gene 1..2963
/gene="SF1"
/gene_synonym="D11S636"
/gene_synonym="ZFM1"
/gene_synonym="ZNF162"
/db_xref="GeneID:7536"
/db_xref="MIM:601516"
CDS 72..2093
/gene="SF1"
/gene_synonym="D11S636"
/gene_synonym="ZFM1"
/gene_synonym="ZNF162"
/codon_start=1
/product="SF1 protein"
/protein_id="AAH38446.1"
/db_xref="GeneID:7536"
/db_xref="MIM:601516"
/translation="MATGANATPLGKLGPPGLPPLPGPKGGFEPGPPPAPGPGAGLLA
PGPPPPPPVGSMGALTAAFPFAALPPPPPPPPPPPPQQPPPPPPPPSPGASYPPPQPP
PPPPLYQRVSPPQPPPPQPPRKDQQPGPAGGGGDFPSKKRKRSRWNQDTMEQKTVIPG
MPTVIPPGLTREQERAYIVQLQIEDLTRKLRTGDLGIPPNPEDRSPSPEPIYNSEGKR
LNTREFRTRKKLEEERHNLITEMVALNPDFKPPADYKPPATRVSDKVMIPQDEYPEIN
FVGLLIGPRGNTLKNIEKECNAKIMIRGKGSVKEGKVGRKDGQMLPGEDEPLHALVTA
NTMENVKKAVEQIRNILKQGIETPEDQNDLRKMQLRELARLNGTLREDDNRILRPWQS
SETRSITNTTVCTKCGGAGHIASDCKFQRPGDPQSAQDKARMDKEYLSLMAELGEAPV
PASVGSTSGPATTPLASAPRPAAPANNPPPPSLMSTTQSRPPWMNSGPSESRPYHGMH
GGGPGGPGGGPHSFPHPLPSLTGGHGGHPMQHNPNGPPPPWMQPPPPPMNQGPHPPGH
HGPPPMDQYLGSTPVGSGVYRLHQGKGMMPPPPMGMMPPPPPPPSGQPPPPPSGPLPP
WQQQQQQPPPPPPPSSSMASSTPLPWQQRSLPAAAMARAMRVRTFRAHW"
misc_feature 495..1217
/gene="SF1"
/gene_synonym="D11S636"
/gene_synonym="ZFM1"
/gene_synonym="ZNF162"
/note="MSL5; Region: COG5176, MSL5, Splicing factor
(branch point binding protein) [RNA processing and
modification]"
/db_xref="CDD:COG5176"
BASE COUNT 634 a 943 c 774 g 612 t
ORIGIN
1 cccgggaggc ttgcgaagga gaagccgccg cagaggaaaa gcaggtgccg gtgcctgtcc
61 ccgggggcgc catggcgacc ggagcgaacg ccacgccgtt gggtaagctg ggcccccccg
121 ggctgccccc gctccccggg cccaaaggag gcttcgagcc gggccctccg cctgcacccg
181 ggcctggggc ggggctgctg gctcccgggc cgccgccgcc cccgcccgtg ggctcgatgg
241 gggccctgac cgcggccttc cccttcgcgg cgctgcctcc gccgcctccg ccgccgcccc
301 ctccgcctcc ccagcagccg ccgccgcctc caccgccacc gtcccccggc gcctcgtacc
361 cgccgccgca gccgccccct ccgccgccgc tctaccagcg cgtgtcgccg ccgcagccgc
421 cgccacccca gccgccgcgt aaggaccagc agccgggccc ggccggcggc ggaggagact
481 tcccaagtaa gaagcggaag aggagccgct ggaaccaaga cacaatggaa cagaagacag
541 tgattccagg aatgcctaca gttattcccc ctggacttac tcgagaacaa gaaagagctt
601 atatagtgca actgcagata gaagacctga ctcgtaaact gcgcacagga gacctgggca
661 tcccccctaa ccctgaggac aggtcccctt cccctgagcc catctacaat agcgagggga
721 agcggcttaa cacccgagag ttccgcaccc gcaaaaagct ggaagaggag cggcacaacc
781 tcatcacaga gatggttgca ctcaatccgg atttcaagcc acctgcagat tacaaacctc
841 cagcaacacg tgtgagtgat aaagtcatga ttccacaaga tgagtaccca gaaatcaact
901 ttgtggggct gctcatcggg cccagaggga acaccctgaa gaacatagag aaggagtgca
961 atgccaagat tatgatccgg gggaaagggt ctgtgaaaga agggaaggtt gggcgcaaag
1021 atggccagat gttgccagga gaagatgagc cacttcatgc cctggttact gccaatacaa
1081 tggagaacgt caaaaaggca gtggaacaga taagaaacat cctgaagcag ggtatcgaga
1141 ctccagagga ccagaatgat ctacggaaga tgcagcttcg ggagttggct cgcttaaatg
1201 ggacccttcg ggaagacgat aacaggatct taagaccctg gcagagctca gagacccgca
1261 gcattaccaa caccacagtg tgtaccaagt gtggaggggc tggccacatt gcttcagact
1321 gtaaattcca aaggcctggt gatcctcagt cagctcagga taaagcacgg atggataaag
1381 aatatttgtc cctcatggct gaactgggtg aagcacctgt cccagcatct gtgggctcca
1441 cctctgggcc tgccaccaca cccctggcca gcgcacctcg tcctgctgct cccgccaaca
1501 acccacctcc accgtctctc atgtctacca cccagagccg cccaccctgg atgaattctg
1561 gcccttcaga gagtcggccc taccacggca tgcatggagg tggtcctggt gggcccggag
1621 gtggccccca cagcttccca cacccattac ccagcctgac aggtgggcat ggtggacatc
1681 ccatgcagca caaccccaat ggacccccac ccccttggat gcagccacca ccaccaccga
1741 tgaaccaggg cccccaccct cctgggcacc atggccctcc tccaatggat cagtacctgg
1801 gaagtacgcc tgtgggctct ggggtctatc gcctgcatca aggaaaaggt atgatgccgc
1861 caccacctat gggcatgatg ccgccgccgc cgccgcctcc cagtgggcag cccccacccc
1921 ctccctctgg tcctcttccc ccatggcaac aacagcagca gcagcctccg ccaccccctc
1981 cgcccagcag cagtatggct tccagtaccc ccttgccatg gcagcaaaga tccctccccg
2041 cggcggcgat ggcccgagcc atgagagtga ggactttccg cgcccattgg tgacccttcc
2101 aggcagacag cctcagcaac gcccctggtg gacaggatgg ttcggcaaag cagcctgagt
2161 tatttttgtg gacggaatcg gaacacgctg gctccatatc gtgaaatttt tattaatttt
2221 tttctttttc ctttgttact tctttatctt ttcctttctt cagactccgt ccaaggagat
2281 gctctccccg gtcttctgct gcaatttaga ttcctttccc ttctctccag ttctccttcc
2341 cttaccaagg agaggggagc aaatggtttt gggcaagggc tttggccatt catgtcaagc
2401 tggttgtggg tttttcaagg tgccatagcc acccccaaat atgtttgttt aaagcgtggg
2461 gttttttaat ctctgccacc cttgtcaagg gagtcttgta aagttgccga gggtaggttc
2521 atctccaggt ttcgggattc ccatccgtcc tggcgatcct gccagcagtg ggtgggcagc
2581 ctgagctccc tcgggctcgc ctgccagcct ggagttcttc ctgtgctcct tgatcacctg
2641 agctgcctca gattccattt ggtcctctcc ttcctggaag gcttcctttt atgttttgtt
2701 ttaatcccaa atgtctgaat gttttgcagt gtgtaggggt ttgagcccct tgttcattct
2761 ccttcctttt tcctcccgct tccctctcca tgaagtgatt ctgttgacaa taatgtatac
2821 tgcgcgttct cttcactggt ttatctgcag aaatttctct gggctttttt cggtgttaga
2881 ttcaacactg cgctaaagcg gggatgttcc attgaataaa agagcagtgt ggttttctga
2941 aaaaaaaaaa aaaaaaaaaa aaa
//