LOCUS BC020586 4202 bp mRNA linear HUM 07-OCT-2003 DEFINITION Homo sapiens splicing factor, arginine/serine-rich 14, mRNA (cDNA clone MGC:22011 IMAGE:4400185), complete cds. ACCESSION BC020586 VERSION BC020586.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4202) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 4202) AUTHORS Strausberg,R. TITLE Direct Submission JOURNAL Submitted (03-JAN-2002) National Institutes of Health, Mammalian Gene Collection (MGC), Cancer Genomics Office, National Cancer Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Sequencing Group at the Stanford Human Genome Center, Stanford University School of Medicine, Stanford, CA 94305 Web site: http://www-shgc.stanford.edu Contact: (Dickson, Mark) mcd@paxil.stanford.edu Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers, R. M. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 27 Row: f Column: 23 This clone was selected for full length sequencing because it passed the following selection criteria: Hexamer frequency ORF analysis, GenomeScan gene prediction. FEATURES Location/Qualifiers source 1..4202 /db_xref="H-InvDB:HIT000038837" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:22011 IMAGE:4400185" /tissue_type="Duodenum, adenocarcinoma" /clone_lib="NIH_MGC_88" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..4202 /gene="SFRS14" /gene_synonym="KIAA0365" /db_xref="GeneID:10147" /db_xref="MIM:607993" CDS 223..3189 /gene="SFRS14" /gene_synonym="KIAA0365" /codon_start=1 /product="SFRS14 protein" /protein_id="AAH20586.1" /db_xref="GeneID:10147" /db_xref="MIM:607993" /translation="MYDDVHSDGRYSLSGSVAHSRDAGREGLRSDVFPGPSFRSSNPS ISDDSYFRKECGRDLEFSHSDSRDQVIGHRKLGHFRSQDWKFALRGSWEQDFGHPVSQ ESSWSQEYSFGPSAVLGDFGSSRLIEKECLEKESRDYDVDHPGEADSVLRGSSQVQAR GRALNIVDQEGSLLGKGETQGLLTAKGGVGKLVTLRNVSTKKIPTVNRITPKTQGTNQ IQKNTPSPDVTLGTNPGTEDIQFPIQKIPLGLDLKNLRLPRRKMSFDIIDKSDVFSRF GIEIIKWAGFHTIKDDIKFSQLFQTLFELETETCAKMLASFKCSLKPEHRDFCFFTIK FLKHSALKTPRVDNEFLNMLLDKGAVKTKNCFFEIIKPFDKYIMRLQDRLLKSVTPLL MACNAYELSVKMKTLSNPLDLALALETTNSLCRKSLALLGQTFSLASSFRQEKILEAV GLQDIAPSPAAFPNFEDSTLFGREYIDHLKAWLVSSGCPLQVKKAEPEPMREEEKMIP PTKPEIQAKAPSSLSDAVPQRADHRVVGTIDQLVKRVIEGSLSPKERTLLKEDPAYWF LSDENSLEYKYYKLKLAEMQRMSENLRGADQKPTSADCAVRAMLYSRAVRNLKKKLLP WQRRGLLRAQGLRGWKARRATTGTQTLLSSGTRLKHHGRQAPGLSQAKPSLPDRNDAA KDCPPDPVGPSPQDPSLEASGPSPKPAGVDISEAPQTSSPCPSADIDMKTMETAEKLA RFVAQVGPEIEQFSIENSTDNPDLWFLHDQNSSAFKFYRKKVFELCPSICFTSSPHNL HTGGGDTTGSQESPVDLMEGEAEFEDEPPPREAELESPEASSGTCFPRKRISSKSLKV GMIPAPKRVCLIQEPKVHEPVRIAYDRPRGRPMSKKKKPKDLDFAQQKLTDKNLGFQM LQKMGWKEGHGLGSLGKGIREPVSVYAAGSLGWEWVGPQSFHLQPAAWLLHSQDGLQL AVDFCFLNRRHLQMRS" misc_feature 1828..1983 /gene="SFRS14" /gene_synonym="KIAA0365" /note="Surp; Region: Surp module. This domain is also known as the SWAP domain. SWAP stands for Suppressor-of-White-APricot. It has been suggested that these domains may be RNA binding" /db_xref="CDD:pfam01805" misc_feature 2419..2583 /gene="SFRS14" /gene_synonym="KIAA0365" /note="Surp; Region: Surp module. This domain is also known as the SWAP domain. SWAP stands for Suppressor-of-White-APricot. It has been suggested that these domains may be RNA binding" /db_xref="CDD:pfam01805" misc_feature 2941..3066 /gene="SFRS14" /gene_synonym="KIAA0365" /note="G-patch; Region: G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines" /db_xref="CDD:pfam01585" BASE COUNT 1120 a 1047 c 1047 g 988 t ORIGIN 1 ggcctcccag agactcgcgg acgctcacga taaccaagga cgaccgcgaa ccaagggcaa 61 aataacatgg cagccagacg aattacacag gagacttttg atgctgtatt acaagaaaaa 121 gccaaacgat atcacatgga tgccagtggt gaggctgtaa gcgaaactct tcagtttaaa 181 gctcaagatc tcttaagggc agtcccaaga tccagagcag agatgtatga tgacgtccac 241 agcgatggca gatactccct cagtggatct gtagctcact ctagagatgc cggaagagaa 301 ggcctgagaa gtgacgtatt tccagggcct tccttcagat caagcaaccc ttccatcagt 361 gatgacagct actttcgcaa agaatgtggc cgggatctgg aattttctca ctctgattct 421 cgggaccagg tcattggcca ccggaaattg gggcatttcc gttctcagga ctggaaattt 481 gcgctccgtg gttcttggga acaagacttt ggccatccag tttctcaaga gtcctcttgg 541 tcacaggagt atagttttgg tccctctgca gttttggggg actttggatc ttccaggctg 601 attgagaaag agtgtttgga gaaggagagt cgggattatg acgtggacca tcctggggag 661 gctgactctg tgcttagggg cagcagtcaa gtccaggcca gaggtcgagc tctaaacatc 721 gttgaccagg aaggttccct cctaggaaag ggggagactc agggcctgct cacagctaag 781 gggggtgttg ggaaacttgt cacattgaga aatgtgagca caaaaaaaat acccaccgtg 841 aatcgtatta ctcccaaaac tcagggcact aaccaaatcc agaaaaacac tccaagtcct 901 gatgtgaccc tggggacaaa cccagggaca gaagatatcc agttccccat tcagaagatc 961 cctctggggc tggatctgaa gaatcttcgg ctccccagaa gaaagatgag ctttgacatc 1021 atagataagt ctgatgtttt ttcaagattt gggatagaaa taatcaaatg ggcaggattc 1081 cacaccataa aagatgatat taaattttcc caacttttcc agactctctt tgaacttgaa 1141 acagaaacct gtgctaaaat gcttgcctca ttcaaatgtt ccttaaaacc agagcacaga 1201 gatttttgct tttttactat caaattttta aagcactctg ctttgaaaac acccagagtt 1261 gataatgagt ttttaaacat gcttttagac aaaggtgctg tgaagaccaa aaattgcttt 1321 tttgaaatca taaagccttt tgacaagtac ataatgagac ttcaagaccg gcttctgaag 1381 agtgtcacac ctttgcttat ggcctgcaat gcctacgagc taagtgtcaa gatgaagacc 1441 ctcagtaacc ccctggactt ggctcttgcc ctagaaacca ccaactctct ctgccggaag 1501 tctttggccc ttttgggaca gacattttcc ttggcctctt ctttccggca ggagaaaatc 1561 ttagaagctg tcggcctgca agatatagct ccctcacctg ctgcgtttcc aaacttcgaa 1621 gactccactt tgtttgggcg agagtacata gaccacctga aggcctggct agtcagcagc 1681 ggatgtcccc tccaggttaa gaaagccgaa ccagagccga tgcgagagga ggagaaaatg 1741 attcctccta cgaaacctga aattcaggcc aaggctccaa gtagtctgag tgatgctgtc 1801 ccccagcgag cagatcacag ggtagtgggc accatcgacc agcttgtgaa acgtgtcatc 1861 gaaggcagcc tgtctcccaa agagagaact cttctcaaag aggaccctgc ttactggttt 1921 ttgtctgatg aaaatagtct ggagtataaa tattacaagc tgaagttggc agaaatgcag 1981 cggatgagcg agaacttgcg aggagccgac cagaagccga cctcagcaga ctgtgcagtg 2041 agggccatgc tgtactcccg ggctgtccgc aacctcaaga agaaactcct tccgtggcag 2101 cggcgggggc tcctccgtgc tcaagggctc cggggctgga aggcgaggag agcgaccacc 2161 gggacccaga ccctcctatc ctcaggcacc aggctgaaac accacggccg gcaggctcca 2221 ggcctctcac aggcaaaacc atccctgcca gacagaaatg atgctgccaa ggactgcccg 2281 ccagacccag ttggaccttc tcctcaggac cccagcttag aagcctcagg cccatccccc 2341 aagccagcag gagtggacat ctctgaagca cctcagacct cttctccctg cccatctgct 2401 gacattgaca tgaagacaat ggagactgca gagaaactgg ctagatttgt tgctcaggtg 2461 ggaccagaga tcgaacaatt cagcatagaa aacagcaccg ataaccctga cctgtggttt 2521 ctacatgacc aaaatagttc tgctttcaaa ttctatcgaa agaaagtgtt tgaactatgt 2581 ccatcaattt gtttcacgtc atctccgcac aaccttcaca ctggtggtgg tgacaccacg 2641 ggttctcagg agagccccgt ggacctcatg gaaggggaag cagagtttga agacgagccc 2701 cctccgcggg aggctgagct ggagagccca gaggcctcct caggtacctg cttccctcgg 2761 aagaggatca gcagcaagtc attgaaggtt ggcatgattc cagctcccaa gagagtgtgt 2821 ctcatccagg agccaaaagt ccatgaacca gttcgaattg cctatgacag gcctcggggt 2881 cgtcccatgt ccaaaaagaa gaaacccaag gacttggact tcgcccagca gaagctgacc 2941 gataagaacc tgggcttcca gatgctgcag aagatgggct ggaaggaggg ccatggcctg 3001 ggctccctcg gaaagggcat cagggagccg gtcagcgtgt acgcagcagg cagcctgggg 3061 tgggagtggg tggggcctca gtccttccac ctgcagcctg ccgcttggct ccttcacagc 3121 caagatggct tacagctggc agttgatttt tgttttttaa acagaaggca tcttcagatg 3181 agaagctgat catttacatg tgcaggtgtt tacagggttc ctttctgtcc tggtgtagat 3241 tttttaacca gcttgttggc cctggtcatt ttggccacat ttgtgaccat cataaaagct 3301 aagtggtatt tctgtgtagt ttccgtctgg aactgctttc ccattcccgg gaacccatag 3361 ccgggccagc cagggtcccg aacacaggcc caaagtttat taaaccccga tcataacctc 3421 cagcaggcat ttcatttaat actgagctta gttcctgctg ggtaaggcat tccgaggtaa 3481 ccagggccct ctgggcaccc cctcaaaagc cagctcttcg agggtgagta ctccttgttt 3541 ctactgtgag tcgcgtcttg attttccctt tctttgatgt ctcagtgtgt gtcccaaaca 3601 cctgcatctc atggactgtt tgtgcccatg cccagttcct ggcatgccag gccctgggct 3661 caggtgcaca actgactctc tttttcactc cctaggggaa ccccctcgga aggggaaggg 3721 ttgggtgctg acgggcagga gcacaaagaa gacacattcg atgtgttccg acagaggatg 3781 atgcagatgt acagacacaa gcgggccaac aaatagatca aaaccactga tgtgaaagat 3841 aagccttgaa gcagcaattg cccttaaaac atcatccctg ccctggatcg gcctggagcc 3901 agtgcccaat tccagggtca cccccgagag gacaacaggc atctggaagt gctctctcgc 3961 cactctgggt gctttactgt ctctggcttg tttccatcac tggaaatcac ttagagaatt 4021 gtagtgtttt tggtccttga taaagcctag aagacatttg tgatgttaca aaatggaagt 4081 tttcttttgt ttctgttttt taagatctaa gaagttgtga atgttgttaa tcatttagcc 4141 gttgcaataa atgtagagga aatgcagtga aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 4201 aa //