LOCUS BC002693 2040 bp mRNA linear HUM 04-NOV-2003 DEFINITION Homo sapiens spermatid perinuclear RNA binding protein, mRNA (cDNA clone IMAGE:3606572), partial cds. ACCESSION BC002693 VERSION BC002693.2 KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2040) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2040) AUTHORS Strausberg,R. TITLE Direct Submission JOURNAL Submitted (05-FEB-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Cancer Genomics Office, National Cancer Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Oct 31, 2003 this sequence version replaced BC002693.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 12 Row: o Column: 5 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 21361744. FEATURES Location/Qualifiers source 1..2040 /db_xref="H-InvDB:HIT000031103" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="IMAGE:3606572" /tissue_type="Uterus, endometrium adenocarcinoma" /clone_lib="NIH_MGC_44" /lab_host="DH10B-R" /note="Vector: pOTB7" gene <1..2040 /gene="STRBP" /gene_synonym="DKFZp434N214" /gene_synonym="FLJ11307" /gene_synonym="FLJ14223" /gene_synonym="FLJ14984" /gene_synonym="MGC21529" /gene_synonym="MGC3405" /gene_synonym="SPNR" /db_xref="GeneID:55342" CDS <1..1848 /gene="STRBP" /gene_synonym="DKFZp434N214" /gene_synonym="FLJ11307" /gene_synonym="FLJ14223" /gene_synonym="FLJ14984" /gene_synonym="MGC21529" /gene_synonym="MGC3405" /gene_synonym="SPNR" /codon_start=1 /product="STRBP protein" /protein_id="AAH02693.2" /db_xref="GeneID:55342" /translation="TEGETEVKKDEAGENYSKDQGGRTLCGVMRIGLVAKGLLIKDDM DLELVLMCKDKPTETLLNTVKDNLPIQIQKLTEEKYQVEQCVNEASIIIRNTKEPTLT LKVILTSPLIRDELEKKDGENVSMKDPPDLLDRQKCLNALASLRHAKWFQARANGLKS CVIVLRILRDLCNRVPTWAPLKGWPLELICEKSIGTCNRPLGAGEALRRVMECLASGI LLPGGPGLHDPCERDPTDALSYMTIQQKEDITHSAQHALRLSAFGQIYKVLEMDPLPS SKPFQKYSWSVTDKEGAGSSALKRPFEDGLGDDKDPNKKMKRNLRKILDSKAIDLMNA LMRLNQIRPGLQYKLLSQSGPVHAPVFTMSVDVDGTTYEASGPSKKTAKLHVAVKVLQ AMGYPTGFDADIECMSSDEKSDNESKNETVSSNSSNNTGNSTTETSSTLEVRTQGPIL TASGKNPVMELNEKRRGLKYELISETGGSHDKRFVMEVEVDGQKFRGAGPNKKVAKAS AALAALEKLFSGPNAANNKKKKIIPQAKGVVNTAVSAAVQAVRGRGRGTLTRGAFVGA TAAPGYIAPGYGTPYGYSTAAPAYGLPKRMVLLPVMKFPTYPVPHYSFF" misc_feature 70..831 /gene="STRBP" /gene_synonym="DKFZp434N214" /gene_synonym="FLJ11307" /gene_synonym="FLJ14223" /gene_synonym="FLJ14984" /gene_synonym="MGC21529" /gene_synonym="MGC3405" /gene_synonym="SPNR" /note="DZF; Region: domain in DSRM or ZnF_C2H2 domain containing proteins" /db_xref="CDD:smart00572" misc_feature 994..1182 /gene="STRBP" /gene_synonym="DKFZp434N214" /gene_synonym="FLJ11307" /gene_synonym="FLJ14223" /gene_synonym="FLJ14984" /gene_synonym="MGC21529" /gene_synonym="MGC3405" /gene_synonym="SPNR" /note="dsrm; Region: Double-stranded RNA binding motif. Sequences gathered for seed by HMM_iterative_training Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localisation of at least five different mRNAs in the early Drosophila embryo. Also by interferon-induced protein kinase in humans, which is part of the cellular response to dsRNA" /db_xref="CDD:pfam00035" misc_feature 1363..1503 /gene="STRBP" /gene_synonym="DKFZp434N214" /gene_synonym="FLJ11307" /gene_synonym="FLJ14223" /gene_synonym="FLJ14984" /gene_synonym="MGC21529" /gene_synonym="MGC3405" /gene_synonym="SPNR" /note="DSRM; Region: Double-stranded RNA binding motif" /db_xref="CDD:smart00358" BASE COUNT 670 a 400 c 471 g 499 t ORIGIN 1 acagagggtg agacagaagt gaagaaagat gaggccggag aaaactattc caaggatcaa 61 ggtggtcgga cattgtgtgg tgtaatgagg attggcctgg ttgcaaaagg cttgctgatt 121 aaagatgata tggacttgga gctggtttta atgtgcaaag acaaacccac agagaccctg 181 ttaaatacag tcaaagataa tcttcctatt cagattcaga aactcacaga agagaaatat 241 caagtggaac aatgtgtaaa tgaggcatct attataattc ggaatacaaa agagcccacg 301 ctaactttga aggtgatact tacctcacct ctaattaggg acgaattgga gaagaaggat 361 ggagaaaatg tttcgatgaa agatcctccg gacttattgg acaggcagaa atgcctgaac 421 gccttggcgt ctcttcgaca tgccaaatgg tttcaggcaa gggcaaatgg attaaaatca 481 tgtgtaattg tcctccgcat tctgcgtgat ttgtgcaaca gagtccccac atgggcacca 541 ttgaaaggat ggccactaga acttatatgt gaaaagtcta taggtacttg taatagacct 601 ttgggcgctg gggaggcctt gagacgagta atggagtgtt tggcatctgg aatactactt 661 cctgggggtc ctggtcttca tgatccttgt gagcgagacc caacagatgc tctgagctat 721 atgaccatcc agcaaaaaga agatattacc cacagtgcac agcatgcact cagactatca 781 gcctttggcc agatttacaa agtgctggag atggaccccc ttccatctag taagcctttt 841 cagaagtatt cctggtcagt tactgataaa gaaggtgctg ggtcttcagc tctaaagagg 901 ccatttgaag atggattagg ggatgataaa gaccccaaca agaagatgaa acgaaactta 961 aggaaaattc tggatagtaa agcaatagac cttatgaatg cactaatgag gctaaatcag 1021 atcaggcctg ggcttcagta taagctccta tctcagtctg gccccgttca tgccccagtc 1081 ttcacaatgt ctgtagatgt ggatggcaca acatatgaag cctcaggacc atccaagaaa 1141 acagcaaaac ttcacgtagc ggtgaaggta ttgcaggcaa tgggatatcc aacaggcttt 1201 gatgcagata ttgaatgtat gagttccgat gaaaaatcag ataatgaaag taaaaatgaa 1261 acagtgtctt caaactcaag caataatact ggaaattcta caactgaaac ctccagtacc 1321 ttagaggtaa gaactcaggg ccctatcctc acagcaagtg gcaaaaaccc tgtaatggag 1381 ctcaatgaaa aaagaagagg tctcaagtat gaactcatct cagagactgg tggaagccat 1441 gacaagcgct ttgtaatgga ggtagaagta gatggacaga aattcagagg cgcaggtcca 1501 aataagaaag tggcaaaggc gagtgcagct ttagctgcct tggagaaact gttttctgga 1561 cccaatgcgg caaataataa gaaaaagaag attatccctc aggcaaaggg cgttgtgaat 1621 acagctgtgt ctgcagcagt ccaagctgtt cggggcagag gaagaggaac tctaacaagg 1681 ggagcttttg ttggggcgac agctgctcct ggctacatag ctccaggcta tggaacacca 1741 tatggttaca gcacagctgc ccctgcctat ggtttaccca agagaatggt tctgttaccc 1801 gttatgaaat ttccaacata tcctgttccc cactactcat tcttttagca aatgacagaa 1861 gctaattcct attgaacaac aatacagtac aacacagaat gttagagaaa aagccttttt 1921 atcctgcttt ctttgaacac atacttgatc aaaattattt gtaaagaaca tctttcctac 1981 tttttgattt taacaaatgc aaatttagtt ctctaaaact tgaaaaaaaa aaaaaaaaaa //