LOCUS BC002422 4244 bp mRNA linear HUM 04-NOV-2003 DEFINITION Homo sapiens SON DNA binding protein, mRNA (cDNA clone IMAGE:3161999), partial cds. ACCESSION BC002422 VERSION BC002422.2 KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4244) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 4244) AUTHORS Strausberg,R. TITLE Direct Submission JOURNAL Submitted (05-FEB-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Cancer Genomics Office, National Cancer Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Aug 19, 2003 this sequence version replaced BC002422.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 5 Row: m Column: 1 This clone was selected for full length sequencing because it passed the following selection criteria: Hexamer frequency ORF analysis. FEATURES Location/Qualifiers source 1..4244 /db_xref="H-InvDB:HIT000086129" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="IMAGE:3161999" /tissue_type="Brain, neuroblastoma" /clone_lib="NIH_MGC_19" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..>4244 /gene="SON" /gene_synonym="BASS1" /gene_synonym="C21orf50" /gene_synonym="DBP-5" /gene_synonym="FLJ21099" /gene_synonym="KIAA1019" /gene_synonym="NREBP" /gene_synonym="SON3" /db_xref="GeneID:6651" /db_xref="MIM:182465" CDS 149..>4244 /gene="SON" /gene_synonym="BASS1" /gene_synonym="C21orf50" /gene_synonym="DBP-5" /gene_synonym="FLJ21099" /gene_synonym="KIAA1019" /gene_synonym="NREBP" /gene_synonym="SON3" /codon_start=1 /product="SON protein" /protein_id="AAH02422.1" /db_xref="GeneID:6651" /db_xref="MIM:182465" /translation="MVLELPGQPVATTALELPGQPSVTGVPELPGLPSATRALELSGQ PVATGALELPGPLMAAGALEFSGQSGAAGALELLGQPLATGVLELPGQPGAPELPGQP VATVALEISVQSVVTTSELSTMTVSQSLEVPSTTALESYNTVAQELPTTLVGETSVTV GVDPLMAPESHILASNTMETHILASNTMDSQMLASNTMDSQMLASNTMDSQMLASSTM DSQMLATSSMDSQMLATSSMDSQMLATSTMDSQMLATSSMDSQMLATSSMDSQMLATS SMDSQMLATSSMDSQMLATSTMDSQMLATSTMDSQMLATSSMDSQMLASGTMDSQMLA SGTMDAQMLASGTMDAQMLASSTQDSAMLGSKSPDPYRLAQDPYRLAQDPYRLGHDPY RLGHDAYRLGQDPYRLGHDPYRLTPDPYRMSPRPYRIAPRSYRIAPRPYRLAPRPLML ASRRSMMMSYAAERSMMSSYERSMMSYERSMMSPMAERSMMSAYERSMMSAYERSMMS PMAERSMMSAYERSMMSAYERSMMSPMADRSMMSMGADRSMMSSYSAADRSMMSSYSA ADRSMMSSYTADRSMMSMAADSYTDSYTDTYTEAYMVPPLPPEEPPTMPPLPPEEPPM TPPLPPEEPPEGPALPTEQSALTAENTWPTEVPSLPSEESVSQPEPPVSQSEISEPSA VPTDYSVSASDPSVLVSEAAVTVPEPPPEPESSITLTPVESAVVAEEHEVVPERPVTC MVSETPAMSAEPTVLASEPPVMSETAETFDSMRASGHVASEVSTSLLVPAVTTPVLAE SILEPPAMAAPESSAMAVLESSAVTVLESSTVTVLESSTVTVLEPSVVTVPEPPVVAE PDYVTIPVPVVSALEPSVPVLEPAVSVLQPSMIVSEPSVSVQESTVTVSEPAVTVSEQ TQVIPTEVAIESTPMILESSIMSSHVMKGINLSSGDQNLAPEIGMQEIALHSGEEPHA EEHLKGDFYESEHGINIDLNINNHLIAKEMEHNTVCAAGTSPVGEIGEEKILPTSETK QRTVLDTYPGVSEADAGETLSSTGPFALEPDATGTSKGIEFTTASTLSLVNKYDVDLS LTTQDTEHDMVISTSPSGGSEADIEGPLPAKDIHLDLPSNNNLVSKDTEEPLPVKESD QTLAALLSPKESSGGEKEVPPPPKETLPDSGFSANIEDINEADLVRPLLPKDMERLTS LRAGIEGPLLASDVGRDRSAASPVVSSMPERASESSSEEKDDYEIFVKVKDTHEKSKK NKNRDKGEKEKKRDSSLRSRSKRSKSSEHKSRKRTSESRSRARKRSSKSKSHRSQTRS RSRSRRRRRSSRSRSKSRGRRSVSKEKRKRSPKHRSKSREKKKKK" BASE COUNT 1139 a 1019 c 1035 g 1051 t ORIGIN 1 gcctgcggta acagtagcaa tggagttgac cgaacaacct gtgacgacga cagagttgga 61 gcagcctgtg gggatgacaa cggtggaaca tcctgggcat cctgaggtga caacggcaac 121 agggttgctg gggcagcctg aggcaacgat ggtgctggag ttgccaggac agccagtggc 181 aacgacagcg ctggagttgc cggggcagcc ttcggtgact ggggtgccag agttgccagg 241 gctgccttcg gcaactaggg cactggagtt gtcggggcag cctgtggcaa ctggggcact 301 ggagttgcct gggccgctca tggcagctgg ggcactggag ttctcggggc agtctggggc 361 agctggagca ctggagcttt tggggcagcc tctggcaaca ggggtgctgg agttgccagg 421 gcagcctggg gcgccagagt tgcctgggca gcctgtggca actgtggcgc tggagatctc 481 tgttcagtct gtggtgacaa catcggagct gtcaacgatg accgtgtcgc agtccctgga 541 ggtgccctcg acgacagcgc tggaatccta taatacggta gcacaggagc tgcctactac 601 attagtgggg gagacttctg taacagtagg agtggatccc ttgatggccc cagaatccca 661 tatattagct tctaacacca tggagaccca tatattagca tccaacacca tggactccca 721 aatgctagcg tccaacacca tggactccca gatgctagca tccaacacca tggactccca 781 gatgttagcg tctagcacca tggactccca gatgttagca actagctcca tggactccca 841 gatgttagca actagctcca tggactccca gatgttagca actagcacta tggactccca 901 gatgttagca accagttcca tggactccca gatgttagca accagctcca tggactccca 961 gatgttagca accagctcca tggactccca gatgttagca accagctcca tggactccca 1021 gatgttagca accagcacca tggattctca gatgttagca accagcacca tggactccca 1081 gatgttagca actagctcaa tggattccca gatgttagca tctggcacta tggactctca 1141 aatgttagct tctggcacca tggatgctca gatgttagcg tctggtacca tggatgccca 1201 gatgttagcg tctagtaccc aagattctgc tatgttgggt tcaaaatctc ctgatcccta 1261 taggttagct caggatcctt acaggttagc tcaggatccc tataggttgg gccatgaccc 1321 ctatagatta ggtcatgatg cttacaggtt aggacaagac ccttatagat taggccatga 1381 tccctacaga ctaactcctg atccctatag gatgtcacct agaccctaca ggatagcacc 1441 caggtcctat agaatagcac ccaggccata taggttagca cctagacccc tgatgttagc 1501 atctagacgt tctatgatga tgtcctatgc tgcagaacgt tccatgatgt catcttacga 1561 acgctctatg atgtcttatg agcggtctat gatgtcccct atggctgaac gctctatgat 1621 gtcagcctac gagcgctcta tgatgtcagc ctacgagcgc tctatgatgt cccctatggc 1681 tgagcgctct atgatgtcag cttatgaacg ctccatgatg tcagcttatg aacgctccat 1741 gatgtcccca atggctgatc gatctatgat gtccatgggt gctgaccggt ctatgatgtc 1801 gtcatactct gctgctgacc ggtctatgat gtcatcgtac tctgcagctg accgatctat 1861 gatgtcatct tatactgctg atcgttcaat gatgtctatg gctgctgatt cttacaccga 1921 ttcttacact gacacatata cagaggcata tatggtgcca cctttgcctc ctgaagagcc 1981 cccaacaatg ccaccgttgc cacctgagga gccaccaatg acaccaccat tgcctcctga 2041 ggaaccacca gagggtccag cattgcccac tgagcagtca gcattaacag ctgaaaatac 2101 ttggcctaca gaggtgccat cattaccatc tgaagagtct gtatcgcagc ctgagcctcc 2161 tgtgagtcaa agtgagattt cggagccttc agcagtgcct actgattatt cagtgtcagc 2221 atcagatccc tcagttttag tatcagaggc tgctgtgact gttccagaac caccaccaga 2281 gccagaatct tcaattacgt taacacctgt agagtctgca gtagtagcag aagaacatga 2341 agttgttcca gagagaccag tgacttgtat ggtatctgaa actcccgcca tgtcagctga 2401 accaactgtg ttagcatcag agcctcctgt tatgtcagag acagcagaaa catttgattc 2461 catgagagcc tcaggacatg ttgcctcaga agtatctaca tccttgttgg ttccagcagt 2521 aactactcca gtgctggcag agagcattct ggagccgcca gccatggctg ccccagagtc 2581 ttcagctatg gctgtcctgg agtcttcggc tgtgaccgtc ctggagtctt cgactgtgac 2641 tgtcctggag tcttcgactg taactgtcct ggagccttcg gttgtgactg tcccggagcc 2701 tcctgttgtg gctgagccag actatgttac cattcctgtg ccagttgttt ctgcgctgga 2761 gccttctgtg cctgttctgg aaccagcggt gtcagtcctt caaccttcta tgattgtttc 2821 agaaccatct gtttctgtcc aggaatcgac tgtgacagtt tcagagcctg ctgtcacagt 2881 ctcagagcag actcaagtaa taccaactga ggtggctata gagtccacac caatgatact 2941 ggaatctagt atcatgtcat cacatgttat gaaaggaatt aatctatcct ctggtgatca 3001 aaatcttgct ccagagattg gcatgcagga gattgcattg cattcaggtg aagaaccaca 3061 tgctgaggaa cacctgaaag gtgactttta cgaaagtgaa catggtataa atatagacct 3121 taatataaat aatcatttaa ttgctaaaga gatggaacat aatacagtgt gtgctgctgg 3181 tactagtcct gttggggaaa ttggtgaaga gaaaattttg cccaccagtg agactaaaca 3241 gcgcacagta ttggatacct accctggtgt tagtgaagct gatgcaggag aaactctatc 3301 ttctactggt ccttttgctc tggaacctga tgcaacagga actagtaagg gtattgaatt 3361 taccacagca tctactctca gtttagttaa taaatatgat gttgatttat ctttaactac 3421 tcaagatact gaacatgaca tggtaatttc caccagtcct agtggtggta gtgaagctga 3481 cattgaaggg cctttgcctg ctaaagatat tcatcttgat ttaccatcta ataataacct 3541 tgttagtaag gatacagaag aaccattacc tgtaaaagag agtgaccaga cattagcagc 3601 tctgctcagc cctaaagaaa gtagtggagg agaaaaagaa gtacctcccc ctcctaaaga 3661 gacactgcct gattcaggat tttctgccaa tattgaggat attaatgaag cagatttagt 3721 gagaccgtta cttcctaagg acatggaacg tcttacaagc cttagagctg gcattgaagg 3781 acctttactt gcaagtgatg ttggacgtga cagatctgct gccagcccgg ttgtaagtag 3841 tatgccagaa agagcttcag agtcttcttc agaggaaaaa gatgattatg aaatttttgt 3901 aaaagttaag gacactcacg aaaaaagcaa gaaaaataag aaccgtgata agggggagaa 3961 agagaagaaa agagactctt cattaagatc tcgaagtaag cgttccaaat cttctgaaca 4021 caaatcacgc aagcgtacca gtgaatctcg ttctagggca agaaagagat catctaagtc 4081 caagtctcat cgctctcaga cacgttcacg gtcacgttca agacgcagga ggagaagcag 4141 cagatcaaga tcaaagtcta gaggaagaag atctgtatca aaagagaagc gcaaaagatc 4201 tccaaagcac agatccaagt ctagggaaaa aaaaaaaaaa aaaa //