LOCUS       BC002422                4244 bp    mRNA    linear   HUM 04-NOV-2003
DEFINITION  Homo sapiens SON DNA binding protein, mRNA (cDNA clone
            IMAGE:3161999), partial cds.
ACCESSION   BC002422
VERSION     BC002422.2
KEYWORDS    .
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 4244)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 4244)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (05-FEB-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     On Aug 19, 2003 this sequence version replaced BC002422.1.
            Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: National Institutes of Health Intramural
            Sequencing Center (NISC),
            Gaithersburg, Maryland;
            Web site: http://www.nisc.nih.gov/
            Contact: nisc_mgc@nhgri.nih.gov
            Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
            Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
            Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
            Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
            Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
            McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
            Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
            Young,A., Zhang,L.-H. and Green,E.D.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 5 Row: m Column: 1
            This clone was selected for full length sequencing because it
            passed the following selection criteria: Hexamer frequency ORF
            analysis.
FEATURES             Location/Qualifiers
     source          1..4244
                     /db_xref="H-InvDB:HIT000086129"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="IMAGE:3161999"
                     /tissue_type="Brain, neuroblastoma"
                     /clone_lib="NIH_MGC_19"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     gene            1..>4244
                     /gene="SON"
                     /gene_synonym="BASS1"
                     /gene_synonym="C21orf50"
                     /gene_synonym="DBP-5"
                     /gene_synonym="FLJ21099"
                     /gene_synonym="KIAA1019"
                     /gene_synonym="NREBP"
                     /gene_synonym="SON3"
                     /db_xref="GeneID:6651"
                     /db_xref="MIM:182465"
     CDS             149..>4244
                     /gene="SON"
                     /gene_synonym="BASS1"
                     /gene_synonym="C21orf50"
                     /gene_synonym="DBP-5"
                     /gene_synonym="FLJ21099"
                     /gene_synonym="KIAA1019"
                     /gene_synonym="NREBP"
                     /gene_synonym="SON3"
                     /codon_start=1
                     /product="SON protein"
                     /protein_id="AAH02422.1"
                     /db_xref="GeneID:6651"
                     /db_xref="MIM:182465"
                     /translation="MVLELPGQPVATTALELPGQPSVTGVPELPGLPSATRALELSGQ
                     PVATGALELPGPLMAAGALEFSGQSGAAGALELLGQPLATGVLELPGQPGAPELPGQP
                     VATVALEISVQSVVTTSELSTMTVSQSLEVPSTTALESYNTVAQELPTTLVGETSVTV
                     GVDPLMAPESHILASNTMETHILASNTMDSQMLASNTMDSQMLASNTMDSQMLASSTM
                     DSQMLATSSMDSQMLATSSMDSQMLATSTMDSQMLATSSMDSQMLATSSMDSQMLATS
                     SMDSQMLATSSMDSQMLATSTMDSQMLATSTMDSQMLATSSMDSQMLASGTMDSQMLA
                     SGTMDAQMLASGTMDAQMLASSTQDSAMLGSKSPDPYRLAQDPYRLAQDPYRLGHDPY
                     RLGHDAYRLGQDPYRLGHDPYRLTPDPYRMSPRPYRIAPRSYRIAPRPYRLAPRPLML
                     ASRRSMMMSYAAERSMMSSYERSMMSYERSMMSPMAERSMMSAYERSMMSAYERSMMS
                     PMAERSMMSAYERSMMSAYERSMMSPMADRSMMSMGADRSMMSSYSAADRSMMSSYSA
                     ADRSMMSSYTADRSMMSMAADSYTDSYTDTYTEAYMVPPLPPEEPPTMPPLPPEEPPM
                     TPPLPPEEPPEGPALPTEQSALTAENTWPTEVPSLPSEESVSQPEPPVSQSEISEPSA
                     VPTDYSVSASDPSVLVSEAAVTVPEPPPEPESSITLTPVESAVVAEEHEVVPERPVTC
                     MVSETPAMSAEPTVLASEPPVMSETAETFDSMRASGHVASEVSTSLLVPAVTTPVLAE
                     SILEPPAMAAPESSAMAVLESSAVTVLESSTVTVLESSTVTVLEPSVVTVPEPPVVAE
                     PDYVTIPVPVVSALEPSVPVLEPAVSVLQPSMIVSEPSVSVQESTVTVSEPAVTVSEQ
                     TQVIPTEVAIESTPMILESSIMSSHVMKGINLSSGDQNLAPEIGMQEIALHSGEEPHA
                     EEHLKGDFYESEHGINIDLNINNHLIAKEMEHNTVCAAGTSPVGEIGEEKILPTSETK
                     QRTVLDTYPGVSEADAGETLSSTGPFALEPDATGTSKGIEFTTASTLSLVNKYDVDLS
                     LTTQDTEHDMVISTSPSGGSEADIEGPLPAKDIHLDLPSNNNLVSKDTEEPLPVKESD
                     QTLAALLSPKESSGGEKEVPPPPKETLPDSGFSANIEDINEADLVRPLLPKDMERLTS
                     LRAGIEGPLLASDVGRDRSAASPVVSSMPERASESSSEEKDDYEIFVKVKDTHEKSKK
                     NKNRDKGEKEKKRDSSLRSRSKRSKSSEHKSRKRTSESRSRARKRSSKSKSHRSQTRS
                     RSRSRRRRRSSRSRSKSRGRRSVSKEKRKRSPKHRSKSREKKKKK"
BASE COUNT         1139 a         1019 c         1035 g         1051 t
ORIGIN      
        1 gcctgcggta acagtagcaa tggagttgac cgaacaacct gtgacgacga cagagttgga
       61 gcagcctgtg gggatgacaa cggtggaaca tcctgggcat cctgaggtga caacggcaac
      121 agggttgctg gggcagcctg aggcaacgat ggtgctggag ttgccaggac agccagtggc
      181 aacgacagcg ctggagttgc cggggcagcc ttcggtgact ggggtgccag agttgccagg
      241 gctgccttcg gcaactaggg cactggagtt gtcggggcag cctgtggcaa ctggggcact
      301 ggagttgcct gggccgctca tggcagctgg ggcactggag ttctcggggc agtctggggc
      361 agctggagca ctggagcttt tggggcagcc tctggcaaca ggggtgctgg agttgccagg
      421 gcagcctggg gcgccagagt tgcctgggca gcctgtggca actgtggcgc tggagatctc
      481 tgttcagtct gtggtgacaa catcggagct gtcaacgatg accgtgtcgc agtccctgga
      541 ggtgccctcg acgacagcgc tggaatccta taatacggta gcacaggagc tgcctactac
      601 attagtgggg gagacttctg taacagtagg agtggatccc ttgatggccc cagaatccca
      661 tatattagct tctaacacca tggagaccca tatattagca tccaacacca tggactccca
      721 aatgctagcg tccaacacca tggactccca gatgctagca tccaacacca tggactccca
      781 gatgttagcg tctagcacca tggactccca gatgttagca actagctcca tggactccca
      841 gatgttagca actagctcca tggactccca gatgttagca actagcacta tggactccca
      901 gatgttagca accagttcca tggactccca gatgttagca accagctcca tggactccca
      961 gatgttagca accagctcca tggactccca gatgttagca accagctcca tggactccca
     1021 gatgttagca accagcacca tggattctca gatgttagca accagcacca tggactccca
     1081 gatgttagca actagctcaa tggattccca gatgttagca tctggcacta tggactctca
     1141 aatgttagct tctggcacca tggatgctca gatgttagcg tctggtacca tggatgccca
     1201 gatgttagcg tctagtaccc aagattctgc tatgttgggt tcaaaatctc ctgatcccta
     1261 taggttagct caggatcctt acaggttagc tcaggatccc tataggttgg gccatgaccc
     1321 ctatagatta ggtcatgatg cttacaggtt aggacaagac ccttatagat taggccatga
     1381 tccctacaga ctaactcctg atccctatag gatgtcacct agaccctaca ggatagcacc
     1441 caggtcctat agaatagcac ccaggccata taggttagca cctagacccc tgatgttagc
     1501 atctagacgt tctatgatga tgtcctatgc tgcagaacgt tccatgatgt catcttacga
     1561 acgctctatg atgtcttatg agcggtctat gatgtcccct atggctgaac gctctatgat
     1621 gtcagcctac gagcgctcta tgatgtcagc ctacgagcgc tctatgatgt cccctatggc
     1681 tgagcgctct atgatgtcag cttatgaacg ctccatgatg tcagcttatg aacgctccat
     1741 gatgtcccca atggctgatc gatctatgat gtccatgggt gctgaccggt ctatgatgtc
     1801 gtcatactct gctgctgacc ggtctatgat gtcatcgtac tctgcagctg accgatctat
     1861 gatgtcatct tatactgctg atcgttcaat gatgtctatg gctgctgatt cttacaccga
     1921 ttcttacact gacacatata cagaggcata tatggtgcca cctttgcctc ctgaagagcc
     1981 cccaacaatg ccaccgttgc cacctgagga gccaccaatg acaccaccat tgcctcctga
     2041 ggaaccacca gagggtccag cattgcccac tgagcagtca gcattaacag ctgaaaatac
     2101 ttggcctaca gaggtgccat cattaccatc tgaagagtct gtatcgcagc ctgagcctcc
     2161 tgtgagtcaa agtgagattt cggagccttc agcagtgcct actgattatt cagtgtcagc
     2221 atcagatccc tcagttttag tatcagaggc tgctgtgact gttccagaac caccaccaga
     2281 gccagaatct tcaattacgt taacacctgt agagtctgca gtagtagcag aagaacatga
     2341 agttgttcca gagagaccag tgacttgtat ggtatctgaa actcccgcca tgtcagctga
     2401 accaactgtg ttagcatcag agcctcctgt tatgtcagag acagcagaaa catttgattc
     2461 catgagagcc tcaggacatg ttgcctcaga agtatctaca tccttgttgg ttccagcagt
     2521 aactactcca gtgctggcag agagcattct ggagccgcca gccatggctg ccccagagtc
     2581 ttcagctatg gctgtcctgg agtcttcggc tgtgaccgtc ctggagtctt cgactgtgac
     2641 tgtcctggag tcttcgactg taactgtcct ggagccttcg gttgtgactg tcccggagcc
     2701 tcctgttgtg gctgagccag actatgttac cattcctgtg ccagttgttt ctgcgctgga
     2761 gccttctgtg cctgttctgg aaccagcggt gtcagtcctt caaccttcta tgattgtttc
     2821 agaaccatct gtttctgtcc aggaatcgac tgtgacagtt tcagagcctg ctgtcacagt
     2881 ctcagagcag actcaagtaa taccaactga ggtggctata gagtccacac caatgatact
     2941 ggaatctagt atcatgtcat cacatgttat gaaaggaatt aatctatcct ctggtgatca
     3001 aaatcttgct ccagagattg gcatgcagga gattgcattg cattcaggtg aagaaccaca
     3061 tgctgaggaa cacctgaaag gtgactttta cgaaagtgaa catggtataa atatagacct
     3121 taatataaat aatcatttaa ttgctaaaga gatggaacat aatacagtgt gtgctgctgg
     3181 tactagtcct gttggggaaa ttggtgaaga gaaaattttg cccaccagtg agactaaaca
     3241 gcgcacagta ttggatacct accctggtgt tagtgaagct gatgcaggag aaactctatc
     3301 ttctactggt ccttttgctc tggaacctga tgcaacagga actagtaagg gtattgaatt
     3361 taccacagca tctactctca gtttagttaa taaatatgat gttgatttat ctttaactac
     3421 tcaagatact gaacatgaca tggtaatttc caccagtcct agtggtggta gtgaagctga
     3481 cattgaaggg cctttgcctg ctaaagatat tcatcttgat ttaccatcta ataataacct
     3541 tgttagtaag gatacagaag aaccattacc tgtaaaagag agtgaccaga cattagcagc
     3601 tctgctcagc cctaaagaaa gtagtggagg agaaaaagaa gtacctcccc ctcctaaaga
     3661 gacactgcct gattcaggat tttctgccaa tattgaggat attaatgaag cagatttagt
     3721 gagaccgtta cttcctaagg acatggaacg tcttacaagc cttagagctg gcattgaagg
     3781 acctttactt gcaagtgatg ttggacgtga cagatctgct gccagcccgg ttgtaagtag
     3841 tatgccagaa agagcttcag agtcttcttc agaggaaaaa gatgattatg aaatttttgt
     3901 aaaagttaag gacactcacg aaaaaagcaa gaaaaataag aaccgtgata agggggagaa
     3961 agagaagaaa agagactctt cattaagatc tcgaagtaag cgttccaaat cttctgaaca
     4021 caaatcacgc aagcgtacca gtgaatctcg ttctagggca agaaagagat catctaagtc
     4081 caagtctcat cgctctcaga cacgttcacg gtcacgttca agacgcagga ggagaagcag
     4141 cagatcaaga tcaaagtcta gaggaagaag atctgtatca aaagagaagc gcaaaagatc
     4201 tccaaagcac agatccaagt ctagggaaaa aaaaaaaaaa aaaa
//