LOCUS AF163151 9944 bp DNA linear HUM 15-NOV-2000 DEFINITION Homo sapiens dentin sialophosphoprotein precursor (DSPP) gene, complete cds. ACCESSION AF163151 VERSION AF163151.2 KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 9944) AUTHORS Gu,K., Chang,S., Ritchie,H.H., Clarkson,B.H. and Rutherford,R.B. TITLE Molecular cloning of a human dentin sialophosphoprotein gene JOURNAL Eur. J. Oral Sci. 108 (1), 35-42 (2000) PUBMED 10706475 REFERENCE 2 (bases 1 to 9944) AUTHORS Gu,K., Racenis,P., Clarkson,B.H. and Rutherford,R.B. TITLE Cloning and functional analysis of cis-acting element regulating the expression of human dentin sialophosphoprotein JOURNAL Unpublished REFERENCE 3 (bases 1 to 9944) AUTHORS Gu,K. TITLE Direct Submission JOURNAL Submitted (24-JUN-1999) Cariology, Restorative Sciences and Endodontics, University of Michigan School of Dentistry, 1011 N. University Ave., Ann Arbor, MI 48109, USA REFERENCE 4 (bases 1 to 9944) AUTHORS Gu,K. and Rutherford,R.B. TITLE Direct Submission JOURNAL Submitted (09-MAR-2000) Cariology, Restorative Sciences and Endodontics, University of Michigan School of Dentistry, 1011 N. University Ave., Ann Arbor, MI 48109, USA REMARK Sequence update by submitter COMMENT On Nov 15, 2000 this sequence version replaced AF163151.1. FEATURES Location/Qualifiers source 1..9944 /organism="Homo sapiens" /mol_type="genomic DNA" /db_xref="taxon:9606" gene 1..9944 /gene="DSPP" regulatory 1..1749 /regulatory_class="promoter" /gene="DSPP" mRNA join(1750..1841,4102..4180,5320..5403,5537..6523, 7000..9944) /gene="DSPP" /product="dentin sialophosphoprotein" exon 1750..1841 /gene="DSPP" /number=1 exon 4102..4180 /gene="DSPP" /number=2 CDS join(4130..4180,5320..5403,5537..6523,7000..9639) /gene="DSPP" /codon_start=1 /product="dentin sialophosphoprotein precursor" /protein_id="AAF42472.1" /translation="MKIITYFCIWAVAWAIPVPQSKPLERHVEKSMNLHLLARSNVSV QDELNASGTIKESGVLVHEGDRGRQENTQDGHKGEGNGSKWAEVGGKSFSTYSTLANE EGNIEGWNGDTGKAETYGHDGIHGKEENITANGIQGQVSIIDNAGATNRSNTNGNTDK NTQNGDVGDAGHNEDVAVVQEDGPQVAGSNNSTDNEDEIIENSCRNEGNTSEITPQIN SKRNGTKEAEVTPGTGEDAGLDNSDGSPSGNGADEDEDEGSGDDEDEEAGNGKDSSNN SKGQEGQDHGKEDDHDSSIGQNSDSKEYYDPEGKEDPHNEVDGDKTSKSEENSAGIPE DNGSQRIEDTQKLNHRESKRVENRITKESETHAVGKSQDKGIEIKGPSSGNRNITKEV GKGNEGKEDKGQHGMILGKGNVKTQGEVVNIEGPGQKSEPGNKVGHSNTGSDSNSDGY DSYDFDDKSMQGDDPNSSDESNGNDDANSESDNNSSSRGDASYNSDESKDNGNGSDSK GAEDDDSDSTSDTNNSDSNGNGNNGNDDNDKSDSGKGKSDSSDSDSSDSSNSSDSSDS SDSDSSDSNSSSDSDSSDSDSSDSSDSDSSDSSNSSDSSDSSDSSDSSDSSDSSDSKS DSSKSESDSSDSDSKSDSSDSNSSDSSDNSDSSDSSNSSNSSDSSDSSDSSDSSSSSD SSSSSDSSNSSDSSDSSDSSNSSESSDSSDSSDSDSSDSSDSSNSNSSDSDSSNSSDS SDSSDSSDSSNSSDSSDSSDSSNSSDSSDSSDSSDSSDSSNSSDSNDSSNSSDSSDSS NSSDSSNSSDSSDSSDSSDSDSSNSSDSSNSSDSSDSSNSSDSSDSSDSSDSSDSDSS NRSDSSNSSDSSDSSDSSNSSDSSDSSDSSDSNESSNSSDSSDSSNSSDSDSSDSSNS SDSSDSSNSSDSSESSNSSDNSNSSDSSNSSDSSDSSDSSNSSDSSNSGDSSNSSDSS DSNSSDSSDSSNSSDSSDSSDSSDSSDSSDSSNSSDSSDSSDSSDSSNSSDSSNSSDS SDSSDSSDSSDSSDSSNSSDSSDSSDSSDSSDSSGSSDSSDSSDSSDSSDSSDSSDSS DSSDSSESSDSSDSSDSSDSSDSSDSSDSSDSSDSSDSSDSSNSSDSSDSSDSSDSSD SSDSSDSSDSSDSSDSSDSSDSSDSSDSSDSSDSSDSNESSDSSDSSDSSDSSNSSDS SDSSDSSDSTSDSNDESDSQSKSGNGNNNGSDSDSDSEGSDSNHSTSDD" sig_peptide 4130..4174 /gene="DSPP" mat_peptide join(4175..4180,5320..5403,5537..6523,7000..9636) /gene="DSPP" /product="dentin sialophosphoprotein" mat_peptide join(4175..4180,5320..5403,5537..6523,7000..7263) /gene="DSPP" /product="dentin sialoprotein" mat_peptide 7264..9636 /gene="DSPP" /product="phosphophoryn" misc_feature 7339..7347 /gene="DSPP" /note="Region: cell binding domain" exon 5320..5403 /gene="DSPP" /number=3 exon 5537..6523 /gene="DSPP" /number=4 exon 7000..9944 /gene="DSPP" /number=5 regulatory 9731..9736 /regulatory_class="polyA_signal_sequence" /gene="DSPP" regulatory 9914..9919 /regulatory_class="polyA_signal_sequence" /gene="DSPP" BASE COUNT 3326 a 1761 c 2180 g 2677 t ORIGIN 1 atgaggtcat tgtttcaaaa cttcttctct tcatatatgg ttatttagtg ctataatttc 61 atcctaaata actgctttag tgacatccca cagattttga tatgctatgc tttcattttc 121 agtcatccca tatattttga atttcccttt tggtttcttc tttcacccat tgactattta 181 tgagtgttcc acttagcctt aaaatacttg cggttttcca gaattctttt gttattgatt 241 tctgattgcc ttccattgtg gtcagtaaac ataatttgta cgacttgaat tcttttaaat 301 ttattgagat tcattttggg gctcagaata cgttctatct tggtaaatgg tgcaggtaca 361 cttgaaaaga atgtttattc ttctgttgtt ggatgaagtc gtctgtaaat gtcaattaga 421 tcaagttgga tgatacttct gtttaaaggt tgtatatcca ttatattttt ctttcaatta 481 ttaatagatg gctattgaaa tctctaacca taattcatgt taattcaaat taaggaaaag 541 aaaggcaggt ttagttcacc aacctccact gatgaatgtg tgtgtgggta agggtgtcac 601 tcccataagt gactcgagcc ttgttaaaag ttcttaggcc ctttaatctg cccaagtaaa 661 catgctatat gaataaaaac aaagcattca ctatgaaaac agcttcaagg caaatttcct 721 tcccattcaa ttcacttcct tttaacctta ctttttccct acatataaag ctttatttgt 781 atataaatac catatatttt tgtatatata gtttctggag ataccactgt tgcatagtga 841 tatagcatta gtggtcttat gtgtttcacc ttctttaata ttgtaacaca aaaattggca 901 tattcctgaa aattctaaac aagtatatag agaaagacgg tcaagttttt gaatgcatga 961 attgaagaaa tagtttaggg attacctgca ttatagtaca tctagggttc aagagaatgg 1021 tgcactcatg ttgcaaaact tgttagacat aaagtctagt tttctacttt gcattctgca 1081 gctaacacag ataattgaga gataatggac agctaaatct gttgtcttta tgcacctttg 1141 gacatcttgc tacttttgac attttcaatg tctgtttttg aaatgtagtt tgaatactca 1201 attgatagat ttttaagctt ttatgtcact gcttatttca gttaaatcag agagatttca 1261 ggggagttac tgtttataga ttgttaaaga tccctcaggc aacgtcaagg ctgctgcatg 1321 tggacagttt cttgaattat agcacagaaa ctttaatacc tacctcaaga atgatagggg 1381 tcttaaatag gcagtcatat ttactaaaga aacctagagt tttcttagat tgccaacctt 1441 ggtaagacaa gaaatgtcag ggtgacagag tttaagtggc tcttttcagg tatcttacac 1501 tgattctcta tatttaagat gtgaaacagc ctcataggag ctgtttaatt aagtgaaagt 1561 aagtctagtc cttttggaac cgaacggttc agtgagcaaa tacatatcac caagtgaagg 1621 aagtggggaa gttattatgt gcttcataga ctcgatgaca ctttataacc ccatatcagg 1681 gatcctaaac agtgattggt tgagaaaatt atcaaactga gtttaaattt cggcaagtac 1741 aaaattgtca tgcaaaagtc caggacagtg ggccactttc agtcttcaaa gagaaagata 1801 agaaattctg gattttcaaa atccttttga agccttttaa ggtaagatga aatatccttt 1861 ttactcagaa ccaactgatt catttagaaa gaactttgaa tttcaaagat gaagccagtt 1921 tgattttaag aagcgagtac cccttaatga ttagattgta tgcttccttt ttgacttgtc 1981 atattgatag tatgtataaa agataacgga cgattacgac ctaaggaaga gatagattgg 2041 gaagaagaaa gacctcgtac tgaaaaattg gccaactgag gtggaaattt gacaattaac 2101 tatctgggca ctttgattag ttttgataaa aaatgagata actcagattt caaaaatcca 2161 ccttgggctt tcaaacaagg cttcaattag gctttgcttt ttagtatttt attacttact 2221 attacttatt atttattgtc ccacatgaaa tgaaatttag caatcactaa tgatgccaaa 2281 tctaattgct aaatgaaatg aagctaaatc tcatttcatt agtaacaata aatgaaataa 2341 tctgatggag cttcacaaat tctgaagtct ttgtttcatg ctgaggtcac ctgggccatt 2401 tttattgtag tcttcgaagt cattcacctg ccttggaaac ggtgataacc atcatggaat 2461 tgttcaggag tggagctgaa agagagatgt agtggtcaga tttctgaact gtagctcaga 2521 aactggacac gtatcactct ggccttggct gcaggtacct ttccagtatg ctgaggctct 2581 tccaaatcac agtgcagacg ggccttctgc agagctatgt aatgattagg cttgggactg 2641 caaagtacag gataactgtg gcttagtaaa cagctggcct tcaacatctg tgccccagag 2701 ctctgcatga tacttgtcct ggtgtcacct cagcctcact tgaatctatg gcatttcaga 2761 aggagctcta gctgttcttg gctttctgtt gaacagctat aagaatgagc acttttttcc 2821 ctctcagtag ctctggaact gtgtcatctc tcctgtgaga aaacgccagt aattctcatg 2881 acagttgata ttcagtgaag ttttattata ttttcactac caccattaaa ttcaatcaaa 2941 gccattttat gacatgcagc attataatct atacatctgg tgggagttca tgaaatagga 3001 gtaaaactct cctttctatc attacttcaa gaaatccaac ttgcaatata aattaatttt 3061 tttactcaca cagattataa aatgtctatt ccaacttatc agaaacatgt tttagaccat 3121 ttctgaattt gaattctaac agggatgaag aatcatgatt ttagaagtcc cataaaataa 3181 ttgctatcat ttattcaaaa attgcaaagt gcctgaagca atgctagata ttgctgatag 3241 tcataaatat ttatcaacaa cattcagaaa acgttttttt ctgtgctttg cattggaata 3301 caataatcac caagacactc tcctgggcct caggagctta caggaaatca gggcaacaca 3361 taagtaacta ggcaatttta aacagtgcaa tgcgttacca gtgagacgtg caaacttcct 3421 tggtataaaa aggaaagaga taccaaatac cctttgaagt ggcgtcagag agggcgtctc 3481 agagataatt ctaccaaact tcaggataat cctgaggtgc aggtgttgtt attattccag 3541 gtggagggat aataaaccta cttaaatttc tcaagcttac acagcaagta gcaggggtaa 3601 catttgaacc caggtctctg aatacaaacc ccgtattctt tccactagcg taggctccct 3661 catgttagta atttctttct cttaaagtct ggtatagctc aattctatag atttggagta 3721 aggatgacaa gtgttttacc tttgaagcac aatttcagca gaattagtta gtacttgatt 3781 aaagctattc agaagagaaa tagatgtttt tacacccaag aattgcagaa gaacaaagtt 3841 acagctatgc cctttgtacc tattatggtg ttttccttca ttggcacagg cagaaaaaaa 3901 tctaggaagc tacattagtg ctgagcctgg tgatgtcccc ataaccacac caggtatgtt 3961 ctggaccatc gtatgtcttc tcgtgttaga tacatgcttc ttgtccagga aaagggcaaa 4021 tgcttacaca tcaaaataat atagtactat gattttccct ttactttata agtaattttg 4081 tgctgttcct tttttataca gccattgatt attattattc ctaaagaaaa tgaagataat 4141 tacatatttt tgcatttggg cagtagcatg ggccattcca gtaagtatgc ctttcttaga 4201 aaacctcttc actttgttat cttttttaac ctaacattaa tacaaaatgt agtgtgtgtg 4261 tgtgtgtgtg tgtgtgtgtg tgtgtgcatg tacatgtgtg tatatatgtg tgtgtgtata 4321 tatgtttcct taattttttt taacaggctg agtctaaaca tttagatttg cactaagggc 4381 tttatgtgat atctgtgagg tttcaacaaa accactccaa ttcatcgtct cattcctcta 4441 tagaaactca tatctcgtct gaaggattat tattatttaa aacatttatt cagattaatt 4501 tacacttaat gcccagaagt catggagact ttgtccatct ttgcttcata ctctgtgaat 4561 ttcattctaa tacgaacaaa gtctgtgctg tttaggaagt ttccaagaaa gaataataag 4621 aaaaagtaga ttttttttca acatatagga gactaatttt tcactcagag ttattattta 4681 tgtgctcact gtggaaaatt tggaatatat gacgaaaacc aataaaaaat tgagaaaatt 4741 caaccattta taattttact agccagccat catgtttaac attttcatat gctttcataa 4801 taccaaacat ttggtattta tgtagttgaa aatgttctca agtatttcaa atgtgctctt 4861 gcagagcaca gaagtatact agcgtaatac ttgattttgc ttctgtgcag gctctggtca 4921 cgcctcctgt tctcttaaga gttttcatca ggattacact tagagcgggt ttgtgctagt 4981 gcaagaggct ttttgtagag aaacaccaga ggtctatccc ctcgtctttc tacaagactc 5041 tttccttcta cagttgagat aagtgggctg atctaacacg tccataaaat tggtaatacc 5101 acagtgaaaa atatccatgt acccagttta aattctacac aagccctgta agaagccact 5161 tctcttttct atctgattag atcatacttt ggcctttgtg ttaaaccttt cttcttcatg 5221 gagggaagaa tatttgtgtg tgtgtgtgtg tgtgtgcacg ctcacacaca tattcacaaa 5281 taagaacctt ttcaatagcc agtattttct acttggcagg ttcctcaaag caaaccactg 5341 gagagacatg tcgaaaaatc catgaatttg catctcctag caagatcaaa tgtgtcagta 5401 caggtatagg atgtaatata tttcatttta tttcctattt ctgagttgct acattccatt 5461 aacttctcca agattgcaat ttgctttcct tcaagatcat tgacactcat aattgattga 5521 attgtttctt tttcaggatg agttaaatgc cagtggaacc atcaaagaaa gtggtgtcct 5581 ggtgcatgaa ggtgatagag gaaggcaaga gaatacccaa gatggtcaca agggagaagg 5641 gaatggctct aagtgggcag aagtaggagg gaagagtttt tctacatatt ccacattagc 5701 aaacgaagag gggaatattg agggctggaa tggggacaca ggaaaagcag aaacatatgg 5761 tcatgatgga atacatggga aagaagaaaa catcacagca aatggcatcc agggacaagt 5821 aagcatcatt gacaatgctg gagccacaaa cagaagcaac actaatggaa atactgataa 5881 gaatacccaa aatggggatg ttggcgatgc aggtcacaat gaggatgtcg ctgttgtcca 5941 agaagatgga cctcaagtag ctggaagcaa taacagtaca gacaatgagg atgaaataat 6001 tgagaattcc tgtagaaacg agggtaatac aagtgaaata acacctcaga tcaacagcaa 6061 gagaaatggg actaaggaag ctgaggtaac accaggcact ggagaagatg ctggcctgga 6121 taattccgat gggagtccta gtgggaatgg agcagatgag gatgaagacg agggttctgg 6181 tgatgatgaa gatgaagaag cagggaatgg aaaagacagt agtaataaca gcaagggcca 6241 ggagggccag gaccatggga aagaagatga tcatgatagt agcataggtc aaaattcgga 6301 tagtaaagaa tattatgacc ctgaaggcaa agaagatccc cataatgaag ttgatggaga 6361 caagacctcc aagagtgagg agaattctgc tggtattcca gaagacaatg gcagccaaag 6421 aatagaggac acccagaagc tcaaccatag agaaagcaaa cgcgtagaaa atagaatcac 6481 caaagaatca gagacacatg ctgttgggaa gagccaagat aaggttagtt tgtaaagctg 6541 atttctttca atggcagttt aaattcttcc cctccatcta ttgatgctag cacaaaaata 6601 aaccatgaca agcatccatg tatttttgta tccatattac ttgactattt aaggaaatct 6661 agagtcctta ctagacttcg agatagaaca actttaaaca tcttacattt ctgataactt 6721 agttataatt ctagaaaagt cttatgtgaa atcatggatc cccatgtaat tgtttacaaa 6781 agttcctact gggtaggaat gtggatgaat ttttaaggaa tctaagcacc aggatgcttt 6841 caattacaga ataaagcaca ttttcacaaa taactgtgaa gtactagaaa tgtaactcct 6901 atccctatgg caacttttcc cagttattct tcctcagatc aatgcaattt tgcagcaaat 6961 attcactagt taatcattct ttcctccatc cttccatagg gaatagaaat caagggtccc 7021 agcagtggca acagaaatat taccaaagaa gttgggaaag gcaacgaagg taaagaggat 7081 aaaggacaac atggaatgat cttgggcaaa ggcaatgtca agacacaagg agaggttgtc 7141 aacatagaag gacctggcca aaaatcagaa ccaggaaata aagttggaca cagcaataca 7201 ggtagtgaca gcaatagtga tggatatgac agttatgatt ttgatgataa gtccatgcaa 7261 ggagatgatc ccaatagcag tgatgaatct aatggcaatg atgatgctaa ttcagaaagt 7321 gacaataaca gcagtagccg aggagatgct tcttataact ctgatgaatc aaaagataat 7381 ggcaatggca gtgactcaaa aggagcagaa gatgatgaca gtgatagcac atcagacact 7441 aataatagtg acagtaatgg caatggtaac aatgggaatg atgacaatga caaatcagac 7501 agtggcaaag gtaaatcaga tagcagtgac agtgatagta gtgatagcag caatagcagt 7561 gatagtagtg acagcagtga cagtgacagc agtgatagca acagtagcag tgatagtgac 7621 agcagtgaca gtgacagcag tgatagcagt gacagtgata gtagtgatag cagcaatagc 7681 agtgacagta gtgacagcag tgatagcagt gacagtagtg atagtagtga cagcagtgac 7741 agcaagtcag acagcagcaa atcagagagc gacagcagtg atagtgacag taagtcagac 7801 agcagtgaca gcaacagcag tgacagtagt gacaacagtg atagcagcga cagcagcaat 7861 agcagtaaca gcagtgatag tagtgacagc agtgatagca gtgacagcag cagtagcagt 7921 gacagcagca gtagcagtga cagcagcaac agcagtgata gtagtgacag tagtgacagc 7981 agcaatagca gtgagagcag tgatagtagt gacagcagtg atagtgacag cagtgatagt 8041 agtgacagca gtaatagtaa cagcagcgat agtgacagca gcaacagcag cgatagcagt 8101 gacagcagtg atagcagtga cagcagcaac agcagtgaca gtagcgatag cagtgacagc 8161 agcaacagca gtgacagcag tgatagcagt gacagcagtg atagtagtga cagcagcaac 8221 agcagtgata gcaacgacag cagcaatagc agtgacagca gtgatagcag caacagcagt 8281 gatagcagca acagcagtga tagcagtgat agcagtgaca gcagtgatag cgacagcagc 8341 aatagcagtg acagcagtaa tagtagtgac agcagcgata gcagcaacag cagtgatagc 8401 agcgacagca gcgatagcag tgacagcagt gatagcgaca gcagcaatag aagtgacagt 8461 agtaatagta gtgacagcag cgatagcagt gacagcagca acagcagtga cagcagtgat 8521 agtagtgaca gcagtgacag caacgaaagc agcaatagca gtgacagcag tgatagcagc 8581 aacagcagtg atagtgacag cagtgatagc agcaacagca gtgacagcag tgatagcagc 8641 aacagcagtg atagcagtga aagcagtaat agtagtgaca acagcaatag cagtgacagc 8701 agcaacagca gtgacagcag tgatagcagt gacagcagta atagtagtga cagcagcaat 8761 agcggtgaca gcagcaacag cagtgacagc agtgatagca atagcagcga cagcagtgac 8821 agcagcaaca gcagcgatag cagtgacagc agtgatagca gtgacagcag tgacagcagt 8881 gatagcagca acagcagtga tagcagtgac agcagtgaca gcagtgatag cagtaatagt 8941 agtgacagca gcaacagcag tgacagcagc gatagcagtg acagcagcga tagcagtgac 9001 agcagtgaca gcagcaatag cagtgacagc agtgacagca gcgacagcag tgatagcagt 9061 gacagcagtg gcagcagcga cagcagtgat agcagtgaca gcagtgatag cagcgatagc 9121 agtgacagca gcgacagcag tgacagcagt gacagcagtg aaagcagcga cagcagcgat 9181 agcagcgaca gcagtgacag cagcgacagc agtgacagca gcgatagcag cgacagcagc 9241 gacagcagcg atagcagtga cagcagcaat agcagtgata gcagcgacag cagtgatagc 9301 agtgacagca gcgacagcag cgatagcagc gacagcagtg atagtagtga tagcagtgac 9361 agcagtgaca gcagcgacag cagtgacagc agcgacagca gtgacagcag cgacagcagt 9421 gacagcaatg aaagcagcga cagcagtgac agcagcgata gcagtgacag cagcaacagc 9481 agtgacagca gcgacagcag tgatagcagt gacagcacat ctgacagcaa tgatgagagt 9541 gacagccaga gcaagtctgg taacggtaac aacaatggaa gtgacagtga cagtgacagt 9601 gaaggcagtg acagtaacca ctcaaccagt gatgattaga acaaaagaaa aacccataag 9661 attccttttg tgaaaagttt ggtaatggga taggaaaaaa agatttccaa gaaagtaaag 9721 aaaggggaga aataaacata agacgtatgt aaacaaaaac aactggggga atcaaatcaa 9781 acagttggat tcagaaccaa gacctaactc ctgcagagac agactctgaa tgcatgacct 9841 ttggtacatg cctgttaata ttcatgttct gaaaatattt tgttaaaagt gtaaatctaa 9901 acataaaaga acaattaaaa tattctttaa tacttcacac agaa //