LOCUS HUMINCP 3716 bp DNA linear HUM 27-APR-1993 DEFINITION Human cysteine-proteinase inhibitor (CST1) gene, complete cds. ACCESSION M19169 VERSION M19169.1 KEYWORDS cysteine proteinase inhibitor; salivary cystatine. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3716) AUTHORS Saitoh,E., Kim,H.S., Smithies,O. and Maeda,N. TITLE Human cysteine-proteinase inhibitors: nucleotide sequence analysis of three members of the cystatin gene family JOURNAL Gene 61 (3), 329-338 (1987) PUBMED 3446578 COMMENT Draft entry and computer-readable sequence for [1] kindly provided by N.Maeda, 23-MAY-1988. FEATURES Location/Qualifiers source 1..3716 /organism="Homo sapiens" /mol_type="genomic DNA" /db_xref="taxon:9606" prim_transcript 12..>3628 /note="CPI mRNA and introns" CDS join(341..568,2079..2192,3311..3394) /note="cysteine-proteinase inhibitor" /codon_start=1 /protein_id="AAA36115.1" /translation="MAQHLSTLLLLLATLAVALAWSPKEEDRIIPGGIYNADLNDEWV QRALHFAISEYNKATKDDYYRRPLRVLRARQQTVGGVNYFFDVEVGRTICTKSQPNLD TCAFHEQPELQKKQLCSFEIYEVPWENRRSLVKSRCQES" exon <341..568 /note="cysteine-proteinase inhibitor" /number=1 intron 569..2078 /note="CPI intron A" exon 2079..2192 /number=1 intron 2193..3310 /note="CPI intron B" exon 3311..>3394 /note="cysteine-proteinase inhibitor" /number=1 BASE COUNT 775 a 1005 c 1093 g 843 t ORIGIN 1 bp upstream of HindIII site. 1 aagcttggga gcactgggga agagaggcat ggctcgggga ggtcgcagtg aggactggag 61 tggggaggag ggggagatgg aggaggaggc ttgggagggg cagggggaac ttaggcagga 121 aaggagcttg tagtagcggg ggagtgaaaa gagagatgga gaaagagggg atgggaagaa 181 agagggagaa agggagtcag gggtggggca tggaggtggg tggggctggg ctgccaaagc 241 aggataaatg cacagctgcc tgctggtctg ggctccctgc ctcaggctct caccctcctc 301 tcctgcagct ccagctttgt gctctgcctc tgaggagacc atggcccagc atctgagtac 361 cctgctgctc ctgctggcca ccctagctgt ggccctggcc tggagcccca aggaggagga 421 taggataatc ccgggtggca tctataacgc agacctcaat gatgagtggg tacagcgtgc 481 ccttcacttc gccatcagcg agtataacaa ggccaccaaa gatgactact acagacgtcc 541 gctgcgggta ctaagagcca ggcaacaggt aggtgctccc tccaccccag gggtcctggg 601 tcccagcctg gtttgttccc caacccccaa gagcattccc agcaaatcaa cactgataca 661 ttcatgatct aatgctcaga ttcattcagc tttccctggc tctccgctga tgcccttcat 721 gcctaagcac gctccccggc cgtgcacaaa ctcagcttcc tttaacctgc agcagccact 781 gtgtctgtac catgactgtg gcatttccca gggtccagca ggtgtggatg gagactgtgc 841 ttactctggg tgggcttgat gctgctcagg atgagatcca ggccatgagg ttcatactcc 901 tccctgagtc ctctctgcag gggccacaca ggaacctggc tcactgttct gcagagccct 961 gcttccccaa gtcacgcccc tgggcacagc cccttatggc tagcggcctt caccctcagg 1021 cccggctgac aaactcccac agcctagggc gctgagtccc tgctggggtg gagcatgcct 1081 gaccctgcct ctaccagctg atgcagttag acctcagcca gatgaggaca gtggtcaccc 1141 agcagagcag aggaggggtc aggtcgggag ggagcttcag cagggcaact gggcccagct 1201 tgacctgcat cccatggcac agcagcaaat agtgacacag tctttagagc tcctccacct 1261 tctcctgaaa ttcaaaggaa tccccaccag ccccgtttct cctcttgcag ctgtcagctg 1321 gggctctctc cctgcatacg agatacactc cctggtgccg tggtccccgc tggcctgcat 1381 ctccctttca agcatgacag taacttggag tgaagcacag ggcattgcag accatcaggc 1441 ccagaagcct attttagaca tgggtaaact gacactcgag ggatctcagc agttcctcct 1501 ggttccaaag agtccctcat cccaggtttc tccacagctc tgccacattg tgtctgggaa 1561 aggccctatg cagggaaagg gttcaattct aatctgcaac tgtaagacac gcaggtgtgc 1621 tgctgacttg agaaatgtat cttgaatctc acacttgaaa tggtggcatc cggacggccc 1681 cattgatcca aaatatctgt gtgtgtgaag catctcattt cctactctga gtgaagtaat 1741 aaatctatgt taaatggagg gaataagatt ttcagaagtt aggtgaaatt ttgtcatcag 1801 acagaacttc ctagaaaaga gtcagtgttc cctcgcccct gagccacaga cagcagaatt 1861 caatgaatcc ttttacccag cacagagaaa gcaatgttta agagcgggta tgaggctcag 1921 caccctgcca gttgacagga agagggggct tgtgtgcctt gtgttgacat gtgggcagct 1981 cacgaagccc ccaagcaagt ccagtgactc agccacagtg aagtgcctgt gagtgcatga 2041 actgatgggg gcgctgtcct gttttctcct gtgtgcagac cgttgggggg gtgaattact 2101 tcttcgacgt agaggtgggc cgaaccatat gtaccaagtc ccagcccaac ttggacacct 2161 gtgccttcca tgaacagcca gaactgcaga aggtacgttc ctgatgcagg tcccgggcca 2221 gtcatgcact gcagaggggt gcgtatgtgt cagcctctgc cctacacatg tttggagggt 2281 gtgtgtgtgt gcaggtgggt atgtggggag tcatgtatgc atggatgtgt acatgttcat 2341 gtacttgtgg aggggtgtgc ctgtaggtgt gcatgtggaa aggtacacgt gtgtacacac 2401 ctgtgccagt gtgtgcaggg aggtggatgg gagcatgtgt gcctgtgcat ggatgtgtgg 2461 ggggtgtatg gggctttgta catagatcca tggggatgag gggtccaagt gagtttacgt 2521 agttgtccat gtatgtgcag atggggtggt gagggaggag ggtgatgtgt ttgttttgct 2581 aggaaggctt taggttggga atggttacta taaggtcaat tctgcctgct ttggagtgtt 2641 gcctgttgga caggaagaag cagctgtgcg gctgtgtgct gggcagggag aaggggctct 2701 gtctaatccc aggctcaggc acctgcatgc agccacagcc acagtgatca gattagtggg 2761 acctagaggc ctgttagctg ggaagccctg gacctgcccg gctcacccaa caccagcctc 2821 tccaaggacc tgctggttct tgtgaggtct ccactcgggg aagagcctga gcactcccct 2881 tgttgccctt gccccatacc ccagctcttt gagggggagt tgccctgccc tggttcttcc 2941 ctctggcccc tcttagtgct ggcctggtgc tggaagtgga aggagctggg ggaactgagc 3001 cgcctcccca tgccctgcac ccttggggct cccgaggcct gcccaggcta ctcctcacag 3061 ggctgtgctg ggacaggaca ctgcaggctg gggtggggtc ccaatgccac ctggtgactt 3121 ggagccttgg gaggggcaat ggaacagtca ctattcattc tagttcagca ctctgggact 3181 cagtaggggt gggtgagggc ccagtgtctc acctccatcc tcctcaccca ggctctgaca 3241 tctcatgcct gggcatcttc ccctttaact gtaacccaca ctgattggcc ctctctcttc 3301 cctttcacag aaacagttgt gctctttcga gatctacgaa gttccctggg agaacagaag 3361 gtccctggtg aaatccaggt gtcaagaatc ctagggatct gtgccaggcc attcgcacca 3421 gccaccaccc actcccaccc cctgtagtgc tcccacccct ggactggtgg cccccaccct 3481 gcgggaggcc tccccatgtg cctgtgccaa gagacagaca gagaaggctg caggagtcct 3541 ttgttgctca gcagggcgct ccgccctccc tccttccttc tcgcttctaa tagcctaggt 3601 acacacaccc ccacctcccg caattaaaca gtagcatcgc ctccctctga gttcttgagt 3661 tcttggctgt ctggggatgt gcacgcaggc agggtttctg cagttccttt atgaag //