LOCUS       HUMCTSB                 2002 bp    mRNA    linear   HUM 01-NOV-1994
DEFINITION  Human cathepsin B proteinase mRNA, complete cds.
ACCESSION   M14221
VERSION     M14221.1
KEYWORDS    cathepsin B; proteinase.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2002)
  AUTHORS   Chan,S.J., San Segundo,B., McCormick,M.B. and Steiner,D.F.
  TITLE     Nucleotide and predicted amino acid sequences of cloned human and
            mouse preprocathepsin B cDNAs
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 83 (20), 7721-7725 (1986)
   PUBMED   3463996
COMMENT     Original source text: Human kidney, cDNA to mRNA, clone
            lambda-hCB79.
FEATURES             Location/Qualifiers
     source          1..2002
                     /db_xref="H-InvDB:HIT000194314"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /map="8p22"
     gene            1..2002
                     /gene="CTSB"
     mRNA            195..2002
                     /gene="CTSB"
                     /product="preprocathepsin B mRNA"
     CDS             195..1214
                     /gene="CTSB"
                     /note="preprocathepsin B"
                     /codon_start=1
                     /protein_id="AAA52129.1"
                     /db_xref="GDB:G00-119-087"
                     /translation="MWQLWASLCCLLVLANARSRPSFHPVSDELVNYVNKRNTTWQAG
                     HNFYNVDMSYLKRLCGTFLGGPKPPQRVMFTEDLKLPASFDAREQWPQCPTIKEIRDQ
                     GSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWN
                     FWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPT
                     YKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMG
                     GHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQ
                     YWEKI"
BASE COUNT          460 a          553 c          550 g          439 t
ORIGIN      Chromosome 8p22; EcoRI site.
        1 aattccgcgg caaccgctcc ggcaacgcca accgctccgc tgcgcgcagg ctgggctgca
       61 ggctctcggc tgcagcgctg ggctggtgtg cagtggtgcg accacggctc acggcagcct
      121 cagccaccca gatgtaagcg atctggttcc cacctcagcc ttccgagtag tggatctagg
      181 atctggcttc caacatgtgg cagctctggg cctccctctg ctgcctgctg gtgttggcca
      241 atgcccggag caggccctct ttccatcccg tgtcggatga gctggtcaac tatgtcaaca
      301 aacggaatac cacgtggcag gccgggcaca acttctacaa cgtggacatg agctacttga
      361 agaggctatg tggtaccttc ctgggtgggc ccaagccacc ccagagagtt atgtttaccg
      421 aggacctgaa gctgcctgca agcttcgatg cacgggaaca atggccacag tgtcccacca
      481 tcaaagagat cagagaccag ggctcctgtg gctcctgctg ggccttcggg gctgtggaag
      541 ccatctctga ccgcatctgc atccacacca atgcgcacgt cagcgtggag gtgtcggcgg
      601 aggacctgct cacctgctgt ggcagcatgt gtggggacgg ctgtaatggt ggctatcctg
      661 ctgaagcttg gaacttctgg acaagaaaag gcctggtttc tggtggcctc tatgaatccc
      721 atgtagggtg cagaccgtac tccatccctc cctgtgagca ccacgtcaac ggctcccggc
      781 ccccatgcac gggggaggga gataccccca agtgtagcaa gatctgtgag cctggctaca
      841 gcccgaccta caaacaggac aagcactacg gatacaattc ctacagcgtc tccaatagcg
      901 agaaggacat catggccgag atctacaaaa acggccccgt ggagggagct ttctctgtgt
      961 attcggactt cctgctctac aagtcaggag tgtaccaaca cgtcaccgga gagatgatgg
     1021 gtggccatgc catccgcatc ctgggctggg gagtggagaa tggcacaccc tactggctgg
     1081 ttgccaactc ctggaacact gactggggtg acaatggctt ctttaaaata ctcagaggac
     1141 aggatcactg cggaatcgaa tcagaagtgg tggctggaat tccacgcacc gatcagtact
     1201 gggaaaagat ctaatctgcc gtgggcctgt cgtgccagtc ctgggggcga gatcggggta
     1261 gaaagtcatt ttattcttta agttcacgta agatacaagt ttcaggcagg gtctgaagga
     1321 ctggattggc caaagtcctc caaggagacc aagtcctggc tacatcccag cctgtggtta
     1381 cagtgcagac aggccatgtg agccaccgct gccagcacag agcgtccttc cccctgtaga
     1441 ctagtgccgt gggagtacct gctgcccagc tgctgtggcc ccctccgtga tccatccatc
     1501 tccagggagc aagacagaga cgcaggatgg aaagcggagt tcctaacagg atgaaagttc
     1561 ccccatcagt tcccccagta cctccaagca agtagctttc cacatttgtc acagaaatca
     1621 gaggagagat ggtgttggga gccctttgga gaacgccagt ctccaggtcc ccctgcatct
     1681 atcgagtttg caatgtcaca acctctctga tcttgtgctc agcatgattc tttaatagaa
     1741 gttttatttt tcgtgcactc tgctaatcat gtgggtgagc cagtggaaca gcgggagcct
     1801 gtgctggttt gcagattgcc tcctaatgac gcggctcaaa aggaaaccaa gtggtcagga
     1861 gttgtttctg acccactgat ctctactacc acaaggaaaa tagtttagga gaaaccagct
     1921 tttactgttt ttgaaaaatt acagcttcac cctgtcaagt taacaaggaa tgcctgtgcc
     1981 aataaaaggt ttctccaact tg
//