LOCUS       BC095408                3743 bp    mRNA    linear   HUM 17-JUL-2006
DEFINITION  Homo sapiens cathepsin B, mRNA (cDNA clone MGC:110901
            IMAGE:30334082), complete cds.
ACCESSION   BC095408
VERSION     BC095408.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3743)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 3743)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (06-MAY-2005) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: Dr. Stefan Hansson
            cDNA Library Preparation: Michael Brownstein /  Ted Usdin
            Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Sequencing Group at the Stanford Human Genome
            Center, Stanford University School of Medicine, Stanford, CA  94305
            Web site:       http://www-shgc.stanford.edu
            Contact:  (Dickson, Mark) mcd@paxil.stanford.edu
            Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
            R. M.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 198 Row: g Column: 5
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 66346646.
FEATURES             Location/Qualifiers
     source          1..3743
                     /db_xref="H-InvDB:HIT000335201"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:110901 IMAGE:30334082"
                     /tissue_type="Placenta, pre-eclamptic"
                     /clone_lib="NIH_MGC_148"
                     /lab_host="DH10B"
                     /note="Vector: pBluescriptR"
     gene            1..3743
                     /gene="CTSB"
                     /gene_synonym="APPS"
                     /gene_synonym="CPSB"
                     /db_xref="GeneID:1508"
                     /db_xref="HGNC:HGNC:2527"
                     /db_xref="MIM:116810"
     CDS             113..1132
                     /gene="CTSB"
                     /gene_synonym="APPS"
                     /gene_synonym="CPSB"
                     /codon_start=1
                     /product="cathepsin B"
                     /protein_id="AAH95408.1"
                     /db_xref="GeneID:1508"
                     /db_xref="HGNC:HGNC:2527"
                     /db_xref="MIM:116810"
                     /translation="MWQLWASLCCLLVLANARSRPSFHPLSDELVNYVNKRNTTWQAG
                     HNFYNVDMSYLKRLCGTFLGGPKPPQRVMFTEDLKLPASFDAREQWPQCPTIKEIRDQ
                     GSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWN
                     FWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPT
                     YKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMG
                     GHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQ
                     YWEKI"
BASE COUNT          919 a          981 c          967 g          876 t
ORIGIN      
        1 ggctgcaggg ctccggcaac cgctccggca acgccaaccg ctccgctgcg cgcaggctgg
       61 gctgcaggct ctcggctgca gcgctgggtg gatctaggat ccggcttcca acatgtggca
      121 gctctgggcc tccctctgct gcctgctggt gttggccaat gcccggagca ggccctcttt
      181 ccatcccctg tcggatgagc tggtcaacta tgtcaacaaa cggaatacca cgtggcaggc
      241 cgggcacaac ttctacaacg tggacatgag ctacttgaag aggctatgtg gtaccttcct
      301 gggtgggccc aagccacccc agagagttat gtttaccgag gacctgaagc tgcctgcaag
      361 cttcgatgca cgggaacaat ggccacagtg tcccaccatc aaagagatca gagaccaggg
      421 ctcctgtggc tcctgctggg ccttcggggc tgtggaagcc atctctgacc ggatctgcat
      481 ccacaccaat gcgcacgtca gcgtggaggt gtcggcggag gacctgctca catgctgtgg
      541 cagcatgtgt ggggacggct gtaatggtgg ctatcctgct gaagcttgga acttctggac
      601 aagaaaaggc ctggtttctg gtggcctcta tgaatcccat gtagggtgca gaccgtactc
      661 catccctccc tgtgagcacc acgtcaacgg ctcccggccc ccatgcacgg gggagggaga
      721 tacccccaag tgtagcaaga tctgtgagcc tggctacagc ccgacctaca aacaggacaa
      781 gcactacgga tacaattcct acagcgtctc caatagcgag aaggacatca tggccgagat
      841 ctacaaaaac ggccccgtgg agggagcttt ctctgtgtat tcggacttcc tgctctacaa
      901 gtcaggagtg taccaacacg tcaccggaga gatgatgggt ggccatgcca tccgcatcct
      961 gggctgggga gtggagaatg gcacacccta ctggctggtt gccaactcct ggaacactga
     1021 ctggggtgac aatggcttct ttaaaatact cagaggacag gatcactgtg gaatcgaatc
     1081 agaagtggtg gctggaattc cacgcaccga tcagtactgg gaaaagatct aatctgccgt
     1141 gggcctgtcg tgccagtcct gggggcgaga tcggggtaga aatgcatttt attctttaag
     1201 ttcacgtaag atacaagttt caggcagggt ctgaaggact ggattggcca aacatcagac
     1261 ctgtcttcca aggagaccaa gtcctggcta catcccagcc tgtggttaca gtgcagacag
     1321 gccatgtgag ccaccgctgc cagcacagag cgtccttccc cctgtagact agtgccgtag
     1381 ggagtacctg ctgccccagc tgactgtggc cccctccgtg atccatccat ctccagggag
     1441 caagacagag acgcaggaat gggaagcgga gttcctaaca ggatgaaagt tcccccatca
     1501 gttcccccag tacctccaag caagtagctt tccacatttg tcacagaaat cagaggagag
     1561 atggtgttgg gagccctttg gagaacgcca gtctcccagg ccccctgcat ctatcgagtt
     1621 tgcaatgtca caacctctct gatgttgtgc tcagcatgat tctttaatag aagttttatt
     1681 ttttcgtgca ctctgctaat catgtgggtg agccagtgga acagcgggag acctgtgcta
     1741 gttttacaga ttgcctccta atgacgcggc tcaaaaggaa accaagtggt caggagttgt
     1801 ttctgaccca ctgatctcta ctaccacaag gaaaatagtt taggagaaac cagcttttac
     1861 tgtttttgaa aaattacagc ttcaccctgt caagttaaca aggaatgcct gtgccaataa
     1921 aaggtttctc caacttgaag tctactctga tgggatctca gatcctttgt cactgcctat
     1981 agacttgtag ctgctgtctc tctttgtccc tgcagagaat cacgtcctgg aactgcatgt
     2041 tcttgcgact cttgggactt catcttaact tctcgctgcc ccagccatgt tttcaaccat
     2101 ggcatccctc ccccaattag ttccctgtca tcctcgtcaa ccttcactgt aagtgcctgg
     2161 taagcttgcc cttgcttaag aactcaaaac atagctgtgc tctatttttt tgttgttgtt
     2221 gtgactgaca gagtgagatt ccgtctccca ggctggagtg cagtggcgcc ttctcggctc
     2281 actgcaacct gcagcctcct agattcaagc gattctcctg cttcagcctt ccgagtagct
     2341 gggatgacag gcactcacca atatgcctgg gtagtttttg tgtttttaag tacatacagg
     2401 atttcaccat gttggccagg ctagtttcga actcctggcc tcaggtggtc tgcctgcctc
     2461 ggcctcccaa ggtgttggga ttacaggcgt gagccactgg gccctgcctg tattttttat
     2521 cagccacaaa tccagcaaca agctgaggat tcagctcata aaacaggctt ggtgtcttgg
     2581 tgatctcaca taaccaagat gctaccccgt ggggaaccac atccccctgg atgccctcca
     2641 gccttggttt gggctggagt cagggcctgg atacagtatt ttgaatttgt atgccactgg
     2701 tttgcattgc tggtcaggaa ctctagtgct ttgcatagcc ctggtttaga aacatgttat
     2761 agcagttctt ggtatagagc aaactagaag aaccagcaat cattccactg tcctgccaag
     2821 gtacacctca gtactcccct tcccaactga agtggtatga ggctagctct ttccaaaagc
     2881 attcaagttt ggcttctgat gtgactcaga atttaggaac cagatgctag atcaaataag
     2941 ctctgaaaat ctgaggaaca ttgtaggaaa ggtttgttaa gcatctctta agtgccatga
     3001 tgagcataac agccggccgt ggtggctcac gcctgtaagc ccagcacttt gggaggccga
     3061 ggtgggaaga tgacaaggtc aggagttcgg gaccagcctg gccaacatgc tgaaacctca
     3121 cctctactga agatacaaga attggctggg catggtggca catgcctgtg atcccagcta
     3181 cttgggaggc tgaggcagga gaatcgcttg agcccgggag gcggaggttg cagtgagccg
     3241 agacagtgcc agtgcactcc agcctcggtg acagcgcaag gctccgtctc aataattaaa
     3301 aaaaaaaaaa aaaaagaggc cgggcgcagt ggctcaagcc tgtaatccca gcactttggg
     3361 aggctgaggc gggcagatca cctgaggtca ggagttttga gatcagcctt ggcaacacgg
     3421 tgaaacccca tctctactaa aaatacaaaa ttagccaagc atgctggcac atgcctgtaa
     3481 tcccagctac tcgtgaggct gaggtacgag aatcgcttga acctgggagg cagaggatgc
     3541 agtgagccga gatcacgcca ttgcactcca gcctggggga caagagtgaa tctgtgtctc
     3601 accaaaaaaa aaaagaaaaa gaaagatgct taacaaaggt taccataagc cacaaattca
     3661 tgaccactta tccttccagt ttcaagtaga atatattcat aacctcaata aagttctccc
     3721 tgctcccaaa aaaaaaaaaa aaa
//