LOCUS       BC002642                1743 bp    mRNA    linear   HUM 15-JUL-2006
DEFINITION  Homo sapiens cathepsin S, mRNA (cDNA clone MGC:3886 IMAGE:3610589),
            complete cds.
ACCESSION   BC002642
VERSION     BC002642.2
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1743)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 1743)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (05-FEB-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     On Aug 20, 2003 this sequence version replaced BC002642.1.
            Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: National Institutes of Health Intramural
            Sequencing Center (NISC),
            Gaithersburg, Maryland;
            Web site: http://www.nisc.nih.gov/
            Contact: nisc_mgc@nhgri.nih.gov
            Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
            Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
            Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
            Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
            Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
            McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
            Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
            Young,A., Zhang,L.-H. and Green,E.D.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 12 Row: c Column: 21
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 23110961.
FEATURES             Location/Qualifiers
     source          1..1743
                     /db_xref="H-InvDB:HIT000031060"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:3886 IMAGE:3610589"
                     /tissue_type="Pancreas, adenocarcinoma"
                     /clone_lib="NIH_MGC_39"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     gene            1..1743
                     /gene="CTSS"
                     /gene_synonym="MGC3886"
                     /db_xref="GeneID:1520"
                     /db_xref="HGNC:HGNC:2545"
                     /db_xref="MIM:116845"
     CDS             63..1058
                     /gene="CTSS"
                     /gene_synonym="MGC3886"
                     /codon_start=1
                     /product="cathepsin S"
                     /protein_id="AAH02642.1"
                     /db_xref="GeneID:1520"
                     /db_xref="HGNC:HGNC:2545"
                     /db_xref="MIM:116845"
                     /translation="MKRLVCVLLVCSSAVAQLHKDPTLDHHWHLWKKTYGKQYKEKNE
                     EAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQ
                     RNITYKSNPNWILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLV
                     SLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQYDSK
                     YRAATCSKYTELPYGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNV
                     NHGVLVVGYGDLNGKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYPEI"
BASE COUNT          543 a          337 c          390 g          473 t
ORIGIN      
        1 ttagaagaga gcccactaat tcaaggactc ttaccgtggg agcaactgct ggttctatca
       61 caatgaaacg gctggtttgt gtgctcttgg tgtgctcctc tgcagtggca cagttgcata
      121 aagatcctac cctggatcac cactggcatc tctggaagaa aacctatggc aaacaataca
      181 aggaaaagaa tgaagaagca gtacgacgtc tcatctggga aaagaatcta aagtttgtga
      241 tgcttcacaa cctggagcat tcaatgggaa tgcactcata cgatctgggc atgaaccacc
      301 tgggagacat gaccagtgaa gaagtgatgt ctttgatgag ttccctgaga gttcccagcc
      361 agtggcagag aaatatcaca tataagtcaa accctaattg gatattgcct gattctgtgg
      421 actggagaga gaaagggtgt gttactgaag tgaaatatca aggttcttgt ggtgcttgct
      481 gggctttcag tgctgtgggg gccctggaag cacagctgaa gctgaaaaca ggaaagctgg
      541 tgtctctcag tgcccagaac ctggtggatt gctcaactga aaaatatgga aacaaaggct
      601 gcaatggtgg cttcatgaca acggctttcc agtacatcat tgataacaag ggcatcgact
      661 cagacgcttc ctatccctac aaagccatgg atcagaaatg tcaatatgac tcaaaatatc
      721 gtgctgccac atgttcaaag tacactgaac ttccttatgg cagagaagat gtcctgaaag
      781 aagctgtggc caataaaggc ccagtgtctg ttggtgtaga tgcgcgtcat ccttctttct
      841 tcctctacag aagtggtgtc tactatgaac catcctgtac tcagaatgtg aatcatggtg
      901 tacttgtggt tggctatggt gatcttaatg ggaaagaata ctggcttgtg aaaaacagct
      961 ggggccacaa ctttggtgaa gaaggatata ttcggatggc aagaaataaa ggaaatcatt
     1021 gtgggattgc tagctttccc tcttacccag aaatctagag gatctctcct ttttataaca
     1081 aatcaagaaa tatgaagcac tttctcttaa cttaattttt cctgctgtat ccagaagaaa
     1141 taattgtgtc atgattaatg tgtatttact gtactaatta gaaaatatag tttgaggccg
     1201 ggcacggtgg ctcacgcctg taatcccagt acttgggagg ccaaggcagg catatcaact
     1261 tgaggccagg agttaaagag cagcctggct aacatggtga aaccccatct ctactaaaaa
     1321 tacaaaaaat tagccgagca cggtggtgca tgcctgtaat cccagctact tgggaggctg
     1381 aggcacgaga ttccttgaac ccaagaggtt gaggctatgt tgagctgaga tcacaccact
     1441 gtactccagc ctggatgaca gagtggagac tctgtttcaa aaaaacagaa aagaaaatat
     1501 agtttgattc ttcatttttt taaatttgca aatctcagga taaagtttgc taagtaaatt
     1561 agtaatgtac tatagatata actgtacaaa aattgttcaa cctaaaacaa tctgtaattg
     1621 cttattgttt tattgtatac tctttgtctt ttaagacccc taatagcctt ttgtaacttg
     1681 atggcttaaa aatacttaat aaatctgcca tttcaaattt caaaaaaaaa aaaaaaaaaa
     1741 aaa
//