LOCUS BC002642 1743 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens cathepsin S, mRNA (cDNA clone MGC:3886 IMAGE:3610589), complete cds. ACCESSION BC002642 VERSION BC002642.2 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1743) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 1743) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (05-FEB-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Aug 20, 2003 this sequence version replaced BC002642.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 12 Row: c Column: 21 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 23110961. FEATURES Location/Qualifiers source 1..1743 /db_xref="H-InvDB:HIT000031060" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:3886 IMAGE:3610589" /tissue_type="Pancreas, adenocarcinoma" /clone_lib="NIH_MGC_39" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..1743 /gene="CTSS" /gene_synonym="MGC3886" /db_xref="GeneID:1520" /db_xref="HGNC:HGNC:2545" /db_xref="MIM:116845" CDS 63..1058 /gene="CTSS" /gene_synonym="MGC3886" /codon_start=1 /product="cathepsin S" /protein_id="AAH02642.1" /db_xref="GeneID:1520" /db_xref="HGNC:HGNC:2545" /db_xref="MIM:116845" /translation="MKRLVCVLLVCSSAVAQLHKDPTLDHHWHLWKKTYGKQYKEKNE EAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQ RNITYKSNPNWILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLV SLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQYDSK YRAATCSKYTELPYGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNV NHGVLVVGYGDLNGKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYPEI" BASE COUNT 543 a 337 c 390 g 473 t ORIGIN 1 ttagaagaga gcccactaat tcaaggactc ttaccgtggg agcaactgct ggttctatca 61 caatgaaacg gctggtttgt gtgctcttgg tgtgctcctc tgcagtggca cagttgcata 121 aagatcctac cctggatcac cactggcatc tctggaagaa aacctatggc aaacaataca 181 aggaaaagaa tgaagaagca gtacgacgtc tcatctggga aaagaatcta aagtttgtga 241 tgcttcacaa cctggagcat tcaatgggaa tgcactcata cgatctgggc atgaaccacc 301 tgggagacat gaccagtgaa gaagtgatgt ctttgatgag ttccctgaga gttcccagcc 361 agtggcagag aaatatcaca tataagtcaa accctaattg gatattgcct gattctgtgg 421 actggagaga gaaagggtgt gttactgaag tgaaatatca aggttcttgt ggtgcttgct 481 gggctttcag tgctgtgggg gccctggaag cacagctgaa gctgaaaaca ggaaagctgg 541 tgtctctcag tgcccagaac ctggtggatt gctcaactga aaaatatgga aacaaaggct 601 gcaatggtgg cttcatgaca acggctttcc agtacatcat tgataacaag ggcatcgact 661 cagacgcttc ctatccctac aaagccatgg atcagaaatg tcaatatgac tcaaaatatc 721 gtgctgccac atgttcaaag tacactgaac ttccttatgg cagagaagat gtcctgaaag 781 aagctgtggc caataaaggc ccagtgtctg ttggtgtaga tgcgcgtcat ccttctttct 841 tcctctacag aagtggtgtc tactatgaac catcctgtac tcagaatgtg aatcatggtg 901 tacttgtggt tggctatggt gatcttaatg ggaaagaata ctggcttgtg aaaaacagct 961 ggggccacaa ctttggtgaa gaaggatata ttcggatggc aagaaataaa ggaaatcatt 1021 gtgggattgc tagctttccc tcttacccag aaatctagag gatctctcct ttttataaca 1081 aatcaagaaa tatgaagcac tttctcttaa cttaattttt cctgctgtat ccagaagaaa 1141 taattgtgtc atgattaatg tgtatttact gtactaatta gaaaatatag tttgaggccg 1201 ggcacggtgg ctcacgcctg taatcccagt acttgggagg ccaaggcagg catatcaact 1261 tgaggccagg agttaaagag cagcctggct aacatggtga aaccccatct ctactaaaaa 1321 tacaaaaaat tagccgagca cggtggtgca tgcctgtaat cccagctact tgggaggctg 1381 aggcacgaga ttccttgaac ccaagaggtt gaggctatgt tgagctgaga tcacaccact 1441 gtactccagc ctggatgaca gagtggagac tctgtttcaa aaaaacagaa aagaaaatat 1501 agtttgattc ttcatttttt taaatttgca aatctcagga taaagtttgc taagtaaatt 1561 agtaatgtac tatagatata actgtacaaa aattgttcaa cctaaaacaa tctgtaattg 1621 cttattgttt tattgtatac tctttgtctt ttaagacccc taatagcctt ttgtaacttg 1681 atggcttaaa aatacttaat aaatctgcca tttcaaattt caaaaaaaaa aaaaaaaaaa 1741 aaa //