LOCUS BC002642 1743 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens cathepsin S, mRNA (cDNA clone MGC:3886 IMAGE:3610589),
complete cds.
ACCESSION BC002642
VERSION BC002642.2
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 1743)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 1743)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (05-FEB-2001) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT On Aug 20, 2003 this sequence version replaced BC002642.1.
Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: National Institutes of Health Intramural
Sequencing Center (NISC),
Gaithersburg, Maryland;
Web site: http://www.nisc.nih.gov/
Contact: nisc_mgc@nhgri.nih.gov
Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
Young,A., Zhang,L.-H. and Green,E.D.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 12 Row: c Column: 21
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 23110961.
FEATURES Location/Qualifiers
source 1..1743
/db_xref="H-InvDB:HIT000031060"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:3886 IMAGE:3610589"
/tissue_type="Pancreas, adenocarcinoma"
/clone_lib="NIH_MGC_39"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene 1..1743
/gene="CTSS"
/gene_synonym="MGC3886"
/db_xref="GeneID:1520"
/db_xref="HGNC:HGNC:2545"
/db_xref="MIM:116845"
CDS 63..1058
/gene="CTSS"
/gene_synonym="MGC3886"
/codon_start=1
/product="cathepsin S"
/protein_id="AAH02642.1"
/db_xref="GeneID:1520"
/db_xref="HGNC:HGNC:2545"
/db_xref="MIM:116845"
/translation="MKRLVCVLLVCSSAVAQLHKDPTLDHHWHLWKKTYGKQYKEKNE
EAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQ
RNITYKSNPNWILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLV
SLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQYDSK
YRAATCSKYTELPYGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNV
NHGVLVVGYGDLNGKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYPEI"
BASE COUNT 543 a 337 c 390 g 473 t
ORIGIN
1 ttagaagaga gcccactaat tcaaggactc ttaccgtggg agcaactgct ggttctatca
61 caatgaaacg gctggtttgt gtgctcttgg tgtgctcctc tgcagtggca cagttgcata
121 aagatcctac cctggatcac cactggcatc tctggaagaa aacctatggc aaacaataca
181 aggaaaagaa tgaagaagca gtacgacgtc tcatctggga aaagaatcta aagtttgtga
241 tgcttcacaa cctggagcat tcaatgggaa tgcactcata cgatctgggc atgaaccacc
301 tgggagacat gaccagtgaa gaagtgatgt ctttgatgag ttccctgaga gttcccagcc
361 agtggcagag aaatatcaca tataagtcaa accctaattg gatattgcct gattctgtgg
421 actggagaga gaaagggtgt gttactgaag tgaaatatca aggttcttgt ggtgcttgct
481 gggctttcag tgctgtgggg gccctggaag cacagctgaa gctgaaaaca ggaaagctgg
541 tgtctctcag tgcccagaac ctggtggatt gctcaactga aaaatatgga aacaaaggct
601 gcaatggtgg cttcatgaca acggctttcc agtacatcat tgataacaag ggcatcgact
661 cagacgcttc ctatccctac aaagccatgg atcagaaatg tcaatatgac tcaaaatatc
721 gtgctgccac atgttcaaag tacactgaac ttccttatgg cagagaagat gtcctgaaag
781 aagctgtggc caataaaggc ccagtgtctg ttggtgtaga tgcgcgtcat ccttctttct
841 tcctctacag aagtggtgtc tactatgaac catcctgtac tcagaatgtg aatcatggtg
901 tacttgtggt tggctatggt gatcttaatg ggaaagaata ctggcttgtg aaaaacagct
961 ggggccacaa ctttggtgaa gaaggatata ttcggatggc aagaaataaa ggaaatcatt
1021 gtgggattgc tagctttccc tcttacccag aaatctagag gatctctcct ttttataaca
1081 aatcaagaaa tatgaagcac tttctcttaa cttaattttt cctgctgtat ccagaagaaa
1141 taattgtgtc atgattaatg tgtatttact gtactaatta gaaaatatag tttgaggccg
1201 ggcacggtgg ctcacgcctg taatcccagt acttgggagg ccaaggcagg catatcaact
1261 tgaggccagg agttaaagag cagcctggct aacatggtga aaccccatct ctactaaaaa
1321 tacaaaaaat tagccgagca cggtggtgca tgcctgtaat cccagctact tgggaggctg
1381 aggcacgaga ttccttgaac ccaagaggtt gaggctatgt tgagctgaga tcacaccact
1441 gtactccagc ctggatgaca gagtggagac tctgtttcaa aaaaacagaa aagaaaatat
1501 agtttgattc ttcatttttt taaatttgca aatctcagga taaagtttgc taagtaaatt
1561 agtaatgtac tatagatata actgtacaaa aattgttcaa cctaaaacaa tctgtaattg
1621 cttattgttt tattgtatac tctttgtctt ttaagacccc taatagcctt ttgtaacttg
1681 atggcttaaa aatacttaat aaatctgcca tttcaaattt caaaaaaaaa aaaaaaaaaa
1741 aaa
//