LOCUS       BC002479                1470 bp    mRNA    linear   HUM 11-AUG-2006
DEFINITION  Homo sapiens cathepsin H, mRNA (cDNA clone MGC:1519 IMAGE:3349910),
            complete cds.
ACCESSION   BC002479
VERSION     BC002479.2
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1470)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 1470)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (05-FEB-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     On Aug 19, 2003 this sequence version replaced BC002479.1.
            Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: National Institutes of Health Intramural
            Sequencing Center (NISC),
            Gaithersburg, Maryland;
            Web site: http://www.nisc.nih.gov/
            Contact: nisc_mgc@nhgri.nih.gov
            Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
            Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
            Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
            Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
            Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
            McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
            Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
            Young,A., Zhang,L.-H. and Green,E.D.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 5 Row: l Column: 22
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 23110954.
FEATURES             Location/Qualifiers
     source          1..1470
                     /db_xref="H-InvDB:HIT000030926"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:1519 IMAGE:3349910"
                     /tissue_type="Eye, retinoblastoma"
                     /clone_lib="NIH_MGC_16"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     gene            1..1470
                     /gene="CTSH"
                     /gene_synonym="MGC1519"
                     /gene_synonym="minichain"
                     /db_xref="GeneID:1512"
                     /db_xref="HGNC:HGNC:2535"
                     /db_xref="MIM:116820"
     CDS             62..1069
                     /gene="CTSH"
                     /gene_synonym="MGC1519"
                     /gene_synonym="minichain"
                     /codon_start=1
                     /product="cathepsin H"
                     /protein_id="AAH02479.1"
                     /db_xref="GeneID:1512"
                     /db_xref="HGNC:HGNC:2535"
                     /db_xref="MIM:116820"
                     /translation="MWATLPLLCAGAWLLGVPVCGAAELSVNSLEKFHFKSWMSKHRK
                     TYSTEEYHHRLQTFASNWRKINAHNNGNHTFKMALNQFSDMSFAEIKHKYLWSEPQNC
                     SATKSNYLRGTGPYPPSVDWRKKGNFVSPVKNQGACGSCWTFSTTGALESAIAIATGK
                     MLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQGKDGYCKFQP
                     GKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQDFMMYRTGIYSSTSCHKTP
                     DKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASYPIPL
                     V"
BASE COUNT          391 a          399 c          377 g          303 t
ORIGIN      
        1 gacgctctgg gccgccacct ccgcggaccc tgagcgcaag agccaagccg ccagcgctgc
       61 tatgtgggcc acgctgccgc tgctctgcgc cggggcctgg ctcctgggag tccccgtctg
      121 cggtgccgcc gaactgtccg tgaactcctt agagaagttt cacttcaagt catggatgtc
      181 taagcaccgt aagacctaca gtacggagga gtaccaccac aggctgcaga cgtttgccag
      241 caactggagg aagataaacg cccacaacaa tgggaaccac acatttaaaa tggcactgaa
      301 ccaattttca gacatgagct ttgctgaaat aaaacacaag tatctctggt cagagcctca
      361 gaattgctca gccaccaaaa gtaactacct tcgaggtact ggtccctacc caccttccgt
      421 ggactggcgg aaaaaaggaa attttgtctc acctgtgaaa aatcagggtg cctgcggcag
      481 ttgctggact ttctccacca ctggggccct ggagtctgcg atcgccatcg caaccggaaa
      541 gatgctgtcc ttggcggaac agcagctggt ggactgcgcc caggacttca ataatcacgg
      601 ctgccaaggg ggtctcccca gccaggcttt cgagtatatc ctgtacaaca aggggatcat
      661 gggtgaagac acctacccct accagggcaa ggatggttat tgcaagttcc aacctggaaa
      721 ggccatcggc tttgtcaagg atgtagccaa catcacaatc tatgacgagg aagcgatggt
      781 ggaggctgtg gccctctaca accctgtgag ctttgccttt gaggtgactc aggacttcat
      841 gatgtataga acgggcatct actccagtac ttcctgccat aaaactccag ataaagtaaa
      901 ccatgcagta cttgctgttg ggtatggaga aaaaaatggg atcccttact ggatcgtgaa
      961 aaactcttgg ggtccccagt ggggaatgaa cgggtacttc ctcatcgagc gcggaaagaa
     1021 catgtgtggc ctggctgcct gcgcctccta ccccatccct ctggtgtgag ccgtggcagc
     1081 cgcagcgcag actggcggag aaggagagga acgggcagcc tgggcctggg tggaaatcct
     1141 gccctggaag aagttgtggg gagatccact gggaccccca acattctgcc ctcacctctg
     1201 tgcccagcct ggaaacctac agacaaggag gagttccacc atgagctcac ccgtgtctat
     1261 gacgcaaaga tcaccagcca tgtgccttag tgtccttctt aacagactca aaccacatgg
     1321 accacgaata ttctttctgt ccagaagggc tactttccac atatagagct ccagggactg
     1381 tcttttctgt attcgctgtt caataaacat tgagtgagca cctccccaga tggaaaaaaa
     1441 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
//