LOCUS       BT020155                1239 bp    mRNA    linear   HUM 28-OCT-2004
DEFINITION  Homo sapiens cathepsin D (lysosomal aspartyl protease) mRNA,
            complete cds.
ACCESSION   BT020155
VERSION     BT020155.1
KEYWORDS    FLI_CDNA.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1239)
  AUTHORS   Kalnine,N., Chen,X., Rolfs,A., Halleck,A., Hines,L., Eisenstein,S.,
            Koundinya,M., Raphael,J., Moreira,D., Kelley,T., LaBaer,J., Lin,Y.,
            Phelan,M. and Farmer,A.
  TITLE     Cloning of human full-length CDSs in BD Creator(TM) System Donor
            vector
  JOURNAL   Unpublished
REFERENCE   2  (bases 1 to 1239)
  AUTHORS   Kalnine,N., Chen,X., Rolfs,A., Halleck,A., Hines,L., Eisenstein,S.,
            Koundinya,M., Raphael,J., Moreira,D., Kelley,T., LaBaer,J., Lin,Y.,
            Phelan,M. and Farmer,A.
  TITLE     Direct Submission
  JOURNAL   Submitted (28-OCT-2004) BD Biosciences Clontech, 1020 East Meadow
            circle, Palo Alto, California 94303, USA
COMMENT     This CDS clone is a part of a collection of human full length
            expression clones generated by BD Biosciences Clontech and the
            Harvard Institute of Proteomics. Each CDS has been cloned in two
            forms: with and without stop-codon (to allow fusion with C-terminal
            tag). The CDS has been  directionally cloned using BD In-Fusion(TM)
            cloning system between the SalI  and HindIII sites of the pDNR-DUAL
            vector. Additional sequences in the clone:  'ACC' after SalI site
            and before 'ATG' to provide Kozak consensus sequence; 'GG' after
            last codon and before HindIII site to maintain reading frame.
            Clone distribution: http://bioinfo.clontech.com/orfclones.
FEATURES             Location/Qualifiers
     source          1..1239
                     /db_xref="H-InvDB:HIT000266838"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="GH00426X1.1"
                     /clone_lib="BD Creator(TM) CDS Library derived from MGC
                     collection"
                     /lab_host="DH5alpha T1 resistant"
                     /note="Vector: pDNR-Dual"
     CDS             1..1239
                     /codon_start=1
                     /product="cathepsin D (lysosomal aspartyl protease)"
                     /protein_id="AAV38957.1"
                     /translation="MQPSSLLPLALCLLAAPASALVRIPLHKFTSIRRTMSEVGGSVE
                     DLIAKGPVSKYSQAVPAVTEGPIPEVLKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSN
                     LWVPSIHCKLLDIACWIHHKYNSDKSSTYVKNGTSFDIHYGSGSLSGYLSQDTVSVPC
                     QSASSASALGGVKVERQVFGEATKQPGITFIAAKFDGILGMAYPRISVNNVLPVFDNL
                     MQQKLVDQNIFSFYLSRDPDAQPGGELMLGGTDSKYYKGSLSYLNVTRKAYWQVHLDQ
                     VEVASGLTLCKEGCEAIVDTGTSLMVGPVDEVRELQKAIGAVPLIQGEYMIPCEKVST
                     LPAITLKLGGKGYKLSPEDYTLKVSQAGKTLCLSGFMGMDIPPPSGPLWILGDVFIGR
                     YYTVFDRDNNRVGFAEAARL"
BASE COUNT          236 a          404 c          367 g          232 t
ORIGIN      
        1 atgcagccct ccagccttct gccgctcgcc ctctgcctgc tggctgcacc cgcctccgcg
       61 ctcgtcagga tcccgctgca caagttcacg tccatccgcc ggaccatgtc ggaggttggg
      121 ggctctgtgg aggacctgat tgccaaaggc cccgtctcaa agtactccca ggcggtgcca
      181 gccgtgaccg aggggcccat tcccgaggtg ctcaagaact acatggacgc ccagtactac
      241 ggggagattg gcatcgggac gcccccccag tgcttcacag tcgtcttcga cacgggctcc
      301 tccaacctgt gggtcccctc catccactgc aaactgctgg acatcgcttg ctggatccac
      361 cacaagtaca acagcgacaa gtccagcacc tacgtgaaga atggtacctc gtttgacatc
      421 cactatggct cgggcagcct ctccgggtac ctgagccagg acactgtgtc ggtgccctgc
      481 cagtcagcgt cgtcagcctc tgccctgggc ggtgtcaaag tggagaggca ggtctttggg
      541 gaggccacca agcagccagg catcaccttc atcgcagcca agttcgatgg catcctgggc
      601 atggcctacc cccgcatctc cgtcaacaac gtgctgcccg tcttcgacaa cctgatgcag
      661 cagaagctgg tggaccagaa catcttctcc ttctacctga gcagggaccc agatgcgcag
      721 cctgggggtg agctgatgct gggtggcaca gactccaagt attacaaggg ttctctgtcc
      781 tacctgaatg tcacccgcaa ggcctactgg caggtccacc tggaccaggt ggaggtggcc
      841 agcgggctga ccctgtgcaa ggagggctgt gaggccattg tggacacagg cacttccctc
      901 atggtgggcc cggtggatga ggtgcgcgag ctgcagaagg ccatcggggc cgtgccgctg
      961 attcagggcg agtacatgat cccctgtgag aaggtgtcca ccctgcccgc gatcacactg
     1021 aagctgggag gcaaaggcta caagctgtcc ccagaggact acacgctcaa ggtgtcgcag
     1081 gccgggaaga ccctctgcct gagcggcttc atgggcatgg acatcccgcc acccagcggg
     1141 ccactctgga tcctgggcga cgtcttcatc ggccgctact acactgtgtt tgaccgtgac
     1201 aacaacaggg tgggcttcgc cgaggctgcc cgcctctag
//