CR456947

LOCUS       CR456947                1239 bp    mRNA    linear   HUM 16-OCT-2008
DEFINITION  Homo sapiens full open reading frame cDNA clone RZPDo834E1218D for
            gene CTSD, cathepsin D (lysosomal aspartyl protease); complete cds,
            incl. stopcodon.
ACCESSION   CR456947
VERSION     CR456947.1
KEYWORDS    Full ORF shuttle clone, Gateway(TM), complete cds.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1239)
  AUTHORS   Ebert L., Schick M., Neubert P., Schatten R., Henze S., Korn B.
  TITLE     Cloning of human full open reading frames in Gateway(TM) system
            entry vector (pDONR201)
  JOURNAL   Unpublished.
REFERENCE   2  (bases 1 to 1239)
  AUTHORS   Ebert L., Schick M., Neubert P., Schatten R., Henze S., Korn B.
  JOURNAL   Submitted (03-JUN-2004) to the INSDC. RZPD Deutsches
            Ressourcenzentrum fuer Genomforschung GmbH, Im Neuenheimer Feld
            580, D-69120 Heidelberg, Germany
COMMENT     RZPD; RZPDo834E1218D, ORFNo 1349
            www.rzpd.de/cgi-bin/products/cl.cgi?CloneID=RZPDo834E1218D
            RZPDLIB; Human Full ORF Clones Gateway(TM) - RZPD (kan-resist.)
            RZPD LIB No. 834
            www.rzpd.de/cgi-bin/products/showLib.pl.cgi/response?libNo=834
            www.rzpd.de/products/orfclones/
            Contact: Inge Arlart
            RZPD Deutsches Ressourcenzentrum fuer Genomforschung GmbH,
            Heubnerweg 6, D-14059 Berlin, Germany
            Tel: +49 30 32639 100
            Fax: +49 30 32639 111
            www.rzpd.de
            This clone is available from RZPD;
            contact RZPD (customer.service@rzpd.de) for further information.
            
            This CDS clone is a part of a collection of human full length
            expression clones generated by RZPD.
            This CDS has been cloned incl. stopcodon.
            The CDS has been inserted into pDONR201 via a BP Clonase(TM)
            reaction.
            Additional sequence has been added in front of the start codon:
            att..AAAAAA GCA GGC (ATG).
            The last base of the last coding triplett has been changed to T,
            which might lead to an amino acid change at the C terminus of the
            polypeptide.
            The stop codon has been set to TAA followed by
            TTAACCCAGCTTTCTT..att.
            Compared to the reference sequence NM_001909 we did not find any
            amino acid exchanges.
            Clone distribution: http://www.rzpd.de/products/orfclones/
FEATURES             Location/Qualifiers
     source          1..1239
                     /db_xref="H-InvDB:HIT000267797"
                     /organism="Homo sapiens"
                     /lab_host="DH10B"
                     /mol_type="mRNA"
                     /clone_lib="Human Full ORF Clones Gateway(TM) - RZPD"
                     /clone="RZPDo834E1218D"
                     /note="Vector: pDONR201, Site_1: attP1; Site_2: attP2"
                     /db_xref="taxon:9606"
     CDS             1..1239
                     /codon_start=1
                     /gene="CTSD"
                     /db_xref="GOA:P07339"
                     /db_xref="H-InvDB:HIT000267797.12"
                     /db_xref="HGNC:HGNC:2529"
                     /db_xref="InterPro:IPR001461"
                     /db_xref="InterPro:IPR001969"
                     /db_xref="InterPro:IPR012848"
                     /db_xref="InterPro:IPR021109"
                     /db_xref="InterPro:IPR033121"
                     /db_xref="InterPro:IPR033144"
                     /db_xref="PDB:1LYA"
                     /db_xref="PDB:1LYB"
                     /db_xref="PDB:1LYW"
                     /db_xref="PDB:4OBZ"
                     /db_xref="PDB:4OC6"
                     /db_xref="PDB:4OD9"
                     /db_xref="UniProtKB/Swiss-Prot:P07339"
                     /protein_id="CAG33228.1"
                     /translation="MQPSSLLPLALCLLAAPASALVRIPLHKFTSIRRTMSEVGGSVE
                     DLIAKGPVSKYSQAVPAVTEGPIPEVLKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSN
                     LWVPSIHCKLLDIACWIHHKYNSDKSSTYVKNGTSFDIHYGSGSLSGYLSQDTVSVPC
                     QSASSASALGGVKVERQVFGEATKQPGITFIAAKFDGILGMAYPRISVNNVLPVFDNL
                     MQQKLVDQNIFSFYLSRDPDAQPGGELMLGGTDSKYYKGSLSYLNVTRKAYWQVHLDQ
                     VEVASGLTLCKEGCEAIVDTGTSLMVGPVDEVRELQKAIGAVPLIQGEYMIPCEKVST
                     LPAITLKLGGKGYKLSPEDYTLKVSQAGKTLCLSGFMGMDIPPPSGPLWILGDVFIGR
                     YYTVFDRDNNRVGFAEAARL"
BASE COUNT          237 a          403 c          366 g          233 t
ORIGIN      
        1 atgcagccct ccagccttct gccgctcgcc ctctgcctgc tggctgcacc cgcctccgcg
       61 ctcgtcagga tcccgctgca caagttcacg tccatccgcc ggaccatgtc ggaggttggg
      121 ggctctgtgg aggacctgat tgccaaaggc cccgtctcaa agtactccca ggcggtgcca
      181 gccgtgaccg aggggcccat tcccgaggtg ctcaagaact acatggacgc ccagtactac
      241 ggggagattg gcatcgggac gcccccccag tgcttcacag tcgtcttcga cacgggctcc
      301 tccaacctgt gggtcccctc catccactgc aaactgctgg acatcgcttg ctggatccac
      361 cacaagtaca acagcgacaa gtccagcacc tacgtgaaga atggtacctc gtttgacatc
      421 cactatggct cgggcagcct ctccgggtac ctgagccagg acactgtgtc ggtgccctgc
      481 cagtcagcgt cgtcagcctc tgccctgggc ggtgtcaaag tggagaggca ggtctttggg
      541 gaggccacca agcagccagg catcaccttc atcgcagcca agttcgatgg catcctgggc
      601 atggcctacc cccgcatctc cgtcaacaac gtgctgcccg tcttcgacaa cctgatgcag
      661 cagaagctgg tggaccagaa catcttctcc ttctacctga gcagggaccc agatgcgcag
      721 cctgggggtg agctgatgct gggtggcaca gactccaagt attacaaggg ttctctgtcc
      781 tacctgaatg tcacccgcaa ggcctactgg caggtccacc tggaccaggt ggaggtggcc
      841 agcgggctga ccctgtgcaa ggagggctgt gaggccattg tggacacagg cacttccctc
      901 atggtgggcc cggtggatga ggtgcgcgag ctgcagaagg ccatcggggc cgtgccgctg
      961 attcagggcg agtacatgat cccctgtgag aaggtgtcca ccctgcccgc gatcacactg
     1021 aagctgggag gcaaaggcta caagctgtcc ccagaggact acacgctcaa ggtgtcgcag
     1081 gccgggaaga ccctctgcct gagcggcttc atgggcatgg acatcccgcc acccagcggg
     1141 ccactctgga tcctgggcga cgtcttcatc ggccgctact acactgtgtt tgaccgtgac
     1201 aacaacaggg tgggcttcgc cgaggctgcc cgcctttaa
//