LOCUS BT020155 1239 bp mRNA linear HUM 28-OCT-2004 DEFINITION Homo sapiens cathepsin D (lysosomal aspartyl protease) mRNA, complete cds. ACCESSION BT020155 VERSION BT020155.1 KEYWORDS FLI_CDNA. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1239) AUTHORS Kalnine,N., Chen,X., Rolfs,A., Halleck,A., Hines,L., Eisenstein,S., Koundinya,M., Raphael,J., Moreira,D., Kelley,T., LaBaer,J., Lin,Y., Phelan,M. and Farmer,A. TITLE Cloning of human full-length CDSs in BD Creator(TM) System Donor vector JOURNAL Unpublished REFERENCE 2 (bases 1 to 1239) AUTHORS Kalnine,N., Chen,X., Rolfs,A., Halleck,A., Hines,L., Eisenstein,S., Koundinya,M., Raphael,J., Moreira,D., Kelley,T., LaBaer,J., Lin,Y., Phelan,M. and Farmer,A. TITLE Direct Submission JOURNAL Submitted (28-OCT-2004) BD Biosciences Clontech, 1020 East Meadow circle, Palo Alto, California 94303, USA COMMENT This CDS clone is a part of a collection of human full length expression clones generated by BD Biosciences Clontech and the Harvard Institute of Proteomics. Each CDS has been cloned in two forms: with and without stop-codon (to allow fusion with C-terminal tag). The CDS has been directionally cloned using BD In-Fusion(TM) cloning system between the SalI and HindIII sites of the pDNR-DUAL vector. Additional sequences in the clone: 'ACC' after SalI site and before 'ATG' to provide Kozak consensus sequence; 'GG' after last codon and before HindIII site to maintain reading frame. Clone distribution: http://bioinfo.clontech.com/orfclones. FEATURES Location/Qualifiers source 1..1239 /db_xref="H-InvDB:HIT000266838" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="GH00426X1.1" /clone_lib="BD Creator(TM) CDS Library derived from MGC collection" /lab_host="DH5alpha T1 resistant" /note="Vector: pDNR-Dual" CDS 1..1239 /codon_start=1 /product="cathepsin D (lysosomal aspartyl protease)" /protein_id="AAV38957.1" /translation="MQPSSLLPLALCLLAAPASALVRIPLHKFTSIRRTMSEVGGSVE DLIAKGPVSKYSQAVPAVTEGPIPEVLKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSN LWVPSIHCKLLDIACWIHHKYNSDKSSTYVKNGTSFDIHYGSGSLSGYLSQDTVSVPC QSASSASALGGVKVERQVFGEATKQPGITFIAAKFDGILGMAYPRISVNNVLPVFDNL MQQKLVDQNIFSFYLSRDPDAQPGGELMLGGTDSKYYKGSLSYLNVTRKAYWQVHLDQ VEVASGLTLCKEGCEAIVDTGTSLMVGPVDEVRELQKAIGAVPLIQGEYMIPCEKVST LPAITLKLGGKGYKLSPEDYTLKVSQAGKTLCLSGFMGMDIPPPSGPLWILGDVFIGR YYTVFDRDNNRVGFAEAARL" BASE COUNT 236 a 404 c 367 g 232 t ORIGIN 1 atgcagccct ccagccttct gccgctcgcc ctctgcctgc tggctgcacc cgcctccgcg 61 ctcgtcagga tcccgctgca caagttcacg tccatccgcc ggaccatgtc ggaggttggg 121 ggctctgtgg aggacctgat tgccaaaggc cccgtctcaa agtactccca ggcggtgcca 181 gccgtgaccg aggggcccat tcccgaggtg ctcaagaact acatggacgc ccagtactac 241 ggggagattg gcatcgggac gcccccccag tgcttcacag tcgtcttcga cacgggctcc 301 tccaacctgt gggtcccctc catccactgc aaactgctgg acatcgcttg ctggatccac 361 cacaagtaca acagcgacaa gtccagcacc tacgtgaaga atggtacctc gtttgacatc 421 cactatggct cgggcagcct ctccgggtac ctgagccagg acactgtgtc ggtgccctgc 481 cagtcagcgt cgtcagcctc tgccctgggc ggtgtcaaag tggagaggca ggtctttggg 541 gaggccacca agcagccagg catcaccttc atcgcagcca agttcgatgg catcctgggc 601 atggcctacc cccgcatctc cgtcaacaac gtgctgcccg tcttcgacaa cctgatgcag 661 cagaagctgg tggaccagaa catcttctcc ttctacctga gcagggaccc agatgcgcag 721 cctgggggtg agctgatgct gggtggcaca gactccaagt attacaaggg ttctctgtcc 781 tacctgaatg tcacccgcaa ggcctactgg caggtccacc tggaccaggt ggaggtggcc 841 agcgggctga ccctgtgcaa ggagggctgt gaggccattg tggacacagg cacttccctc 901 atggtgggcc cggtggatga ggtgcgcgag ctgcagaagg ccatcggggc cgtgccgctg 961 attcagggcg agtacatgat cccctgtgag aaggtgtcca ccctgcccgc gatcacactg 1021 aagctgggag gcaaaggcta caagctgtcc ccagaggact acacgctcaa ggtgtcgcag 1081 gccgggaaga ccctctgcct gagcggcttc atgggcatgg acatcccgcc acccagcggg 1141 ccactctgga tcctgggcga cgtcttcatc ggccgctact acactgtgtt tgaccgtgac 1201 aacaacaggg tgggcttcgc cgaggctgcc cgcctctag //