LOCUS BC016320 2117 bp mRNA linear HUM 08-DEC-2006 DEFINITION Homo sapiens cathepsin D, mRNA (cDNA clone MGC:2311 IMAGE:3506977), complete cds. ACCESSION BC016320 VERSION BC016320.2 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2117) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2117) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (29-OCT-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Oct 8, 2003 this sequence version replaced BC016320.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: DCTD/DTP cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Institute for Systems Biology http://www.systemsbiology.org contact: amadan@systemsbiology.org Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 7 Row: o Column: 14 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 23110949. FEATURES Location/Qualifiers source 1..2117 /db_xref="H-InvDB:HIT000037529" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:2311 IMAGE:3506977" /tissue_type="Kidney, renal cell adenocarcinoma" /clone_lib="NIH_MGC_14" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..2117 /gene="CTSD" /gene_synonym="CLN10" /gene_synonym="MGC2311" /db_xref="GeneID:1509" /db_xref="HGNC:HGNC:2529" /db_xref="MIM:116840" CDS 46..1284 /gene="CTSD" /gene_synonym="CLN10" /gene_synonym="MGC2311" /codon_start=1 /product="cathepsin D" /protein_id="AAH16320.1" /db_xref="GeneID:1509" /db_xref="HGNC:HGNC:2529" /db_xref="MIM:116840" /translation="MQPSSLLPLALCLLAAPASALVRIPLHKFTSIRRTMSEVGGSVE DLIAKGPVSKYSQAVPAVTEGPIPEVLKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSN LWVPSIHCKLLDIACWIHHKYNSDKSSTYVKNGTSFDIHYGSGSLSGYLSQDTVSVPC QSASSASALGGVKVERQVFGEATKQPGITFIAAKFDGILGMAYPRISVNNVLPVFDNL MQQKLVDQNIFSFYLSRDPDAQPGGELMLGGTDSKYYKGSLSYLNVTRKAYWQVHLDQ VEVASGLTLCKEGCEAIVDTGTSLMVGPVDEVRELQKAIGAVPLIQGEYMIPCEKVST LPAITLKLGGKGYKLSPEDYTLKVSQAGKTLCLSGFMGMDIPPPSGPLWILGDVFIGR YYTVFDRDNNRVGFAEAARL" BASE COUNT 440 a 688 c 605 g 384 t ORIGIN 1 aagcgcacgg cctcggcgac cctctccgac ccggccgccg ccgccatgca gccctccagc 61 cttctgccgc tcgccctctg cctgctggct gcacccgcct ccgcgctcgt caggatcccg 121 ctgcacaagt tcacgtccat ccgccggacc atgtcggagg ttgggggctc tgtggaggac 181 ctgattgcca aaggccccgt ctcaaagtac tcccaggcgg tgccagccgt gaccgagggg 241 cccattcccg aggtgctcaa gaactacatg gacgcccagt actacgggga gattggcatc 301 gggacgcccc cccagtgctt cacagtcgtc ttcgacacgg gctcctccaa cctgtgggtc 361 ccctccatcc actgcaaact gctggacatc gcttgctgga tccaccacaa gtacaacagc 421 gacaagtcca gcacctacgt gaagaatggt acctcgtttg acatccacta tggctcgggc 481 agcctctccg ggtacctgag ccaggacact gtgtcggtgc cctgccagtc agcgtcgtca 541 gcctctgccc tgggcggtgt caaagtggag aggcaggtct ttggggaggc caccaagcag 601 ccaggcatca ccttcatcgc agccaagttc gatggcatcc tgggcatggc ctacccccgc 661 atctccgtca acaacgtgct gcccgtcttc gacaacctga tgcagcagaa gctggtggac 721 cagaacatct tctccttcta cctgagcagg gacccagatg cgcagcctgg gggtgagctg 781 atgctgggtg gcacagactc caagtattac aagggttctc tgtcctacct gaatgtcacc 841 cgcaaggcct actggcaggt ccacctggac caggtggagg tggccagcgg gctgaccctg 901 tgcaaggagg gctgtgaggc cattgtggac acaggcactt ccctcatggt gggcccggtg 961 gatgaggtgc gcgagctgca gaaggccatc ggggccgtgc cgctgattca gggcgagtac 1021 atgatcccct gtgagaaggt gtccaccctg cccgcgatca cactgaagct gggaggcaaa 1081 ggctacaagc tgtccccaga ggactacacg ctcaaggtgt cgcaggccgg gaagaccctc 1141 tgcctgagcg gcttcatggg catggacatc ccgccaccca gcgggccact ctggatcctg 1201 ggcgacgtct tcatcggccg ctactacact gtgtttgacc gtgacaacaa cagggtgggc 1261 ttcgccgagg ctgcccgcct ctagttccca aggcgtccgc gcgccagcac agaaacagag 1321 gagagtccca gagcaggagg cccctggccc agcggcccct cccacacaca cccacacact 1381 cgcccgccca ctgtcctggg cgccctggaa gccggcggcc caagcccgac ttgctgtttt 1441 gttctgtggt tttcccctcc ctgggttcag aaatgctgcc tgcctgtctg tctctccatc 1501 tgtttggtgg gggtagagct gatccagagc acagatctgt ttcgtgcatt ggaagacccc 1561 acccaagctt ggcagccgag ctcgtgtatc ctggggctcc cttcatctcc agggagtccc 1621 ctccccggcc ctaccagcgc ccgctgggct gagcccctac cccacaccag gccgtcctcc 1681 cgggccctcc cttggaaacc tgccctgcct gagggcccct ctgcccagct tgggcccagc 1741 tgggctctgc caccctacct gttcagtgtc ccgggcccgt tgaggatgag gccgctagag 1801 gcctgaggat gagctggaag gagtgagagg ggacaaaacc caccttgttg gagcctgcag 1861 ggtggtgctg ggactgagcc agtcccaggg gcatgtattg gcctggaggt ggggttggga 1921 ttgggggctg gtgccagcct tcctctgcag ctgacctctg ttgtcctccc cttgggcggc 1981 tgagagcccc agctgacatg gaaatacagt tgttggcctc cggcctcccc tcaaaaaaaa 2041 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2101 aaaaaaaaaa aaaaaaa //