LOCUS BC095408 3743 bp mRNA linear HUM 17-JUL-2006 DEFINITION Homo sapiens cathepsin B, mRNA (cDNA clone MGC:110901 IMAGE:30334082), complete cds. ACCESSION BC095408 VERSION BC095408.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3743) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 3743) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (06-MAY-2005) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Dr. Stefan Hansson cDNA Library Preparation: Michael Brownstein / Ted Usdin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Sequencing Group at the Stanford Human Genome Center, Stanford University School of Medicine, Stanford, CA 94305 Web site: http://www-shgc.stanford.edu Contact: (Dickson, Mark) mcd@paxil.stanford.edu Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers, R. M. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 198 Row: g Column: 5 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 66346646. FEATURES Location/Qualifiers source 1..3743 /db_xref="H-InvDB:HIT000335201" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:110901 IMAGE:30334082" /tissue_type="Placenta, pre-eclamptic" /clone_lib="NIH_MGC_148" /lab_host="DH10B" /note="Vector: pBluescriptR" gene 1..3743 /gene="CTSB" /gene_synonym="APPS" /gene_synonym="CPSB" /db_xref="GeneID:1508" /db_xref="HGNC:HGNC:2527" /db_xref="MIM:116810" CDS 113..1132 /gene="CTSB" /gene_synonym="APPS" /gene_synonym="CPSB" /codon_start=1 /product="cathepsin B" /protein_id="AAH95408.1" /db_xref="GeneID:1508" /db_xref="HGNC:HGNC:2527" /db_xref="MIM:116810" /translation="MWQLWASLCCLLVLANARSRPSFHPLSDELVNYVNKRNTTWQAG HNFYNVDMSYLKRLCGTFLGGPKPPQRVMFTEDLKLPASFDAREQWPQCPTIKEIRDQ GSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWN FWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPT YKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMG GHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQ YWEKI" BASE COUNT 919 a 981 c 967 g 876 t ORIGIN 1 ggctgcaggg ctccggcaac cgctccggca acgccaaccg ctccgctgcg cgcaggctgg 61 gctgcaggct ctcggctgca gcgctgggtg gatctaggat ccggcttcca acatgtggca 121 gctctgggcc tccctctgct gcctgctggt gttggccaat gcccggagca ggccctcttt 181 ccatcccctg tcggatgagc tggtcaacta tgtcaacaaa cggaatacca cgtggcaggc 241 cgggcacaac ttctacaacg tggacatgag ctacttgaag aggctatgtg gtaccttcct 301 gggtgggccc aagccacccc agagagttat gtttaccgag gacctgaagc tgcctgcaag 361 cttcgatgca cgggaacaat ggccacagtg tcccaccatc aaagagatca gagaccaggg 421 ctcctgtggc tcctgctggg ccttcggggc tgtggaagcc atctctgacc ggatctgcat 481 ccacaccaat gcgcacgtca gcgtggaggt gtcggcggag gacctgctca catgctgtgg 541 cagcatgtgt ggggacggct gtaatggtgg ctatcctgct gaagcttgga acttctggac 601 aagaaaaggc ctggtttctg gtggcctcta tgaatcccat gtagggtgca gaccgtactc 661 catccctccc tgtgagcacc acgtcaacgg ctcccggccc ccatgcacgg gggagggaga 721 tacccccaag tgtagcaaga tctgtgagcc tggctacagc ccgacctaca aacaggacaa 781 gcactacgga tacaattcct acagcgtctc caatagcgag aaggacatca tggccgagat 841 ctacaaaaac ggccccgtgg agggagcttt ctctgtgtat tcggacttcc tgctctacaa 901 gtcaggagtg taccaacacg tcaccggaga gatgatgggt ggccatgcca tccgcatcct 961 gggctgggga gtggagaatg gcacacccta ctggctggtt gccaactcct ggaacactga 1021 ctggggtgac aatggcttct ttaaaatact cagaggacag gatcactgtg gaatcgaatc 1081 agaagtggtg gctggaattc cacgcaccga tcagtactgg gaaaagatct aatctgccgt 1141 gggcctgtcg tgccagtcct gggggcgaga tcggggtaga aatgcatttt attctttaag 1201 ttcacgtaag atacaagttt caggcagggt ctgaaggact ggattggcca aacatcagac 1261 ctgtcttcca aggagaccaa gtcctggcta catcccagcc tgtggttaca gtgcagacag 1321 gccatgtgag ccaccgctgc cagcacagag cgtccttccc cctgtagact agtgccgtag 1381 ggagtacctg ctgccccagc tgactgtggc cccctccgtg atccatccat ctccagggag 1441 caagacagag acgcaggaat gggaagcgga gttcctaaca ggatgaaagt tcccccatca 1501 gttcccccag tacctccaag caagtagctt tccacatttg tcacagaaat cagaggagag 1561 atggtgttgg gagccctttg gagaacgcca gtctcccagg ccccctgcat ctatcgagtt 1621 tgcaatgtca caacctctct gatgttgtgc tcagcatgat tctttaatag aagttttatt 1681 ttttcgtgca ctctgctaat catgtgggtg agccagtgga acagcgggag acctgtgcta 1741 gttttacaga ttgcctccta atgacgcggc tcaaaaggaa accaagtggt caggagttgt 1801 ttctgaccca ctgatctcta ctaccacaag gaaaatagtt taggagaaac cagcttttac 1861 tgtttttgaa aaattacagc ttcaccctgt caagttaaca aggaatgcct gtgccaataa 1921 aaggtttctc caacttgaag tctactctga tgggatctca gatcctttgt cactgcctat 1981 agacttgtag ctgctgtctc tctttgtccc tgcagagaat cacgtcctgg aactgcatgt 2041 tcttgcgact cttgggactt catcttaact tctcgctgcc ccagccatgt tttcaaccat 2101 ggcatccctc ccccaattag ttccctgtca tcctcgtcaa ccttcactgt aagtgcctgg 2161 taagcttgcc cttgcttaag aactcaaaac atagctgtgc tctatttttt tgttgttgtt 2221 gtgactgaca gagtgagatt ccgtctccca ggctggagtg cagtggcgcc ttctcggctc 2281 actgcaacct gcagcctcct agattcaagc gattctcctg cttcagcctt ccgagtagct 2341 gggatgacag gcactcacca atatgcctgg gtagtttttg tgtttttaag tacatacagg 2401 atttcaccat gttggccagg ctagtttcga actcctggcc tcaggtggtc tgcctgcctc 2461 ggcctcccaa ggtgttggga ttacaggcgt gagccactgg gccctgcctg tattttttat 2521 cagccacaaa tccagcaaca agctgaggat tcagctcata aaacaggctt ggtgtcttgg 2581 tgatctcaca taaccaagat gctaccccgt ggggaaccac atccccctgg atgccctcca 2641 gccttggttt gggctggagt cagggcctgg atacagtatt ttgaatttgt atgccactgg 2701 tttgcattgc tggtcaggaa ctctagtgct ttgcatagcc ctggtttaga aacatgttat 2761 agcagttctt ggtatagagc aaactagaag aaccagcaat cattccactg tcctgccaag 2821 gtacacctca gtactcccct tcccaactga agtggtatga ggctagctct ttccaaaagc 2881 attcaagttt ggcttctgat gtgactcaga atttaggaac cagatgctag atcaaataag 2941 ctctgaaaat ctgaggaaca ttgtaggaaa ggtttgttaa gcatctctta agtgccatga 3001 tgagcataac agccggccgt ggtggctcac gcctgtaagc ccagcacttt gggaggccga 3061 ggtgggaaga tgacaaggtc aggagttcgg gaccagcctg gccaacatgc tgaaacctca 3121 cctctactga agatacaaga attggctggg catggtggca catgcctgtg atcccagcta 3181 cttgggaggc tgaggcagga gaatcgcttg agcccgggag gcggaggttg cagtgagccg 3241 agacagtgcc agtgcactcc agcctcggtg acagcgcaag gctccgtctc aataattaaa 3301 aaaaaaaaaa aaaaagaggc cgggcgcagt ggctcaagcc tgtaatccca gcactttggg 3361 aggctgaggc gggcagatca cctgaggtca ggagttttga gatcagcctt ggcaacacgg 3421 tgaaacccca tctctactaa aaatacaaaa ttagccaagc atgctggcac atgcctgtaa 3481 tcccagctac tcgtgaggct gaggtacgag aatcgcttga acctgggagg cagaggatgc 3541 agtgagccga gatcacgcca ttgcactcca gcctggggga caagagtgaa tctgtgtctc 3601 accaaaaaaa aaaagaaaaa gaaagatgct taacaaaggt taccataagc cacaaattca 3661 tgaccactta tccttccagt ttcaagtaga atatattcat aacctcaata aagttctccc 3721 tgctcccaaa aaaaaaaaaa aaa //