LOCUS HUMCTSB 2002 bp mRNA linear HUM 01-NOV-1994 DEFINITION Human cathepsin B proteinase mRNA, complete cds. ACCESSION M14221 VERSION M14221.1 KEYWORDS cathepsin B; proteinase. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2002) AUTHORS Chan,S.J., San Segundo,B., McCormick,M.B. and Steiner,D.F. TITLE Nucleotide and predicted amino acid sequences of cloned human and mouse preprocathepsin B cDNAs JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83 (20), 7721-7725 (1986) PUBMED 3463996 COMMENT Original source text: Human kidney, cDNA to mRNA, clone lambda-hCB79. FEATURES Location/Qualifiers source 1..2002 /db_xref="H-InvDB:HIT000194314" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /map="8p22" gene 1..2002 /gene="CTSB" mRNA 195..2002 /gene="CTSB" /product="preprocathepsin B mRNA" CDS 195..1214 /gene="CTSB" /note="preprocathepsin B" /codon_start=1 /protein_id="AAA52129.1" /db_xref="GDB:G00-119-087" /translation="MWQLWASLCCLLVLANARSRPSFHPVSDELVNYVNKRNTTWQAG HNFYNVDMSYLKRLCGTFLGGPKPPQRVMFTEDLKLPASFDAREQWPQCPTIKEIRDQ GSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWN FWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPT YKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMG GHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQ YWEKI" BASE COUNT 460 a 553 c 550 g 439 t ORIGIN Chromosome 8p22; EcoRI site. 1 aattccgcgg caaccgctcc ggcaacgcca accgctccgc tgcgcgcagg ctgggctgca 61 ggctctcggc tgcagcgctg ggctggtgtg cagtggtgcg accacggctc acggcagcct 121 cagccaccca gatgtaagcg atctggttcc cacctcagcc ttccgagtag tggatctagg 181 atctggcttc caacatgtgg cagctctggg cctccctctg ctgcctgctg gtgttggcca 241 atgcccggag caggccctct ttccatcccg tgtcggatga gctggtcaac tatgtcaaca 301 aacggaatac cacgtggcag gccgggcaca acttctacaa cgtggacatg agctacttga 361 agaggctatg tggtaccttc ctgggtgggc ccaagccacc ccagagagtt atgtttaccg 421 aggacctgaa gctgcctgca agcttcgatg cacgggaaca atggccacag tgtcccacca 481 tcaaagagat cagagaccag ggctcctgtg gctcctgctg ggccttcggg gctgtggaag 541 ccatctctga ccgcatctgc atccacacca atgcgcacgt cagcgtggag gtgtcggcgg 601 aggacctgct cacctgctgt ggcagcatgt gtggggacgg ctgtaatggt ggctatcctg 661 ctgaagcttg gaacttctgg acaagaaaag gcctggtttc tggtggcctc tatgaatccc 721 atgtagggtg cagaccgtac tccatccctc cctgtgagca ccacgtcaac ggctcccggc 781 ccccatgcac gggggaggga gataccccca agtgtagcaa gatctgtgag cctggctaca 841 gcccgaccta caaacaggac aagcactacg gatacaattc ctacagcgtc tccaatagcg 901 agaaggacat catggccgag atctacaaaa acggccccgt ggagggagct ttctctgtgt 961 attcggactt cctgctctac aagtcaggag tgtaccaaca cgtcaccgga gagatgatgg 1021 gtggccatgc catccgcatc ctgggctggg gagtggagaa tggcacaccc tactggctgg 1081 ttgccaactc ctggaacact gactggggtg acaatggctt ctttaaaata ctcagaggac 1141 aggatcactg cggaatcgaa tcagaagtgg tggctggaat tccacgcacc gatcagtact 1201 gggaaaagat ctaatctgcc gtgggcctgt cgtgccagtc ctgggggcga gatcggggta 1261 gaaagtcatt ttattcttta agttcacgta agatacaagt ttcaggcagg gtctgaagga 1321 ctggattggc caaagtcctc caaggagacc aagtcctggc tacatcccag cctgtggtta 1381 cagtgcagac aggccatgtg agccaccgct gccagcacag agcgtccttc cccctgtaga 1441 ctagtgccgt gggagtacct gctgcccagc tgctgtggcc ccctccgtga tccatccatc 1501 tccagggagc aagacagaga cgcaggatgg aaagcggagt tcctaacagg atgaaagttc 1561 ccccatcagt tcccccagta cctccaagca agtagctttc cacatttgtc acagaaatca 1621 gaggagagat ggtgttggga gccctttgga gaacgccagt ctccaggtcc ccctgcatct 1681 atcgagtttg caatgtcaca acctctctga tcttgtgctc agcatgattc tttaatagaa 1741 gttttatttt tcgtgcactc tgctaatcat gtgggtgagc cagtggaaca gcgggagcct 1801 gtgctggttt gcagattgcc tcctaatgac gcggctcaaa aggaaaccaa gtggtcagga 1861 gttgtttctg acccactgat ctctactacc acaaggaaaa tagtttagga gaaaccagct 1921 tttactgttt ttgaaaaatt acagcttcac cctgtcaagt taacaaggaa tgcctgtgcc 1981 aataaaaggt ttctccaact tg //