LOCUS HUMCTSE 2158 bp mRNA linear HUM 01-NOV-1994 DEFINITION Human cathepsin E mRNA, complete cds. ACCESSION J05036 VERSION J05036.1 KEYWORDS aspartic proteinase; cathepsin. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2158) AUTHORS Azuma,T., Pals,G., Mohandas,T.K., Couvreur,J.M. and Taggart,R.T. TITLE Human gastric cathepsin E. Predicted sequence, localization to chromosome 1, and sequence homology with other aspartic proteinases JOURNAL J. Biol. Chem. 264 (28), 16748-16753 (1989) PUBMED 2674141 COMMENT Original source text: Human stomach, cDNA to mRNA. FEATURES Location/Qualifiers source 1..2158 /db_xref="H-InvDB:HIT000191304" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /map="1q31" gene 1..2158 /gene="CTSE" mRNA <1..2158 /gene="CTSE" /product="CTSE mRNA" CDS 50..1240 /gene="CTSE" /note="cathepsin E precursor" /codon_start=1 /protein_id="AAA52130.1" /db_xref="GDB:G00-119-821" /translation="MKTLLLLLLVLLELGEAQGSLHRVPLRRHPSLKKKLRARSQLSE FWKSHNLDMIQFTESCSMDQSAKEPLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNL WVPSVYCTSPACKTHSRFQPSQSSTYSQPGQSFSIQYGTGSLSGIIGADQVSVEGLTV VGQQFGESVTEPGQTFVDAEFDGILGLGYPSLAVGGVTPVFDNMMAQNLVDLPMFSVY MSSNPEGGAGSELIFGGYDHSHFSGSLNWVPVTKQAYWQIALDNIQVGGTVMFCSEGC QAIVDTGTSLITGPSDKIKQLQNAIGAAPVDGEYAVECANLNVMPDVTFTINGVPYTL SPTAYTLLDFVDGMQFCSSGFQGLDIHPPAGPLWILGDVFIRQFYSVFDRGNNRVGLA PAVP" sig_peptide 50..100 /gene="CTSE" /note="cathepsin E signal peptide" mat_peptide 101..1237 /gene="CTSE" /product="cathepsin E" BASE COUNT 530 a 552 c 500 g 576 t ORIGIN 1 ggagagaaga aaggaggggg caagggagaa gctgctggtc ggactcacaa tgaaaacgct 61 ccttcttttg ctgctggtgc tcctggagct gggagaggcc caaggatccc ttcacagggt 121 gcccctcagg aggcatccgt ccctcaagaa gaagctgcgg gcacggagcc agctctctga 181 gttctggaaa tcccataatt tggacatgat ccagttcacc gagtcctgct caatggacca 241 gagtgccaag gaacccctca tcaactactt ggatatggaa tacttcggca ctatctccat 301 tggctcccca ccacagaact tcactgtcat cttcgacact ggctcctcca acctctgggt 361 cccctctgtg tactgcacta gcccagcctg caagacgcac agcaggttcc agccttccca 421 gtccagcaca tacagccagc caggtcaatc tttctccatt cagtatggaa ccgggagctt 481 gtccgggatc attggagccg accaagtctc tgtggaagga ctaaccgtgg ttggccagca 541 gtttggagaa agtgtcacag agccaggcca gacctttgtg gatgcagagt ttgatggaat 601 tctgggcctg ggatacccct ccttggctgt gggaggagtg actccagtat ttgacaacat 661 gatggctcag aacctggtgg acttgccgat gttttctgtc tacatgagca gtaacccaga 721 aggtggtgcg gggagcgagc tgatttttgg aggctacgac cactcccatt tctctgggag 781 cctgaattgg gtcccagtca ccaagcaagc ttactggcag attgcactgg ataacatcca 841 ggtgggaggc actgttatgt tctgctccga gggctgccag gccattgtgg acacagggac 901 ttccctcatc actggccctt ccgacaagat taagcagctg caaaacgcca ttggggcagc 961 ccccgtggat ggagaatatg ctgtggagtg tgccaacctt aacgtcatgc cggatgtcac 1021 cttcaccatt aacggagtcc cctataccct cagcccaact gcctacaccc tactggactt 1081 cgtggatgga atgcagttct gcagcagtgg ctttcaagga cttgacatcc accctccagc 1141 tgggcccctc tggatcctgg gggatgtctt cattcgacag ttttactcag tctttgaccg 1201 tgggaataac cgtgtgggac tggccccagc agtcccctaa ggaggggcct tgtgtctgtg 1261 cctgcctgtc tgacagacct tgaatatgtt aggctggggc attctttaca cctacaaaaa 1321 gttattttcc agagaatgta gctgtttcca gggttgcaac ttgaattaag accaaacaga 1381 acatgagaat acacacacac acacacatat acacacacac acacttcaca catacacacc 1441 actcccacca ccgtcatgat ggaggaatta cgttatacat tcatattttg tattgatttt 1501 tgattatgaa aatcaaaaat tttcacattt gattatgaaa atctccaaac atatgcacaa 1561 gcagagatca tggtataata aatccctttg caactccact cagccctgac aacccatcca 1621 cacacggcca ggcctgttta tctacactgc tgcccactcc tctctccagc tccacatgct 1681 gtacctggat cattctgaag caaattccga gcattacatc attttgtcca taaatatttc 1741 taacatcctt aaatatacaa tcggaattca agcatctccc attgtcccac aaatgtttgg 1801 ctgtttttgt agttggattg tttgtattag gattcaagca aggcccatat attgcattta 1861 tttgaaatgt ctgtaagtct ctttccatct acagagttta gcacatttga acgttgctgg 1921 ttgaaatccc gaggtgtcat ttgacatggt tctctgaact tatctttcct ataaaatggt 1981 agttagatct ggaggtctga ttttgtggca aaaatacttc ctaggtggtg ctgggtactt 2041 cttgttgcat cctgtcagga ggcagataat gctggtgcct ctctattggt aatgttaaga 2101 ctgctgggtg ggtttggagt tcttggcttt aatcattcat tacaaagttc agcatttt //