LOCUS       HUMCTSE                 2158 bp    mRNA    linear   HUM 01-NOV-1994
DEFINITION  Human cathepsin E mRNA, complete cds.
ACCESSION   J05036
VERSION     J05036.1
KEYWORDS    aspartic proteinase; cathepsin.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2158)
  AUTHORS   Azuma,T., Pals,G., Mohandas,T.K., Couvreur,J.M. and Taggart,R.T.
  TITLE     Human gastric cathepsin E. Predicted sequence, localization to
            chromosome 1, and sequence homology with other aspartic proteinases
  JOURNAL   J. Biol. Chem. 264 (28), 16748-16753 (1989)
   PUBMED   2674141
COMMENT     Original source text: Human stomach, cDNA to mRNA.
FEATURES             Location/Qualifiers
     source          1..2158
                     /db_xref="H-InvDB:HIT000191304"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /map="1q31"
     gene            1..2158
                     /gene="CTSE"
     mRNA            <1..2158
                     /gene="CTSE"
                     /product="CTSE mRNA"
     CDS             50..1240
                     /gene="CTSE"
                     /note="cathepsin E precursor"
                     /codon_start=1
                     /protein_id="AAA52130.1"
                     /db_xref="GDB:G00-119-821"
                     /translation="MKTLLLLLLVLLELGEAQGSLHRVPLRRHPSLKKKLRARSQLSE
                     FWKSHNLDMIQFTESCSMDQSAKEPLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNL
                     WVPSVYCTSPACKTHSRFQPSQSSTYSQPGQSFSIQYGTGSLSGIIGADQVSVEGLTV
                     VGQQFGESVTEPGQTFVDAEFDGILGLGYPSLAVGGVTPVFDNMMAQNLVDLPMFSVY
                     MSSNPEGGAGSELIFGGYDHSHFSGSLNWVPVTKQAYWQIALDNIQVGGTVMFCSEGC
                     QAIVDTGTSLITGPSDKIKQLQNAIGAAPVDGEYAVECANLNVMPDVTFTINGVPYTL
                     SPTAYTLLDFVDGMQFCSSGFQGLDIHPPAGPLWILGDVFIRQFYSVFDRGNNRVGLA
                     PAVP"
     sig_peptide     50..100
                     /gene="CTSE"
                     /note="cathepsin E signal peptide"
     mat_peptide     101..1237
                     /gene="CTSE"
                     /product="cathepsin E"
BASE COUNT          530 a          552 c          500 g          576 t
ORIGIN      
        1 ggagagaaga aaggaggggg caagggagaa gctgctggtc ggactcacaa tgaaaacgct
       61 ccttcttttg ctgctggtgc tcctggagct gggagaggcc caaggatccc ttcacagggt
      121 gcccctcagg aggcatccgt ccctcaagaa gaagctgcgg gcacggagcc agctctctga
      181 gttctggaaa tcccataatt tggacatgat ccagttcacc gagtcctgct caatggacca
      241 gagtgccaag gaacccctca tcaactactt ggatatggaa tacttcggca ctatctccat
      301 tggctcccca ccacagaact tcactgtcat cttcgacact ggctcctcca acctctgggt
      361 cccctctgtg tactgcacta gcccagcctg caagacgcac agcaggttcc agccttccca
      421 gtccagcaca tacagccagc caggtcaatc tttctccatt cagtatggaa ccgggagctt
      481 gtccgggatc attggagccg accaagtctc tgtggaagga ctaaccgtgg ttggccagca
      541 gtttggagaa agtgtcacag agccaggcca gacctttgtg gatgcagagt ttgatggaat
      601 tctgggcctg ggatacccct ccttggctgt gggaggagtg actccagtat ttgacaacat
      661 gatggctcag aacctggtgg acttgccgat gttttctgtc tacatgagca gtaacccaga
      721 aggtggtgcg gggagcgagc tgatttttgg aggctacgac cactcccatt tctctgggag
      781 cctgaattgg gtcccagtca ccaagcaagc ttactggcag attgcactgg ataacatcca
      841 ggtgggaggc actgttatgt tctgctccga gggctgccag gccattgtgg acacagggac
      901 ttccctcatc actggccctt ccgacaagat taagcagctg caaaacgcca ttggggcagc
      961 ccccgtggat ggagaatatg ctgtggagtg tgccaacctt aacgtcatgc cggatgtcac
     1021 cttcaccatt aacggagtcc cctataccct cagcccaact gcctacaccc tactggactt
     1081 cgtggatgga atgcagttct gcagcagtgg ctttcaagga cttgacatcc accctccagc
     1141 tgggcccctc tggatcctgg gggatgtctt cattcgacag ttttactcag tctttgaccg
     1201 tgggaataac cgtgtgggac tggccccagc agtcccctaa ggaggggcct tgtgtctgtg
     1261 cctgcctgtc tgacagacct tgaatatgtt aggctggggc attctttaca cctacaaaaa
     1321 gttattttcc agagaatgta gctgtttcca gggttgcaac ttgaattaag accaaacaga
     1381 acatgagaat acacacacac acacacatat acacacacac acacttcaca catacacacc
     1441 actcccacca ccgtcatgat ggaggaatta cgttatacat tcatattttg tattgatttt
     1501 tgattatgaa aatcaaaaat tttcacattt gattatgaaa atctccaaac atatgcacaa
     1561 gcagagatca tggtataata aatccctttg caactccact cagccctgac aacccatcca
     1621 cacacggcca ggcctgttta tctacactgc tgcccactcc tctctccagc tccacatgct
     1681 gtacctggat cattctgaag caaattccga gcattacatc attttgtcca taaatatttc
     1741 taacatcctt aaatatacaa tcggaattca agcatctccc attgtcccac aaatgtttgg
     1801 ctgtttttgt agttggattg tttgtattag gattcaagca aggcccatat attgcattta
     1861 tttgaaatgt ctgtaagtct ctttccatct acagagttta gcacatttga acgttgctgg
     1921 ttgaaatccc gaggtgtcat ttgacatggt tctctgaact tatctttcct ataaaatggt
     1981 agttagatct ggaggtctga ttttgtggca aaaatacttc ctaggtggtg ctgggtactt
     2041 cttgttgcat cctgtcagga ggcagataat gctggtgcct ctctattggt aatgttaaga
     2101 ctgctgggtg ggtttggagt tcttggcttt aatcattcat tacaaagttc agcatttt
//