LOCUS       BC042537                2289 bp    mRNA    linear   HUM 15-JUL-2006
DEFINITION  Homo sapiens cathepsin E, mRNA (cDNA clone MGC:34609
            IMAGE:5174814), complete cds.
ACCESSION   BC042537
VERSION     BC042537.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2289)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 2289)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (02-JAN-2003) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: Life Technologies, Inc.
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Baylor College of Medicine Human Genome
            Sequencing Center
            Center code: BCM-HGSC
            Web site: http://www.hgsc.bcm.tmc.edu/cdna/
            Contact: amg@bcm.tmc.edu
            Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H.,
            Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati,
            A.N., Gibbs, R.A.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 51 Row: k Column: 9
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 23110950.
FEATURES             Location/Qualifiers
     source          1..2289
                     /db_xref="H-InvDB:HIT000052706"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:34609 IMAGE:5174814"
                     /tissue_type="Brain, Lung, Testis, adult, pooled whole"
                     /clone_lib="NIH_MGC_115"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     gene            1..2289
                     /gene="CTSE"
                     /db_xref="GeneID:1510"
                     /db_xref="HGNC:2530"
                     /db_xref="MIM:116890"
     CDS             86..1276
                     /gene="CTSE"
                     /codon_start=1
                     /product="cathepsin E"
                     /protein_id="AAH42537.1"
                     /db_xref="GI:27502799"
                     /db_xref="GeneID:1510"
                     /db_xref="HGNC:2530"
                     /db_xref="MIM:116890"
                     /translation="MKTLLLLLLVLLELGEAQGSLHRVPLRRHPSLKKKLRARSQLSE
                     FWKSHNLDMIQFTESCSMDQSAKEPLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNL
                     WVPSVYCTSPACKTHSRFQPSQSSTYSQPGQSFSIQYGTGSLSGIIGADQVSVEGLTV
                     VGQQFGESVTEPGQTFVDAEFDGILGLGYPSLAVGGVTPVFDNMMAQNLVDLPMFSVY
                     MSSNPEGGAGSELIFGGYDHSHFSGSLNWVPVTKQAYWQIALDNIQVGGTVMFCSEGC
                     QAIVDTGTSLITGPSDKIKQLQNAIGAAPVDGEYAVECANLNVMPDVTFTINGVPYTL
                     SPTAYTLLDFVDGMQFCSSGFQGLDIHPPAGPLWILGDVFIRQFYSVFDRGNNRVGLA
                     PAVP"
BASE COUNT          632 a          562 c          511 g          584 t
ORIGIN      
        1 tgagagttag ggaaagtccg ttcccactgc cctcggggag agaagaaagg agggggcaag
       61 ggagaagctg ctggtcggac tcacaatgaa aacgctcctt cttttgctgc tggtgctcct
      121 ggagctggga gaggcccaag gatcccttca cagggtgccc ctcaggaggc atccgtccct
      181 caagaagaag ctgcgggcac ggagccagct ctctgagttc tggaaatccc ataatttgga
      241 catgatccag ttcaccgagt cctgctcaat ggaccagagt gccaaggaac ccctcatcaa
      301 ctacttggat atggaatact tcggcactat ctccattggc tccccaccac agaacttcac
      361 tgtcatcttc gacactggct cctccaacct ctgggtcccc tctgtgtact gcactagccc
      421 agcctgcaag acgcacagca ggttccagcc ttcccagtcc agcacataca gccagccagg
      481 tcaatctttc tccattcagt atggaaccgg gagcttgtcc gggatcattg gagccgacca
      541 agtctctgtg gaaggactaa ccgtggttgg ccagcagttt ggagaaagtg tcacagagcc
      601 aggccagacc tttgtggatg cagagtttga tggaattctg ggcctgggat acccctcctt
      661 ggctgtggga ggagtgactc cagtatttga caacatgatg gctcagaacc tggtggactt
      721 gccgatgttt tctgtctaca tgagcagtaa cccagaaggt ggtgcgggga gcgagctgat
      781 ttttggaggc tacgaccact cccatttctc tgggagcctg aattgggtcc cagtcaccaa
      841 gcaagcttac tggcagattg cactggataa catccaggtg ggaggcactg ttatgttctg
      901 ctccgagggc tgccaggcca ttgtggacac agggacttcc ctcatcactg gcccttccga
      961 caagattaag cagctgcaaa acgccattgg ggcagccccc gtggatggag aatatgctgt
     1021 ggagtgtgcc aaccttaacg tcatgccgga tgtcaccttc accattaacg gagtccccta
     1081 taccctcagc ccaactgcct acaccctact ggacttcgtg gatggaatgc agttctgcag
     1141 cagtggcttt caaggacttg acatccaccc tccagctggg cccctctgga tcctggggga
     1201 tgtcttcatt cgacagtttt actcagtctt tgaccgtggg aataaccgtg tgggactggc
     1261 cccagcagtc ccctaaggag gggccttgtg tctgtgcctg cctgtctgac agaccttgaa
     1321 tatgttaggc tggggcattc tttacaccta caaaaagtta ttttccagag aatgtagctg
     1381 tttccagggt tgcaacttga attaagacca aacagaacat gagaatacac acacacacac
     1441 acatatacac acacacacac ttcacacata cacaccactc ccaccaccgt catgatggag
     1501 gaattacgtt atacattcat attttgtatt gatttttgat tatgaaaatc aaaaattttc
     1561 acatttgatt atgaaaatct ccaaacatat gcacaagcag agatcatggt ataataaatc
     1621 cctttgcaac tccactcagc cctgacaacc catccacaca cggccaggcc tgtttatcta
     1681 cactgctgcc cactcctctc tccagctcca catgctgtac ctggatcatt ctgaagcaaa
     1741 ttccgagcat tacatcattt tgtccataaa tatttctaac atccttaaat atacaatcgg
     1801 aattcaagca tctcccattg tcccacaaat gtttggctgt ttttgtagtt ggattgtttg
     1861 tattaggatt caagcaaggc ccatatattg catttatttg aaatgtctgt aagtctcttt
     1921 ccatctacag agtttagcac atttgaacgt tgctggttga aatcccgagg tgtcatttga
     1981 catggttctc tgaacttatc tttcctataa aatggtagtt agatctggag gtctgatttt
     2041 gtggcaaaaa tacttcctag gtggtgctgg gtacttcttg ttgcatcctg tcaggaggca
     2101 gataatgctg gtgcctctct attggtaatg ttaagactgc tgggtgggtt tggagttctt
     2161 ggctttaatc attcattaca aagttcagca ttttaaaaaa aaaaaaaaaa aaaaaaaaaa
     2221 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
     2281 aaaaaaaaa
//