LOCUS BC042537 2289 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens cathepsin E, mRNA (cDNA clone MGC:34609 IMAGE:5174814), complete cds. ACCESSION BC042537 VERSION BC042537.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2289) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2289) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (02-JAN-2003) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Life Technologies, Inc. cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Baylor College of Medicine Human Genome Sequencing Center Center code: BCM-HGSC Web site: http://www.hgsc.bcm.tmc.edu/cdna/ Contact: amg@bcm.tmc.edu Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H., Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati, A.N., Gibbs, R.A. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 51 Row: k Column: 9 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 23110950. FEATURES Location/Qualifiers source 1..2289 /db_xref="H-InvDB:HIT000052706" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:34609 IMAGE:5174814" /tissue_type="Brain, Lung, Testis, adult, pooled whole" /clone_lib="NIH_MGC_115" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..2289 /gene="CTSE" /db_xref="GeneID:1510" /db_xref="HGNC:HGNC:2530" /db_xref="MIM:116890" CDS 86..1276 /gene="CTSE" /codon_start=1 /product="cathepsin E" /protein_id="AAH42537.1" /db_xref="GeneID:1510" /db_xref="HGNC:HGNC:2530" /db_xref="MIM:116890" /translation="MKTLLLLLLVLLELGEAQGSLHRVPLRRHPSLKKKLRARSQLSE FWKSHNLDMIQFTESCSMDQSAKEPLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNL WVPSVYCTSPACKTHSRFQPSQSSTYSQPGQSFSIQYGTGSLSGIIGADQVSVEGLTV VGQQFGESVTEPGQTFVDAEFDGILGLGYPSLAVGGVTPVFDNMMAQNLVDLPMFSVY MSSNPEGGAGSELIFGGYDHSHFSGSLNWVPVTKQAYWQIALDNIQVGGTVMFCSEGC QAIVDTGTSLITGPSDKIKQLQNAIGAAPVDGEYAVECANLNVMPDVTFTINGVPYTL SPTAYTLLDFVDGMQFCSSGFQGLDIHPPAGPLWILGDVFIRQFYSVFDRGNNRVGLA PAVP" BASE COUNT 632 a 562 c 511 g 584 t ORIGIN 1 tgagagttag ggaaagtccg ttcccactgc cctcggggag agaagaaagg agggggcaag 61 ggagaagctg ctggtcggac tcacaatgaa aacgctcctt cttttgctgc tggtgctcct 121 ggagctggga gaggcccaag gatcccttca cagggtgccc ctcaggaggc atccgtccct 181 caagaagaag ctgcgggcac ggagccagct ctctgagttc tggaaatccc ataatttgga 241 catgatccag ttcaccgagt cctgctcaat ggaccagagt gccaaggaac ccctcatcaa 301 ctacttggat atggaatact tcggcactat ctccattggc tccccaccac agaacttcac 361 tgtcatcttc gacactggct cctccaacct ctgggtcccc tctgtgtact gcactagccc 421 agcctgcaag acgcacagca ggttccagcc ttcccagtcc agcacataca gccagccagg 481 tcaatctttc tccattcagt atggaaccgg gagcttgtcc gggatcattg gagccgacca 541 agtctctgtg gaaggactaa ccgtggttgg ccagcagttt ggagaaagtg tcacagagcc 601 aggccagacc tttgtggatg cagagtttga tggaattctg ggcctgggat acccctcctt 661 ggctgtggga ggagtgactc cagtatttga caacatgatg gctcagaacc tggtggactt 721 gccgatgttt tctgtctaca tgagcagtaa cccagaaggt ggtgcgggga gcgagctgat 781 ttttggaggc tacgaccact cccatttctc tgggagcctg aattgggtcc cagtcaccaa 841 gcaagcttac tggcagattg cactggataa catccaggtg ggaggcactg ttatgttctg 901 ctccgagggc tgccaggcca ttgtggacac agggacttcc ctcatcactg gcccttccga 961 caagattaag cagctgcaaa acgccattgg ggcagccccc gtggatggag aatatgctgt 1021 ggagtgtgcc aaccttaacg tcatgccgga tgtcaccttc accattaacg gagtccccta 1081 taccctcagc ccaactgcct acaccctact ggacttcgtg gatggaatgc agttctgcag 1141 cagtggcttt caaggacttg acatccaccc tccagctggg cccctctgga tcctggggga 1201 tgtcttcatt cgacagtttt actcagtctt tgaccgtggg aataaccgtg tgggactggc 1261 cccagcagtc ccctaaggag gggccttgtg tctgtgcctg cctgtctgac agaccttgaa 1321 tatgttaggc tggggcattc tttacaccta caaaaagtta ttttccagag aatgtagctg 1381 tttccagggt tgcaacttga attaagacca aacagaacat gagaatacac acacacacac 1441 acatatacac acacacacac ttcacacata cacaccactc ccaccaccgt catgatggag 1501 gaattacgtt atacattcat attttgtatt gatttttgat tatgaaaatc aaaaattttc 1561 acatttgatt atgaaaatct ccaaacatat gcacaagcag agatcatggt ataataaatc 1621 cctttgcaac tccactcagc cctgacaacc catccacaca cggccaggcc tgtttatcta 1681 cactgctgcc cactcctctc tccagctcca catgctgtac ctggatcatt ctgaagcaaa 1741 ttccgagcat tacatcattt tgtccataaa tatttctaac atccttaaat atacaatcgg 1801 aattcaagca tctcccattg tcccacaaat gtttggctgt ttttgtagtt ggattgtttg 1861 tattaggatt caagcaaggc ccatatattg catttatttg aaatgtctgt aagtctcttt 1921 ccatctacag agtttagcac atttgaacgt tgctggttga aatcccgagg tgtcatttga 1981 catggttctc tgaacttatc tttcctataa aatggtagtt agatctggag gtctgatttt 2041 gtggcaaaaa tacttcctag gtggtgctgg gtacttcttg ttgcatcctg tcaggaggca 2101 gataatgctg gtgcctctct attggtaatg ttaagactgc tgggtgggtt tggagttctt 2161 ggctttaatc attcattaca aagttcagca ttttaaaaaa aaaaaaaaaa aaaaaaaaaa 2221 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2281 aaaaaaaaa //