LOCUS BC048255 1339 bp mRNA linear HUM 22-DEC-2006 DEFINITION Homo sapiens cathepsin W, mRNA (cDNA clone MGC:51946 IMAGE:5221520), complete cds. ACCESSION BC048255 VERSION BC048255.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1339) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 1339) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (07-MAR-2003) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Life Technologies, Inc. cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 93 Row: k Column: 12 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 23110963. FEATURES Location/Qualifiers source 1..1339 /db_xref="H-InvDB:HIT000053310" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:51946 IMAGE:5221520" /tissue_type="Pancreas, Spleen, adult pooled" /clone_lib="NIH_MGC_120" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..1339 /gene="CTSW" /gene_synonym="LYPN" /db_xref="GeneID:1521" /db_xref="HGNC:HGNC:2546" /db_xref="MIM:602364" CDS 43..1173 /gene="CTSW" /gene_synonym="LYPN" /codon_start=1 /product="cathepsin W" /protein_id="AAH48255.1" /db_xref="GeneID:1521" /db_xref="HGNC:HGNC:2546" /db_xref="MIM:602364" /translation="MALTAHPSCLLALLVAGLAQGIRGPLRAQDLGPQPLELKEAFKL FQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTEEEFGQL YGYRRAAGGVPSMGREIRSEEPEESVPFSCDWRKVAGAISPIKDQKNCNCCWAMAAAG NIETLWRISFWDFVDVSVQELLDCGRCGDGCHGGFVWDAFITVLNNSGLASEKDYPFQ GKVRAHRCHPKKYQKVAWIQDFIMLQNNEHRIAQYLATYGPITVTINMKPLQLYRKGV IKATPTTCDPQLVDHSVLLVGFGSVKSEEGIWAETVSSQSQPQPPHPTPYWILKNSWG AQWGEKGYFRLHRGSNTCGITKFPLTARVQKPDMKPRVSCPP" BASE COUNT 338 a 408 c 353 g 240 t ORIGIN 1 tgcgcggctt cctgcctcca tgccactcca gactgcaccg gcatggcact gactgcccac 61 ccctcctgcc tcctggccct gttggtggca ggcctagccc aaggcatcag aggccccctt 121 agggcccagg acctaggtcc ccagccgcta gagctgaaag aggccttcaa gttgttccag 181 atccagttca accggagtta cctgagccca gaagagcatg ctcaccgcct ggacatcttt 241 gcccacaacc tggcccaggc tcagaggctg caggaggagg acttgggcac agctgagttt 301 ggggtgactc cattcagtga cctcacagag gaggagtttg gccagctcta tggctatcgg 361 agggcagctg gaggggtccc cagcatgggc agagaaataa ggtctgaaga gccagaggag 421 tcagtacctt tcagctgtga ctggcggaag gtggccggcg ccatctcacc catcaaggac 481 cagaaaaact gcaactgctg ctgggccatg gcagcggcag gcaacataga gaccctgtgg 541 cgcatcagtt tctgggattt tgtggacgtc tccgtgcagg aactgctgga ctgtggccgc 601 tgtggggatg gctgccacgg tggcttcgtc tgggacgcgt tcataactgt cctcaacaac 661 agcggcctgg ccagtgaaaa ggactacccg ttccagggca aagtcagagc ccacaggtgc 721 caccccaaga agtaccagaa ggtggcctgg atccaggact tcatcatgct gcagaacaac 781 gagcacagaa ttgcgcagta cctggccact tatggcccca tcaccgtgac catcaacatg 841 aagccccttc agctataccg gaaaggtgtg atcaaggcca cacccaccac ctgtgacccc 901 cagcttgtgg accactctgt cctgctggtg ggttttggca gcgtcaagtc agaggagggg 961 atatgggcag agacagtctc atcgcagtct cagcctcagc ctccacaccc caccccatac 1021 tggatcctga agaactcctg gggggcccaa tggggagaga agggctattt ccggctgcac 1081 cgagggagca atacctgtgg catcaccaag ttcccgctca ctgcccgtgt gcagaaaccg 1141 gatatgaagc cccgagtctc ctgccctccc tgaacccacc tggccccctc agctctgtcc 1201 tgttaggcca actgcctcct tgccagcccc acccccaggt ttttgcccat cctcccaatc 1261 tcaatacagc ctgaataaac caaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1321 aaaaaaaaaa aaaaaaaaa //