LOCUS BC048255 1339 bp mRNA linear HUM 22-DEC-2006
DEFINITION Homo sapiens cathepsin W, mRNA (cDNA clone MGC:51946
IMAGE:5221520), complete cds.
ACCESSION BC048255
VERSION BC048255.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 1339)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 1339)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (07-MAR-2003) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: Life Technologies, Inc.
cDNA Library Preparation: Life Technologies, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: National Institutes of Health Intramural
Sequencing Center (NISC),
Gaithersburg, Maryland;
Web site: http://www.nisc.nih.gov/
Contact: nisc_mgc@nhgri.nih.gov
Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
Young,A., Zhang,L.-H. and Green,E.D.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 93 Row: k Column: 12
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 23110963.
FEATURES Location/Qualifiers
source 1..1339
/db_xref="H-InvDB:HIT000053310"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:51946 IMAGE:5221520"
/tissue_type="Pancreas, Spleen, adult pooled"
/clone_lib="NIH_MGC_120"
/lab_host="DH10B"
/note="Vector: pCMV-SPORT6"
gene 1..1339
/gene="CTSW"
/gene_synonym="LYPN"
/db_xref="GeneID:1521"
/db_xref="HGNC:HGNC:2546"
/db_xref="MIM:602364"
CDS 43..1173
/gene="CTSW"
/gene_synonym="LYPN"
/codon_start=1
/product="cathepsin W"
/protein_id="AAH48255.1"
/db_xref="GeneID:1521"
/db_xref="HGNC:HGNC:2546"
/db_xref="MIM:602364"
/translation="MALTAHPSCLLALLVAGLAQGIRGPLRAQDLGPQPLELKEAFKL
FQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTEEEFGQL
YGYRRAAGGVPSMGREIRSEEPEESVPFSCDWRKVAGAISPIKDQKNCNCCWAMAAAG
NIETLWRISFWDFVDVSVQELLDCGRCGDGCHGGFVWDAFITVLNNSGLASEKDYPFQ
GKVRAHRCHPKKYQKVAWIQDFIMLQNNEHRIAQYLATYGPITVTINMKPLQLYRKGV
IKATPTTCDPQLVDHSVLLVGFGSVKSEEGIWAETVSSQSQPQPPHPTPYWILKNSWG
AQWGEKGYFRLHRGSNTCGITKFPLTARVQKPDMKPRVSCPP"
BASE COUNT 338 a 408 c 353 g 240 t
ORIGIN
1 tgcgcggctt cctgcctcca tgccactcca gactgcaccg gcatggcact gactgcccac
61 ccctcctgcc tcctggccct gttggtggca ggcctagccc aaggcatcag aggccccctt
121 agggcccagg acctaggtcc ccagccgcta gagctgaaag aggccttcaa gttgttccag
181 atccagttca accggagtta cctgagccca gaagagcatg ctcaccgcct ggacatcttt
241 gcccacaacc tggcccaggc tcagaggctg caggaggagg acttgggcac agctgagttt
301 ggggtgactc cattcagtga cctcacagag gaggagtttg gccagctcta tggctatcgg
361 agggcagctg gaggggtccc cagcatgggc agagaaataa ggtctgaaga gccagaggag
421 tcagtacctt tcagctgtga ctggcggaag gtggccggcg ccatctcacc catcaaggac
481 cagaaaaact gcaactgctg ctgggccatg gcagcggcag gcaacataga gaccctgtgg
541 cgcatcagtt tctgggattt tgtggacgtc tccgtgcagg aactgctgga ctgtggccgc
601 tgtggggatg gctgccacgg tggcttcgtc tgggacgcgt tcataactgt cctcaacaac
661 agcggcctgg ccagtgaaaa ggactacccg ttccagggca aagtcagagc ccacaggtgc
721 caccccaaga agtaccagaa ggtggcctgg atccaggact tcatcatgct gcagaacaac
781 gagcacagaa ttgcgcagta cctggccact tatggcccca tcaccgtgac catcaacatg
841 aagccccttc agctataccg gaaaggtgtg atcaaggcca cacccaccac ctgtgacccc
901 cagcttgtgg accactctgt cctgctggtg ggttttggca gcgtcaagtc agaggagggg
961 atatgggcag agacagtctc atcgcagtct cagcctcagc ctccacaccc caccccatac
1021 tggatcctga agaactcctg gggggcccaa tggggagaga agggctattt ccggctgcac
1081 cgagggagca atacctgtgg catcaccaag ttcccgctca ctgcccgtgt gcagaaaccg
1141 gatatgaagc cccgagtctc ctgccctccc tgaacccacc tggccccctc agctctgtcc
1201 tgttaggcca actgcctcct tgccagcccc acccccaggt ttttgcccat cctcccaatc
1261 tcaatacagc ctgaataaac caaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
1321 aaaaaaaaaa aaaaaaaaa
//