LOCUS BC011682 2052 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens cathepsin F, mRNA (cDNA clone MGC:19716 IMAGE:3535532), complete cds. ACCESSION BC011682 VERSION BC011682.2 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2052) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2052) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (30-JUL-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Aug 19, 2003 this sequence version replaced BC011682.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: DCTD/DTP cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 27 Row: e Column: 1 This clone was selected for full length sequencing because it passed the following selection criteria: Similarity but not identity to protein. FEATURES Location/Qualifiers source 1..2052 /db_xref="H-InvDB:HIT000035371" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:19716 IMAGE:3535532" /tissue_type="Lung, small cell carcinoma" /clone_lib="NIH_MGC_7" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..2052 /gene="CTSF" /gene_synonym="CATSF" /db_xref="GeneID:8722" /db_xref="HGNC:HGNC:2531" /db_xref="MIM:603539" CDS 85..1539 /gene="CTSF" /gene_synonym="CATSF" /codon_start=1 /product="cathepsin F" /protein_id="AAH11682.1" /db_xref="GeneID:8722" /db_xref="HGNC:HGNC:2531" /db_xref="MIM:603539" /translation="MAPWLQLLSLLGLLPGAVAAPAQPRAASFQAWGPPSPELLAPTR FALEMFNRGRAAGTRAVLGLVRGRVRRAGQGSLYSLEATLEEPPCNDPMVCRLPVSKK TLLCSFQVLDELGRHVLLRKDCGPVDTKVPGAGEPKSAFTQGSAMISSLSQNHPDNRN ETFSSVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRA QKIQALDRGTAQYGVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWD WRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACM GGLPSNAYSAIKNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVELSQNEQKLAA WLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRSDVPFWAIK NSWGTDWGEKGYYYLHRGSGACGVNTMASSAVVD" BASE COUNT 433 a 597 c 607 g 415 t ORIGIN 1 ctcaggcccc gctggccgcg ggctcggtac ccggtgggtc ggtggagcgt ctgttgggtc 61 cgggccgccg gcttcgccct cgccatggcg ccctggctgc agctcctgtc gctgctgggg 121 ctgctcccgg gcgcagtggc cgcccccgcc cagccccgag ccgccagctt tcaggcctgg 181 gggccgccgt ccccggagct gctggcgccc acccgcttcg cgctggagat gttcaaccgc 241 ggccgggctg cggggacgcg ggccgtgctg ggccttgtgc gcggccgcgt ccgccgggcg 301 ggccaggggt cgctgtactc cctggaggcc accctggagg agccaccctg caacgacccc 361 atggtgtgcc ggctccccgt gtccaagaaa accctgctct gcagcttcca agtcctggat 421 gagctcggaa gacacgtgct gctgcggaag gactgtggcc cagtggacac caaggttcca 481 ggtgctgggg agcccaagtc agccttcact cagggctcag ccatgatttc ttctctgtcc 541 caaaaccatc cagacaacag aaacgagact ttcagctcag tcatttccct gttgaatgag 601 gatcccctgt cccaggactt gcctgtgaag atggcttcaa tcttcaagaa ctttgtcatt 661 acctataacc ggacatatga gtcaaaggaa gaagcccggt ggcgcctgtc cgtctttgtc 721 aataacatgg tgcgagcaca gaagatccag gccctggacc gtggcacagc tcagtatgga 781 gtcaccaagt tcagtgatct cacagaggag gagttccgca ctatctacct gaatactctc 841 ctgagaaaag agcctggcaa caagatgaag caagccaagt ctgtgggtga cctcgcccca 901 cctgaatggg actggaggag taagggggct gtcacaaaag tcaaagacca gggcatgtgt 961 ggctcctgct gggccttctc agtcacaggc aatgtggagg gccagtggtt tctcaaccag 1021 gggaccctgc tctccctctc tgaacaggag ctcttggact gtgacaagat ggacaaggcc 1081 tgcatgggcg gcttgccctc caatgcctac tcggccataa agaatttggg agggctggag 1141 acagaggatg actacagcta ccagggtcac atgcagtcct gcaacttctc agcagagaag 1201 gccaaggtct acatcaatga ctccgtggag ctgagccaga acgagcagaa gctggcagcc 1261 tggctggcca agagaggccc aatctccgtg gccatcaatg cctttggcat gcagttttac 1321 cgccacggga tctcccgccc tctccggccc ctctgcagcc cttggctcat tgaccatgcg 1381 gtgttgcttg tgggctacgg caaccgctct gacgttccct tttgggccat caagaacagc 1441 tggggcactg actggggtga gaagggttac tactacttgc atcgcgggtc cggggcctgt 1501 ggcgtgaaca ccatggccag ctcggcggtg gtggactgaa gaggggcccc cagctcggga 1561 cctggtgctg atcagagtgg ctgctgcccc agcctgacat gtgtccaggc ccctccccgg 1621 gaggtacagc tggcagaggg aaaggcactg ggtacctcag ggtgagcaga gggcactggg 1681 ctggggcaca gcccctgctt ccctgcaccc cattcccacc ctgaagttct gcacctgcac 1741 ctttgttgaa ttgtggtagc ttaggaggat gtcagggtga agggtggtat cttggcagtt 1801 gaagctgggg caagaactct gggcttgggt aatgagcagg aagaaaattt tctgatctta 1861 agcccagctc tgttctgccc ccgctttcct ctgtttgata ctataaattt tctggttccc 1921 ttggatttag ggatagtgtc cccctccatg tccaggaaac ttgtaaccac ccttttctaa 1981 cagcaataaa gaggtgtcct tgtcccgaga aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2041 aaaaaaaaaa aa //