LOCUS       BC036451                2000 bp    mRNA    linear   HUM 17-JUL-2006
DEFINITION  Homo sapiens cathepsin F, mRNA (cDNA clone MGC:33063
            IMAGE:4830274), complete cds.
ACCESSION   BC036451
VERSION     BC036451.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2000)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 2000)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (09-AUG-2002) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: Miklos Palkovits, M.D., Ph.D.
            cDNA Library Preparation: Michael J. Brownstein (NHGRI) &  Shiraki
            Toshiyuki and Piero Carninci (RIKEN)
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Institute for Systems Biology
            http://www.systemsbiology.org
            contact: amadan@systemsbiology.org
            Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
            Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 47 Row: c Column: 21
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 6042195.
FEATURES             Location/Qualifiers
     source          1..2000
                     /db_xref="H-InvDB:HIT000051703"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:33063 IMAGE:4830274"
                     /tissue_type="Testis"
                     /clone_lib="NIH_MGC_97"
                     /lab_host="DH10B"
                     /note="Vector: pBluescriptR"
     gene            1..2000
                     /gene="CTSF"
                     /gene_synonym="CATSF"
                     /db_xref="GeneID:8722"
                     /db_xref="HGNC:HGNC:2531"
                     /db_xref="MIM:603539"
     CDS             66..1520
                     /gene="CTSF"
                     /gene_synonym="CATSF"
                     /codon_start=1
                     /product="cathepsin F"
                     /protein_id="AAH36451.1"
                     /db_xref="GeneID:8722"
                     /db_xref="HGNC:HGNC:2531"
                     /db_xref="MIM:603539"
                     /translation="MAPWLQLLSLLGLLPGAVAAPAQPRAASFQAWGPPSPELLAPTR
                     FALEMFNRGRAAGTRAVLGLVRGRVRRAGQGSLYSLEATLEEPPCNDPMVCRLPVSKK
                     TLLCSFQVLDELGRHVLLRKDCGPVDTKVPGAGEPKSAFTQGSAMISSLSQNHPDNRN
                     ETFSSVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRA
                     QKIQALDRGTAQYGVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWD
                     WRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACM
                     GGLPSNAYSAIKNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVELSQNEQKLAA
                     WLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRSDVPFWAIK
                     NSWGTDWGEKGYYYLHRGSGACGVNTMASSAVVD"
BASE COUNT          405 a          583 c          599 g          413 t
ORIGIN      
        1 agcgtgggta cccggtgggt cggtggagcg tctgttgggt ccgggccgcc ggcttcgccc
       61 tcgccatggc gccctggctg cagctcctgt cgctgctggg gctgctcccg ggcgcagtgg
      121 ccgcccccgc ccagccccga gccgccagct ttcaggcctg ggggccgccg tccccggagc
      181 tgctggcgcc cacccgcttc gcgctggaga tgttcaaccg cggccgggct gcggggacgc
      241 gggccgtgct gggccttgtg cgcggccgcg tccgccgggc gggccagggg tcgctgtact
      301 ccctggaggc caccctggag gagccaccct gcaacgaccc catggtgtgc cggctccccg
      361 tgtccaagaa aaccctgctc tgcagcttcc aagtcctgga tgagctcgga agacacgtgc
      421 tgctgcggaa ggactgtggc ccagtggaca ccaaggttcc aggtgctggg gagcccaagt
      481 cagccttcac tcagggctca gccatgattt cttctctgtc ccaaaaccat ccagacaaca
      541 gaaacgagac tttcagctca gtcatttccc tgttgaatga ggatcccctg tcccaggact
      601 tgcctgtgaa gatggcttca atcttcaaga actttgtcat tacctataac cggacatatg
      661 agtcaaagga agaagcccgg tggcgcctgt ccgtctttgt caataacatg gtgcgagcac
      721 agaagatcca ggccctggac cgtggcacag ctcagtatgg agtcaccaag ttcagtgatc
      781 tcacagagga ggagttccgc actatctacc tgaatactct cctgagaaaa gagcctggca
      841 acaagatgaa gcaagccaag tctgtgggtg acctcgcccc acctgaatgg gactggagga
      901 gtaagggggc tgtcacaaaa gtcaaagacc agggcatgtg tggctcctgc tgggccttct
      961 cagtcacagg caatgtggag ggccagtggt ttctcaacca ggggaccctg ctctccctct
     1021 ctgaacagga gctcttggac tgtgacaaga tggacaaggc ctgcatgggc ggcttgccct
     1081 ccaatgccta ctcggccata aagaatttgg gagggctgga gacagaggat gactacagct
     1141 accagggtca catgcagtcc tgcaacttct cagcagagaa ggccaaggtc tacatcaatg
     1201 actccgtgga gctgagccag aacgagcaga agctggcagc ctggctggcc aagagaggcc
     1261 caatctccgt ggccatcaat gcctttggca tgcagtttta ccgccacggg atctcccgcc
     1321 ctctccggcc cctctgcagc ccttggctca ttgaccatgc ggtgttgctt gtgggctacg
     1381 gcaaccgctc tgacgttccc ttttgggcca tcaagaacag ctggggcact gactggggtg
     1441 agaagggtta ctactacttg catcgcgggt ccggggcctg tggcgtgaac accatggcca
     1501 gctcggcggt ggtggactga agaggggccc ccagctcggg acctggtgct gatcagagtg
     1561 gctgctgccc cagcctgaca tgtgtccagg cccctccccg ggaggtacag ctggcagagg
     1621 gaaaggcact gggtacctca gggtgagcag agggcactgg gctggggcac agcccctgct
     1681 tccctgcacc ccattcccac cctgaagttc tgcacctgca cctttgttga attgtggtag
     1741 cttaggagga tgtcagggtg aagggtggta tcttggcagt tgaagctggg gcaagaactc
     1801 tgggcttggg taatgagcag gaagaaaatt ttctgatctt aagcccagct ctgttctgcc
     1861 cccgctttcc tctgtttgat actataaatt ttctggttcc cttggattta gggatagtgt
     1921 ccccctccat gtccaggaaa cttgtaacca cccttttcta acagcaataa agaggtgtcc
     1981 ttgtaaaaaa aaaaaaaaaa
//