LOCUS BC036451 2000 bp mRNA linear HUM 17-JUL-2006 DEFINITION Homo sapiens cathepsin F, mRNA (cDNA clone MGC:33063 IMAGE:4830274), complete cds. ACCESSION BC036451 VERSION BC036451.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2000) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2000) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (09-AUG-2002) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Miklos Palkovits, M.D., Ph.D. cDNA Library Preparation: Michael J. Brownstein (NHGRI) & Shiraki Toshiyuki and Piero Carninci (RIKEN) cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Institute for Systems Biology http://www.systemsbiology.org contact: amadan@systemsbiology.org Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 47 Row: c Column: 21 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 6042195. FEATURES Location/Qualifiers source 1..2000 /db_xref="H-InvDB:HIT000051703" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:33063 IMAGE:4830274" /tissue_type="Testis" /clone_lib="NIH_MGC_97" /lab_host="DH10B" /note="Vector: pBluescriptR" gene 1..2000 /gene="CTSF" /gene_synonym="CATSF" /db_xref="GeneID:8722" /db_xref="HGNC:HGNC:2531" /db_xref="MIM:603539" CDS 66..1520 /gene="CTSF" /gene_synonym="CATSF" /codon_start=1 /product="cathepsin F" /protein_id="AAH36451.1" /db_xref="GeneID:8722" /db_xref="HGNC:HGNC:2531" /db_xref="MIM:603539" /translation="MAPWLQLLSLLGLLPGAVAAPAQPRAASFQAWGPPSPELLAPTR FALEMFNRGRAAGTRAVLGLVRGRVRRAGQGSLYSLEATLEEPPCNDPMVCRLPVSKK TLLCSFQVLDELGRHVLLRKDCGPVDTKVPGAGEPKSAFTQGSAMISSLSQNHPDNRN ETFSSVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRA QKIQALDRGTAQYGVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWD WRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACM GGLPSNAYSAIKNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVELSQNEQKLAA WLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRSDVPFWAIK NSWGTDWGEKGYYYLHRGSGACGVNTMASSAVVD" BASE COUNT 405 a 583 c 599 g 413 t ORIGIN 1 agcgtgggta cccggtgggt cggtggagcg tctgttgggt ccgggccgcc ggcttcgccc 61 tcgccatggc gccctggctg cagctcctgt cgctgctggg gctgctcccg ggcgcagtgg 121 ccgcccccgc ccagccccga gccgccagct ttcaggcctg ggggccgccg tccccggagc 181 tgctggcgcc cacccgcttc gcgctggaga tgttcaaccg cggccgggct gcggggacgc 241 gggccgtgct gggccttgtg cgcggccgcg tccgccgggc gggccagggg tcgctgtact 301 ccctggaggc caccctggag gagccaccct gcaacgaccc catggtgtgc cggctccccg 361 tgtccaagaa aaccctgctc tgcagcttcc aagtcctgga tgagctcgga agacacgtgc 421 tgctgcggaa ggactgtggc ccagtggaca ccaaggttcc aggtgctggg gagcccaagt 481 cagccttcac tcagggctca gccatgattt cttctctgtc ccaaaaccat ccagacaaca 541 gaaacgagac tttcagctca gtcatttccc tgttgaatga ggatcccctg tcccaggact 601 tgcctgtgaa gatggcttca atcttcaaga actttgtcat tacctataac cggacatatg 661 agtcaaagga agaagcccgg tggcgcctgt ccgtctttgt caataacatg gtgcgagcac 721 agaagatcca ggccctggac cgtggcacag ctcagtatgg agtcaccaag ttcagtgatc 781 tcacagagga ggagttccgc actatctacc tgaatactct cctgagaaaa gagcctggca 841 acaagatgaa gcaagccaag tctgtgggtg acctcgcccc acctgaatgg gactggagga 901 gtaagggggc tgtcacaaaa gtcaaagacc agggcatgtg tggctcctgc tgggccttct 961 cagtcacagg caatgtggag ggccagtggt ttctcaacca ggggaccctg ctctccctct 1021 ctgaacagga gctcttggac tgtgacaaga tggacaaggc ctgcatgggc ggcttgccct 1081 ccaatgccta ctcggccata aagaatttgg gagggctgga gacagaggat gactacagct 1141 accagggtca catgcagtcc tgcaacttct cagcagagaa ggccaaggtc tacatcaatg 1201 actccgtgga gctgagccag aacgagcaga agctggcagc ctggctggcc aagagaggcc 1261 caatctccgt ggccatcaat gcctttggca tgcagtttta ccgccacggg atctcccgcc 1321 ctctccggcc cctctgcagc ccttggctca ttgaccatgc ggtgttgctt gtgggctacg 1381 gcaaccgctc tgacgttccc ttttgggcca tcaagaacag ctggggcact gactggggtg 1441 agaagggtta ctactacttg catcgcgggt ccggggcctg tggcgtgaac accatggcca 1501 gctcggcggt ggtggactga agaggggccc ccagctcggg acctggtgct gatcagagtg 1561 gctgctgccc cagcctgaca tgtgtccagg cccctccccg ggaggtacag ctggcagagg 1621 gaaaggcact gggtacctca gggtgagcag agggcactgg gctggggcac agcccctgct 1681 tccctgcacc ccattcccac cctgaagttc tgcacctgca cctttgttga attgtggtag 1741 cttaggagga tgtcagggtg aagggtggta tcttggcagt tgaagctggg gcaagaactc 1801 tgggcttggg taatgagcag gaagaaaatt ttctgatctt aagcccagct ctgttctgcc 1861 cccgctttcc tctgtttgat actataaatt ttctggttcc cttggattta gggatagtgt 1921 ccccctccat gtccaggaaa cttgtaacca cccttttcta acagcaataa agaggtgtcc 1981 ttgtaaaaaa aaaaaaaaaa //