LOCUS       BC049206                2926 bp    mRNA    linear   HUM 15-JUL-2006
DEFINITION  Homo sapiens cathepsin O, mRNA (cDNA clone MGC:51921
            IMAGE:5189657), complete cds.
ACCESSION   BC049206
VERSION     BC049206.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2926)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 2926)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (21-MAR-2003) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: Life Technologies, Inc.
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: National Institutes of Health Intramural
            Sequencing Center (NISC),
            Gaithersburg, Maryland;
            Web site: http://www.nisc.nih.gov/
            Contact: nisc_mgc@nhgri.nih.gov
            Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
            Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
            Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
            Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
            Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
            McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
            Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
            Young,A., Zhang,L.-H. and Green,E.D.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 93 Row: h Column: 11.
FEATURES             Location/Qualifiers
     source          1..2926
                     /db_xref="H-InvDB:HIT000053360"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:51921 IMAGE:5189657"
                     /tissue_type="Colon, Kidney, Stomach, adult, whole pooled"
                     /clone_lib="NIH_MGC_116"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     gene            1..2926
                     /gene="CTSO"
                     /db_xref="GeneID:1519"
                     /db_xref="HGNC:HGNC:2542"
                     /db_xref="MIM:600550"
     CDS             3..968
                     /gene="CTSO"
                     /codon_start=1
                     /product="cathepsin O"
                     /protein_id="AAH49206.1"
                     /db_xref="GeneID:1519"
                     /db_xref="HGNC:HGNC:2542"
                     /db_xref="MIM:600550"
                     /translation="MDVRALPWLPWLLWLLCRGGGDADSRAPFTPTWPRSREREAAAF
                     RESLNRHRYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYLRSKPSKFPRYSAEVHMS
                     IPNVSLPLRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAIKGKPLEDLSVQQV
                     IDCSYNNYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHYFSGSHSGFSIKG
                     YSAYDFSDQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQHHCSSGEANHAVLITGF
                     DKTGSTPYWIVRNSWGSSWGVDGYAHVKMGSNVCGIADSVSSIFV"
BASE COUNT          874 a          574 c          561 g          917 t
ORIGIN      
        1 caatggacgt gcgggcgctg ccgtggctgc cgtggctgct gtggctgctg tgccggggcg
       61 gcggcgatgc ggactcccgc gcccccttca ccccgacctg gccgcggagc cgcgagcgtg
      121 aagccgccgc cttccgggaa agtcttaata gacatcgata cttgaattct ttatttccca
      181 gtgaaaactc caccgccttc tatggaataa atcagttttc ctatttgttt cctgaagagt
      241 ttaaagccat ttatttaaga agcaaacctt ccaagtttcc cagatactca gcagaagtac
      301 atatgtccat ccccaatgtg tctttgccgt taagatttga ctggagggac aagcaggttg
      361 tgacacaagt gagaaaccag cagatgtgtg gaggatgctg ggccttcagc gtggtggggg
      421 cagtggaatc tgcttatgca ataaagggga agcccctgga agacctaagt gtccagcagg
      481 tcattgactg ttcgtataat aattatggct gcaatggagg ctctactctc aatgctttga
      541 actggttaaa caagatgcaa gtaaaactgg tgaaagattc agaatatcct tttaaagcac
      601 aaaatggtct gtgccattac ttttctggtt cacattctgg attttcaatc aaaggttatt
      661 ctgcatatga cttcagtgac caagaagatg aaatggcaaa agcacttctt acctttggcc
      721 ctttggtagt catagtagat gcagtgagct ggcaagatta tctgggaggc attatacagc
      781 atcactgctc tagtggagaa gcaaatcatg cagttctcat aactgggttt gataaaacag
      841 gaagcactcc atattggatt gtgcggaatt cctggggaag ttcttgggga gtagatggtt
      901 atgcccatgt caaaatggga agtaatgttt gtggtattgc agattccgtt tcttctatat
      961 ttgtgtgaca tgttgggcag atcaagagac agctacaaaa atgaaggttt tcataatgca
     1021 atgtaacata gtacttcaaa gtattattca acttcaagtt tcagcaacta cctacaaaag
     1081 attctaaggc ctagtagtat ttaaactaag tttcagaatg ttcccttctt gtagagagat
     1141 ggacaaccaa agtcagtggg acaaactcca gcacagaagc ctgcgaggaa gcctatggaa
     1201 tagtttcctg tcctgagacg aaattcagat taggagatat tttaggcccc tgcaactggg
     1261 gaaggctact gtttgttttt gtttgcttat tatttatttg tttgtttatt gtgagatatt
     1321 tcaggtggga tcaaagaggt cataagaatt tattttcttt tgtggggtgt aactactagc
     1381 tttagattac ccctatacac aagaatggcc aacctaaaat tatgtgtgtc ttgtacagtt
     1441 agttatatta gcagccctct gagatggcgt atctatcgga aggatttcaa acaccaattg
     1501 ctttacctga acaaatggtg cttacccttt gaacagcaga gtgaccacgt agaaggaagg
     1561 aaaagggcaa aatcgcttca gttaaactga aattaaatga acaataaggc aactatataa
     1621 gtaacttcta gtagcattgc ctgagagaca aattattgtt tgataatttt cattgtgaat
     1681 aggaatccaa tagatcatat tgcttacttt gttcttttta tactatagaa taatattttg
     1741 ttctctagta tatcaaaata ccaaaatatt atctcatatt ttctccctct ttctcttact
     1801 cttaccaagt tttcctggtg gcttggcttc cctgactaaa gaattaagtc tcatttttac
     1861 tttccatttc tattttctta ccacttggtt ggctcccttt gtctctgtac ctttaccaac
     1921 attaggatct cacctctttc ttcctccctt aattcataag caccactcct atcaaagtcc
     1981 catctcttaa ccctgggtat caaacaaact gtgagttttc cagaatctgt ttcccagttt
     2041 tcccctcagc tttcctggtc tcccatctga actgcttctt tgtgcacctc ttgttctttc
     2101 tcttggctcc cagtcttgat tcctgtgatc actcttgcat cactaattgc acaagtgatt
     2161 tcaggtgcaa ttctgattag cctgcgtcca cacagtgatc gatgatccta tgtgcctaga
     2221 aaggacactg tgtgctgctc atgacctgca acaggaaaaa agccacttct tgttagcagt
     2281 gtaagaacct tagagcaaag gagttgacct tctgattgaa tataagcaca accatattaa
     2341 atgaatcaat acaagaaaat tatttctgat actatgtatg tacatatttc ttctctaaaa
     2401 tgtatcattc ttttctaatg tatatgatct aacaaaaatg aaacatgaaa tgcagtagca
     2461 accactaaaa aaaaaaattc aaggacatct aactttttct ctcacttttg ccctttgttt
     2521 atccttccct gtgattagat aaacaaaata aaaaacaaaa tgctgtattt ctcttcttac
     2581 gccagtcaga accaatccga aaagaatgtg tgttgactca ggtttggagt tatttcagga
     2641 agacagatat tgacctttta attgataaat attcttatta tcctggaatg ccagaaaaga
     2701 actatgtcct gcttgtctag tttgtattcg ctgactttct atgtgatata gatgcatttg
     2761 taatactctt tttcaagtgc taaaggattt ctaaaatttc aaactgatta atatgtttct
     2821 gctgttctgg attttgatga catttacaat aaaacaacct acatttgaaa aaaaaaaaaa
     2881 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaa
//