LOCUS BC049206 2926 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens cathepsin O, mRNA (cDNA clone MGC:51921 IMAGE:5189657), complete cds. ACCESSION BC049206 VERSION BC049206.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2926) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2926) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (21-MAR-2003) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Life Technologies, Inc. cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 93 Row: h Column: 11. FEATURES Location/Qualifiers source 1..2926 /db_xref="H-InvDB:HIT000053360" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:51921 IMAGE:5189657" /tissue_type="Colon, Kidney, Stomach, adult, whole pooled" /clone_lib="NIH_MGC_116" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..2926 /gene="CTSO" /db_xref="GeneID:1519" /db_xref="HGNC:HGNC:2542" /db_xref="MIM:600550" CDS 3..968 /gene="CTSO" /codon_start=1 /product="cathepsin O" /protein_id="AAH49206.1" /db_xref="GeneID:1519" /db_xref="HGNC:HGNC:2542" /db_xref="MIM:600550" /translation="MDVRALPWLPWLLWLLCRGGGDADSRAPFTPTWPRSREREAAAF RESLNRHRYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYLRSKPSKFPRYSAEVHMS IPNVSLPLRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAIKGKPLEDLSVQQV IDCSYNNYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHYFSGSHSGFSIKG YSAYDFSDQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQHHCSSGEANHAVLITGF DKTGSTPYWIVRNSWGSSWGVDGYAHVKMGSNVCGIADSVSSIFV" BASE COUNT 874 a 574 c 561 g 917 t ORIGIN 1 caatggacgt gcgggcgctg ccgtggctgc cgtggctgct gtggctgctg tgccggggcg 61 gcggcgatgc ggactcccgc gcccccttca ccccgacctg gccgcggagc cgcgagcgtg 121 aagccgccgc cttccgggaa agtcttaata gacatcgata cttgaattct ttatttccca 181 gtgaaaactc caccgccttc tatggaataa atcagttttc ctatttgttt cctgaagagt 241 ttaaagccat ttatttaaga agcaaacctt ccaagtttcc cagatactca gcagaagtac 301 atatgtccat ccccaatgtg tctttgccgt taagatttga ctggagggac aagcaggttg 361 tgacacaagt gagaaaccag cagatgtgtg gaggatgctg ggccttcagc gtggtggggg 421 cagtggaatc tgcttatgca ataaagggga agcccctgga agacctaagt gtccagcagg 481 tcattgactg ttcgtataat aattatggct gcaatggagg ctctactctc aatgctttga 541 actggttaaa caagatgcaa gtaaaactgg tgaaagattc agaatatcct tttaaagcac 601 aaaatggtct gtgccattac ttttctggtt cacattctgg attttcaatc aaaggttatt 661 ctgcatatga cttcagtgac caagaagatg aaatggcaaa agcacttctt acctttggcc 721 ctttggtagt catagtagat gcagtgagct ggcaagatta tctgggaggc attatacagc 781 atcactgctc tagtggagaa gcaaatcatg cagttctcat aactgggttt gataaaacag 841 gaagcactcc atattggatt gtgcggaatt cctggggaag ttcttgggga gtagatggtt 901 atgcccatgt caaaatggga agtaatgttt gtggtattgc agattccgtt tcttctatat 961 ttgtgtgaca tgttgggcag atcaagagac agctacaaaa atgaaggttt tcataatgca 1021 atgtaacata gtacttcaaa gtattattca acttcaagtt tcagcaacta cctacaaaag 1081 attctaaggc ctagtagtat ttaaactaag tttcagaatg ttcccttctt gtagagagat 1141 ggacaaccaa agtcagtggg acaaactcca gcacagaagc ctgcgaggaa gcctatggaa 1201 tagtttcctg tcctgagacg aaattcagat taggagatat tttaggcccc tgcaactggg 1261 gaaggctact gtttgttttt gtttgcttat tatttatttg tttgtttatt gtgagatatt 1321 tcaggtggga tcaaagaggt cataagaatt tattttcttt tgtggggtgt aactactagc 1381 tttagattac ccctatacac aagaatggcc aacctaaaat tatgtgtgtc ttgtacagtt 1441 agttatatta gcagccctct gagatggcgt atctatcgga aggatttcaa acaccaattg 1501 ctttacctga acaaatggtg cttacccttt gaacagcaga gtgaccacgt agaaggaagg 1561 aaaagggcaa aatcgcttca gttaaactga aattaaatga acaataaggc aactatataa 1621 gtaacttcta gtagcattgc ctgagagaca aattattgtt tgataatttt cattgtgaat 1681 aggaatccaa tagatcatat tgcttacttt gttcttttta tactatagaa taatattttg 1741 ttctctagta tatcaaaata ccaaaatatt atctcatatt ttctccctct ttctcttact 1801 cttaccaagt tttcctggtg gcttggcttc cctgactaaa gaattaagtc tcatttttac 1861 tttccatttc tattttctta ccacttggtt ggctcccttt gtctctgtac ctttaccaac 1921 attaggatct cacctctttc ttcctccctt aattcataag caccactcct atcaaagtcc 1981 catctcttaa ccctgggtat caaacaaact gtgagttttc cagaatctgt ttcccagttt 2041 tcccctcagc tttcctggtc tcccatctga actgcttctt tgtgcacctc ttgttctttc 2101 tcttggctcc cagtcttgat tcctgtgatc actcttgcat cactaattgc acaagtgatt 2161 tcaggtgcaa ttctgattag cctgcgtcca cacagtgatc gatgatccta tgtgcctaga 2221 aaggacactg tgtgctgctc atgacctgca acaggaaaaa agccacttct tgttagcagt 2281 gtaagaacct tagagcaaag gagttgacct tctgattgaa tataagcaca accatattaa 2341 atgaatcaat acaagaaaat tatttctgat actatgtatg tacatatttc ttctctaaaa 2401 tgtatcattc ttttctaatg tatatgatct aacaaaaatg aaacatgaaa tgcagtagca 2461 accactaaaa aaaaaaattc aaggacatct aactttttct ctcacttttg ccctttgttt 2521 atccttccct gtgattagat aaacaaaata aaaaacaaaa tgctgtattt ctcttcttac 2581 gccagtcaga accaatccga aaagaatgtg tgttgactca ggtttggagt tatttcagga 2641 agacagatat tgacctttta attgataaat attcttatta tcctggaatg ccagaaaaga 2701 actatgtcct gcttgtctag tttgtattcg ctgactttct atgtgatata gatgcatttg 2761 taatactctt tttcaagtgc taaaggattt ctaaaatttc aaactgatta atatgtttct 2821 gctgttctgg attttgatga catttacaat aaaacaacct acatttgaaa aaaaaaaaaa 2881 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaa //