LOCUS       BC032310                1437 bp    mRNA    linear   HUM 15-JUL-2006
DEFINITION  Homo sapiens O-sialoglycoprotein endopeptidase, mRNA (cDNA clone
            MGC:40286 IMAGE:5211591), complete cds.
ACCESSION   BC032310
VERSION     BC032310.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1437)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 1437)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (06-JUN-2002) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: Life Technologies, Inc.
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: National Institutes of Health Intramural
            Sequencing Center (NISC),
            Gaithersburg, Maryland;
            Web site: http://www.nisc.nih.gov/
            Contact: nisc_mgc@nhgri.nih.gov
            Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
            Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
            Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
            Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
            Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
            McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
            Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
            Young,A., Zhang,L.-H. and Green,E.D.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 64 Row: k Column: 3
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 50726965.
FEATURES             Location/Qualifiers
     source          1..1437
                     /db_xref="H-InvDB:HIT000041480"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:40286 IMAGE:5211591"
                     /tissue_type="Blood, adult leukocytes"
                     /clone_lib="NIH_MGC_118"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     gene            1..1437
                     /gene="OSGEP"
                     /gene_synonym="FLJ20411"
                     /gene_synonym="GCPL1"
                     /gene_synonym="KAE1"
                     /gene_synonym="OSGEP1"
                     /gene_synonym="PRSMG1"
                     /db_xref="GeneID:55644"
                     /db_xref="HGNC:HGNC:18028"
                     /db_xref="MIM:610107"
     CDS             159..1166
                     /gene="OSGEP"
                     /gene_synonym="FLJ20411"
                     /gene_synonym="GCPL1"
                     /gene_synonym="KAE1"
                     /gene_synonym="OSGEP1"
                     /gene_synonym="PRSMG1"
                     /codon_start=1
                     /product="O-sialoglycoprotein endopeptidase"
                     /protein_id="AAH32310.1"
                     /db_xref="GeneID:55644"
                     /db_xref="HGNC:HGNC:18028"
                     /db_xref="MIM:610107"
                     /translation="MPAVLGFEGSANKIGVGVVRDGKVLANPRRTYVTPPGTGFLPGD
                     TARHHRAVILDLLQEALTESGLTSQDIDCIAYTKGPGMGAPLVSVAVVARTVAQLWNK
                     PLVGVNHCIGHIEMGRLITGATSPTVLYVSGGNTQVIAYSEHRYRIFGETIDIAVGNC
                     LDRFARVLKISNDPSPGYNIEQMAKRGKKLVELPYTVKGMDVSFSGILSFIEDVAHRM
                     LATGECTPEDLCFSLQETVFAMLVEITERAMAHCGSQEALIVGGVGCNVRLQEMMATM
                     CQERGARLFATDERFCIDNGAMIAQAGWEMFRAGHRTPLSDSGVTQRYRTDEVEVTWR
                     D"
BASE COUNT          394 a          324 c          400 g          319 t
ORIGIN      
        1 gctcgcggga gttccgcggt gtctttccca gtgctatctg caggctggcc agcttcctct
       61 gcgctccgga aagctgcggc ccagcgcgga ctagtgagga cctccacagc tcctgacatt
      121 gccaggagtc ctgtcggcgt tttctcccag cctccgccat gccggcggtg ctgggttttg
      181 aaggcagcgc caataagatt ggcgtgggcg tggtgcggga tggcaaggtg ctggcgaacc
      241 cgcggcggac ttacgtcacg cctcctggca caggattcct tccaggtgat acagccaggc
      301 atcaccgagc tgttatccta gacctgctgc aggaggcact aacagagtct ggattaacct
      361 cccaggatat cgactgcatt gcatacacca agggccctgg catgggtgcc ccactggttt
      421 ctgtggctgt tgtggcccgt actgtggccc aactgtggaa taagccattg gtgggtgtga
      481 accactgtat aggccacatt gagatgggcc gcctcatcac tggagccacc agcccaaccg
      541 tgttgtatgt gagtggagga aatacgcagg tgattgcata ctcggaacat cgttaccgta
      601 tctttgggga aaccatcgat attgcagtgg gtaattgtct ggatcgtttt gctcgagtgc
      661 tgaagatttc taacgaccca agtccaggat acaacattga acagatggca aagcgaggca
      721 agaagctagt tgagctgcca tacactgtaa aggggatgga cgtctcattc tcagggatcc
      781 tgtctttcat tgaggatgta gcccatcgga tgctggccac aggcgagtgt actcctgagg
      841 atctgtgttt ctccctgcag gaaactgtgt ttgcaatgct ggtagagatc acagagcgag
      901 ccatggcaca ttgtggctcc caggaggccc tcattgtggg aggagtgggg tgtaatgtga
      961 ggctacagga gatgatggca acaatgtgcc aggaacgtgg agcccggctt tttgctacag
     1021 atgagagatt ctgtattgac aatggagcga tgatagccca ggctggctgg gagatgtttc
     1081 gggctggaca caggacccca ctcagtgatt ctggggttac acagaggtat cggacagatg
     1141 aagtagaggt gacctggagg gactaataag atcaacagaa tcagagtaga tagttcctta
     1201 atcggaaccc aaaggacccc gtgcctcaat ctctatcctg atgtcatggg agtcctagca
     1261 aagctataga ctccaagcaa ggcttggggt cctttatgga accccaggat gactcagcaa
     1321 taaaatattt ttggtttttt ggttttgtaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
     1381 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaa
//