LOCUS       CR457232                1008 bp    mRNA    linear   HUM 16-OCT-2008
DEFINITION  Homo sapiens full open reading frame cDNA clone RZPDo834E0712D for
            gene OSGEP, O-sialoglycoprotein endopeptidase; complete cds, incl.
            stopcodon.
ACCESSION   CR457232
VERSION     CR457232.1
KEYWORDS    Full ORF shuttle clone, Gateway(TM), complete cds.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1008)
  AUTHORS   Ebert L., Schick M., Neubert P., Schatten R., Henze S., Korn B.
  TITLE     Cloning of human full open reading frames in Gateway(TM) system
            entry vector (pDONR201)
  JOURNAL   Unpublished.
REFERENCE   2  (bases 1 to 1008)
  AUTHORS   Ebert L., Schick M., Neubert P., Schatten R., Henze S., Korn B.
  JOURNAL   Submitted (03-JUN-2004) to the INSDC. RZPD Deutsches
            Ressourcenzentrum fuer Genomforschung GmbH, Im Neuenheimer Feld
            580, D-69120 Heidelberg, Germany
COMMENT     RZPD; RZPDo834E0712D, ORFNo 2153
            www.rzpd.de/cgi-bin/products/cl.cgi?CloneID=RZPDo834E0712D
            RZPDLIB; Human Full ORF Clones Gateway(TM) - RZPD (kan-resist.)
            RZPD LIB No. 834
            www.rzpd.de/cgi-bin/products/showLib.pl.cgi/response?libNo=834
            www.rzpd.de/products/orfclones/
            Contact: Inge Arlart
            RZPD Deutsches Ressourcenzentrum fuer Genomforschung GmbH,
            Heubnerweg 6, D-14059 Berlin, Germany
            Tel: +49 30 32639 100
            Fax: +49 30 32639 111
            www.rzpd.de
            This clone is available from RZPD;
            contact RZPD (customer.service@rzpd.de) for further information.
            
            This CDS clone is a part of a collection of human full length
            expression clones generated by RZPD.
            This CDS has been cloned incl. stopcodon.
            The CDS has been inserted into pDONR201 via a BP Clonase(TM)
            reaction.
            Additional sequence has been added in front of the start codon:
            att..AAAAAA GCA GGC (ATG).
            The last base of the last coding triplett has been changed to T,
            which might lead to an amino acid change at the C terminus of the
            polypeptide.
            The stop codon has been set to TAA followed by
            TTAACCCAGCTTTCTT..att.
            Compared to the reference sequence NM_017807 we did not find any
            amino acid exchanges.
            Clone distribution: http://www.rzpd.de/products/orfclones/
FEATURES             Location/Qualifiers
     source          1..1008
                     /db_xref="H-InvDB:HIT000268082"
                     /organism="Homo sapiens"
                     /lab_host="DH10B"
                     /mol_type="mRNA"
                     /clone_lib="Human Full ORF Clones Gateway(TM) - RZPD"
                     /clone="RZPDo834E0712D"
                     /note="Vector: pDONR201, Site_1: attP1; Site_2: attP2"
                     /db_xref="taxon:9606"
     CDS             1..1008
                     /codon_start=1
                     /gene="OSGEP"
                     /db_xref="GOA:Q9NPF4"
                     /db_xref="H-InvDB:HIT000268082.11"
                     /db_xref="HGNC:HGNC:18028"
                     /db_xref="InterPro:IPR000905"
                     /db_xref="InterPro:IPR017860"
                     /db_xref="InterPro:IPR017861"
                     /db_xref="InterPro:IPR034680"
                     /db_xref="PDB:6GWJ"
                     /db_xref="UniProtKB/Swiss-Prot:Q9NPF4"
                     /protein_id="CAG33513.1"
                     /translation="MPAVLGFEGSANKIGVGVVRDGKVLANPRRTYVTPPGTGFLPGD
                     TARHHRAVILDLLQEALTESGLTSQDIDCIAYTKGPGMGAPLVSVAVVARTVAQLWNK
                     PLVGVNHCIGHIEMGRLITGATSPTVLYVSGGNTQVIAYSEHRYRIFGETIDIAVGNC
                     LDRFARVLKISNDPSPGYNIEQMAKRGKKLVELPYTVKGMDVSFSGILSFIEDVAHRM
                     LATGECTPEDLCFSLQETVFAMLVEITERAMAHCGSQEALIVGGVGCNVRLQEMMATM
                     CQERGARLFATDERFCIDNGAMIAQAGWEMFRAGHRTPLSDSGVTQRYRTDEVEVTWR
                     D"
BASE COUNT          233 a          227 c          315 g          233 t
ORIGIN      
        1 atgccggcgg tgctgggttt tgaaggcagc gccaataaga ttggcgtggg cgtggtgcgg
       61 gatggcaagg tgctggcgaa cccgcggcgg acttacgtca cgcctcctgg cacaggattc
      121 cttccaggtg atacagccag gcatcaccga gctgttatcc tagacctgct gcaggaggca
      181 ctaacagagt ctggattaac ctcccaggat atcgactgca ttgcatacac caagggccct
      241 ggcatgggtg ccccactggt ttctgtggct gttgtggccc gtactgtggc ccaactgtgg
      301 aataagccat tggtgggtgt gaaccactgt ataggccaca ttgagatggg ccgcctcatc
      361 actggagcca ccagcccaac cgtgttgtat gtgagtggag gaaatacgca ggtgattgca
      421 tactcggaac atcgttaccg tatctttggg gaaaccatcg atattgcagt gggtaattgt
      481 ctggatcgtt ttgctcgagt gctgaagatt tctaacgacc caagtccagg atacaacatt
      541 gaacagatgg caaagcgagg caagaagcta gttgagctgc catacactgt aaaggggatg
      601 gacgtctcat tctcagggat cctgtctttc attgaggatg tagcccatcg gatgctggcc
      661 acaggcgagt gtactcctga ggatctgtgt ttctccctgc aggaaactgt gtttgcaatg
      721 ctggtagaga tcacagagcg agccatggca cattgtggct cccaggaggc cctcattgtg
      781 ggaggagtgg ggtgtaatgt gaggctacag gagatgatgg caacaatgtg ccaggaacgt
      841 ggagcccggc tttttgctac agatgagaga ttctgtattg acaatggagc gatgatagcc
      901 caggctggct gggagatgtt tcgggctgga cacaggaccc cactcagtga ttctggggtt
      961 acacagaggt atcggacaga tgaagtagag gtgacctgga gggattaa
//