LOCUS       BC011904                1908 bp    mRNA    linear   HUM 03-NOV-2006
DEFINITION  Homo sapiens O-sialoglycoprotein endopeptidase-like 1, mRNA (cDNA
            clone MGC:20293 IMAGE:4121450), complete cds.
ACCESSION   BC011904
VERSION     BC011904.2
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1908)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 1908)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (30-JUL-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     On Dec 19, 2003 this sequence version replaced BC011904.1.
            Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: National Institutes of Health Intramural
            Sequencing Center (NISC),
            Gaithersburg, Maryland;
            Web site: http://www.nisc.nih.gov/
            Contact: nisc_mgc@nhgri.nih.gov
            Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
            Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
            Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
            Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
            Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
            McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
            Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
            Young,A., Zhang,L.-H. and Green,E.D.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 28 Row: i Column: 22
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 11641264.
FEATURES             Location/Qualifiers
     source          1..1908
                     /db_xref="H-InvDB:HIT000035530"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:20293 IMAGE:4121450"
                     /tissue_type="Muscle, rhabdomyosarcoma"
                     /clone_lib="NIH_MGC_17"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     gene            1..1908
                     /gene="OSGEPL1"
                     /db_xref="GeneID:64172"
                     /db_xref="HGNC:HGNC:23075"
     CDS             104..1348
                     /gene="OSGEPL1"
                     /codon_start=1
                     /product="O-sialoglycoprotein endopeptidase-like 1"
                     /protein_id="AAH11904.1"
                     /db_xref="GeneID:64172"
                     /db_xref="HGNC:HGNC:23075"
                     /translation="MLILTKTAGVFFKPSKRKVYEFLRSFNFHPGTLFLHKIVLGIET
                     SCDDTAAAVVDETGNVLGEAIHSQTEVHLKTGGIVPPAAQQLHRENIQRIVQEALSAS
                     GVSPSDLSAIATTIKPGLALSLGVGLSFSLQLVGQLKKPFIPIHHMEAHALTIRLTNK
                     VEFPFLVLLISGGHCLLALVQGVSDFLLLGKSLDIAPGDMLDKVARRLSLIKHPECST
                     MSGGKAIEHLAKQGNRFHFDIKPPLHHAKNCDFSFTGLQHVTDKIIMKKEKEEGIEKG
                     QILSSAADIAATVQHTMACHLVKRTHRAILFCKQRDLLPQNNAVLVASGGVASNFYIR
                     RALEILTNATQCTLLCPPPRLCTDNGIMIAWNGIERLRAGLGILHDIEGIRYEPKCPL
                     GVDISKEVGEASIKVPQLKMEI"
BASE COUNT          620 a          349 c          375 g          564 t
ORIGIN      
        1 ctttctcctg cagcgataag ggcagtcgac agtctttagt agggaaagga gacaagtgct
       61 agctactgcc gcccaagtgg aaggaattat ctatagagta agtatgctaa tcttgactaa
      121 gactgcagga gtttttttta aaccatcaaa aaggaaagtt tatgaatttt taagaagttt
      181 taattttcat cctggaacac tatttcttca taaaatagta ttgggaattg aaactagttg
      241 tgatgataca gcagctgctg tggtggatga aactggaaat gtgttgggag aagcaataca
      301 ttcccaaact gaagttcatt taaaaacagg tgggattgtt cctccagcag ctcaacagct
      361 tcacagagaa aatattcaac gaatagtaca agaagctctt tctgccagtg gagtctctcc
      421 aagtgacctc tcagcaattg caactaccat aaaaccagga cttgctttaa gcctgggagt
      481 gggcttatca tttagcttac agctggtagg acagttaaaa aagccattca ttcccattca
      541 tcatatggag gctcatgcac ttactattag gttgaccaat aaagtagaat ttcctttttt
      601 agttcttttg atttctggag gtcactgtct gttggcatta gttcaaggag tttcagattt
      661 tctgcttctt ggaaagtctt tggacatagc accaggtgac atgcttgaca aggtggcaag
      721 aagactttct ttaataaaac atccagagtg ctccaccatg agtggtggga aagccataga
      781 acatttggcc aaacaaggaa atagatttca ttttgacatc aaacctccct tgcatcatgc
      841 taaaaattgt gatttttctt ttactggact tcaacacgtt actgataaaa taataatgaa
      901 aaaggaaaaa gaggaaggta ttgagaaggg gcaaatcctg tcttcagcag cagacattgc
      961 tgccacagta cagcacacaa tggcatgtca tcttgtgaaa agaacacatc gggctattct
     1021 gttttgtaag cagagagact tgttacctca aaataatgca gtactggttg catctggtgg
     1081 tgtcgcaagt aacttctata tccgcagagc tctggaaatt ttaacaaacg caacacagtg
     1141 cactttgttg tgtcctcctc ccagactatg cactgataat ggcattatga ttgcatggaa
     1201 tggtattgaa agactacgtg ctggcttggg cattttacat gacatagaag gcatccgcta
     1261 tgaaccaaaa tgtcctcttg gagtagacat atcaaaagaa gttggagaag cttccataaa
     1321 agtaccacaa ttaaaaatgg agatatgatt tctgctgttc aaaaaagtcc ctaaagagac
     1381 aggctcttgc catgttgccc gggctggtcc tgaactgctg aattcaagtg atcctcccac
     1441 cttggcctcc agaagtgctg ggattatggg tgtgagccac catgcctagc caaaatgttt
     1501 cttaaggtat acattttggg tcttagaaga cttatacatt tgtaatattt attactaaat
     1561 atctcaaagt attacaataa atgttaccat gtgagctact ttgaatcagg cttcttgcac
     1621 accaatttaa aaatgttaac tcttgatata tacactagtt ataccactca tgtcagtcaa
     1681 taaattttaa ggtttaagtg caggcctttg tttacagaaa tcctaatttt ttgaaaccat
     1741 aactctgacc tgacactaaa ttcctgtaga catgctaagg aaaatctgct tagtatcgag
     1801 atcaagaact tccattcaaa aagattattc agttatgtta tttgcatatt accattgtta
     1861 aaaataaaaa aatttttaaa agatgaaaaa aaaaaaaaaa aaaaaaaa
//