LOCUS BC032310 1437 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens O-sialoglycoprotein endopeptidase, mRNA (cDNA clone MGC:40286 IMAGE:5211591), complete cds. ACCESSION BC032310 VERSION BC032310.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1437) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 1437) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (06-JUN-2002) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Life Technologies, Inc. cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 64 Row: k Column: 3 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 50726965. FEATURES Location/Qualifiers source 1..1437 /db_xref="H-InvDB:HIT000041480" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:40286 IMAGE:5211591" /tissue_type="Blood, adult leukocytes" /clone_lib="NIH_MGC_118" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..1437 /gene="OSGEP" /gene_synonym="FLJ20411" /gene_synonym="GCPL1" /gene_synonym="KAE1" /gene_synonym="OSGEP1" /gene_synonym="PRSMG1" /db_xref="GeneID:55644" /db_xref="HGNC:HGNC:18028" /db_xref="MIM:610107" CDS 159..1166 /gene="OSGEP" /gene_synonym="FLJ20411" /gene_synonym="GCPL1" /gene_synonym="KAE1" /gene_synonym="OSGEP1" /gene_synonym="PRSMG1" /codon_start=1 /product="O-sialoglycoprotein endopeptidase" /protein_id="AAH32310.1" /db_xref="GeneID:55644" /db_xref="HGNC:HGNC:18028" /db_xref="MIM:610107" /translation="MPAVLGFEGSANKIGVGVVRDGKVLANPRRTYVTPPGTGFLPGD TARHHRAVILDLLQEALTESGLTSQDIDCIAYTKGPGMGAPLVSVAVVARTVAQLWNK PLVGVNHCIGHIEMGRLITGATSPTVLYVSGGNTQVIAYSEHRYRIFGETIDIAVGNC LDRFARVLKISNDPSPGYNIEQMAKRGKKLVELPYTVKGMDVSFSGILSFIEDVAHRM LATGECTPEDLCFSLQETVFAMLVEITERAMAHCGSQEALIVGGVGCNVRLQEMMATM CQERGARLFATDERFCIDNGAMIAQAGWEMFRAGHRTPLSDSGVTQRYRTDEVEVTWR D" BASE COUNT 394 a 324 c 400 g 319 t ORIGIN 1 gctcgcggga gttccgcggt gtctttccca gtgctatctg caggctggcc agcttcctct 61 gcgctccgga aagctgcggc ccagcgcgga ctagtgagga cctccacagc tcctgacatt 121 gccaggagtc ctgtcggcgt tttctcccag cctccgccat gccggcggtg ctgggttttg 181 aaggcagcgc caataagatt ggcgtgggcg tggtgcggga tggcaaggtg ctggcgaacc 241 cgcggcggac ttacgtcacg cctcctggca caggattcct tccaggtgat acagccaggc 301 atcaccgagc tgttatccta gacctgctgc aggaggcact aacagagtct ggattaacct 361 cccaggatat cgactgcatt gcatacacca agggccctgg catgggtgcc ccactggttt 421 ctgtggctgt tgtggcccgt actgtggccc aactgtggaa taagccattg gtgggtgtga 481 accactgtat aggccacatt gagatgggcc gcctcatcac tggagccacc agcccaaccg 541 tgttgtatgt gagtggagga aatacgcagg tgattgcata ctcggaacat cgttaccgta 601 tctttgggga aaccatcgat attgcagtgg gtaattgtct ggatcgtttt gctcgagtgc 661 tgaagatttc taacgaccca agtccaggat acaacattga acagatggca aagcgaggca 721 agaagctagt tgagctgcca tacactgtaa aggggatgga cgtctcattc tcagggatcc 781 tgtctttcat tgaggatgta gcccatcgga tgctggccac aggcgagtgt actcctgagg 841 atctgtgttt ctccctgcag gaaactgtgt ttgcaatgct ggtagagatc acagagcgag 901 ccatggcaca ttgtggctcc caggaggccc tcattgtggg aggagtgggg tgtaatgtga 961 ggctacagga gatgatggca acaatgtgcc aggaacgtgg agcccggctt tttgctacag 1021 atgagagatt ctgtattgac aatggagcga tgatagccca ggctggctgg gagatgtttc 1081 gggctggaca caggacccca ctcagtgatt ctggggttac acagaggtat cggacagatg 1141 aagtagaggt gacctggagg gactaataag atcaacagaa tcagagtaga tagttcctta 1201 atcggaaccc aaaggacccc gtgcctcaat ctctatcctg atgtcatggg agtcctagca 1261 aagctataga ctccaagcaa ggcttggggt cctttatgga accccaggat gactcagcaa 1321 taaaatattt ttggtttttt ggttttgtaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1381 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaa //