LOCUS BC039170 2194 bp mRNA linear HUM 23-NOV-2007 DEFINITION Homo sapiens peptidase M20 domain containing 1, mRNA (cDNA clone MGC:21660 IMAGE:4746689), complete cds. ACCESSION BC039170 VERSION BC039170.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2194) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2194) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (01-NOV-2002) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: James Cleaver, M.D. cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Baylor College of Medicine Human Genome Sequencing Center Center code: BCM-HGSC Web site: http://www.hgsc.bcm.tmc.edu/cdna/ Contact: amg@bcm.tmc.edu Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H., Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati, A.N., Gibbs, R.A. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 28 Row: m Column: 12 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 47458826. FEATURES Location/Qualifiers source 1..2194 /db_xref="H-InvDB:HIT000095549" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:21660 IMAGE:4746689" /tissue_type="Skin, normal" /clone_lib="NCI_CGAP_Skn3" /lab_host="DH10B" /note="Vector: pCMV-SPORT6.1" gene 1..2194 /gene="PM20D1" /gene_synonym="Cps1" /db_xref="GeneID:148811" /db_xref="HGNC:HGNC:26518" CDS 46..1554 /gene="PM20D1" /gene_synonym="Cps1" /codon_start=1 /product="peptidase M20 domain containing 1" /protein_id="AAH39170.1" /db_xref="GeneID:148811" /db_xref="HGNC:HGNC:26518" /translation="MAQRCVCVLALVAMLLLVFPTVSRSMGPRSGEYQRASRIPSQFS KEERVAMKEALKGAIQIPTVTFSSEKSNTTALAEFGKYIRKVFPTVVSTSFIQHEVVE EYSHLFTIQGSDPSLQPYLLMAHFDVVPAPEEGWEVPPFSGLERDGVIYGRGTLDDKN SVMALLQALELLLIRKYIPRRSFFISLGHDEESSGTGAQRISALLQSRGVQLAFIVDE GGFILDDFIPNFKKPIALTAVSEKGSMNLMLQVNMTSGHSSAPPKETSIGILAAAVSR LEQTPMPIIFGSGTVVTVLQQLANEFPFPVNIILSNPWLFEPLISRFMERNPLTNAII RTTTALTIFKARVKFNVIPPVAQATVNFRIHPGQTVQEVLELTKNTVADNRVQFHVLS AFDPLPVSPSDDKALGYQLLRQTVQSVFPEVNITAPVTSIGNTDSRFFTNLTTGIYRF YPIYIQPEDFKRIHGVNEKISVQAYETQVKFIFELIQNADTDQEPVSHLHKL" BASE COUNT 570 a 596 c 478 g 550 t ORIGIN 1 gtcagaacta ccccggtagc ctgacagcag gagctcgaga gaagcatggc tcagcggtgc 61 gtttgcgtgc tggccctggt ggctatgctg ctcctagttt tccctaccgt ctccagatcg 121 atgggcccga ggagcgggga gtatcaaagg gcgtcgcgaa tcccttctca gttcagcaaa 181 gaggaacgcg tcgcgatgaa agaggcactg aaaggtgcca tccagattcc aacagtgact 241 tttagctctg agaagtccaa tactacagcc ctggctgagt tcggaaaata cattcgtaaa 301 gtctttccta cagtggtcag caccagcttt atccagcatg aagtcgtgga agagtatagc 361 cacctgttca ctatccaagg ctcggacccc agcttgcagc cctacctgct gatggctcac 421 tttgatgtgg tgcctgcccc tgaagaaggc tgggaggtgc ccccattctc tgggttggag 481 cgtgatggcg tcatctatgg tcggggcaca ctggacgaca agaactctgt gatggcatta 541 ctgcaggcct tggagctcct gctgatcagg aagtacatcc cccgaagatc tttcttcatt 601 tctctgggcc atgatgagga gtcatcaggg acaggggctc agaggatctc agccctgcta 661 cagtcaaggg gcgtccagct agccttcatt gtggacgagg ggggcttcat cttggatgat 721 ttcattccta acttcaagaa gcccatcgcc ttgactgcag tctcagagaa gggttccatg 781 aacctcatgc tgcaagtaaa catgacttca ggccactctt cagctcctcc aaaggagaca 841 agcattggca tccttgcagc tgctgtcagc cgattggagc agacaccaat gcctatcata 901 tttggaagcg ggacagtggt gactgtattg cagcaactgg caaatgagtt tcccttccct 961 gtcaatataa tcctgagcaa cccatggcta tttgaaccac ttataagcag gtttatggag 1021 agaaatccct taaccaatgc aataatcagg accaccacgg cactcaccat attcaaagca 1081 agggtcaagt tcaatgtcat ccccccagtg gcccaggcca cagtcaactt ccggattcac 1141 cctggacaga cagtccaaga ggtcctagaa ctcacgaaga acactgtggc tgataacaga 1201 gtccagttcc atgtgttgag tgcctttgac cccctccccg tcagcccttc tgatgacaag 1261 gccttgggct accagctgct ccgccagacc gtacagtccg tcttcccgga agtcaatatt 1321 actgccccag ttacttctat tggcaacaca gacagccgat tctttacaaa cctcaccact 1381 ggcatctaca ggttctaccc catctacata cagcctgaag acttcaaacg catccatgga 1441 gtcaacgaga aaatctcagt ccaagcctat gagacccaag tgaaattcat ctttgagttg 1501 attcagaatg ctgacacaga ccaggagcca gtttctcacc tgcacaaact gtgaggtcaa 1561 ggggcctgct gggttaggca tgcccgaccc cgggacagga ctaacccaag ggggaaagct 1621 agtgttgatg aaacttttga tcaaaaccac attgtaaaac attgcccatc tgtcttgctc 1681 actcttaaac tctcccaaga acaaggccgg ggtaaggtaa agtcagcaga aatctggctt 1741 ctcccttcct cccgacatct gcatcccttg atccactggc atttgctgcc ctcttgtccc 1801 ttatctgtct tatgctggtt atttcactgc ttcaccttcc aggcttgact taacaaatgt 1861 agatttgaga aatctcaacc agttgttaac tgataggagt ctttaattta gggcactctt 1921 gctgggatgc tttctccaga gcttatatat ttcttcttac tagaactttc ttcccccttt 1981 tattcccctc tcttcttgga ctcatgagct gtctcttcat ctctcctctc tctcctgcat 2041 ctctcccctt actcttcaat ttattctact tctggacctg gacttaccca aactgtgata 2101 ctaccataat tgtcaccata atcagtcaaa taaagtgatc tgtgcatcaa aaaaaaaaaa 2161 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaa //