LOCUS BC013875 2048 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens matrix metallopeptidase 1 (interstitial collagenase), mRNA (cDNA clone MGC:10479 IMAGE:3834572), complete cds. ACCESSION BC013875 VERSION BC013875.2 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2048) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2048) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (07-SEP-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Nov 6, 2003 this sequence version replaced BC013875.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: DCTD/DTP cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Institute for Systems Biology http://www.systemsbiology.org contact: amadan@systemsbiology.org Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 14 Row: h Column: 19 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 13027798. FEATURES Location/Qualifiers source 1..2048 /db_xref="H-InvDB:HIT000036382" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:10479 IMAGE:3834572" /tissue_type="Ovary, adenocarcinoma" /clone_lib="NIH_MGC_9" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..2048 /gene="MMP1" /gene_synonym="CLGN" /db_xref="GeneID:4312" /db_xref="HGNC:HGNC:7155" /db_xref="MIM:120353" CDS 43..1452 /gene="MMP1" /gene_synonym="CLGN" /codon_start=1 /product="matrix metallopeptidase 1 (interstitial collagenase)" /protein_id="AAH13875.1" /db_xref="GeneID:4312" /db_xref="HGNC:HGNC:7155" /db_xref="MIM:120353" /translation="MHSFPPLLLLLFWGVVSHSFPATLETQEQDVDLVQKYLEKYYNL KNDGRQVEKRRNSGPVVEKLKQMQEFFGLKVTGKPDAETLKVMKQPRCGVPDVAQFVL TEGNPRWEQTHLTYRIENYTPDLPRADVDHAIEKAFQLWSNVTPLTFTKVSEGQADIM ISFVRGDHRDNSPFDGPGGNLAHAFQPGPGIGGDAHFDEDERWTNNFREYNLHRVAAH ELGHSLGLSHSTDIGALMYPSYTFSGDVQLAQDDIDGIQAIYGRSQNPVQPIGPQTPK ACDSKLTFDAITTIRGEVMFFKDRFYMRTNPFYPEVELNFISVFWPQLPNGLEAAYEF ADRDEVRFFKGNKYWAVQGQNVLHGYPKDIYSSFGFPRTVKHIDAALSEENTGKTYFF VANKYWRYDEYKRSMDPGYPKMIAHDFPGIGHKVDAVFMKDGFFYFFHGTRQYKFDPK TKRILTLQKANSWFNCRKN" BASE COUNT 678 a 408 c 434 g 528 t ORIGIN 1 gccatcactt accttgcact gagaaagaag acaaaggcca gtatgcacag ctttcctcca 61 ctgctgctgc tgctgttctg gggtgtggtg tctcacagct tcccagcgac tctagaaaca 121 caagagcaag atgtggactt agtccagaaa tacctggaaa aatactacaa cctgaagaat 181 gatgggaggc aagttgaaaa gcggagaaat agtggcccag tggttgaaaa attgaagcaa 241 atgcaggaat tctttgggct gaaagtgact gggaaaccag atgctgaaac cctgaaggtg 301 atgaagcagc ccagatgtgg agtgcctgat gtggctcagt ttgtcctcac tgaggggaac 361 cctcgctggg agcaaacaca tctgacctac aggattgaaa attacacgcc agatttgcca 421 agagcagatg tggaccatgc cattgagaaa gccttccaac tctggagtaa tgtcacacct 481 ctgacattca ccaaggtctc tgagggtcaa gcagacatca tgatatcttt tgtcagggga 541 gatcatcggg acaactctcc ttttgatgga cctggaggaa atcttgctca tgcttttcaa 601 ccaggcccag gtattggagg ggatgctcat tttgatgaag atgaaaggtg gaccaacaat 661 ttcagagagt acaacttaca tcgtgttgcg gctcatgaac tcggccattc tcttggactc 721 tcccattcta ctgatatcgg ggctttgatg taccctagct acaccttcag tggtgatgtt 781 cagctagctc aggatgacat tgatggcatc caagccatat atggacgttc ccaaaatcct 841 gtccagccca tcggcccaca aaccccaaaa gcgtgtgaca gtaagctaac ctttgatgct 901 ataactacga ttcggggaga agtgatgttc tttaaagaca gattctacat gcgcacaaat 961 cccttctacc cggaagttga gctcaatttc atttctgttt tctggccaca actgccaaat 1021 gggcttgaag ctgcttacga atttgccgac agagatgaag tccggttttt caaagggaat 1081 aagtactggg ctgttcaggg acagaatgtg ctacacggat accccaagga catctacagc 1141 tcctttggct tccctagaac tgtgaagcat atcgatgctg ctctttctga ggaaaacact 1201 ggaaaaacct acttctttgt tgctaacaaa tactggaggt atgatgaata taaacgatct 1261 atggatccag gttatcccaa aatgatagca catgactttc ctggaattgg ccacaaagtt 1321 gatgcagttt tcatgaaaga tggatttttc tatttctttc atggaacaag acaatacaaa 1381 tttgatccta aaacgaagag aattttgact ctccagaaag ctaatagctg gttcaactgc 1441 aggaaaaatt gaacattact aatttgaatg gaaaacacat ggtgtgagtc caaagaaggt 1501 gttttcctga agaattgtct attttctcag tcatttttaa cctctagagt cactgataca 1561 cagaatataa tcttatttat acctcagttt gcatattttt ttactattta gaatgtagcc 1621 ctttttgtac tgatataatt tagttccaca aatggtgggt acaaaaagtc aagtttgtgg 1681 cttatggatt catataggcc agagttgcaa agatcttttc cagagtatgc aactctgacg 1741 ttgatcccag agagcagctt cagtgacaaa catatccttt caagacagaa agagacagga 1801 gacatgagtc tttgccggag gaaaagcagc tcaagaacac atgtgcagtc actggtgtca 1861 ccctggatag gcaagggata actcttctaa cacaaaataa gtgttttatg tttggaataa 1921 agtcaacctt gtttctactg ttttatacaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1981 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2041 aaaaaaaa //