LOCUS BC011817 2594 bp mRNA linear HUM 06-JAN-2004 DEFINITION Homo sapiens mutS homolog 3 (E. coli), mRNA (cDNA clone IMAGE:3532592), partial cds. ACCESSION BC011817 VERSION BC011817.2 KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2594) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2594) AUTHORS Strausberg,R. TITLE Direct Submission JOURNAL Submitted (30-JUL-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Cancer Genomics Office, National Cancer Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Dec 19, 2003 this sequence version replaced BC011817.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 28 Row: c Column: 1 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 4505248. FEATURES Location/Qualifiers source 1..2594 /db_xref="H-InvDB:HIT000087714" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="IMAGE:3532592" /tissue_type="Muscle, rhabdomyosarcoma" /clone_lib="NIH_MGC_17" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..>2594 /gene="MSH3" /db_xref="GeneID:4437" /db_xref="MIM:600887" CDS 49..>2594 /gene="MSH3" /codon_start=1 /product="MSH3 protein" /protein_id="AAH11817.1" /db_xref="GeneID:4437" /db_xref="MIM:600887" /translation="MSRRKPASGGLAASSSAPARQAVLSRFFQSTGSLKSTSSSTGAA DQVDPGAAAAAAPPAPAFPPQLPPHVATEIDRRKKRPLENDGPVKKKVKKVQQKEGGS DLGMSGNSEPKKCLRTRNVSKSLEKLKEFCCDSALPQSRVQTESLQERFAVLPKCTDF DDISLLHAKNAVSSEDSKRQINQKDTTLFDLSQFGSSNTSHENLQKTASKSANKRSKS IYTPLELQYIEMKQQHKDAVLCVECGYKYRFFGEDAEIAARELNIYCHLDHNFMTASI PTHRLFVHVRRLVAKGYKVGVVKQTETAALKAIGDNRSSLFSRKLTALYTKSTLIGED VNPLIKLDDAVNVDEIMTDTSTSYLLCISENKENVRDKKKGNIFIGIVGVQPATGEVV FDSFQDSASRSELETRMSSLQPVELLLPSALSEQTEALIHRATSVSVQDDRIRVERMD NIYFEYSHAFQAVTEFYAKDTVDIKGSQIISGIVNLEKPVICSLAAIIKYLKEFNLEK MLSKPENFKQLSSKMEFMTINGTTLRNLEILQNQTDMKTKGSLLWVLDHTKTSFGRRK LKKWVTQPLLKLREINARLDAVSEVLHSESSVFGQIENHLRKLPDIERGLCSIYHKKC STQEFFLIVKTLYHLKSEFQAIIPAVNSHIQSDLLRTVILEIPELLSPVEHYLKILNE QAAKVGDKTELFKDLSDFPLIKKRKDEIQGVIDEIRMHLQEIRKILKNPSAQYVTVSG QEFMIEIKNSAVSCIPTDWVKVGSTKAVSRFHSPFIVENYRHLNQLREQLVLDCSAEW LDFLEKFSEHYHSLCKAVHHLATVDCIFSLAKVAKQGDYCRPTVQKKKKK" misc_feature 709..1050 /gene="MSH3" /note="MutS_I; Region: MutS domain I. This domain is found in proteins of the MutS family (DNA mismatch repair proteins) and is found associated with pfam00488, pfam05188, pfam05192 and pfam05190. The MutS family of proteins is named after the Salmonella typhimurium MutS protein involved in mismatch repair" /db_xref="CDD:pfam01624" misc_feature 1117..1590 /gene="MSH3" /note="MutS_II; Region: MutS domain II. This domain is found in proteins of the MutS family (DNA mismatch repair proteins) and is found associated with pfam00488, pfam01624, pfam05192 and pfam05190. The MutS family of proteins is named after the Salmonella typhimurium MutS protein involved in mismatch repair" /db_xref="CDD:pfam05188" misc_feature 1684..2589 /gene="MSH3" /note="MUTSd; Region: DNA-binding domain of DNA mismatch repair MUTS family" /db_xref="CDD:smart00533" BASE COUNT 817 a 552 c 562 g 663 t ORIGIN 1 cgcgctcctc gccgggccct gccgccgggc tgccatcctt gccctgccat gtctcgccgg 61 aagcctgcgt cgggcggcct cgctgcctcc agctcagccc ctgcgaggca agcggttttg 121 agccgattct tccagtctac gggaagcctg aaatccacct cctcctccac aggtgcagcc 181 gaccaggtgg accctggcgc tgcagcggcc gcagcgcccc cagcgcccgc cttcccgccc 241 cagctgccgc cgcacgtagc tacagaaatt gacagaagaa agaagagacc attggaaaat 301 gatgggcctg ttaaaaagaa agtaaagaaa gtccaacaaa aggaaggagg aagtgatctg 361 ggaatgtctg gcaactctga gccaaagaaa tgtctgagga ccaggaatgt ttcaaagtct 421 ctggaaaaat tgaaagaatt ctgctgcgat tctgcccttc ctcaaagtag agtccagaca 481 gaatctctgc aggagagatt tgcagttctg ccaaaatgta ctgattttga tgatatcagt 541 cttctacacg caaagaatgc agtttcttct gaagattcga aacgtcaaat taatcaaaag 601 gacacaacac tttttgatct cagtcagttt ggatcatcaa atacaagtca tgaaaattta 661 cagaaaactg cttccaaatc agctaacaaa cggtccaaaa gcatctatac gccgctagaa 721 ttacaataca tagaaatgaa gcagcagcac aaagatgcag ttttgtgtgt ggaatgtgga 781 tataagtata gattctttgg ggaagatgca gagattgcag cccgagagct caatatttat 841 tgccatttag atcacaactt tatgacagca agtataccta ctcacagact gtttgttcat 901 gtacgccgcc tggtggcaaa aggatataag gtgggagttg tgaagcaaac tgaaactgca 961 gcattaaagg ccattggaga caacagaagt tcactctttt cccggaaatt gactgccctt 1021 tatacaaaat ctacacttat tggagaagat gtgaatcccc taatcaagct ggatgatgct 1081 gtaaatgttg atgagataat gactgatact tctaccagct atcttctgtg catctctgaa 1141 aataaggaaa atgttaggga caaaaaaaag ggcaacattt ttattggcat tgtgggagtg 1201 cagcctgcca caggcgaggt tgtgtttgat agtttccagg actctgcttc tcgttcagag 1261 ctagaaaccc ggatgtcaag cctgcagcca gtagagctgc tgcttccttc ggccttgtcc 1321 gagcaaacag aggcgctcat ccacagagcc acatctgtta gtgtgcagga tgacagaatt 1381 cgagtcgaaa ggatggataa catttatttt gaatacagcc atgctttcca ggcagttaca 1441 gagttttatg caaaagatac agttgacatc aaaggttctc aaattatttc tggcattgtt 1501 aacttagaga agcctgtgat ttgctctttg gctgccatca taaaatacct caaagaattc 1561 aacttggaaa agatgctctc caaacctgag aattttaaac agctatcaag taaaatggaa 1621 tttatgacaa ttaatggaac aacattaagg aatctggaaa tcctacagaa tcagactgat 1681 atgaaaacca aaggaagttt gctgtgggtt ttagaccaca ctaaaacttc atttgggaga 1741 cggaagttaa agaagtgggt gacccagcca ctccttaaat taagggaaat aaatgcccgg 1801 cttgatgctg tatcggaagt tctccattca gaatctagtg tgtttggtca gatagaaaat 1861 catctacgta aattgcccga catagagagg ggactctgta gcatttatca caaaaaatgt 1921 tctacccaag agttcttctt gattgtcaaa actttatatc acctaaagtc agaatttcaa 1981 gcaataatac ctgctgttaa ttcccacatt cagtcagact tgctccggac cgttatttta 2041 gaaattcctg aactcctcag tccagtggag cattacttaa agatactcaa tgaacaagct 2101 gccaaagttg gggataaaac tgaattattt aaagaccttt ctgacttccc tttaataaaa 2161 aagaggaagg atgaaattca aggtgttatt gacgagatcc gaatgcattt gcaagaaata 2221 cgaaaaatac taaaaaatcc ttctgcacaa tatgtgacag tatcaggaca ggagtttatg 2281 atagaaataa agaactctgc tgtatcttgt ataccaactg attgggtaaa ggttggaagc 2341 acaaaagctg tgagccgctt tcactctcct tttattgtag aaaattacag acatctgaat 2401 cagctccggg agcagctagt ccttgactgc agtgctgaat ggcttgattt tctagagaaa 2461 ttcagtgaac attatcactc cttgtgtaaa gcagtgcatc acctagcaac tgttgactgc 2521 attttctccc tggccaaggt cgctaagcaa ggagattact gcagaccaac tgtacaaaaa 2581 aaaaaaaaaa aaaa //