LOCUS       BC011817                2594 bp    mRNA    linear   HUM 06-JAN-2004
DEFINITION  Homo sapiens mutS homolog 3 (E. coli), mRNA (cDNA clone
            IMAGE:3532592), partial cds.
ACCESSION   BC011817
VERSION     BC011817.2
KEYWORDS    .
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2594)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 2594)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (30-JUL-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     On Dec 19, 2003 this sequence version replaced BC011817.1.
            Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: National Institutes of Health Intramural
            Sequencing Center (NISC),
            Gaithersburg, Maryland;
            Web site: http://www.nisc.nih.gov/
            Contact: nisc_mgc@nhgri.nih.gov
            Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
            Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
            Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
            Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
            Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
            McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
            Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
            Young,A., Zhang,L.-H. and Green,E.D.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 28 Row: c Column: 1
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 4505248.
FEATURES             Location/Qualifiers
     source          1..2594
                     /db_xref="H-InvDB:HIT000087714"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="IMAGE:3532592"
                     /tissue_type="Muscle, rhabdomyosarcoma"
                     /clone_lib="NIH_MGC_17"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     gene            1..>2594
                     /gene="MSH3"
                     /db_xref="GeneID:4437"
                     /db_xref="MIM:600887"
     CDS             49..>2594
                     /gene="MSH3"
                     /codon_start=1
                     /product="MSH3 protein"
                     /protein_id="AAH11817.1"
                     /db_xref="GeneID:4437"
                     /db_xref="MIM:600887"
                     /translation="MSRRKPASGGLAASSSAPARQAVLSRFFQSTGSLKSTSSSTGAA
                     DQVDPGAAAAAAPPAPAFPPQLPPHVATEIDRRKKRPLENDGPVKKKVKKVQQKEGGS
                     DLGMSGNSEPKKCLRTRNVSKSLEKLKEFCCDSALPQSRVQTESLQERFAVLPKCTDF
                     DDISLLHAKNAVSSEDSKRQINQKDTTLFDLSQFGSSNTSHENLQKTASKSANKRSKS
                     IYTPLELQYIEMKQQHKDAVLCVECGYKYRFFGEDAEIAARELNIYCHLDHNFMTASI
                     PTHRLFVHVRRLVAKGYKVGVVKQTETAALKAIGDNRSSLFSRKLTALYTKSTLIGED
                     VNPLIKLDDAVNVDEIMTDTSTSYLLCISENKENVRDKKKGNIFIGIVGVQPATGEVV
                     FDSFQDSASRSELETRMSSLQPVELLLPSALSEQTEALIHRATSVSVQDDRIRVERMD
                     NIYFEYSHAFQAVTEFYAKDTVDIKGSQIISGIVNLEKPVICSLAAIIKYLKEFNLEK
                     MLSKPENFKQLSSKMEFMTINGTTLRNLEILQNQTDMKTKGSLLWVLDHTKTSFGRRK
                     LKKWVTQPLLKLREINARLDAVSEVLHSESSVFGQIENHLRKLPDIERGLCSIYHKKC
                     STQEFFLIVKTLYHLKSEFQAIIPAVNSHIQSDLLRTVILEIPELLSPVEHYLKILNE
                     QAAKVGDKTELFKDLSDFPLIKKRKDEIQGVIDEIRMHLQEIRKILKNPSAQYVTVSG
                     QEFMIEIKNSAVSCIPTDWVKVGSTKAVSRFHSPFIVENYRHLNQLREQLVLDCSAEW
                     LDFLEKFSEHYHSLCKAVHHLATVDCIFSLAKVAKQGDYCRPTVQKKKKK"
     misc_feature    709..1050
                     /gene="MSH3"
                     /note="MutS_I; Region: MutS domain I. This domain is found
                     in proteins of the MutS family (DNA mismatch repair
                     proteins) and is found associated with pfam00488,
                     pfam05188, pfam05192 and pfam05190. The MutS family of
                     proteins is named after the Salmonella typhimurium MutS
                     protein involved in mismatch repair"
                     /db_xref="CDD:pfam01624"
     misc_feature    1117..1590
                     /gene="MSH3"
                     /note="MutS_II; Region: MutS domain II. This domain is
                     found in proteins of the MutS family (DNA mismatch repair
                     proteins) and is found associated with pfam00488,
                     pfam01624, pfam05192 and pfam05190. The MutS family of
                     proteins is named after the Salmonella typhimurium MutS
                     protein involved in mismatch repair"
                     /db_xref="CDD:pfam05188"
     misc_feature    1684..2589
                     /gene="MSH3"
                     /note="MUTSd; Region: DNA-binding domain of DNA mismatch
                     repair MUTS family"
                     /db_xref="CDD:smart00533"
BASE COUNT          817 a          552 c          562 g          663 t
ORIGIN      
        1 cgcgctcctc gccgggccct gccgccgggc tgccatcctt gccctgccat gtctcgccgg
       61 aagcctgcgt cgggcggcct cgctgcctcc agctcagccc ctgcgaggca agcggttttg
      121 agccgattct tccagtctac gggaagcctg aaatccacct cctcctccac aggtgcagcc
      181 gaccaggtgg accctggcgc tgcagcggcc gcagcgcccc cagcgcccgc cttcccgccc
      241 cagctgccgc cgcacgtagc tacagaaatt gacagaagaa agaagagacc attggaaaat
      301 gatgggcctg ttaaaaagaa agtaaagaaa gtccaacaaa aggaaggagg aagtgatctg
      361 ggaatgtctg gcaactctga gccaaagaaa tgtctgagga ccaggaatgt ttcaaagtct
      421 ctggaaaaat tgaaagaatt ctgctgcgat tctgcccttc ctcaaagtag agtccagaca
      481 gaatctctgc aggagagatt tgcagttctg ccaaaatgta ctgattttga tgatatcagt
      541 cttctacacg caaagaatgc agtttcttct gaagattcga aacgtcaaat taatcaaaag
      601 gacacaacac tttttgatct cagtcagttt ggatcatcaa atacaagtca tgaaaattta
      661 cagaaaactg cttccaaatc agctaacaaa cggtccaaaa gcatctatac gccgctagaa
      721 ttacaataca tagaaatgaa gcagcagcac aaagatgcag ttttgtgtgt ggaatgtgga
      781 tataagtata gattctttgg ggaagatgca gagattgcag cccgagagct caatatttat
      841 tgccatttag atcacaactt tatgacagca agtataccta ctcacagact gtttgttcat
      901 gtacgccgcc tggtggcaaa aggatataag gtgggagttg tgaagcaaac tgaaactgca
      961 gcattaaagg ccattggaga caacagaagt tcactctttt cccggaaatt gactgccctt
     1021 tatacaaaat ctacacttat tggagaagat gtgaatcccc taatcaagct ggatgatgct
     1081 gtaaatgttg atgagataat gactgatact tctaccagct atcttctgtg catctctgaa
     1141 aataaggaaa atgttaggga caaaaaaaag ggcaacattt ttattggcat tgtgggagtg
     1201 cagcctgcca caggcgaggt tgtgtttgat agtttccagg actctgcttc tcgttcagag
     1261 ctagaaaccc ggatgtcaag cctgcagcca gtagagctgc tgcttccttc ggccttgtcc
     1321 gagcaaacag aggcgctcat ccacagagcc acatctgtta gtgtgcagga tgacagaatt
     1381 cgagtcgaaa ggatggataa catttatttt gaatacagcc atgctttcca ggcagttaca
     1441 gagttttatg caaaagatac agttgacatc aaaggttctc aaattatttc tggcattgtt
     1501 aacttagaga agcctgtgat ttgctctttg gctgccatca taaaatacct caaagaattc
     1561 aacttggaaa agatgctctc caaacctgag aattttaaac agctatcaag taaaatggaa
     1621 tttatgacaa ttaatggaac aacattaagg aatctggaaa tcctacagaa tcagactgat
     1681 atgaaaacca aaggaagttt gctgtgggtt ttagaccaca ctaaaacttc atttgggaga
     1741 cggaagttaa agaagtgggt gacccagcca ctccttaaat taagggaaat aaatgcccgg
     1801 cttgatgctg tatcggaagt tctccattca gaatctagtg tgtttggtca gatagaaaat
     1861 catctacgta aattgcccga catagagagg ggactctgta gcatttatca caaaaaatgt
     1921 tctacccaag agttcttctt gattgtcaaa actttatatc acctaaagtc agaatttcaa
     1981 gcaataatac ctgctgttaa ttcccacatt cagtcagact tgctccggac cgttatttta
     2041 gaaattcctg aactcctcag tccagtggag cattacttaa agatactcaa tgaacaagct
     2101 gccaaagttg gggataaaac tgaattattt aaagaccttt ctgacttccc tttaataaaa
     2161 aagaggaagg atgaaattca aggtgttatt gacgagatcc gaatgcattt gcaagaaata
     2221 cgaaaaatac taaaaaatcc ttctgcacaa tatgtgacag tatcaggaca ggagtttatg
     2281 atagaaataa agaactctgc tgtatcttgt ataccaactg attgggtaaa ggttggaagc
     2341 acaaaagctg tgagccgctt tcactctcct tttattgtag aaaattacag acatctgaat
     2401 cagctccggg agcagctagt ccttgactgc agtgctgaat ggcttgattt tctagagaaa
     2461 ttcagtgaac attatcactc cttgtgtaaa gcagtgcatc acctagcaac tgttgactgc
     2521 attttctccc tggccaaggt cgctaagcaa ggagattact gcagaccaac tgtacaaaaa
     2581 aaaaaaaaaa aaaa
//