LOCUS       BC004246                4249 bp    mRNA    linear   HUM 15-JUL-2006
DEFINITION  Homo sapiens mutS homolog 6 (E. coli), mRNA (cDNA clone MGC:10498
            IMAGE:3629489), complete cds.
ACCESSION   BC004246
VERSION     BC004246.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 4249)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 4249)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (01-MAR-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Genome Sequence Centre,
            BC Cancer Agency, Vancouver, BC, Canada
            info@bcgsc.bc.ca
            Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson
            Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen
            Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel
            Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave
            Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth
            Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao,
            Kim MacDonald,  Mike R. Mayo, Josh Moran, Diana Palmquist, JR
            Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang,
            Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 13 Row: m Column: 14
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 4504190.
FEATURES             Location/Qualifiers
     source          1..4249
                     /db_xref="H-InvDB:HIT000031813"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:10498 IMAGE:3629489"
                     /tissue_type="Placenta, choriocarcinoma"
                     /clone_lib="NIH_MGC_21"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     gene            1..4249
                     /gene="MSH6"
                     /gene_synonym="HNPCC5"
                     /gene_synonym="HSAP"
                     /db_xref="GeneID:2956"
                     /db_xref="HGNC:HGNC:7329"
                     /db_xref="MIM:600678"
     CDS             53..4135
                     /gene="MSH6"
                     /gene_synonym="HNPCC5"
                     /gene_synonym="HSAP"
                     /codon_start=1
                     /product="mutS homolog 6 (E. coli)"
                     /protein_id="AAH04246.1"
                     /db_xref="GeneID:2956"
                     /db_xref="HGNC:HGNC:7329"
                     /db_xref="MIM:600678"
                     /translation="MSRQSTLYSFFPKSPALSDANKASARASREGGRAAAAPGASPSP
                     GGDAAWSEAGPGPRPLARSASPPKAKNLNGGLRRSVAPAAPTSCDFSPGDLVWAKMEG
                     YPWWPCLVYNHPFDGTFIREKGKSVRVHVQFFDDSPTRGWVSKRLLKPYTGSKSKEAQ
                     KGGHFYSAKPEILRAMQRADEALNKDKIKRLELAVCDEPSEPEEEEEMEVGTTYVTDK
                     SEEDNEIESEEEVQPKTQGSRRSSRQIKKRRVISDSESDIGGSDVEFKPDTKEEGSSD
                     EISSGVGDSESEGLNSPVKVARKRKRMVTGNGSLKRKSSRKETPSATKQATSISSETK
                     NTLRAFSAPQNSESQAHVSGGGDDSSRPTVWYHETLEWLKEEKRRDEHRRRPDHPDFD
                     ASTLYVPEDFLNSCTPGMRKWWQIKSQNFDLVICYKVGKFYELYHMDALIGVSELGLV
                     FMKGNWAHSGFPEIAFGRYSDSLVQKGYKVARVEQTETPEMMEARCRKMAHISKYDRV
                     VRREICRIITKGTQTYSVLEGDPSENYSKYLLSLKEKEEDSSGHTRAYGVCFVDTSLG
                     KFFIGQFSDDRHCSRFRTLVAHYPPVQVLFEKGNLSKETKTILKSSLSCSLQEGLIPG
                     SQFWDASKTLRTLLEEEYFREKLSDGIGVMLPQVLKGMTSESDSIGLTPGEKSELALS
                     ALGGCVFYLKKCLIDQELLSMANFEEYIPLDSDTVSTTRSGAIFTKAYQRMVLDAVTL
                     NNLEIFLNGTNGSTEGTLLERVDTCHTPFGKRLLKQWLCAPLCNHYAINDRLDAIEDL
                     MVVPDKISEVVELLKKLPDLERLLSKIHNVGSPLKSQNHPDSRAIMYEETTYSKKKII
                     DFLSALEGFKVMCKIIGIMEEVADGFKSKILKQVISLQTKNPEGRFPDLTVELNRWDT
                     AFDHEKARKTGLITPKAGFDSDYDQALADIRENEQSLLEYLEKQRNRIGCRTIVYWGI
                     GRNRYQLEIPENFTTRNLPEEYELKSTKKGCKRYWTKTIEKKLANLINAEERRDVSLK
                     DCMRRLFYNFDKNYKDWQSAVECIAVLDVLLCLANYSRGGDGPMCRPVILLPEDTPPF
                     LELKGSRHPCITKTFFGDDFIPNDILIGCEEEEQENGKAYCVLVTGPNMGGKSTLMRQ
                     AGLLAVMAQMGCYVPAEVCRLTPIDRVFTRLGASDRIMSGESTFFVELSETASILMHA
                     TAHSLVLVDELGRGTATFDGTAIANAVVKELAETIKCRTLFSTHYHSLVEDYSQNVAV
                     RLGHMACMVENECEDPSQETITFLYKFIKGACPKSYGFNAARLANLPEEVIQKGHRKA
                     REFEKMNQSLRLFREVCLASERSTVDAEAVHKLLTLIKEL"
BASE COUNT         1264 a          829 c         1064 g         1092 t
ORIGIN      
        1 tgcttttagg agctccgtcc gacagaacgg ttgggccttg ccggctgtcg gtatgtcgcg
       61 acagagcacc ctgtacagct tcttccccaa gtctccggcg ctgagtgatg ccaacaaggc
      121 ctcggccagg gcctcacgcg aaggcggccg tgccgccgct gcccccgggg cctctccttc
      181 cccaggcggg gatgcggcct ggagcgaggc tgggcctggg cccaggccct tggcgcgatc
      241 cgcgtcaccg cccaaggcga agaacctcaa cggagggctg cggagatcgg tagcgcctgc
      301 tgcccccacc agttgtgact tctcaccggg agatttggtt tgggccaaga tggagggtta
      361 cccctggtgg ccttgtctgg tttacaacca cccctttgat ggaacattca tccgcgagaa
      421 agggaaatca gtccgtgttc atgtacagtt ttttgatgac agcccaacaa ggggctgggt
      481 tagcaaaagg cttttaaagc catatacagg ttcaaaatca aaggaagccc agaagggagg
      541 tcatttttac agtgcaaagc ctgaaatact gagagcaatg caacgtgcag acgaagcctt
      601 aaataaagac aagattaaga ggcttgaatt ggcagtttgt gatgagccct cagagccaga
      661 agaggaagaa gagatggagg taggcacaac ttacgtaaca gataagagtg aagaagataa
      721 tgaaattgag agtgaagagg aagtacagcc taagacacaa ggatctaggc gaagtagccg
      781 ccaaataaaa aaacgaaggg tcatatcaga ttctgagagt gacattggtg gctctgatgt
      841 ggaatttaag ccagacacta aggaggaagg aagcagtgat gaaataagca gtggagtggg
      901 ggatagtgag agtgaaggcc tgaacagccc tgtcaaagtt gctcgaaagc ggaagagaat
      961 ggtgactgga aatggctctc ttaaaaggaa aagctctagg aaggaaacgc cctcagccac
     1021 caaacaagca actagcattt catcagaaac caagaatact ttgagagctt tctctgcccc
     1081 tcaaaattct gaatcccaag cccacgttag tggaggtggt gatgacagta gtcgccctac
     1141 tgtttggtat catgaaactt tagaatggct taaggaggaa aagagaagag atgagcacag
     1201 gaggaggcct gatcaccccg attttgatgc atctacactc tatgtgcctg aggatttcct
     1261 caattcttgt actcctggga tgaggaagtg gtggcagatt aagtctcaga actttgatct
     1321 tgtcatctgt tacaaggtgg ggaaatttta tgagctgtac cacatggatg ctcttattgg
     1381 agtcagtgaa ctggggctgg tattcatgaa aggcaactgg gcccattctg gctttcctga
     1441 aattgcattt ggccgttatt cagattccct ggtgcagaag ggctataaag tagcacgagt
     1501 ggaacagact gagactccag aaatgatgga ggcacgatgt agaaagatgg cacatatatc
     1561 caagtatgat agagtggtga ggagggagat ctgtaggatc attaccaagg gtacacagac
     1621 ttacagtgtg ctggaaggtg atccctctga gaactacagt aagtatcttc ttagcctcaa
     1681 agaaaaagag gaagattctt ctggccatac tcgtgcatat ggtgtgtgct ttgttgatac
     1741 ttcactggga aagtttttca taggtcagtt ttcagatgat cgccattgtt cgagatttag
     1801 gactctagtg gcacactatc ccccagtaca agttttattt gaaaaaggaa atctctcaaa
     1861 ggaaactaaa acaattctaa agagttcatt gtcctgttct cttcaggaag gtctgatacc
     1921 cggctcccag ttttgggatg catccaaaac tttgagaact ctccttgagg aagaatattt
     1981 tagggaaaag ctaagtgatg gcattggggt gatgttaccc caggtgctta aaggtatgac
     2041 ttcagagtct gattccattg ggttgacacc aggagagaaa agtgaattgg ccctctctgc
     2101 tctaggtggt tgtgtcttct acctcaaaaa atgccttatt gatcaggagc ttttatcaat
     2161 ggctaatttt gaagaatata ttcccttgga ttctgacaca gtcagcacta caagatctgg
     2221 tgctatcttc accaaagcct atcaacgaat ggtgctagat gcagtgacat taaacaactt
     2281 ggagattttt ctgaatggaa caaatggttc tactgaagga accctactag agagggttga
     2341 tacttgccat actccttttg gtaagcggct cctaaagcaa tggctttgtg ccccactctg
     2401 taaccattat gctattaatg atcgtctaga tgccatagaa gacctcatgg ttgtgcctga
     2461 caaaatctcc gaagttgtag agcttctaaa gaagcttcca gatcttgaga ggctactcag
     2521 taaaattcat aatgttgggt ctcccctgaa gagtcagaac cacccagaca gcagggctat
     2581 aatgtatgaa gaaactacat acagcaagaa gaagattatt gattttcttt ctgctctgga
     2641 aggattcaaa gtaatgtgta aaattatagg gatcatggaa gaagttgctg atggttttaa
     2701 gtctaaaatc cttaagcagg tcatctctct gcagacaaaa aatcctgaag gtcgttttcc
     2761 tgatttgact gtagaattga accgatggga tacagccttt gaccatgaaa aggctcgaaa
     2821 gactggactt attactccca aagcaggctt tgactctgat tatgaccaag ctcttgctga
     2881 cataagagaa aatgaacaga gcctcctgga atacctagag aaacagcgca acagaattgg
     2941 ctgtaggacc atagtctatt gggggattgg taggaaccgt taccagctgg aaattcctga
     3001 gaatttcacc actcgcaatt tgccagaaga atacgagttg aaatctacca agaagggctg
     3061 taaacgatac tggaccaaaa ctattgaaaa gaagttggct aatctcataa atgctgaaga
     3121 acggagggat gtatcattga aggactgcat gcggcgactg ttctataact ttgataaaaa
     3181 ttacaaggac tggcagtctg ctgtagagtg tatcgcagtg ttggatgttt tactgtgcct
     3241 ggctaactat agtcgagggg gtgatggtcc tatgtgtcgc ccagtaattc tgttgccgga
     3301 agataccccc cccttcttag agcttaaagg atcacgccat ccttgcatta cgaagacttt
     3361 ttttggagat gattttattc ctaatgacat tctaataggc tgtgaggaag aggagcagga
     3421 aaatggcaaa gcctattgtg tgcttgttac tggaccaaat atggggggca agtctacgct
     3481 tatgagacag gctggcttat tagctgtaat ggcccagatg ggttgttacg tccctgctga
     3541 agtgtgcagg ctcacaccaa ttgatagagt gtttactaga cttggtgcct cagacagaat
     3601 aatgtcaggt gaaagtacat tttttgttga attaagtgaa actgccagca tactcatgca
     3661 tgcaacagca cattctctgg tgcttgtgga tgaattagga agaggtactg caacatttga
     3721 tgggacggca atagcaaatg cagttgttaa agaacttgct gagactataa aatgtcgtac
     3781 attattttca actcactacc attcattagt agaagattat tctcaaaatg ttgctgtgcg
     3841 cctaggacat atggcatgca tggtagaaaa tgaatgtgaa gaccccagcc aggagactat
     3901 tacgttcctc tataaattca ttaagggagc ttgtcctaaa agctatggct ttaatgcagc
     3961 aaggcttgct aatctcccag aggaagttat tcaaaaggga catagaaaag caagagaatt
     4021 tgagaagatg aatcagtcac tacgattatt tcgggaagtt tgcctggcta gtgaaaggtc
     4081 aactgtagat gctgaagctg tccataaatt gctgactttg attaaggaat tatagactga
     4141 ctacattgga agctttgagt tgacttctga caaaggtggt aaattcagac aacattatga
     4201 tctaataaac tttattttta aaaaatgaaa aaaaaaaaaa aaaaaaaaa
//