LOCUS       HSU28946                4264 bp    mRNA    linear   HUM 04-MAY-1996
DEFINITION  Human G/T mismatch binding protein (GTBP) mRNA, complete cds.
ACCESSION   U28946
VERSION     U28946.1
KEYWORDS    .
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 4264)
  AUTHORS   Palombo,F., Gallinari,P., Iaccarino,I., Lettieri,T., Hughes,M.,
            D'Arrigo,A., Truong,O., Hsuan,J.J. and Jiricny,J.
  TITLE     GTBP, a 160-kilodalton protein essential for mismatch-binding
            activity in human cells
  JOURNAL   Science 268 (5219), 1912-1914 (1995)
   PUBMED   7604265
REFERENCE   2  (bases 1 to 4264)
  AUTHORS   Nicolaides,N.C., Palombo,F., Kinzler,K.W., Vogelstein,B. and
            Jiricny,J.
  TITLE     Molecular cloning of the N-terminus of GTBP
  JOURNAL   Genomics 31 (3), 395-397 (1996)
   PUBMED   8838326
REFERENCE   3  (bases 1 to 4264)
  AUTHORS   Jiricny,J.
  TITLE     Direct Submission
  JOURNAL   Submitted (12-JUN-1995) Josef Jiricny, Genetics Department,
            Istituto di Ricerche di Biol. Molecolare P. Angeletti (IRBM), Via
            Pontina Km 30.600, Pomezia, 00040, Italy
COMMENT     On May 4, 1996 this sequence version replaced gi:902495.
FEATURES             Location/Qualifiers
     source          1..4264
                     /db_xref="H-InvDB:HIT000218941"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /chromosome="2"
                     /map="2p16"
                     /clone="C1"
                     /cell_line="HeLa"
                     /clone_lib="HeLa S3 cDNA in lambda Uni-ZAP XR
                     (Stratagene)"
     gene            1..4264
                     /gene="GTBP"
     CDS             88..4170
                     /gene="GTBP"
                     /note="homolog of bacterial MutS proteins; binds to G/T
                     mismatches through heterodimerization with hMSH2; similar
                     to ORF YD8557.04c, probable DNA repair protein, from S.
                     cerevisiae chromosome IV cosmid 8557, PIR Accession Number
                     S51246; somatic mutations found in colon cancer"
                     /codon_start=1
                     /product="G/T mismatch binding protein"
                     /protein_id="AAC50461.1"
                     /translation="MSRQSTLYSFFPKSPALSDANKASARASREGGRAAAAPGASPSP
                     GGDAAWSEAGPGPRPLARSASPPKAKNLNGGLRRSVAPAAPTSCDFSPGDLVWAKMEG
                     YPWWPCLVYNHPFDGTFIREKGKSVRVHVQFFDDSPTRGWVSKRLLKPYTGSKSKEAQ
                     KGGHFYSAKPEILRAMQRADEALNKDKIKRLELAVCDEPSEPEEEEEMEVGTTYVTDK
                     SEEDNEIESEEEVQPKTQGSRRSSRQIKKRRVISDSESDIGGSDVEFKPDTKEEGSSD
                     EISSGVGDSESEGLNSPVKVARKRKRMVTGNGSLKRKSSRKETPSATKQATSISSETK
                     NTLRAFSAPQNSESQAHVSGGGDDSSRPTVWYHETLEWLKEEKRRDEHRRRPDHPDFD
                     ASTLYVPEDFLNSCTPGMRKWWQIKSQNFDLVICYKVGKFYELYHMDALIGVSELGLV
                     FMKGNWAHSGFPEIAFGRYSDSLVQKGYKVARVEQTETPEMMEARCRKMAHISKYDRV
                     VRREICRIITKGTQTYSVLEGDPSENYSKYLLSLKEKEEDSSGHTRAYGVCFVDTSLG
                     KFFIGQFSDDRHCSRFRTLVAHYPPVQVLFEKGNLSKETKTILKSSLSCSLQEGLIPG
                     SQFWDASKTLRTLLEEEYFREKLSDGIGVMLPQVLKGMTSESDSIGLTPGEKSELALS
                     ALGGCVFYLKKCLIDQELLSMANFEEYIPLDSDTVSTTRSGAIFTKAYQRMVLDAVTL
                     NNLEIFLNGTNGSTEGTLLERVDTCHTPFGKRLLKQWLCAPLCNHYAINDRLDAIEDL
                     MVVPDKISEVVELLKKLPDLERLLSKIHNVGSPLKSQNHPDSRAIMYEETTYSKKKII
                     DFLSALEGFKVMCKIIGIMEEVADGFKSKILKQVISLQTKNPEGRFPDLTVELNRWDT
                     AFDHEKARKTGLITPKAGFDSDYDQALADIRENEQSLLEYLEKQRNRIGCRTIVYWGI
                     GRNRYQLEIPENFTTRNLPEEYELKSTKKGCKRYWTKTIEKKLANLINAEERRDVSLK
                     DCMRRLFYNFDKNYKDWQSAVECIAVLDVLLCLANYSRGGDGPMCRPVILLPEDTPPF
                     LELKGSRHPCITKTFFGDDFIPNDILIGCEEEEQENGKAYCVLVTGPNMGGKSTLMRQ
                     AGLLAVMAQMGCYVPAEVCRLTPIDRVFTRLGASDRIMSGESTFFVELSETASILMHA
                     TAHSLVLVDELGRGTATFDGTAIANAVVKELAETIKCRTLFSTHYHSLVEDYSQNVAV
                     RLGHMACMVENECEDPSQETITFLYKFIKGACPKSYGFNAARLANLPEEVIQKGHRKA
                     REFEKMNQSLRLFREVCLASERSTVDAEAVHKLLTLIKEL"
BASE COUNT         1249 a          840 c         1076 g         1099 t
ORIGIN      
        1 atttcccgcc agcaggagcc gcgcggtaga tgcggtgctt ttaggagctc cgtccgacag
       61 aacggttggg ccttgccggc tgtcggtatg tcgcgacaga gcaccctgta cagcttcttc
      121 cccaagtctc cggcgctgag tgatgccaac aaggcctcgg ccagggcctc acgcgaaggc
      181 ggccgtgccg ccgctgcccc cggggcctct ccttccccag gcggggatgc ggcctggagc
      241 gaggctgggc ctgggcccag gcccttggcg cgatccgcgt caccgcccaa ggcgaagaac
      301 ctcaacggag ggctgcggag atcggtagcg cctgctgccc ccaccagttg tgacttctca
      361 ccaggagatt tggtttgggc caagatggag ggttacccct ggtggccttg tctggtttac
      421 aaccacccct ttgatggaac attcatccgc gagaaaggga aatcagtccg tgttcatgta
      481 cagttttttg atgacagccc aacaaggggc tgggttagca aaaggctttt aaagccatat
      541 acaggttcaa aatcaaagga agcccagaag ggaggtcatt tttacagtgc aaagcctgaa
      601 atactgagag caatgcaacg tgcagatgaa gccttaaata aagacaagat taagaggctt
      661 gaattggcag tttgtgatga gccctcagag ccagaagagg aagaagagat ggaggtaggc
      721 acaacttacg taacagataa gagtgaagaa gataatgaaa ttgagagtga agaggaagta
      781 cagcctaaga cacaaggatc taggcgaagt agccgccaaa taaaaaaacg aagggtcata
      841 tcagattctg agagtgacat tggtggctct gatgtggaat ttaagccaga cactaaggag
      901 gaaggaagca gtgatgaaat aagcagtgga gtgggggata gtgagagtga aggcctgaac
      961 agccctgtca aagttgctcg aaagcggaag agaatggtga ctggaaatgg ctctcttaaa
     1021 aggaaaagct ctaggaagga aacgccctca gccaccaaac aagcaactag catttcatca
     1081 gaaaccaaga atactttgag agctttctct gcccctcaaa attctgaatc ccaagcccac
     1141 gttagtggag gtggtgatga cagtagtcgc cctactgttt ggtatcatga aactttagaa
     1201 tggcttaagg aggaaaagag aagagatgag cacaggagga ggcctgatca ccccgatttt
     1261 gatgcatcta cactctatgt gcctgaggat ttcctcaatt cttgtactcc tgggatgagg
     1321 aagtggtggc agattaagtc tcagaacttt gatcttgtca tctgttacaa ggtggggaaa
     1381 ttttatgagc tgtaccacat ggatgctctt attggagtca gtgaactggg gctggtattc
     1441 atgaaaggca actgggccca ttctggcttt cctgaaattg catttggccg ttattcagat
     1501 tccctggtgc agaagggcta taaagtagca cgagtggaac agactgagac tccagaaatg
     1561 atggaggcac gatgtagaaa gatggcacat atatccaagt atgatagagt ggtgaggagg
     1621 gagatctgta ggatcattac caagggtaca cagacttaca gtgtgctgga aggtgatccc
     1681 tctgagaact acagtaagta tcttcttagc ctcaaagaaa aagaggaaga ttcttctggc
     1741 catactcgtg catatggtgt gtgctttgtt gatacttcac tgggaaagtt tttcataggt
     1801 cagttttcag atgatcgcca ttgttcgaga tttaggactc tagtggcaca ctatccccca
     1861 gtacaagttt tatttgaaaa aggaaatctc tcaaaggaaa ctaaaacaat tctaaagagt
     1921 tcattgtcct gttctcttca ggaaggtctg atacccggct cccagttttg ggatgcatcc
     1981 aaaactttga gaactctcct tgaggaagaa tattttaggg aaaagctaag tgatggcatt
     2041 ggggtgatgt taccccaggt gcttaaaggt atgacttcag agtctgattc cattgggttg
     2101 acaccaggag agaaaagtga attggccctc tctgctctag gtggttgtgt cttctacctc
     2161 aaaaaatgcc ttattgatca ggagctttta tcaatggcta attttgaaga atatattccc
     2221 ttggattctg acacagtcag cactacaaga tctggtgcta tcttcaccaa agcctatcaa
     2281 cgaatggtgc tagatgcagt gacattaaac aacttggaga tttttctgaa tggaacaaat
     2341 ggttctactg aaggaaccct actagagagg gttgatactt gccatactcc ttttggtaag
     2401 cggctcctaa agcaatggct ttgtgcccca ctctgtaacc attatgctat taatgatcgt
     2461 ctagatgcca tagaagacct catggttgtg cctgacaaaa tctccgaagt tgtagagctt
     2521 ctaaagaagc ttccagatct tgagaggcta ctcagtaaaa ttcataatgt tgggtctccc
     2581 ctgaagagtc agaaccaccc agacagcagg gctataatgt atgaagaaac tacatacagc
     2641 aagaagaaga ttattgattt tctttctgct ctggaaggat tcaaagtaat gtgtaaaatt
     2701 atagggatca tggaagaagt tgctgatggt tttaagtcta aaatccttaa gcaggtcatc
     2761 tctctgcaga caaaaaatcc tgaaggtcgt tttcctgatt tgactgtaga attgaaccga
     2821 tgggatacag cctttgacca tgaaaaggct cgaaagactg gacttattac tcccaaagca
     2881 ggctttgact ctgattatga ccaagctctt gctgacataa gagaaaatga acagagcctc
     2941 ctggaatacc tagagaaaca gcgcaacaga attggctgta ggaccatagt ctattggggg
     3001 attggtagga accgttacca gctggaaatt cctgagaatt tcaccactcg caatttgcca
     3061 gaagaatacg agttgaaatc taccaagaag ggctgtaaac gatactggac caaaactatt
     3121 gaaaagaagt tggctaatct cataaatgct gaagaacgga gggatgtatc attgaaggac
     3181 tgcatgcggc gactgttcta taactttgat aaaaattaca aggactggca gtctgctgta
     3241 gagtgtatcg cagtgttgga tgttttactg tgcctggcta actatagtcg agggggtgat
     3301 ggtcctatgt gtcgcccagt aattctgttg ccggaagata cccccccctt cttagagctt
     3361 aaaggatcac gccatccttg cattacgaag actttttttg gagatgattt tattcctaat
     3421 gacattctaa taggctgtga ggaagaggag caggaaaatg gcaaagccta ttgtgtgctt
     3481 gttactggac caaatatggg gggcaagtct acgcttatga gacaggctgg cttattagct
     3541 gtaatggccc agatgggttg ttacgtccct gctgaagtgt gcaggctcac accaattgat
     3601 agagtgttta ctagacttgg tgcctcagac agaataatgt caggtgaaag tacatttttt
     3661 gttgaattaa gtgaaactgc cagcatactc atgcatgcaa cagcacattc tctggtgctt
     3721 gtggatgaat taggaagagg tactgcaaca tttgatggga cggcaatagc aaatgcagtt
     3781 gttaaagaac ttgctgagac tataaaatgt cgtacattat tttcaactca ctaccattca
     3841 ttagtagaag attattctca aaatgttgct gtgcgcctag gacatatggc atgcatggta
     3901 gaaaatgaat gtgaagaccc cagccaggag actattacgt tcctctataa attcattaag
     3961 ggagcttgtc ctaaaagcta tggctttaat gcagcaaggc ttgctaatct cccagaggaa
     4021 gttattcaaa agggacatag aaaagcaaga gaatttgaga agatgaatca gtcactacga
     4081 ttatttcggg aagtttgcct ggctagtgaa aggtcaactg tagatgctga agctgtccat
     4141 aaattgctga ctttgattaa ggaattatag actgactaca ttggaagctt tgagttgact
     4201 tctgaccaaa ggtggtaaat tcagacaaca ttatgatcta ataaacttta ttttttaaaa
     4261 atga
//