LOCUS HSU28946 4264 bp mRNA linear HUM 04-MAY-1996 DEFINITION Human G/T mismatch binding protein (GTBP) mRNA, complete cds. ACCESSION U28946 VERSION U28946.1 KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4264) AUTHORS Palombo,F., Gallinari,P., Iaccarino,I., Lettieri,T., Hughes,M., D'Arrigo,A., Truong,O., Hsuan,J.J. and Jiricny,J. TITLE GTBP, a 160-kilodalton protein essential for mismatch-binding activity in human cells JOURNAL Science 268 (5219), 1912-1914 (1995) PUBMED 7604265 REFERENCE 2 (bases 1 to 4264) AUTHORS Nicolaides,N.C., Palombo,F., Kinzler,K.W., Vogelstein,B. and Jiricny,J. TITLE Molecular cloning of the N-terminus of GTBP JOURNAL Genomics 31 (3), 395-397 (1996) PUBMED 8838326 REFERENCE 3 (bases 1 to 4264) AUTHORS Jiricny,J. TITLE Direct Submission JOURNAL Submitted (12-JUN-1995) Josef Jiricny, Genetics Department, Istituto di Ricerche di Biol. Molecolare P. Angeletti (IRBM), Via Pontina Km 30.600, Pomezia, 00040, Italy COMMENT On May 4, 1996 this sequence version replaced gi:902495. FEATURES Location/Qualifiers source 1..4264 /db_xref="H-InvDB:HIT000218941" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /chromosome="2" /map="2p16" /clone="C1" /cell_line="HeLa" /clone_lib="HeLa S3 cDNA in lambda Uni-ZAP XR (Stratagene)" gene 1..4264 /gene="GTBP" CDS 88..4170 /gene="GTBP" /note="homolog of bacterial MutS proteins; binds to G/T mismatches through heterodimerization with hMSH2; similar to ORF YD8557.04c, probable DNA repair protein, from S. cerevisiae chromosome IV cosmid 8557, PIR Accession Number S51246; somatic mutations found in colon cancer" /codon_start=1 /product="G/T mismatch binding protein" /protein_id="AAC50461.1" /translation="MSRQSTLYSFFPKSPALSDANKASARASREGGRAAAAPGASPSP GGDAAWSEAGPGPRPLARSASPPKAKNLNGGLRRSVAPAAPTSCDFSPGDLVWAKMEG YPWWPCLVYNHPFDGTFIREKGKSVRVHVQFFDDSPTRGWVSKRLLKPYTGSKSKEAQ KGGHFYSAKPEILRAMQRADEALNKDKIKRLELAVCDEPSEPEEEEEMEVGTTYVTDK SEEDNEIESEEEVQPKTQGSRRSSRQIKKRRVISDSESDIGGSDVEFKPDTKEEGSSD EISSGVGDSESEGLNSPVKVARKRKRMVTGNGSLKRKSSRKETPSATKQATSISSETK NTLRAFSAPQNSESQAHVSGGGDDSSRPTVWYHETLEWLKEEKRRDEHRRRPDHPDFD ASTLYVPEDFLNSCTPGMRKWWQIKSQNFDLVICYKVGKFYELYHMDALIGVSELGLV FMKGNWAHSGFPEIAFGRYSDSLVQKGYKVARVEQTETPEMMEARCRKMAHISKYDRV VRREICRIITKGTQTYSVLEGDPSENYSKYLLSLKEKEEDSSGHTRAYGVCFVDTSLG KFFIGQFSDDRHCSRFRTLVAHYPPVQVLFEKGNLSKETKTILKSSLSCSLQEGLIPG SQFWDASKTLRTLLEEEYFREKLSDGIGVMLPQVLKGMTSESDSIGLTPGEKSELALS ALGGCVFYLKKCLIDQELLSMANFEEYIPLDSDTVSTTRSGAIFTKAYQRMVLDAVTL NNLEIFLNGTNGSTEGTLLERVDTCHTPFGKRLLKQWLCAPLCNHYAINDRLDAIEDL MVVPDKISEVVELLKKLPDLERLLSKIHNVGSPLKSQNHPDSRAIMYEETTYSKKKII DFLSALEGFKVMCKIIGIMEEVADGFKSKILKQVISLQTKNPEGRFPDLTVELNRWDT AFDHEKARKTGLITPKAGFDSDYDQALADIRENEQSLLEYLEKQRNRIGCRTIVYWGI GRNRYQLEIPENFTTRNLPEEYELKSTKKGCKRYWTKTIEKKLANLINAEERRDVSLK DCMRRLFYNFDKNYKDWQSAVECIAVLDVLLCLANYSRGGDGPMCRPVILLPEDTPPF LELKGSRHPCITKTFFGDDFIPNDILIGCEEEEQENGKAYCVLVTGPNMGGKSTLMRQ AGLLAVMAQMGCYVPAEVCRLTPIDRVFTRLGASDRIMSGESTFFVELSETASILMHA TAHSLVLVDELGRGTATFDGTAIANAVVKELAETIKCRTLFSTHYHSLVEDYSQNVAV RLGHMACMVENECEDPSQETITFLYKFIKGACPKSYGFNAARLANLPEEVIQKGHRKA REFEKMNQSLRLFREVCLASERSTVDAEAVHKLLTLIKEL" BASE COUNT 1249 a 840 c 1076 g 1099 t ORIGIN 1 atttcccgcc agcaggagcc gcgcggtaga tgcggtgctt ttaggagctc cgtccgacag 61 aacggttggg ccttgccggc tgtcggtatg tcgcgacaga gcaccctgta cagcttcttc 121 cccaagtctc cggcgctgag tgatgccaac aaggcctcgg ccagggcctc acgcgaaggc 181 ggccgtgccg ccgctgcccc cggggcctct ccttccccag gcggggatgc ggcctggagc 241 gaggctgggc ctgggcccag gcccttggcg cgatccgcgt caccgcccaa ggcgaagaac 301 ctcaacggag ggctgcggag atcggtagcg cctgctgccc ccaccagttg tgacttctca 361 ccaggagatt tggtttgggc caagatggag ggttacccct ggtggccttg tctggtttac 421 aaccacccct ttgatggaac attcatccgc gagaaaggga aatcagtccg tgttcatgta 481 cagttttttg atgacagccc aacaaggggc tgggttagca aaaggctttt aaagccatat 541 acaggttcaa aatcaaagga agcccagaag ggaggtcatt tttacagtgc aaagcctgaa 601 atactgagag caatgcaacg tgcagatgaa gccttaaata aagacaagat taagaggctt 661 gaattggcag tttgtgatga gccctcagag ccagaagagg aagaagagat ggaggtaggc 721 acaacttacg taacagataa gagtgaagaa gataatgaaa ttgagagtga agaggaagta 781 cagcctaaga cacaaggatc taggcgaagt agccgccaaa taaaaaaacg aagggtcata 841 tcagattctg agagtgacat tggtggctct gatgtggaat ttaagccaga cactaaggag 901 gaaggaagca gtgatgaaat aagcagtgga gtgggggata gtgagagtga aggcctgaac 961 agccctgtca aagttgctcg aaagcggaag agaatggtga ctggaaatgg ctctcttaaa 1021 aggaaaagct ctaggaagga aacgccctca gccaccaaac aagcaactag catttcatca 1081 gaaaccaaga atactttgag agctttctct gcccctcaaa attctgaatc ccaagcccac 1141 gttagtggag gtggtgatga cagtagtcgc cctactgttt ggtatcatga aactttagaa 1201 tggcttaagg aggaaaagag aagagatgag cacaggagga ggcctgatca ccccgatttt 1261 gatgcatcta cactctatgt gcctgaggat ttcctcaatt cttgtactcc tgggatgagg 1321 aagtggtggc agattaagtc tcagaacttt gatcttgtca tctgttacaa ggtggggaaa 1381 ttttatgagc tgtaccacat ggatgctctt attggagtca gtgaactggg gctggtattc 1441 atgaaaggca actgggccca ttctggcttt cctgaaattg catttggccg ttattcagat 1501 tccctggtgc agaagggcta taaagtagca cgagtggaac agactgagac tccagaaatg 1561 atggaggcac gatgtagaaa gatggcacat atatccaagt atgatagagt ggtgaggagg 1621 gagatctgta ggatcattac caagggtaca cagacttaca gtgtgctgga aggtgatccc 1681 tctgagaact acagtaagta tcttcttagc ctcaaagaaa aagaggaaga ttcttctggc 1741 catactcgtg catatggtgt gtgctttgtt gatacttcac tgggaaagtt tttcataggt 1801 cagttttcag atgatcgcca ttgttcgaga tttaggactc tagtggcaca ctatccccca 1861 gtacaagttt tatttgaaaa aggaaatctc tcaaaggaaa ctaaaacaat tctaaagagt 1921 tcattgtcct gttctcttca ggaaggtctg atacccggct cccagttttg ggatgcatcc 1981 aaaactttga gaactctcct tgaggaagaa tattttaggg aaaagctaag tgatggcatt 2041 ggggtgatgt taccccaggt gcttaaaggt atgacttcag agtctgattc cattgggttg 2101 acaccaggag agaaaagtga attggccctc tctgctctag gtggttgtgt cttctacctc 2161 aaaaaatgcc ttattgatca ggagctttta tcaatggcta attttgaaga atatattccc 2221 ttggattctg acacagtcag cactacaaga tctggtgcta tcttcaccaa agcctatcaa 2281 cgaatggtgc tagatgcagt gacattaaac aacttggaga tttttctgaa tggaacaaat 2341 ggttctactg aaggaaccct actagagagg gttgatactt gccatactcc ttttggtaag 2401 cggctcctaa agcaatggct ttgtgcccca ctctgtaacc attatgctat taatgatcgt 2461 ctagatgcca tagaagacct catggttgtg cctgacaaaa tctccgaagt tgtagagctt 2521 ctaaagaagc ttccagatct tgagaggcta ctcagtaaaa ttcataatgt tgggtctccc 2581 ctgaagagtc agaaccaccc agacagcagg gctataatgt atgaagaaac tacatacagc 2641 aagaagaaga ttattgattt tctttctgct ctggaaggat tcaaagtaat gtgtaaaatt 2701 atagggatca tggaagaagt tgctgatggt tttaagtcta aaatccttaa gcaggtcatc 2761 tctctgcaga caaaaaatcc tgaaggtcgt tttcctgatt tgactgtaga attgaaccga 2821 tgggatacag cctttgacca tgaaaaggct cgaaagactg gacttattac tcccaaagca 2881 ggctttgact ctgattatga ccaagctctt gctgacataa gagaaaatga acagagcctc 2941 ctggaatacc tagagaaaca gcgcaacaga attggctgta ggaccatagt ctattggggg 3001 attggtagga accgttacca gctggaaatt cctgagaatt tcaccactcg caatttgcca 3061 gaagaatacg agttgaaatc taccaagaag ggctgtaaac gatactggac caaaactatt 3121 gaaaagaagt tggctaatct cataaatgct gaagaacgga gggatgtatc attgaaggac 3181 tgcatgcggc gactgttcta taactttgat aaaaattaca aggactggca gtctgctgta 3241 gagtgtatcg cagtgttgga tgttttactg tgcctggcta actatagtcg agggggtgat 3301 ggtcctatgt gtcgcccagt aattctgttg ccggaagata cccccccctt cttagagctt 3361 aaaggatcac gccatccttg cattacgaag actttttttg gagatgattt tattcctaat 3421 gacattctaa taggctgtga ggaagaggag caggaaaatg gcaaagccta ttgtgtgctt 3481 gttactggac caaatatggg gggcaagtct acgcttatga gacaggctgg cttattagct 3541 gtaatggccc agatgggttg ttacgtccct gctgaagtgt gcaggctcac accaattgat 3601 agagtgttta ctagacttgg tgcctcagac agaataatgt caggtgaaag tacatttttt 3661 gttgaattaa gtgaaactgc cagcatactc atgcatgcaa cagcacattc tctggtgctt 3721 gtggatgaat taggaagagg tactgcaaca tttgatggga cggcaatagc aaatgcagtt 3781 gttaaagaac ttgctgagac tataaaatgt cgtacattat tttcaactca ctaccattca 3841 ttagtagaag attattctca aaatgttgct gtgcgcctag gacatatggc atgcatggta 3901 gaaaatgaat gtgaagaccc cagccaggag actattacgt tcctctataa attcattaag 3961 ggagcttgtc ctaaaagcta tggctttaat gcagcaaggc ttgctaatct cccagaggaa 4021 gttattcaaa agggacatag aaaagcaaga gaatttgaga agatgaatca gtcactacga 4081 ttatttcggg aagtttgcct ggctagtgaa aggtcaactg tagatgctga agctgtccat 4141 aaattgctga ctttgattaa ggaattatag actgactaca ttggaagctt tgagttgact 4201 tctgaccaaa ggtggtaaat tcagacaaca ttatgatcta ataaacttta ttttttaaaa 4261 atga //