LOCUS BC011817 2594 bp mRNA linear HUM 06-JAN-2004
DEFINITION Homo sapiens mutS homolog 3 (E. coli), mRNA (cDNA clone
IMAGE:3532592), partial cds.
ACCESSION BC011817
VERSION BC011817.2
KEYWORDS .
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 2594)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 2594)
AUTHORS Strausberg,R.
TITLE Direct Submission
JOURNAL Submitted (30-JUL-2001) National Institutes of Health, Mammalian
Gene Collection (MGC), Cancer Genomics Office, National Cancer
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT On Dec 19, 2003 this sequence version replaced BC011817.1.
Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: National Institutes of Health Intramural
Sequencing Center (NISC),
Gaithersburg, Maryland;
Web site: http://www.nisc.nih.gov/
Contact: nisc_mgc@nhgri.nih.gov
Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
Young,A., Zhang,L.-H. and Green,E.D.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 28 Row: c Column: 1
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 4505248.
FEATURES Location/Qualifiers
source 1..2594
/db_xref="H-InvDB:HIT000087714"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="IMAGE:3532592"
/tissue_type="Muscle, rhabdomyosarcoma"
/clone_lib="NIH_MGC_17"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene 1..>2594
/gene="MSH3"
/db_xref="GeneID:4437"
/db_xref="MIM:600887"
CDS 49..>2594
/gene="MSH3"
/codon_start=1
/product="MSH3 protein"
/protein_id="AAH11817.1"
/db_xref="GeneID:4437"
/db_xref="MIM:600887"
/translation="MSRRKPASGGLAASSSAPARQAVLSRFFQSTGSLKSTSSSTGAA
DQVDPGAAAAAAPPAPAFPPQLPPHVATEIDRRKKRPLENDGPVKKKVKKVQQKEGGS
DLGMSGNSEPKKCLRTRNVSKSLEKLKEFCCDSALPQSRVQTESLQERFAVLPKCTDF
DDISLLHAKNAVSSEDSKRQINQKDTTLFDLSQFGSSNTSHENLQKTASKSANKRSKS
IYTPLELQYIEMKQQHKDAVLCVECGYKYRFFGEDAEIAARELNIYCHLDHNFMTASI
PTHRLFVHVRRLVAKGYKVGVVKQTETAALKAIGDNRSSLFSRKLTALYTKSTLIGED
VNPLIKLDDAVNVDEIMTDTSTSYLLCISENKENVRDKKKGNIFIGIVGVQPATGEVV
FDSFQDSASRSELETRMSSLQPVELLLPSALSEQTEALIHRATSVSVQDDRIRVERMD
NIYFEYSHAFQAVTEFYAKDTVDIKGSQIISGIVNLEKPVICSLAAIIKYLKEFNLEK
MLSKPENFKQLSSKMEFMTINGTTLRNLEILQNQTDMKTKGSLLWVLDHTKTSFGRRK
LKKWVTQPLLKLREINARLDAVSEVLHSESSVFGQIENHLRKLPDIERGLCSIYHKKC
STQEFFLIVKTLYHLKSEFQAIIPAVNSHIQSDLLRTVILEIPELLSPVEHYLKILNE
QAAKVGDKTELFKDLSDFPLIKKRKDEIQGVIDEIRMHLQEIRKILKNPSAQYVTVSG
QEFMIEIKNSAVSCIPTDWVKVGSTKAVSRFHSPFIVENYRHLNQLREQLVLDCSAEW
LDFLEKFSEHYHSLCKAVHHLATVDCIFSLAKVAKQGDYCRPTVQKKKKK"
misc_feature 709..1050
/gene="MSH3"
/note="MutS_I; Region: MutS domain I. This domain is found
in proteins of the MutS family (DNA mismatch repair
proteins) and is found associated with pfam00488,
pfam05188, pfam05192 and pfam05190. The MutS family of
proteins is named after the Salmonella typhimurium MutS
protein involved in mismatch repair"
/db_xref="CDD:pfam01624"
misc_feature 1117..1590
/gene="MSH3"
/note="MutS_II; Region: MutS domain II. This domain is
found in proteins of the MutS family (DNA mismatch repair
proteins) and is found associated with pfam00488,
pfam01624, pfam05192 and pfam05190. The MutS family of
proteins is named after the Salmonella typhimurium MutS
protein involved in mismatch repair"
/db_xref="CDD:pfam05188"
misc_feature 1684..2589
/gene="MSH3"
/note="MUTSd; Region: DNA-binding domain of DNA mismatch
repair MUTS family"
/db_xref="CDD:smart00533"
BASE COUNT 817 a 552 c 562 g 663 t
ORIGIN
1 cgcgctcctc gccgggccct gccgccgggc tgccatcctt gccctgccat gtctcgccgg
61 aagcctgcgt cgggcggcct cgctgcctcc agctcagccc ctgcgaggca agcggttttg
121 agccgattct tccagtctac gggaagcctg aaatccacct cctcctccac aggtgcagcc
181 gaccaggtgg accctggcgc tgcagcggcc gcagcgcccc cagcgcccgc cttcccgccc
241 cagctgccgc cgcacgtagc tacagaaatt gacagaagaa agaagagacc attggaaaat
301 gatgggcctg ttaaaaagaa agtaaagaaa gtccaacaaa aggaaggagg aagtgatctg
361 ggaatgtctg gcaactctga gccaaagaaa tgtctgagga ccaggaatgt ttcaaagtct
421 ctggaaaaat tgaaagaatt ctgctgcgat tctgcccttc ctcaaagtag agtccagaca
481 gaatctctgc aggagagatt tgcagttctg ccaaaatgta ctgattttga tgatatcagt
541 cttctacacg caaagaatgc agtttcttct gaagattcga aacgtcaaat taatcaaaag
601 gacacaacac tttttgatct cagtcagttt ggatcatcaa atacaagtca tgaaaattta
661 cagaaaactg cttccaaatc agctaacaaa cggtccaaaa gcatctatac gccgctagaa
721 ttacaataca tagaaatgaa gcagcagcac aaagatgcag ttttgtgtgt ggaatgtgga
781 tataagtata gattctttgg ggaagatgca gagattgcag cccgagagct caatatttat
841 tgccatttag atcacaactt tatgacagca agtataccta ctcacagact gtttgttcat
901 gtacgccgcc tggtggcaaa aggatataag gtgggagttg tgaagcaaac tgaaactgca
961 gcattaaagg ccattggaga caacagaagt tcactctttt cccggaaatt gactgccctt
1021 tatacaaaat ctacacttat tggagaagat gtgaatcccc taatcaagct ggatgatgct
1081 gtaaatgttg atgagataat gactgatact tctaccagct atcttctgtg catctctgaa
1141 aataaggaaa atgttaggga caaaaaaaag ggcaacattt ttattggcat tgtgggagtg
1201 cagcctgcca caggcgaggt tgtgtttgat agtttccagg actctgcttc tcgttcagag
1261 ctagaaaccc ggatgtcaag cctgcagcca gtagagctgc tgcttccttc ggccttgtcc
1321 gagcaaacag aggcgctcat ccacagagcc acatctgtta gtgtgcagga tgacagaatt
1381 cgagtcgaaa ggatggataa catttatttt gaatacagcc atgctttcca ggcagttaca
1441 gagttttatg caaaagatac agttgacatc aaaggttctc aaattatttc tggcattgtt
1501 aacttagaga agcctgtgat ttgctctttg gctgccatca taaaatacct caaagaattc
1561 aacttggaaa agatgctctc caaacctgag aattttaaac agctatcaag taaaatggaa
1621 tttatgacaa ttaatggaac aacattaagg aatctggaaa tcctacagaa tcagactgat
1681 atgaaaacca aaggaagttt gctgtgggtt ttagaccaca ctaaaacttc atttgggaga
1741 cggaagttaa agaagtgggt gacccagcca ctccttaaat taagggaaat aaatgcccgg
1801 cttgatgctg tatcggaagt tctccattca gaatctagtg tgtttggtca gatagaaaat
1861 catctacgta aattgcccga catagagagg ggactctgta gcatttatca caaaaaatgt
1921 tctacccaag agttcttctt gattgtcaaa actttatatc acctaaagtc agaatttcaa
1981 gcaataatac ctgctgttaa ttcccacatt cagtcagact tgctccggac cgttatttta
2041 gaaattcctg aactcctcag tccagtggag cattacttaa agatactcaa tgaacaagct
2101 gccaaagttg gggataaaac tgaattattt aaagaccttt ctgacttccc tttaataaaa
2161 aagaggaagg atgaaattca aggtgttatt gacgagatcc gaatgcattt gcaagaaata
2221 cgaaaaatac taaaaaatcc ttctgcacaa tatgtgacag tatcaggaca ggagtttatg
2281 atagaaataa agaactctgc tgtatcttgt ataccaactg attgggtaaa ggttggaagc
2341 acaaaagctg tgagccgctt tcactctcct tttattgtag aaaattacag acatctgaat
2401 cagctccggg agcagctagt ccttgactgc agtgctgaat ggcttgattt tctagagaaa
2461 ttcagtgaac attatcactc cttgtgtaaa gcagtgcatc acctagcaac tgttgactgc
2521 attttctccc tggccaaggt cgctaagcaa ggagattact gcagaccaac tgtacaaaaa
2581 aaaaaaaaaa aaaa
//