LOCUS BC021566 3116 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens mutS homolog 2, colon cancer, nonpolyposis type 1 (E.
coli), mRNA (cDNA clone MGC:31906 IMAGE:4110354), complete cds.
ACCESSION BC021566
VERSION BC021566.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 3116)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 3116)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (14-JAN-2002) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Genome Sequence Centre,
BC Cancer Agency, Vancouver, BC, Canada
info@bcgsc.bc.ca
Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson
Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen
Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel
Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave
Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth
Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao,
Kim MacDonald, Mike R. Mayo, Josh Moran, Diana Palmquist, JR
Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang,
Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 40 Row: c Column: 4.
FEATURES Location/Qualifiers
source 1..3116
/db_xref="H-InvDB:HIT000039221"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:31906 IMAGE:4110354"
/tissue_type="Muscle, rhabdomyosarcoma"
/clone_lib="NIH_MGC_17"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene 1..3116
/gene="MSH2"
/gene_synonym="FCC1"
/gene_synonym="HNPCC"
/gene_synonym="HNPCC1"
/db_xref="GeneID:4436"
/db_xref="HGNC:HGNC:7325"
/db_xref="MIM:609309"
CDS 15..2819
/gene="MSH2"
/gene_synonym="FCC1"
/gene_synonym="HNPCC"
/gene_synonym="HNPCC1"
/codon_start=1
/product="mutS homolog 2, colon cancer, nonpolyposis type
1 (E. coli)"
/protein_id="AAH21566.1"
/db_xref="GeneID:4436"
/db_xref="HGNC:HGNC:7325"
/db_xref="MIM:609309"
/translation="MAVQPKETLQLESAAEVGFVRFFQGMPEKPTTTVRLFDRGDFYT
AHGEDALLAAREVFKTQGVIKYMGPAGAKNLQSVVLSKMNFESFVKDLLLVRQYRVEV
YKNRAGNKASKENDWYLAYKASPGNLSQFEDILFGNNDMSASIGVVGVKMSAVDGQRQ
VGVGYVDSIQRKLGLCEFPDNDQFSNLEALLIQIGPKECVLPGGETAGDMGKLRQIIQ
RGGILITERKKADFSTKDIYQDLNRLLKGKKGEQMNSAVLPEMENQVAVSSLSAVIKF
LELLSDDSNFGQFELTTFDFSQYMKLDIAAVRALNLFQGSVEDTTGSQSLAALLNKCK
TPQGQRLVNQWIKQPLMDKNRIEERLNLVEAFVEDAELRQTLQEDLLRRFPDLNRLAK
KFQRQAANLQDCYRLYQGINQLPNVIQALEKHEGKHQKLLLAVFVTPLTDLRSDFSKF
QEMIETTLDMDQVENHEFLVKPSFDPNLSELREIMNDLEKKMQSTLISAARDLGLDPG
KQIKLDSSAQFGYYFRVTCKEEKVLRNNKNFSTVDIQKNGVKFTNSKLTSLNEEYTKN
KTEYEEAQDAIVKEIVNISSGYVEPMQTLNDVLAQLDAVVSFAHVSNGAPVPYVRPAI
LEKGQGRIILKASRHACVEVQDEIAFIPNDVYFEKDKQMFHIITGPNMGGKSTYIRQT
GVIVLMAQIGCFVPCESAEVSIVDCILARVGAGDSQLKGVSTFMAEMLETASILRSAT
KDSLIIIDELGRGTSTYDGFGLAWAISEYIATKIGAFCMFATHFHELTALANQIPTVN
NLHVTALTTEETLTMLYQVKKGVCDQSFGIHVAELANFPKHVIECAKQKALELEEFQY
IGESQGYDIMEPAAKKCYLEREQGEKIIQEFLSKVKQMPFTEMSEENITIKLKQLKAE
VIAKNNSFVNEIISRIKVTT"
BASE COUNT 1008 a 540 c 704 g 864 t
ORIGIN
1 gaggaggttt cgacatggcg gtgcagccga aggagacgct gcagttggag agcgcggccg
61 aggtcggctt cgtgcgcttc tttcagggca tgccggagaa gccgaccacc acagtgcgcc
121 ttttcgaccg gggcgacttc tatacggcgc acggcgagga cgcgctgctg gccgcccggg
181 aggtgttcaa gacccagggg gtgatcaagt acatggggcc ggcaggagca aagaatctgc
241 agagtgttgt gcttagtaaa atgaattttg aatcttttgt aaaagatctt cttctggttc
301 gtcagtatag agttgaagtt tataagaata gagctggaaa taaggcatcc aaggagaatg
361 attggtattt ggcatataag gcttctcctg gcaatctctc tcagtttgaa gacattctct
421 ttggtaacaa tgatatgtca gcttccattg gtgttgtggg tgttaaaatg tccgcagttg
481 atggccagag acaggttgga gttgggtatg tggattccat acagaggaaa ctaggactgt
541 gtgaattccc tgataatgat cagttctcca atcttgaggc tctcctcatc cagattggac
601 caaaggaatg tgttttaccc ggaggagaga ctgctggaga catggggaaa ctgagacaga
661 taattcaaag aggaggaatt ctgatcacag aaagaaaaaa agctgacttt tccacaaaag
721 acatttatca ggacctcaac cggttgttga aaggcaaaaa gggagagcag atgaatagtg
781 ctgtattgcc agaaatggag aatcaggttg cagtttcatc actgtctgcg gtaatcaagt
841 ttttagaact cttatcagat gattccaact ttggacagtt tgaactgact acttttgact
901 tcagccagta tatgaaattg gatattgcag cagtcagagc ccttaacctt tttcagggtt
961 ctgttgaaga taccactggc tctcagtctc tggctgcctt gctgaataag tgtaaaaccc
1021 ctcaaggaca aagacttgtt aaccagtgga ttaagcagcc tctcatggat aagaacagaa
1081 tagaggagag attgaattta gtggaagctt ttgtagaaga tgcagaattg aggcagactt
1141 tacaagaaga tttacttcgt cgattcccag atcttaaccg acttgccaag aagtttcaaa
1201 gacaagcagc aaacttacaa gattgttacc gactctatca gggtataaat caactaccta
1261 atgttataca ggctctggaa aaacatgaag gaaaacacca gaaattattg ttggcagttt
1321 ttgtgactcc tcttactgat cttcgttctg acttctccaa gtttcaggaa atgatagaaa
1381 caactttaga tatggatcag gtggaaaacc atgaattcct tgtaaaacct tcatttgatc
1441 ctaatctcag tgaattaaga gaaataatga atgacttgga aaagaagatg cagtcaacat
1501 taataagtgc agccagagat cttggcttgg accctggcaa acagattaaa ctggattcca
1561 gtgcacagtt tggatattac tttcgtgtaa cctgtaagga agaaaaagtc cttcgtaaca
1621 ataaaaactt tagtactgta gatatccaga agaatggtgt taaatttacc aacagcaaat
1681 tgacttcttt aaatgaagag tataccaaaa ataaaacaga atatgaagaa gcccaggatg
1741 ccattgttaa agaaattgtc aatatttctt caggctatgt agaaccaatg cagacactca
1801 atgatgtgtt agctcagcta gatgctgttg tcagctttgc tcacgtgtca aatggagcac
1861 ctgttccata tgtacgacca gccattttgg agaaaggaca aggaagaatt atattaaaag
1921 catccaggca tgcttgtgtt gaagttcaag atgaaattgc atttattcct aatgacgtat
1981 actttgaaaa agataaacag atgttccaca tcattactgg ccccaatatg ggaggtaaat
2041 caacatatat tcgacaaact ggggtgatag tactcatggc ccaaattggg tgttttgtgc
2101 catgtgagtc agcagaagtg tccattgtgg actgcatctt agcccgagta ggggctggtg
2161 acagtcaatt gaaaggagtc tccacgttca tggctgaaat gttggaaact gcttctatcc
2221 tcaggtctgc aaccaaagat tcattaataa tcatagatga attgggaaga ggaacttcta
2281 cctacgatgg atttgggtta gcatgggcta tatcagaata cattgcaaca aagattggtg
2341 ctttttgcat gtttgcaacc cattttcatg aacttactgc cttggccaat cagataccaa
2401 ctgttaataa tctacatgtc acagcactca ccactgaaga gaccttaact atgctttatc
2461 aggtgaagaa aggtgtctgt gatcaaagtt ttgggattca tgttgcagag cttgctaatt
2521 tccctaagca tgtaatagag tgtgctaaac agaaagccct ggaacttgag gagtttcagt
2581 atattggaga atcgcaagga tatgatatca tggaaccagc agcaaagaag tgctatctgg
2641 aaagagagca aggtgaaaaa attattcagg agttcctgtc caaggtgaaa caaatgccct
2701 ttactgaaat gtcagaagaa aacatcacaa taaagttaaa acagctaaaa gctgaagtaa
2761 tagcaaagaa taatagcttt gtaaatgaaa tcatttcacg aataaaagtt actacgtgaa
2821 aaatcccagt aatggaatga aggtaatatt gataagctat tgtctgtaat agttttatat
2881 tgttttatat taaccctttt tccatagtgt taactgtcag tgcccatggg ctatcaactt
2941 aataagatat ttagtaatat tttactttga ggacattttc aaagattttt attttgaaaa
3001 atgagagctg taactgagga ctgtttgcaa ttgacatagg caataataag tgatgtgctg
3061 aattttataa ataaaatcat gtagtttgtg gaaaaaaaaa aaaaaaaaaa aaaaaa
//