LOCUS BC006850 2530 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens mutL homolog 1, colon cancer, nonpolyposis type 2 (E. coli), mRNA (cDNA clone MGC:5172 IMAGE:3451538), complete cds. ACCESSION BC006850 VERSION BC006850.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2530) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2530) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (27-APR-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Baylor College of Medicine Human Genome Sequencing Center Center code: BCM-HGSC Web site: http://www.hgsc.bcm.tmc.edu/cdna/ Contact: amg@bcm.tmc.edu Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H., Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati, A.N., Gibbs, R.A. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 3 Row: l Column: 24 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 28559089. FEATURES Location/Qualifiers source 1..2530 /db_xref="H-InvDB:HIT000032957" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:5172 IMAGE:3451538" /tissue_type="Placenta, choriocarcinoma" /clone_lib="NIH_MGC_10" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..2530 /gene="MLH1" /gene_synonym="FCC2" /gene_synonym="hMLH1" /gene_synonym="HNPCC" /gene_synonym="HNPCC2" /gene_synonym="MGC5172" /db_xref="GeneID:4292" /db_xref="HGNC:HGNC:7127" /db_xref="MIM:120436" CDS 51..2321 /gene="MLH1" /gene_synonym="FCC2" /gene_synonym="hMLH1" /gene_synonym="HNPCC" /gene_synonym="HNPCC2" /gene_synonym="MGC5172" /codon_start=1 /product="mutL homolog 1, colon cancer, nonpolyposis type 2 (E. coli)" /protein_id="AAH06850.1" /db_xref="GeneID:4292" /db_xref="HGNC:HGNC:7127" /db_xref="MIM:120436" /translation="MSFVAGVIRRLDETVVNRIAAGEVIQRPANAIKEMIENCLDAKS TSIQVIVKEGGLKLIQIQDNGTGIRKEDLDIVCERFTTSKLQSFEDLASISTYGFRGE ALASISHVAHVTITTKTADGKCAYRASYSDGKLKAPPKPCAGNQGTQITVEDLFYNIA TRRKALKNPSEEYGKILEVVGRYSVHNAGISFSVKKQGETVADVRTLPNASTVDNIRS IFGNAVSRELIEIGCEDKTLAFKMNGYISNANYSVKKCIFLLFINHRLVESTSLRKAI ETVYAAYLPKNTHPFLYLSLEISPQNVDVNVHPTKHEVHFLHEESILERVQQHIESKL LGSNSSRMYFTQTLLPGLAGPSGEMVKSTTSLTSSSTSGSSDKVYAHQMVRTDSREQK LDAFLQPLSKPLSSQPQAIVTEDKTDISSGRARQQDEEMLELPAPAEVAAKNQSLEGD TTKGTSEMSEKRGPTSSNPRKRHREDSDVEMVEDDSRKEMTAACTPRRRIINLTSVLS LQEEINEQGHEVLREMLHNHSFVGCVNPQWALAQHQTKLYLLNTTKLSEELFYQILIY DFANFGVLRLSEPAPLFDLAMLALDSPESGWTEEDGPKEGLAEYIVEFLKKKAEMLAD YFSLEIDEEGNLIGLPLLIDNYVPPLEGLPIFILRLATEVNWDEEKECFESLSKECAM FYSIRKQYISEESTLSGQQSEVPGSIPNSWKWTVEHIVYKALRSHILPPKHFTEDGNI LQLANLPDLYKVFERC" BASE COUNT 742 a 543 c 601 g 644 t ORIGIN 1 ggcacttccg ttgagcatct agacgtttcc ttggctcttc tggcgccaaa atgtcgttcg 61 tggcaggggt tattcggcgg ctggacgaga cagtggtgaa ccgcatcgcg gcgggggaag 121 ttatccagcg gccagctaat gctatcaaag agatgattga gaactgttta gatgcaaaat 181 ccacaagtat tcaagtgatt gttaaagagg gaggcctgaa gttgattcag atccaagaca 241 atggcaccgg gatcaggaaa gaagatctgg atattgtatg tgaaaggttc actactagta 301 aactgcagtc ctttgaggat ttagccagta tttctaccta tggctttcga ggtgaggctt 361 tggccagcat aagccatgtg gctcatgtta ctattacaac gaaaacagct gatggaaagt 421 gtgcatacag agcaagttac tcagatggaa aactgaaagc ccctcctaaa ccatgtgctg 481 gcaatcaagg gacccagatc acggtggagg acctttttta caacatagcc acgaggagaa 541 aagctttaaa aaatccaagt gaagaatatg ggaaaatttt ggaagttgtt ggcaggtatt 601 cagtacacaa tgcaggcatt agtttctcag ttaaaaaaca aggagagaca gtagctgatg 661 ttaggacact acccaatgcc tcaaccgtgg acaatattcg ctccatcttt ggaaatgctg 721 ttagtcgaga actgatagaa attggatgtg aggataaaac cctagccttc aaaatgaatg 781 gttacatatc caatgcaaac tactcagtga agaagtgcat cttcttactc ttcatcaacc 841 atcgtctggt agaatcaact tccttgagaa aagccataga aacagtgtat gcagcctatt 901 tgcccaaaaa cacacaccca ttcctgtacc tcagtttaga aatcagtccc cagaatgtgg 961 atgttaatgt gcaccccaca aagcatgaag ttcacttcct gcacgaggag agcatcctgg 1021 agcgggtgca gcagcacatc gagagcaagc tcctgggctc caattcctcc aggatgtact 1081 tcacccagac tttgctacca ggacttgctg gcccctctgg ggagatggtt aaatccacaa 1141 caagtctgac ctcgtcttct acttctggaa gtagtgataa ggtctatgcc caccagatgg 1201 ttcgtacaga ttcccgggaa cagaagcttg atgcatttct gcagcctctg agcaaacccc 1261 tgtccagtca gccccaggcc attgtcacag aggataagac agatatttct agtggcaggg 1321 ctaggcagca agatgaggag atgcttgaac tcccagcccc tgctgaagtg gctgccaaaa 1381 atcagagctt ggagggggat acaacaaagg ggacttcaga aatgtcagag aagagaggac 1441 ctacttccag caaccccaga aagagacatc gggaagattc tgatgtggaa atggtggaag 1501 atgattcccg aaaggaaatg actgcagctt gtaccccccg gagaaggatc attaacctca 1561 ctagtgtttt gagtctccag gaagaaatta atgagcaggg acatgaggtt ctccgggaga 1621 tgttgcataa ccactccttc gtgggctgtg tgaatcctca gtgggccttg gcacagcatc 1681 aaaccaagtt ataccttctc aacaccacca agcttagtga agaactgttc taccagatac 1741 tcatttatga ttttgccaat tttggtgttc tcaggttatc ggagccagca ccgctctttg 1801 accttgccat gcttgcctta gatagtccag agagtggctg gacagaggaa gatggtccca 1861 aagaaggact tgctgaatac attgttgagt ttctgaagaa gaaggctgag atgcttgcag 1921 actatttctc tttggaaatt gatgaggaag ggaacctgat tggattaccc cttctgattg 1981 acaactatgt gccccctttg gagggactgc ctatcttcat tcttcgacta gccactgagg 2041 tgaattggga cgaagaaaag gaatgttttg aaagcctcag taaagaatgc gctatgttct 2101 attccatccg gaagcagtac atatctgagg agtcgaccct ctcaggccag cagagtgaag 2161 tgcctggctc cattccaaac tcctggaagt ggactgtgga acacattgtc tataaagcct 2221 tgcgctcaca cattctgcct cctaaacatt tcacagaaga tggaaatatc ctgcagcttg 2281 ctaacctgcc tgatctatac aaagtctttg agaggtgtta aatatggtta tttatgcact 2341 gtgggatgtg ttcttctttc tctgtattcc gatacaaagt gttgtatcaa agtgtgatat 2401 acaaagtgta ccaacataag tgttggtagc acttaagact tatacttgcc ttctgatagt 2461 attcctttat acacagtgga ttgattataa ataaatagat gtgtcttaac ataaaaaaaa 2521 aaaaaaaaaa //