LOCUS       BC110523                2337 bp    mRNA    linear   HUM 07-AUG-2008
DEFINITION  Homo sapiens excision repair cross-complementing rodent repair
            deficiency, complementation group 2, mRNA (cDNA clone MGC:126219
            IMAGE:40034048), complete cds.
ACCESSION   BC110523
VERSION     BC110523.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2337)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 2337)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (07-DEC-2005) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: Baylor Human Genome Sequencing Center
            cDNA Library Preparation: Baylor Human Genome Sequencing Center
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Sequencing Group at the Stanford Human Genome
            Center, Stanford University School of Medicine, Stanford, CA  94305
            Web site:       http://www-shgc.stanford.edu
            Contact:  (Dickson, Mark) mcd@paxil.stanford.edu
            Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
            R. M.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAM Plate: 14 Row: h Column: 2.
FEATURES             Location/Qualifiers
     source          1..2337
                     /db_xref="H-InvDB:HIT000339587"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:126219 IMAGE:40034048"
                     /tissue_type="PCR rescued clones"
                     /clone_lib="NIH_MGC_283"
                     /note="Vector: pCR-Blunt II-TOPO with reversed insert;
                     Clone identification sequence tag: GCTTTAAC sequenced from
                     the forward primer"
     gene            1..2337
                     /gene="ERCC2"
                     /gene_synonym="COFS2"
                     /gene_synonym="EM9"
                     /gene_synonym="TTD"
                     /db_xref="GeneID:2068"
                     /db_xref="HGNC:HGNC:3434"
                     /db_xref="MIM:126340"
     CDS             29..2311
                     /gene="ERCC2"
                     /gene_synonym="COFS2"
                     /gene_synonym="EM9"
                     /gene_synonym="TTD"
                     /codon_start=1
                     /product="excision repair cross-complementing rodent
                     repair deficiency, complementation group 2"
                     /protein_id="AAI10524.1"
                     /db_xref="GeneID:2068"
                     /db_xref="HGNC:HGNC:3434"
                     /db_xref="MIM:126340"
                     /translation="MKLNVDGLLVYFPYDYIYPEQFSYMRELKRTLDAKGHGVLEMPS
                     GTGKTVSLLALIMAYQRAYPLEVTKLIYCSRTVPEIEKVIEELRKLLNFYEKQEGEKL
                     PFLGLALSSRKNLCIHPEVTPLRFGKDVDGKCHSLTASYVRAQYQHDTSLPHCRFYEE
                     FDAHGREVPLPAGIYNLDDLKALGRRQGWCPYFLARYSILHANVVVYSYHYLLDPKIA
                     DLVSKELARKAVVVFDEAHNIDNVCIDSMSVNLTRRTLDRCQGNLETLQKTVLRIKET
                     DEQRLRDEYRRLVEGLREASAARETDAHLANPVLPDEVLQEAVPGSIRTAEHFLGFLR
                     RLLEYVKWRLRVQHVVQESPPAFLSGLAQRVCIQRKPLRFCAERLRSLLHTLEITDLA
                     DFSPLTLLANFATLVSTYAKGFTIIIEPFDDRTPTIANPILHFSCMDASLAIKPVFER
                     FQSVIITSGTLSPLDIYPKILDFHPVTMATFTMTLARVCLCPMIIGRGNDQVAISSKF
                     ETREDIAVIRNYGNLLLEMSAVVPDGIVAFFTSYQYMESTVASWYEQGILENIQRNKL
                     LFIETQDGAETSVALEKYQEACENGRGAILLSVARGKVSEGIDFVHHYGRAVIMFGVP
                     YVYTQSRILKARLEYLRDQFQIRENDFLTFDAMRHAAQCVGRAIRGKTDYGLMVFADK
                     RFARGDKRGKLPRWIQEHLTDANLNLTVDEGVQVAKYFLRQMAQPFHREDQLGLSLLS
                     LEQLESEETLQRIEQIAQQL"
BASE COUNT          475 a          732 c          679 g          451 t
ORIGIN      
        1 gaccccgctg cacagtccgg ccggcgccat gaagctcaac gtggacgggc tcctggtcta
       61 cttcccgtac gactacatct accccgagca gttctcctac atgcgggagc tcaaacgcac
      121 gctggacgcc aagggtcatg gagtcctgga gatgccctca ggcaccggga agacagtatc
      181 cctgttggcc ctgatcatgg cataccagag agcatatccg ctggaggtga ccaaactcat
      241 ctactgctca agaactgtgc cagagattga gaaggtgatt gaagagcttc gaaagttgct
      301 caacttctat gagaagcagg agggcgagaa gctgccgttt ctgggactgg ctctgagctc
      361 ccgcaaaaac ttgtgtattc accctgaggt gacacccctg cgctttggga aggacgtcga
      421 tgggaaatgc cacagcctca cagcctccta tgtgcgggcg cagtaccagc atgacaccag
      481 cctgccccac tgccgattct atgaggaatt tgatgcccat gggcgtgagg tgcccctccc
      541 cgctggcatc tacaacctgg atgacctgaa ggccctgggg cggcgccagg gctggtgccc
      601 atacttcctt gctcgatact caatcctgca tgccaatgtg gtggtttata gctaccacta
      661 cctcctggac cccaagattg cagacctggt gtccaaggaa ctggcccgca aggccgtcgt
      721 ggtcttcgac gaggcccaca acattgacaa cgtctgcatc gactccatga gcgtcaacct
      781 cacccgccgg acccttgacc ggtgccaggg caacctggag accctgcaga agacggtgct
      841 caggatcaaa gagacagacg agcagcgcct gcgggacgag taccggcgtc tggtggaggg
      901 gctgcgggag gccagcgccg cccgggagac ggacgcccac ctggccaacc ccgtgctgcc
      961 cgacgaagtg ctgcaggagg cagtgcctgg ctccatccgc acggccgagc atttcctggg
     1021 cttcctgagg cggctgctgg agtacgtgaa gtggcggctg cgtgtgcagc atgtggtgca
     1081 ggagagcccg cccgccttcc tgagcggcct ggcccagcgc gtgtgcatcc agcgcaagcc
     1141 cctcagattc tgtgctgaac gcctccggtc cctgctgcat actctggaga tcaccgacct
     1201 tgctgacttc tccccgctca ccctccttgc taactttgcc acccttgtca gcacctacgc
     1261 caaaggcttc accatcatca tcgagccctt tgacgacaga accccgacca ttgccaaccc
     1321 catcctgcac ttcagctgca tggacgcctc gctggccatc aaacccgtat ttgagcgttt
     1381 ccagtctgtc atcatcacat ctgggacact gtccccgctg gacatctacc ccaagatcct
     1441 ggacttccac cccgtcacca tggcaacctt caccatgacg ctggcacggg tctgcctctg
     1501 ccctatgatc atcggccgtg gcaatgacca ggtggccatc agctccaaat ttgagacccg
     1561 ggaggatatt gctgtgatcc ggaactatgg gaacctcctg ctggagatgt ccgctgtggt
     1621 ccctgatggc atcgtggcct tcttcaccag ctaccagtac atggagagca ccgtggcctc
     1681 ctggtatgag caggggatcc ttgagaacat ccagaggaac aagctgctct ttattgagac
     1741 ccaggatggt gccgaaacca gtgtcgccct ggagaagtac caggaggcct gcgagaatgg
     1801 ccgcggggcc atcctgctgt cagtggcccg gggcaaagtg tccgagggaa tcgactttgt
     1861 gcaccactac gggcgggccg tcatcatgtt tggcgtcccc tacgtctaca cacagagccg
     1921 cattctcaag gcgcggctgg aatacctgcg ggaccagttc cagattcgtg agaatgactt
     1981 tcttaccttc gatgccatgc gccacgcggc ccagtgtgtg ggtcgggcca tcaggggcaa
     2041 gacggactac ggcctcatgg tctttgccga caagcggttt gcccgtgggg acaagcgggg
     2101 gaagctgccc cgctggatcc aggagcacct cacagatgcc aacctcaacc tgaccgtgga
     2161 cgagggtgtc caggtggcca agtacttcct gcggcagatg gcacagccct tccaccggga
     2221 ggatcagctg ggcctgtccc tgctcagcct ggagcagcta gaatcagagg agacgctgca
     2281 gaggatagag cagattgctc agcagctctg agtggggcgg gtggggccat aaacggt
//