LOCUS       BC110522                2103 bp    mRNA    linear   HUM 07-AUG-2008
DEFINITION  Homo sapiens excision repair cross-complementing rodent repair
            deficiency, complementation group 2, mRNA (cDNA clone MGC:126218
            IMAGE:40034047), complete cds.
ACCESSION   BC110522
VERSION     BC110522.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2103)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 2103)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (07-DEC-2005) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: Baylor Human Genome Sequencing Center
            cDNA Library Preparation: Baylor Human Genome Sequencing Center
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Sequencing Group at the Stanford Human Genome
            Center, Stanford University School of Medicine, Stanford, CA  94305
            Web site:       http://www-shgc.stanford.edu
            Contact:  (Dickson, Mark) mcd@paxil.stanford.edu
            Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
            R. M.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAM Plate: 14 Row: h Column: 1.
FEATURES             Location/Qualifiers
     source          1..2103
                     /db_xref="H-InvDB:HIT000339586"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:126218 IMAGE:40034047"
                     /tissue_type="PCR rescued clones"
                     /clone_lib="NIH_MGC_283"
                     /note="Vector: pCR-Blunt II-TOPO with reversed insert;
                     Clone identification sequence tag: CAATTCTA sequenced from
                     the forward primer"
     gene            1..2103
                     /gene="ERCC2"
                     /gene_synonym="COFS2"
                     /gene_synonym="EM9"
                     /gene_synonym="TTD"
                     /db_xref="GeneID:2068"
                     /db_xref="HGNC:HGNC:3434"
                     /db_xref="MIM:126340"
     CDS             29..2077
                     /gene="ERCC2"
                     /gene_synonym="COFS2"
                     /gene_synonym="EM9"
                     /gene_synonym="TTD"
                     /codon_start=1
                     /product="ERCC2 protein"
                     /protein_id="AAI10523.1"
                     /db_xref="GeneID:2068"
                     /db_xref="HGNC:HGNC:3434"
                     /db_xref="MIM:126340"
                     /translation="MKLNVDGLLVYFPYDYIYPEQFSYMRELKRTLDAKGHGVLEMPS
                     GTGKTVSLLALIMAYQRAYPLEVTKLIYCSRTVPEIEKVIEELRKLLNFYEKQEGEKL
                     PFLGLALSSRKNLCIHPEILHANVVVYSYHYLLDPKIADLVSKELARKAVVVFDEAHN
                     IDNVCIDSMSVNLTRRTLDRCQGNLETLQKTVLRIKETDEQRLRDEYRRLVEGLREAS
                     AARETDAHLANPVLPNEVLQEAVPGSIRTAEHFLGFLRRLLEYVKWRLRVQHVVQESP
                     PAFLSGLAQRVCIQRKPLRFCAERLRSLLHTLEITDLADFSPLTLLANFATLVSTYAK
                     GFTIIIEPFDDRTPTIANPILHFSCMDASLAIKPVFERFQSVIITSGTLSPLDIYPKI
                     LDFHPVTMATFTMTLARVCLCPMIIGRGNDQVAISSKFETREDIAVIRNYGNLLLEMS
                     AVVPDGIVAFFTSYQYMESTVASWYEQGILENIQRNKLLFIETQDGAETSVALEKYQE
                     ACENGRGAILLSVARGKVSEGIDFVHHYGRAVIMFGVPYVYTQSRILKARLEYLRDQF
                     QIRENDFLTFDAMRHAAQCVGRAIRGKTDYGLMVFADKRFARGDKRGKLPRWIQEHLT
                     DANLNLTVDEGVQVAKYFLRQMAQPFHREDQLGLSLLSLEQLESEETLQRIEQIAQQL
                     "
BASE COUNT          432 a          654 c          611 g          406 t
ORIGIN      
        1 gaccccgctg cacagtccgg ccggcgccat gaagctcaac gtggacgggc tcctggtcta
       61 cttcccgtac gactacatct accccgagca gttctcctac atgcgggagc tcaaacgcac
      121 gctggacgcc aagggtcatg gagtcctgga gatgccctca ggcaccggga agacagtatc
      181 cctgttggcc ctgatcatgg cataccagag agcatatccg ctggaggtga ccaaactcat
      241 ctactgctca agaactgtgc cagagattga gaaggtgatt gaagagcttc gaaagttgct
      301 caacttctat gagaagcagg agggcgagaa gctgccgttt ctgggactgg ctctgagctc
      361 ccgcaaaaac ttgtgtattc accctgagat cctgcatgcc aatgtggtgg tttatagcta
      421 ccactacctc ctggacccca agattgcaga cctggtgtcc aaggaactgg cccgcaaggc
      481 cgtcgtggtc ttcgacgagg cccacaacat tgacaacgtc tgcatcgact ccatgagcgt
      541 caacctcacc cgccggaccc ttgaccggtg ccagggcaac ctggagaccc tgcagaagac
      601 ggtgctcagg atcaaagaga cagacgagca gcgcctgcgg gacgagtacc ggcgtctggt
      661 ggaggggctg cgggaggcca gcgccgcccg ggagacggac gcccacctgg ccaaccccgt
      721 gctgcccaac gaagtgctgc aggaggcagt gcctggctcc atccgcacgg ccgagcattt
      781 cctgggcttc ctgaggcggc tgctggagta cgtgaagtgg cggctgcgtg tgcagcatgt
      841 ggtgcaggag agcccgcccg ccttcctgag cggcctggcc cagcgcgtgt gcatccagcg
      901 caagcccctc agattctgtg ctgaacgcct ccggtccctg ctgcatactc tggagatcac
      961 cgaccttgct gacttctccc cgctcaccct ccttgctaac tttgccaccc ttgtcagcac
     1021 ctacgccaaa ggcttcacca tcatcatcga gccctttgac gacagaaccc cgaccattgc
     1081 caaccccatc ctgcacttca gctgcatgga cgcctcgctg gccatcaaac ccgtatttga
     1141 gcgtttccag tctgtcatca tcacatctgg gacactgtcc ccgctggaca tctaccccaa
     1201 gatcctggac ttccaccccg tcaccatggc aaccttcacc atgacgctgg cacgggtctg
     1261 cctctgccct atgatcatcg gccgtggcaa tgaccaggtg gccatcagct ccaaatttga
     1321 gacccgggag gatattgctg tgatccggaa ctatgggaac ctcctgctgg agatgtccgc
     1381 tgtggtccct gatggcatcg tggccttctt caccagctac cagtacatgg agagcaccgt
     1441 ggcctcctgg tatgagcagg ggatccttga gaacatccag aggaacaagc tgctctttat
     1501 tgagacccag gatggtgccg aaaccagtgt cgccctggag aagtaccagg aggcctgcga
     1561 gaatggccgc ggggccatcc tgctgtcagt ggcccggggc aaagtgtccg agggaatcga
     1621 ctttgtgcac cactacgggc gggccgtcat catgtttggc gtcccctacg tctacacaca
     1681 gagccgcatt ctcaaggcgc ggctggaata cctgcgggac cagttccaga ttcgtgagaa
     1741 tgactttctt accttcgatg ccatgcgcca cgcggcccag tgtgtgggtc gggccatcag
     1801 gggcaagacg gactacggcc tcatggtctt tgccgacaag cggtttgccc gtggggacaa
     1861 gcgggggaag ctgccccgct ggatccagga gcacctcaca gatgccaacc tcaacctgac
     1921 cgtggatgag ggtgtccagg tggccaagta cttcctgcgg cagatggcac agcccttcca
     1981 ccgggaggat cagctgggcc tgtccctgct cagcctggag cagctagaat cagaggagac
     2041 gctgcagagg atagagcaga ttgctcagca gctctgagtg gggcgggtgg ggccataaac
     2101 ggt
//