LOCUS       BC016620                3658 bp    mRNA    linear   HUM 15-JUL-2006
DEFINITION  Homo sapiens xeroderma pigmentosum, complementation group C, mRNA
            (cDNA clone MGC:21338 IMAGE:4509957), complete cds.
ACCESSION   BC016620
VERSION     BC016620.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3658)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 3658)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (31-OCT-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Baylor College of Medicine Human Genome
            Sequencing Center
            Center code: BCM-HGSC
            Web site: http://www.hgsc.bcm.tmc.edu/cdna/
            Contact: amg@bcm.tmc.edu
            Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H.,
            Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati,
            A.N., Gibbs, R.A.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 28 Row: h Column: 23
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 54607142.
FEATURES             Location/Qualifiers
     source          1..3658
                     /db_xref="H-InvDB:HIT000037602"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:21338 IMAGE:4509957"
                     /tissue_type="Testis, embryonal carcinoma"
                     /clone_lib="NIH_MGC_92"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     gene            1..3658
                     /gene="XPC"
                     /gene_synonym="XP3"
                     /gene_synonym="XPCC"
                     /db_xref="GeneID:7508"
                     /db_xref="HGNC:HGNC:12816"
                     /db_xref="MIM:278720"
     CDS             16..2838
                     /gene="XPC"
                     /gene_synonym="XP3"
                     /gene_synonym="XPCC"
                     /codon_start=1
                     /product="xeroderma pigmentosum, complementation group C"
                     /protein_id="AAH16620.1"
                     /db_xref="GeneID:7508"
                     /db_xref="HGNC:HGNC:12816"
                     /db_xref="MIM:278720"
                     /translation="MARKRAAGGEPRGRELRSQKSKAKSKARREEEEEDAFEDEKPPK
                     KSLLSKVSQGKRKRGCSHPGGSADGPAKKKVAKVTVKSENLKVIKDEALSDGDDLRDF
                     PSDLKKAHHLKRGATMNEDSNEEEEESENDWEEVEELSEPVLGDVRESTAFSRSLLPV
                     KPVEIEIETPEQAKTRERSEKIKLEFETYLRRAMKRFNKGVHEDTHKVHLLCLLANGF
                     YRNNICSQPDLHAIGLSIIPARFTRVLPRDVDTYYLSNLVKWFIGTFTVNAELSASEQ
                     DNLQTTLERRFAIYSARDDEELVHIFLLILRALQLLTRLVLSLQPIPLKSATAKGKKP
                     SKERLTADPGGSSETSSQVLENHTKPKTSKGTKQEETFAKGTCRPSAKGKRNKGGRKK
                     RSKPSSSEEDEGPGDKQEKATQRRPHGRERRVASRVSYKEESGSDEAGSGSDFELSSG
                     EASDPSDEDSEPGPPKQRKAPAPQRTKAGSKSASRTHRGSHRKDPSLPAASSSSSSSK
                     RGKKMCSDGEKAEKRSIAGIDQWLEVFCEQEEKWVCVDCVHGVVGQPLTCYKYATKPM
                     TYVVGIDSDGWVRDVTQRYDPVWMTVTRKCRVDAEWWAETLRPYQSPFMDREKKEDLE
                     FQAKHMDQPLPTAIGLYKNHPLYALKRHLLKYEAIYPETAAILGYCRGEAVYSRDCVH
                     TLHSRDTWLKKARVVRLGEVPYKMVKGFSNRARKARLAEPQLREENDLGLFGYWQTEE
                     YQPPVAVDGKVPRNEFGNVYLFLPSMMPIGCVQLNLPNLHRVARKLDIDCVQAITGFD
                     FHGGYSHPVTDGYIVCEEFKDVLLTAWENEQAVIERKEKEKKEKRALGNWKLLAKGLL
                     IRERLKRRYGPKSEAAAPHTDAGGGLSSDEEEGTSSQAEAARILAASWPQNREDEEKQ
                     KLKGGPKKTKREKKAAASHLFPFEKL"
BASE COUNT         1050 a          862 c         1024 g          722 t
ORIGIN      
        1 gcccagacaa gcaacatggc tcggaaacgc gcggccggcg gggagccgcg gggacgcgaa
       61 ctgcgcagcc agaaatccaa ggccaagagc aaggcccggc gtgaggagga ggaggaggat
      121 gcctttgaag atgagaaacc cccaaagaag agccttctct ccaaagtttc acaaggaaag
      181 aggaaaagag gctgcagtca tcctgggggt tcagcagatg gtccagcaaa aaagaaagtg
      241 gccaaggtga ctgttaaatc tgaaaacctc aaggttataa aggatgaagc cctcagcgat
      301 ggggatgacc tcagggactt tccaagtgac ctcaagaagg cacaccatct gaagagaggg
      361 gctaccatga atgaagacag caatgaagaa gaggaagaaa gtgaaaatga ttgggaagag
      421 gttgaagaac ttagtgagcc tgtgctgggt gacgtgagag aaagtacagc cttctctcga
      481 tctcttctgc ctgtgaagcc agtggagata gagattgaaa cgccagagca ggcgaagaca
      541 agagaaagaa gtgaaaagat aaaactggag tttgagacat atcttcggag ggcgatgaaa
      601 cgtttcaata aaggggtcca tgaggacaca cacaaggttc accttctctg cctgctagca
      661 aatggcttct atcgaaataa catctgcagc cagccagatc tgcatgctat tggcctgtcc
      721 atcatcccag cccgctttac cagagtgctg cctcgagatg tggacaccta ctacctctca
      781 aacctggtga agtggttcat tggaacattt acagttaatg cagaactttc agccagtgaa
      841 caagataacc tgcagactac attggaaagg agatttgcta tttactctgc tcgagatgat
      901 gaggaattgg tccatatatt cttactgatt ctccgggctc tgcagctctt gacccggctg
      961 gtattgtctc tacagccaat tcctctgaag tcagcaacag caaagggaaa gaaaccttcc
     1021 aaggaaagat tgactgcgga tccaggaggc tcctcagaaa cttccagcca agttctagaa
     1081 aaccacacca aaccaaagac cagcaaagga accaaacaag aggaaacctt tgctaagggc
     1141 acctgcaggc caagtgccaa agggaagagg aacaagggag gcagaaagaa acggagcaag
     1201 ccctcctcca gcgaggaaga tgagggccca ggagacaagc aggagaaggc aacccagcga
     1261 cgtccgcatg gccgggagcg gcgggtggcc tccagggtgt cttataaaga ggagagtggg
     1321 agtgatgagg ctggcagcgg ctctgatttt gagctctcca gtggagaagc ctctgatccc
     1381 tctgatgagg attccgaacc tggccctcca aagcagagga aagcccccgc tcctcagagg
     1441 acaaaggctg ggtccaagag tgcctccagg acccatcgtg ggagccatcg taaggaccca
     1501 agcttgccag cggcatcctc aagctcttca agcagtaaaa gaggcaagaa aatgtgcagc
     1561 gatggtgaga aggcagaaaa aagaagcata gctggtatag accagtggct agaggtgttc
     1621 tgtgagcagg aggaaaagtg ggtatgtgta gactgtgtgc acggtgtggt gggccagcct
     1681 ctgacctgtt acaagtacgc caccaagccc atgacctatg tggtgggcat tgacagtgac
     1741 ggctgggtcc gagatgtcac acagaggtac gacccagtct ggatgacagt gacccgcaag
     1801 tgccgggttg atgctgagtg gtgggccgag accttgagac cataccagag cccatttatg
     1861 gacagggaga agaaagaaga cttggagttt caggcaaaac acatggacca gcctttgccc
     1921 actgccattg gcttatataa gaaccaccct ctgtatgccc tgaagcggca tctcctgaaa
     1981 tatgaggcca tctatcccga gacagctgcc atccttgggt attgtcgtgg agaagcggtc
     2041 tactccaggg attgtgtgca cactctgcat tccagagaca cgtggctgaa gaaagcaaga
     2101 gtggtgaggc ttggagaagt accctacaag atggtgaaag gcttttctaa ccgtgctcgg
     2161 aaagcccgac ttgctgagcc ccagctgcgg gaagaaaatg acctgggcct gtttggctac
     2221 tggcagacag aggagtatca gcccccagtg gccgtggacg ggaaggtgcc ccggaacgag
     2281 tttgggaatg tgtacctctt cctgcccagc atgatgccta ttggctgtgt ccagctgaac
     2341 ctgcccaatc tacaccgcgt ggcccgcaag ctggacatcg actgtgtcca ggccatcact
     2401 ggctttgatt tccatggcgg ctactcccat cccgtgactg atggatacat cgtctgcgag
     2461 gaattcaaag acgtgctcct gactgcctgg gaaaatgagc aggcagtcat tgaaaggaag
     2521 gagaaggaga aaaaggagaa gcgggctcta gggaactgga agttgctggc caaaggtctg
     2581 ctcatcaggg agaggctgaa gcgtcgctac gggcccaaga gtgaggcagc agctccccac
     2641 acagatgcag gaggtggact ctcttctgat gaagaggagg ggaccagctc tcaagcagaa
     2701 gcggccagga tactggctgc ctcctggcct caaaaccgag aagatgaaga aaagcagaag
     2761 ctgaagggtg ggcccaagaa gaccaaaagg gaaaagaaag cagcagcttc ccacctgttc
     2821 ccatttgaga agctgtgagc tgagcgccca ctagaggggc acccaccagt tgctgctgcc
     2881 ccactacagg ccccacacct gccctgggca tgcccagccc ctggtggtgg gggcttctct
     2941 gctgagaagg caaactgagg cagcatgcac ggaggcgggg tcaggggaga cgaggccaag
     3001 ctgaggaggt gctgcaggtc ccgtctggct ccagcccttg tcagattcac ccagggtgaa
     3061 gccttcaaag ctttttgcta ccaaagccca ctcacccttt gagctacaga acactttgct
     3121 aggagatact cttctgcctc ctagacctgt tctttccatc tttagaaaca tcagtttttg
     3181 tatggaagcc accgggagat ttctggatgg tggtgcatcc gtgaatgcgc tgatcgtttc
     3241 ttccagttag agtcttcatc tgtccgacaa gttcactcgc ctcggttgcg gacctaggac
     3301 catttctctg caggccactt accttcccct gagtcaggct tactaatgct gccctcactg
     3361 cctctttgca gtaggggaga gagcagagaa gtacaggtca tctgctggga tctagttttc
     3421 caagtaacat tttgtggtga cagaagccta aaaaaagcta aaatcaggaa agaaaaggaa
     3481 aaatacgaat tgaaaattaa ggaaatgtta gtaaaataga tcagtgttaa actagattgt
     3541 attcattact agataaaatg tataaagctc tctgtactaa ggagaaatga cttttataac
     3601 attttgagaa aataataaag catttatcta aaaaaaaaaa aaaaaaaaaa aaaaaaaa
//