LOCUS BC016620 3658 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens xeroderma pigmentosum, complementation group C, mRNA (cDNA clone MGC:21338 IMAGE:4509957), complete cds. ACCESSION BC016620 VERSION BC016620.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3658) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 3658) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (31-OCT-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Baylor College of Medicine Human Genome Sequencing Center Center code: BCM-HGSC Web site: http://www.hgsc.bcm.tmc.edu/cdna/ Contact: amg@bcm.tmc.edu Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H., Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati, A.N., Gibbs, R.A. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 28 Row: h Column: 23 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 54607142. FEATURES Location/Qualifiers source 1..3658 /db_xref="H-InvDB:HIT000037602" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:21338 IMAGE:4509957" /tissue_type="Testis, embryonal carcinoma" /clone_lib="NIH_MGC_92" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..3658 /gene="XPC" /gene_synonym="XP3" /gene_synonym="XPCC" /db_xref="GeneID:7508" /db_xref="HGNC:HGNC:12816" /db_xref="MIM:278720" CDS 16..2838 /gene="XPC" /gene_synonym="XP3" /gene_synonym="XPCC" /codon_start=1 /product="xeroderma pigmentosum, complementation group C" /protein_id="AAH16620.1" /db_xref="GeneID:7508" /db_xref="HGNC:HGNC:12816" /db_xref="MIM:278720" /translation="MARKRAAGGEPRGRELRSQKSKAKSKARREEEEEDAFEDEKPPK KSLLSKVSQGKRKRGCSHPGGSADGPAKKKVAKVTVKSENLKVIKDEALSDGDDLRDF PSDLKKAHHLKRGATMNEDSNEEEEESENDWEEVEELSEPVLGDVRESTAFSRSLLPV KPVEIEIETPEQAKTRERSEKIKLEFETYLRRAMKRFNKGVHEDTHKVHLLCLLANGF YRNNICSQPDLHAIGLSIIPARFTRVLPRDVDTYYLSNLVKWFIGTFTVNAELSASEQ DNLQTTLERRFAIYSARDDEELVHIFLLILRALQLLTRLVLSLQPIPLKSATAKGKKP SKERLTADPGGSSETSSQVLENHTKPKTSKGTKQEETFAKGTCRPSAKGKRNKGGRKK RSKPSSSEEDEGPGDKQEKATQRRPHGRERRVASRVSYKEESGSDEAGSGSDFELSSG EASDPSDEDSEPGPPKQRKAPAPQRTKAGSKSASRTHRGSHRKDPSLPAASSSSSSSK RGKKMCSDGEKAEKRSIAGIDQWLEVFCEQEEKWVCVDCVHGVVGQPLTCYKYATKPM TYVVGIDSDGWVRDVTQRYDPVWMTVTRKCRVDAEWWAETLRPYQSPFMDREKKEDLE FQAKHMDQPLPTAIGLYKNHPLYALKRHLLKYEAIYPETAAILGYCRGEAVYSRDCVH TLHSRDTWLKKARVVRLGEVPYKMVKGFSNRARKARLAEPQLREENDLGLFGYWQTEE YQPPVAVDGKVPRNEFGNVYLFLPSMMPIGCVQLNLPNLHRVARKLDIDCVQAITGFD FHGGYSHPVTDGYIVCEEFKDVLLTAWENEQAVIERKEKEKKEKRALGNWKLLAKGLL IRERLKRRYGPKSEAAAPHTDAGGGLSSDEEEGTSSQAEAARILAASWPQNREDEEKQ KLKGGPKKTKREKKAAASHLFPFEKL" BASE COUNT 1050 a 862 c 1024 g 722 t ORIGIN 1 gcccagacaa gcaacatggc tcggaaacgc gcggccggcg gggagccgcg gggacgcgaa 61 ctgcgcagcc agaaatccaa ggccaagagc aaggcccggc gtgaggagga ggaggaggat 121 gcctttgaag atgagaaacc cccaaagaag agccttctct ccaaagtttc acaaggaaag 181 aggaaaagag gctgcagtca tcctgggggt tcagcagatg gtccagcaaa aaagaaagtg 241 gccaaggtga ctgttaaatc tgaaaacctc aaggttataa aggatgaagc cctcagcgat 301 ggggatgacc tcagggactt tccaagtgac ctcaagaagg cacaccatct gaagagaggg 361 gctaccatga atgaagacag caatgaagaa gaggaagaaa gtgaaaatga ttgggaagag 421 gttgaagaac ttagtgagcc tgtgctgggt gacgtgagag aaagtacagc cttctctcga 481 tctcttctgc ctgtgaagcc agtggagata gagattgaaa cgccagagca ggcgaagaca 541 agagaaagaa gtgaaaagat aaaactggag tttgagacat atcttcggag ggcgatgaaa 601 cgtttcaata aaggggtcca tgaggacaca cacaaggttc accttctctg cctgctagca 661 aatggcttct atcgaaataa catctgcagc cagccagatc tgcatgctat tggcctgtcc 721 atcatcccag cccgctttac cagagtgctg cctcgagatg tggacaccta ctacctctca 781 aacctggtga agtggttcat tggaacattt acagttaatg cagaactttc agccagtgaa 841 caagataacc tgcagactac attggaaagg agatttgcta tttactctgc tcgagatgat 901 gaggaattgg tccatatatt cttactgatt ctccgggctc tgcagctctt gacccggctg 961 gtattgtctc tacagccaat tcctctgaag tcagcaacag caaagggaaa gaaaccttcc 1021 aaggaaagat tgactgcgga tccaggaggc tcctcagaaa cttccagcca agttctagaa 1081 aaccacacca aaccaaagac cagcaaagga accaaacaag aggaaacctt tgctaagggc 1141 acctgcaggc caagtgccaa agggaagagg aacaagggag gcagaaagaa acggagcaag 1201 ccctcctcca gcgaggaaga tgagggccca ggagacaagc aggagaaggc aacccagcga 1261 cgtccgcatg gccgggagcg gcgggtggcc tccagggtgt cttataaaga ggagagtggg 1321 agtgatgagg ctggcagcgg ctctgatttt gagctctcca gtggagaagc ctctgatccc 1381 tctgatgagg attccgaacc tggccctcca aagcagagga aagcccccgc tcctcagagg 1441 acaaaggctg ggtccaagag tgcctccagg acccatcgtg ggagccatcg taaggaccca 1501 agcttgccag cggcatcctc aagctcttca agcagtaaaa gaggcaagaa aatgtgcagc 1561 gatggtgaga aggcagaaaa aagaagcata gctggtatag accagtggct agaggtgttc 1621 tgtgagcagg aggaaaagtg ggtatgtgta gactgtgtgc acggtgtggt gggccagcct 1681 ctgacctgtt acaagtacgc caccaagccc atgacctatg tggtgggcat tgacagtgac 1741 ggctgggtcc gagatgtcac acagaggtac gacccagtct ggatgacagt gacccgcaag 1801 tgccgggttg atgctgagtg gtgggccgag accttgagac cataccagag cccatttatg 1861 gacagggaga agaaagaaga cttggagttt caggcaaaac acatggacca gcctttgccc 1921 actgccattg gcttatataa gaaccaccct ctgtatgccc tgaagcggca tctcctgaaa 1981 tatgaggcca tctatcccga gacagctgcc atccttgggt attgtcgtgg agaagcggtc 2041 tactccaggg attgtgtgca cactctgcat tccagagaca cgtggctgaa gaaagcaaga 2101 gtggtgaggc ttggagaagt accctacaag atggtgaaag gcttttctaa ccgtgctcgg 2161 aaagcccgac ttgctgagcc ccagctgcgg gaagaaaatg acctgggcct gtttggctac 2221 tggcagacag aggagtatca gcccccagtg gccgtggacg ggaaggtgcc ccggaacgag 2281 tttgggaatg tgtacctctt cctgcccagc atgatgccta ttggctgtgt ccagctgaac 2341 ctgcccaatc tacaccgcgt ggcccgcaag ctggacatcg actgtgtcca ggccatcact 2401 ggctttgatt tccatggcgg ctactcccat cccgtgactg atggatacat cgtctgcgag 2461 gaattcaaag acgtgctcct gactgcctgg gaaaatgagc aggcagtcat tgaaaggaag 2521 gagaaggaga aaaaggagaa gcgggctcta gggaactgga agttgctggc caaaggtctg 2581 ctcatcaggg agaggctgaa gcgtcgctac gggcccaaga gtgaggcagc agctccccac 2641 acagatgcag gaggtggact ctcttctgat gaagaggagg ggaccagctc tcaagcagaa 2701 gcggccagga tactggctgc ctcctggcct caaaaccgag aagatgaaga aaagcagaag 2761 ctgaagggtg ggcccaagaa gaccaaaagg gaaaagaaag cagcagcttc ccacctgttc 2821 ccatttgaga agctgtgagc tgagcgccca ctagaggggc acccaccagt tgctgctgcc 2881 ccactacagg ccccacacct gccctgggca tgcccagccc ctggtggtgg gggcttctct 2941 gctgagaagg caaactgagg cagcatgcac ggaggcgggg tcaggggaga cgaggccaag 3001 ctgaggaggt gctgcaggtc ccgtctggct ccagcccttg tcagattcac ccagggtgaa 3061 gccttcaaag ctttttgcta ccaaagccca ctcacccttt gagctacaga acactttgct 3121 aggagatact cttctgcctc ctagacctgt tctttccatc tttagaaaca tcagtttttg 3181 tatggaagcc accgggagat ttctggatgg tggtgcatcc gtgaatgcgc tgatcgtttc 3241 ttccagttag agtcttcatc tgtccgacaa gttcactcgc ctcggttgcg gacctaggac 3301 catttctctg caggccactt accttcccct gagtcaggct tactaatgct gccctcactg 3361 cctctttgca gtaggggaga gagcagagaa gtacaggtca tctgctggga tctagttttc 3421 caagtaacat tttgtggtga cagaagccta aaaaaagcta aaatcaggaa agaaaaggaa 3481 aaatacgaat tgaaaattaa ggaaatgtta gtaaaataga tcagtgttaa actagattgt 3541 attcattact agataaaatg tataaagctc tctgtactaa ggagaaatga cttttataac 3601 attttgagaa aataataaag catttatcta aaaaaaaaaa aaaaaaaaaa aaaaaaaa //