LOCUS BC016620 3658 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens xeroderma pigmentosum, complementation group C, mRNA
(cDNA clone MGC:21338 IMAGE:4509957), complete cds.
ACCESSION BC016620
VERSION BC016620.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 3658)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 3658)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (31-OCT-2001) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Life Technologies, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Baylor College of Medicine Human Genome
Sequencing Center
Center code: BCM-HGSC
Web site: http://www.hgsc.bcm.tmc.edu/cdna/
Contact: amg@bcm.tmc.edu
Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H.,
Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati,
A.N., Gibbs, R.A.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 28 Row: h Column: 23
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 54607142.
FEATURES Location/Qualifiers
source 1..3658
/db_xref="H-InvDB:HIT000037602"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:21338 IMAGE:4509957"
/tissue_type="Testis, embryonal carcinoma"
/clone_lib="NIH_MGC_92"
/lab_host="DH10B"
/note="Vector: pCMV-SPORT6"
gene 1..3658
/gene="XPC"
/gene_synonym="XP3"
/gene_synonym="XPCC"
/db_xref="GeneID:7508"
/db_xref="HGNC:HGNC:12816"
/db_xref="MIM:278720"
CDS 16..2838
/gene="XPC"
/gene_synonym="XP3"
/gene_synonym="XPCC"
/codon_start=1
/product="xeroderma pigmentosum, complementation group C"
/protein_id="AAH16620.1"
/db_xref="GeneID:7508"
/db_xref="HGNC:HGNC:12816"
/db_xref="MIM:278720"
/translation="MARKRAAGGEPRGRELRSQKSKAKSKARREEEEEDAFEDEKPPK
KSLLSKVSQGKRKRGCSHPGGSADGPAKKKVAKVTVKSENLKVIKDEALSDGDDLRDF
PSDLKKAHHLKRGATMNEDSNEEEEESENDWEEVEELSEPVLGDVRESTAFSRSLLPV
KPVEIEIETPEQAKTRERSEKIKLEFETYLRRAMKRFNKGVHEDTHKVHLLCLLANGF
YRNNICSQPDLHAIGLSIIPARFTRVLPRDVDTYYLSNLVKWFIGTFTVNAELSASEQ
DNLQTTLERRFAIYSARDDEELVHIFLLILRALQLLTRLVLSLQPIPLKSATAKGKKP
SKERLTADPGGSSETSSQVLENHTKPKTSKGTKQEETFAKGTCRPSAKGKRNKGGRKK
RSKPSSSEEDEGPGDKQEKATQRRPHGRERRVASRVSYKEESGSDEAGSGSDFELSSG
EASDPSDEDSEPGPPKQRKAPAPQRTKAGSKSASRTHRGSHRKDPSLPAASSSSSSSK
RGKKMCSDGEKAEKRSIAGIDQWLEVFCEQEEKWVCVDCVHGVVGQPLTCYKYATKPM
TYVVGIDSDGWVRDVTQRYDPVWMTVTRKCRVDAEWWAETLRPYQSPFMDREKKEDLE
FQAKHMDQPLPTAIGLYKNHPLYALKRHLLKYEAIYPETAAILGYCRGEAVYSRDCVH
TLHSRDTWLKKARVVRLGEVPYKMVKGFSNRARKARLAEPQLREENDLGLFGYWQTEE
YQPPVAVDGKVPRNEFGNVYLFLPSMMPIGCVQLNLPNLHRVARKLDIDCVQAITGFD
FHGGYSHPVTDGYIVCEEFKDVLLTAWENEQAVIERKEKEKKEKRALGNWKLLAKGLL
IRERLKRRYGPKSEAAAPHTDAGGGLSSDEEEGTSSQAEAARILAASWPQNREDEEKQ
KLKGGPKKTKREKKAAASHLFPFEKL"
BASE COUNT 1050 a 862 c 1024 g 722 t
ORIGIN
1 gcccagacaa gcaacatggc tcggaaacgc gcggccggcg gggagccgcg gggacgcgaa
61 ctgcgcagcc agaaatccaa ggccaagagc aaggcccggc gtgaggagga ggaggaggat
121 gcctttgaag atgagaaacc cccaaagaag agccttctct ccaaagtttc acaaggaaag
181 aggaaaagag gctgcagtca tcctgggggt tcagcagatg gtccagcaaa aaagaaagtg
241 gccaaggtga ctgttaaatc tgaaaacctc aaggttataa aggatgaagc cctcagcgat
301 ggggatgacc tcagggactt tccaagtgac ctcaagaagg cacaccatct gaagagaggg
361 gctaccatga atgaagacag caatgaagaa gaggaagaaa gtgaaaatga ttgggaagag
421 gttgaagaac ttagtgagcc tgtgctgggt gacgtgagag aaagtacagc cttctctcga
481 tctcttctgc ctgtgaagcc agtggagata gagattgaaa cgccagagca ggcgaagaca
541 agagaaagaa gtgaaaagat aaaactggag tttgagacat atcttcggag ggcgatgaaa
601 cgtttcaata aaggggtcca tgaggacaca cacaaggttc accttctctg cctgctagca
661 aatggcttct atcgaaataa catctgcagc cagccagatc tgcatgctat tggcctgtcc
721 atcatcccag cccgctttac cagagtgctg cctcgagatg tggacaccta ctacctctca
781 aacctggtga agtggttcat tggaacattt acagttaatg cagaactttc agccagtgaa
841 caagataacc tgcagactac attggaaagg agatttgcta tttactctgc tcgagatgat
901 gaggaattgg tccatatatt cttactgatt ctccgggctc tgcagctctt gacccggctg
961 gtattgtctc tacagccaat tcctctgaag tcagcaacag caaagggaaa gaaaccttcc
1021 aaggaaagat tgactgcgga tccaggaggc tcctcagaaa cttccagcca agttctagaa
1081 aaccacacca aaccaaagac cagcaaagga accaaacaag aggaaacctt tgctaagggc
1141 acctgcaggc caagtgccaa agggaagagg aacaagggag gcagaaagaa acggagcaag
1201 ccctcctcca gcgaggaaga tgagggccca ggagacaagc aggagaaggc aacccagcga
1261 cgtccgcatg gccgggagcg gcgggtggcc tccagggtgt cttataaaga ggagagtggg
1321 agtgatgagg ctggcagcgg ctctgatttt gagctctcca gtggagaagc ctctgatccc
1381 tctgatgagg attccgaacc tggccctcca aagcagagga aagcccccgc tcctcagagg
1441 acaaaggctg ggtccaagag tgcctccagg acccatcgtg ggagccatcg taaggaccca
1501 agcttgccag cggcatcctc aagctcttca agcagtaaaa gaggcaagaa aatgtgcagc
1561 gatggtgaga aggcagaaaa aagaagcata gctggtatag accagtggct agaggtgttc
1621 tgtgagcagg aggaaaagtg ggtatgtgta gactgtgtgc acggtgtggt gggccagcct
1681 ctgacctgtt acaagtacgc caccaagccc atgacctatg tggtgggcat tgacagtgac
1741 ggctgggtcc gagatgtcac acagaggtac gacccagtct ggatgacagt gacccgcaag
1801 tgccgggttg atgctgagtg gtgggccgag accttgagac cataccagag cccatttatg
1861 gacagggaga agaaagaaga cttggagttt caggcaaaac acatggacca gcctttgccc
1921 actgccattg gcttatataa gaaccaccct ctgtatgccc tgaagcggca tctcctgaaa
1981 tatgaggcca tctatcccga gacagctgcc atccttgggt attgtcgtgg agaagcggtc
2041 tactccaggg attgtgtgca cactctgcat tccagagaca cgtggctgaa gaaagcaaga
2101 gtggtgaggc ttggagaagt accctacaag atggtgaaag gcttttctaa ccgtgctcgg
2161 aaagcccgac ttgctgagcc ccagctgcgg gaagaaaatg acctgggcct gtttggctac
2221 tggcagacag aggagtatca gcccccagtg gccgtggacg ggaaggtgcc ccggaacgag
2281 tttgggaatg tgtacctctt cctgcccagc atgatgccta ttggctgtgt ccagctgaac
2341 ctgcccaatc tacaccgcgt ggcccgcaag ctggacatcg actgtgtcca ggccatcact
2401 ggctttgatt tccatggcgg ctactcccat cccgtgactg atggatacat cgtctgcgag
2461 gaattcaaag acgtgctcct gactgcctgg gaaaatgagc aggcagtcat tgaaaggaag
2521 gagaaggaga aaaaggagaa gcgggctcta gggaactgga agttgctggc caaaggtctg
2581 ctcatcaggg agaggctgaa gcgtcgctac gggcccaaga gtgaggcagc agctccccac
2641 acagatgcag gaggtggact ctcttctgat gaagaggagg ggaccagctc tcaagcagaa
2701 gcggccagga tactggctgc ctcctggcct caaaaccgag aagatgaaga aaagcagaag
2761 ctgaagggtg ggcccaagaa gaccaaaagg gaaaagaaag cagcagcttc ccacctgttc
2821 ccatttgaga agctgtgagc tgagcgccca ctagaggggc acccaccagt tgctgctgcc
2881 ccactacagg ccccacacct gccctgggca tgcccagccc ctggtggtgg gggcttctct
2941 gctgagaagg caaactgagg cagcatgcac ggaggcgggg tcaggggaga cgaggccaag
3001 ctgaggaggt gctgcaggtc ccgtctggct ccagcccttg tcagattcac ccagggtgaa
3061 gccttcaaag ctttttgcta ccaaagccca ctcacccttt gagctacaga acactttgct
3121 aggagatact cttctgcctc ctagacctgt tctttccatc tttagaaaca tcagtttttg
3181 tatggaagcc accgggagat ttctggatgg tggtgcatcc gtgaatgcgc tgatcgtttc
3241 ttccagttag agtcttcatc tgtccgacaa gttcactcgc ctcggttgcg gacctaggac
3301 catttctctg caggccactt accttcccct gagtcaggct tactaatgct gccctcactg
3361 cctctttgca gtaggggaga gagcagagaa gtacaggtca tctgctggga tctagttttc
3421 caagtaacat tttgtggtga cagaagccta aaaaaagcta aaatcaggaa agaaaaggaa
3481 aaatacgaat tgaaaattaa ggaaatgtta gtaaaataga tcagtgttaa actagattgt
3541 attcattact agataaaatg tataaagctc tctgtactaa ggagaaatga cttttataac
3601 attttgagaa aataataaag catttatcta aaaaaaaaaa aaaaaaaaaa aaaaaaaa
//