LOCUS AAH16620.1 940 aa PRT HUM 15-JUL-2006 DEFINITION Homo sapiens xeroderma pigmentosum, complementation group C protein. ACCESSION BC016620-1 PROTEIN_ID AAH16620.1 SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3658) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 3658) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (31-OCT-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Baylor College of Medicine Human Genome Sequencing Center Center code: BCM-HGSC Web site: http://www.hgsc.bcm.tmc.edu/cdna/ Contact: amg@bcm.tmc.edu Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H., Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati, A.N., Gibbs, R.A. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 28 Row: h Column: 23 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 54607142. FEATURES Qualifiers source /db_xref="H-InvDB:HIT000037602" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:21338 IMAGE:4509957" /tissue_type="Testis, embryonal carcinoma" /clone_lib="NIH_MGC_92" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" protein /gene="XPC" /gene_synonym="XP3" /gene_synonym="XPCC" /db_xref="GeneID:7508" /db_xref="HGNC:HGNC:12816" /db_xref="MIM:278720" BEGIN 1 MARKRAAGGE PRGRELRSQK SKAKSKARRE EEEEDAFEDE KPPKKSLLSK VSQGKRKRGC 61 SHPGGSADGP AKKKVAKVTV KSENLKVIKD EALSDGDDLR DFPSDLKKAH HLKRGATMNE 121 DSNEEEEESE NDWEEVEELS EPVLGDVRES TAFSRSLLPV KPVEIEIETP EQAKTRERSE 181 KIKLEFETYL RRAMKRFNKG VHEDTHKVHL LCLLANGFYR NNICSQPDLH AIGLSIIPAR 241 FTRVLPRDVD TYYLSNLVKW FIGTFTVNAE LSASEQDNLQ TTLERRFAIY SARDDEELVH 301 IFLLILRALQ LLTRLVLSLQ PIPLKSATAK GKKPSKERLT ADPGGSSETS SQVLENHTKP 361 KTSKGTKQEE TFAKGTCRPS AKGKRNKGGR KKRSKPSSSE EDEGPGDKQE KATQRRPHGR 421 ERRVASRVSY KEESGSDEAG SGSDFELSSG EASDPSDEDS EPGPPKQRKA PAPQRTKAGS 481 KSASRTHRGS HRKDPSLPAA SSSSSSSKRG KKMCSDGEKA EKRSIAGIDQ WLEVFCEQEE 541 KWVCVDCVHG VVGQPLTCYK YATKPMTYVV GIDSDGWVRD VTQRYDPVWM TVTRKCRVDA 601 EWWAETLRPY QSPFMDREKK EDLEFQAKHM DQPLPTAIGL YKNHPLYALK RHLLKYEAIY 661 PETAAILGYC RGEAVYSRDC VHTLHSRDTW LKKARVVRLG EVPYKMVKGF SNRARKARLA 721 EPQLREENDL GLFGYWQTEE YQPPVAVDGK VPRNEFGNVY LFLPSMMPIG CVQLNLPNLH 781 RVARKLDIDC VQAITGFDFH GGYSHPVTDG YIVCEEFKDV LLTAWENEQA VIERKEKEKK 841 EKRALGNWKL LAKGLLIRER LKRRYGPKSE AAAPHTDAGG GLSSDEEEGT SSQAEAARIL 901 AASWPQNRED EEKQKLKGGP KKTKREKKAA ASHLFPFEKL //