LOCUS AAH16620.1 940 aa PRT HUM 15-JUL-2006
DEFINITION Homo sapiens xeroderma pigmentosum, complementation group
C protein.
ACCESSION BC016620-1
PROTEIN_ID AAH16620.1
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 3658)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 3658)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (31-OCT-2001) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Life Technologies, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Baylor College of Medicine Human Genome
Sequencing Center
Center code: BCM-HGSC
Web site: http://www.hgsc.bcm.tmc.edu/cdna/
Contact: amg@bcm.tmc.edu
Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H.,
Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati,
A.N., Gibbs, R.A.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 28 Row: h Column: 23
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 54607142.
FEATURES Qualifiers
source /db_xref="H-InvDB:HIT000037602"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:21338 IMAGE:4509957"
/tissue_type="Testis, embryonal carcinoma"
/clone_lib="NIH_MGC_92"
/lab_host="DH10B"
/note="Vector: pCMV-SPORT6"
protein /gene="XPC"
/gene_synonym="XP3"
/gene_synonym="XPCC"
/db_xref="GeneID:7508"
/db_xref="HGNC:HGNC:12816"
/db_xref="MIM:278720"
BEGIN
1 MARKRAAGGE PRGRELRSQK SKAKSKARRE EEEEDAFEDE KPPKKSLLSK VSQGKRKRGC
61 SHPGGSADGP AKKKVAKVTV KSENLKVIKD EALSDGDDLR DFPSDLKKAH HLKRGATMNE
121 DSNEEEEESE NDWEEVEELS EPVLGDVRES TAFSRSLLPV KPVEIEIETP EQAKTRERSE
181 KIKLEFETYL RRAMKRFNKG VHEDTHKVHL LCLLANGFYR NNICSQPDLH AIGLSIIPAR
241 FTRVLPRDVD TYYLSNLVKW FIGTFTVNAE LSASEQDNLQ TTLERRFAIY SARDDEELVH
301 IFLLILRALQ LLTRLVLSLQ PIPLKSATAK GKKPSKERLT ADPGGSSETS SQVLENHTKP
361 KTSKGTKQEE TFAKGTCRPS AKGKRNKGGR KKRSKPSSSE EDEGPGDKQE KATQRRPHGR
421 ERRVASRVSY KEESGSDEAG SGSDFELSSG EASDPSDEDS EPGPPKQRKA PAPQRTKAGS
481 KSASRTHRGS HRKDPSLPAA SSSSSSSKRG KKMCSDGEKA EKRSIAGIDQ WLEVFCEQEE
541 KWVCVDCVHG VVGQPLTCYK YATKPMTYVV GIDSDGWVRD VTQRYDPVWM TVTRKCRVDA
601 EWWAETLRPY QSPFMDREKK EDLEFQAKHM DQPLPTAIGL YKNHPLYALK RHLLKYEAIY
661 PETAAILGYC RGEAVYSRDC VHTLHSRDTW LKKARVVRLG EVPYKMVKGF SNRARKARLA
721 EPQLREENDL GLFGYWQTEE YQPPVAVDGK VPRNEFGNVY LFLPSMMPIG CVQLNLPNLH
781 RVARKLDIDC VQAITGFDFH GGYSHPVTDG YIVCEEFKDV LLTAWENEQA VIERKEKEKK
841 EKRALGNWKL LAKGLLIRER LKRRYGPKSE AAAPHTDAGG GLSSDEEEGT SSQAEAARIL
901 AASWPQNRED EEKQKLKGGP KKTKREKKAA ASHLFPFEKL
//