LOCUS BC051764 4241 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens damage-specific DNA binding protein 1, 127kDa, mRNA
(cDNA clone MGC:54119 IMAGE:6063274), complete cds.
ACCESSION BC051764
VERSION BC051764.2
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 4241)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 4241)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (01-MAY-2003) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT On Aug 25, 2003 this sequence version replaced BC051764.1.
Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC/DCTD/DTP
cDNA Library Preparation: Life Technologies, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Sequencing Group at the Stanford Human Genome
Center, Stanford University School of Medicine, Stanford, CA 94305
Web site: http://www-shgc.stanford.edu
Contact: (Dickson, Mark) mcd@paxil.stanford.edu
Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
R. M.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 98 Row: k Column: 12
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 13435358.
FEATURES Location/Qualifiers
source 1..4241
/db_xref="H-InvDB:HIT000053674"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:54119 IMAGE:6063274"
/tissue_type="Skin, melanotic melanoma."
/clone_lib="NIH_MGC_72"
/lab_host="DH10B"
/note="Vector: pCMV-SPORT6"
gene 1..4241
/gene="DDB1"
/gene_synonym="DDBA"
/gene_synonym="UV-DDB1"
/gene_synonym="XAP1"
/gene_synonym="XPCE"
/gene_synonym="XPE"
/gene_synonym="XPE-BF"
/db_xref="GeneID:1642"
/db_xref="HGNC:HGNC:2717"
/db_xref="MIM:600045"
CDS 128..3550
/gene="DDB1"
/gene_synonym="DDBA"
/gene_synonym="UV-DDB1"
/gene_synonym="XAP1"
/gene_synonym="XPCE"
/gene_synonym="XPE"
/gene_synonym="XPE-BF"
/codon_start=1
/product="damage-specific DNA binding protein 1, 127kDa"
/protein_id="AAH51764.1"
/db_xref="GeneID:1642"
/db_xref="HGNC:HGNC:2717"
/db_xref="MIM:600045"
/translation="MSYNYVVTAQKPTAVNGCVTGHFTSAEDLNLLIAKNTRLEIYVV
TAEGLRPVKEVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIIT
RAHGNVQDRIGRPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLE
ELHVIDVKFLYGCQAPTICFVYQDPQGRHVKTYEVSLREKEFNKGPWKQENVEAEASM
VIAVPEPFGGAIIIGQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDM
EGRLFMLLLEKEEQMDGTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLV
KLNVDSNEQGSYVVAMETFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRN
GIGIHEHASIDLPGIKGLWPLRSDPNRETDDTLVLSFVGQTRVLMLNGEEVEETELMG
FVDDQQTFFCGNVAHQQLIQITSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQV
VVAVGRALYYLQIHPQELRQISHTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISA
RILKLPSFELLHKEMLGGEIIPRSILMTTFESSHYLLCALGDGALFYFGLNIETGLLS
DRKKVTLGTQPTVLRTFRSLSTTNVFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPL
NSDGYPDSLALANNSTLTIGTIDEIQKLHIRTVPLYESPRKICYQEVSQCFGVLSSRI
EVQDTSGGTTALRPSASTQALSSSVSSSKLFSSSTAPHETSFGEEVEVHNLLIIDQHT
FEVLHAHQFLQNEYALSLVSCKLGKDPNTYFIVGTAMVYPEEAEPKQGRIVVFQYSDG
KLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWTTEKELRTECNHYNNIMALYL
KTKGDFILVGDLMRSVLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLGAENAF
NLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGSVLF
GTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKTEPATGF
IDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKVVEELTRIH"
BASE COUNT 999 a 1062 c 1142 g 1038 t
ORIGIN
1 gtgtctctgg ggcggaggca gcggcagtgg agttcgctgc gcgctgttgg gggccacctg
61 tcttttcgct tgtgtccctc tttctagtgt cgcgctcgag tcccgacggg ccgctccaag
121 cctcgacatg tcgtacaact acgtggtaac ggcccagaag cccaccgccg tgaacggctg
181 cgtgaccgga cactttactt cggccgaaga cttaaacctg ttgattgcca aaaacacgag
241 attagagatc tatgtggtca ccgccgaggg gcttcggccc gtcaaagagg tgggcatgta
301 tgggaagatt gcggtcatgg agcttttcag gcccaagggg gagagcaagg acctgctgtt
361 tatcttgaca gcgaagtaca atgcctgcat cctggagtat aaacagagtg gcgagagcat
421 tgacatcatt acgcgagccc atggcaatgt ccaggaccgc attggccgcc cctcagagac
481 cggcattatt ggcatcattg accctgagtg ccggatgatt ggcctgcgtc tctatgatgg
541 ccttttcaag gttattccac tagatcgcga taataaagaa ctcaaggcct tcaacatccg
601 cctggaggag ctgcatgtca ttgatgtcaa gttcctatat ggttgccaag cacctactat
661 ttgctttgtc taccaggacc ctcaggggcg gcacgtaaaa acctatgagg tgtctctccg
721 agaaaaggaa ttcaataagg gcccttggaa acaggaaaat gtcgaagctg aagcttccat
781 ggtgatcgca gtcccagagc cctttggggg ggccatcatc attggacagg agtcaatcac
841 ctatcacaat ggtgacaaat acctggctat tgcccctcct atcatcaagc aaagcacgat
901 tgtgtgccac aatcgagtgg accctaatgg ctcaagatac ctgctgggag acatggaagg
961 ccggctcttc atgctgcttt tggagaagga ggaacagatg gatggcaccg tcactctcaa
1021 ggatctccgt gtagaactcc ttggagagac ctctattgct gagtgcttga cataccttga
1081 taatggtgtt gtgtttgtcg ggtctcgcct gggtgactcc cagcttgtga agctcaacgt
1141 tgacagtaat gaacaaggct cctatgtagt ggccatggaa acctttacca acttaggacc
1201 cattgtcgat atgtgcgtgg tggacctgga gaggcagggg caggggcagc tggtcacttg
1261 ctctggggct ttcaaggaag gttctttgcg gatcatccgg aatggaattg gaatccacga
1321 gcatgccagc attgacttac caggcatcaa aggattatgg ccactgcggt ctgaccctaa
1381 tcgtgagact gatgacactt tggtgctctc ttttgtgggc cagacaagag ttctcatgtt
1441 aaatggagag gaggtagaag aaaccgaact gatgggtttc gtggatgatc agcagacttt
1501 cttctgtggc aacgtggctc atcagcagct tatccagatc acttcagcat cggtgaggtt
1561 ggtctctcaa gaacccaaag ctctggtcag tgaatggaag gagcctcagg ccaagaacat
1621 cagtgtggcc tcctgcaata gcagccaggt ggtggtggct gtaggcaggg ccctctacta
1681 tctgcagatc catcctcagg agctccggca gatcagccac acagagatgg aacatgaagt
1741 ggcttgcttg gacatcaccc cattaggaga cagcaatgga ctgtcccctc tttgtgccat
1801 tggcctctgg acggacatct cggctcgtat cttgaagttg ccctcttttg aactactgca
1861 caaggagatg ctgggtggag agatcattcc tcgctccatc ctgatgacca cctttgagag
1921 tagccattac ctcctttgtg ccttgggaga tggagcgctt ttctactttg ggctcaacat
1981 tgagacaggt ctgttgagcg accgtaagaa ggtgactttg ggcacccagc ccaccgtatt
2041 gaggactttt cgttctcttt ctaccaccaa cgtctttgct tgttctgacc gccccactgt
2101 catctatagc agcaaccaca aattggtctt ctcaaatgtc aacctcaagg aagtgaacta
2161 catgtgtccc ctcaattcag atggctatcc tgacagcctg gcgctggcca acaatagcac
2221 cctcaccatt ggcaccatcg atgagatcca gaagctgcac attcgcacag ttcccctcta
2281 tgagtctcca aggaagatct gctaccagga agtgtcccag tgtttcgggg tcctctccag
2341 ccgcattgaa gtccaagaca cgagtggggg cacgacagcc ttgaggccca gcgctagcac
2401 ccaggctctg tccagcagtg taagctccag caagctgttc tccagcagca ctgctcctca
2461 tgagacctcc tttggagaag aggtggaggt gcacaaccta cttatcattg accaacacac
2521 ctttgaagtg cttcatgccc accagtttct gcagaatgaa tatgccctca gtctggtttc
2581 ctgcaagctg ggcaaagacc ccaacactta cttcattgtg ggcacagcaa tggtgtatcc
2641 tgaagaggca gagcccaagc agggtcgcat tgtggtcttt cagtattcgg atggaaaact
2701 acagactgtg gctgaaaagg aagtgaaagg ggccgtgtac tctatggtgg aatttaacgg
2761 gaagctgtta gccagcatca atagcacggt gcggctctat gagtggacaa cagagaagga
2821 gctgcgcact gagtgcaacc actacaacaa catcatggcc ctctacctga agaccaaggg
2881 cgacttcatc ctggtgggcg accttatgcg ctcagtgctg ctgcttgcct acaagcccat
2941 ggaaggaaac tttgaagaga ttgctcgaga ctttaatccc aactggatga gtgctgtgga
3001 aatcttggat gatgacaatt ttctgggggc tgaaaatgcc tttaacttgt ttgtgtgtca
3061 aaaggatagc gctgccacca ctgacgagga gcggcagcac ctccaggagg ttggtctttt
3121 ccacctgggc gagtttgtca atgtcttttg ccacggctct ctggtaatgc agaatctggg
3181 tgagacttcc acccccacac aaggctcggt gctcttcggc acggtcaacg gcatgatagg
3241 gctggtgacc tcactgtcag agagctggta caacctcctg ctggacatgc agaatcgact
3301 caataaagtc atcaaaagtg tggggaagat cgagcactcc ttctggagat cctttcacac
3361 cgagcggaag acagaaccag ccacaggttt catcgacggt gacttgattg agagtttcct
3421 ggatattagc cgccccaaga tgcaggaggt ggtggcaaac ctacagtatg acgatggcag
3481 cggtatgaag cgagaggcca ctgcagacga cctcatcaag gttgtggagg agctaactcg
3541 gatccattag ccaagggcag ggggcccctt tgctgaccct ccccaaaggc tttgccctgc
3601 tgccctcccc ctcctctcca ccatcgtctt cttggccatg ggaggccttt ccctaagcca
3661 gctgccccca gagccacagt tcccctatgt ggaagtgggg cgggcttcat agagacttgg
3721 gaatgagctg aaggtgaaac attttctccc tggattttta ccagtctcac atgattccag
3781 ccatcacctt agaccaccaa gccttgattg gtgttgccag ttgtcctcct tccggggaag
3841 gattttgcag ttctttggct gaaaggaagc tgtgcgtgtg tgtgtgtgta tgtgtgtgtg
3901 tgtatgtgta tctcacactc atgcattgtc ctctttttat ttagattggc agtgtaggga
3961 gttgtgggta gtggggaaga gggttaggag ggtttcattg tctgtgaagt gagaccttcc
4021 ttttactttt cttctattgc ctctgagagc atcaggccta gaggcctgac tgccaagcca
4081 tgggtagcct gggtgtaaaa cctggagatg gtggatgatc cccacgccac agcccttttg
4141 tctctgcaaa ctgccttctt cggaaagaag aaggtgggag gatgtgaatt gttagtttct
4201 gagttttacc aaataaagta gaatataaaa aaaaaaaaaa a
//