LOCUS BC011686 4196 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens damage-specific DNA binding protein 1, 127kDa, mRNA
(cDNA clone MGC:19563 IMAGE:3845478), complete cds.
ACCESSION BC011686
VERSION BC011686.2
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 4196)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 4196)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (30-JUL-2001) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT On Aug 19, 2003 this sequence version replaced BC011686.1.
Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: National Institutes of Health Intramural
Sequencing Center (NISC),
Gaithersburg, Maryland;
Web site: http://www.nisc.nih.gov/
Contact: nisc_mgc@nhgri.nih.gov
Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
Young,A., Zhang,L.-H. and Green,E.D.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 27 Row: i Column: 22
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 13435358.
FEATURES Location/Qualifiers
source 1..4196
/db_xref="H-InvDB:HIT000035375"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:19563 IMAGE:3845478"
/tissue_type="Placenta, choriocarcinoma"
/clone_lib="NIH_MGC_21"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene 1..4196
/gene="DDB1"
/gene_synonym="DDBA"
/gene_synonym="UV-DDB1"
/gene_synonym="XAP1"
/gene_synonym="XPCE"
/gene_synonym="XPE"
/gene_synonym="XPE-BF"
/db_xref="GeneID:1642"
/db_xref="HGNC:HGNC:2717"
/db_xref="MIM:600045"
CDS 74..3496
/gene="DDB1"
/gene_synonym="DDBA"
/gene_synonym="UV-DDB1"
/gene_synonym="XAP1"
/gene_synonym="XPCE"
/gene_synonym="XPE"
/gene_synonym="XPE-BF"
/codon_start=1
/product="damage-specific DNA binding protein 1, 127kDa"
/protein_id="AAH11686.1"
/db_xref="GeneID:1642"
/db_xref="HGNC:HGNC:2717"
/db_xref="MIM:600045"
/translation="MSYNYVVTAQKPTAVNGCVTGHFTSAEDLNLLIAKNTRLEIYVV
TAEGLRPVKEVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIIT
RAHGNVQDRIGRPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLE
ELHVIDVKFLYGCQAPTICFVYQDPQGRHVKTYEVSLREKEFNKGPWKQENVEAEASM
VIAVPEPFGGAIIIGQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDM
EGRLFMLLLEKEEQMDGTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLV
KLNVDSNEQGSYVVAMETFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRN
GIGIHEHASIDLPGIKGLWPLRSDPNRETDDTLVLSFVGQTRVLMLNGEEVEETELMG
FVDDQQTFFCGNVAHQQLIQITSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQV
VVAVGRALYYLQIHPQELRQISHTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISA
RILKLPSFELLHKEMLGGEIIPRSILMTTFESSHYLLCALGDGALFYFGLNIETGLLS
DRKKVTLGTQPTVLRTFRSLSTTNVFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPL
NSDGYPDSLALANNSTLTIGTIDEIQKLHIRTVPLYESPRKICYQEVSQCFGVLSSRI
EVQDTSGGTTALRPSASTQALSSSVSSSKLFSSSTAPHETSFGEEVEVHNLLIIDQHT
FEVLHAHQFLQNEYALSLVSCKLGKDPNTYFIVGTAMVYPEEAEPKQGRIVVFQYSDG
KLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWTTEKELRTECNHYNNIMALYL
KTKGDFILVGDLMRSVLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLGAENAF
NLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGSVLF
GTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKTEPATGF
IDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKVVEELTRIH"
BASE COUNT 1002 a 1050 c 1116 g 1028 t
ORIGIN
1 cacctgtctt ttcgcttgtg tccctctttc tagtgtcgcg ctcgagtccc gacgggccgc
61 tccaagcctc gacatgtcgt acaactacgt ggtaacggcc cagaagccca ccgccgtgaa
121 cggctgcgtg accggacact ttacttcggc cgaagactta aacctgttga ttgccaaaaa
181 cacgagatta gagatctatg tggtcaccgc cgaggggctt cggcccgtca aagaggtggg
241 catgtatggg aagattgcgg tcatggagct tttcaggccc aagggggaga gcaaggacct
301 gctgtttatc ttgacagcga agtacaatgc ctgcatcctg gagtataaac agagtggcga
361 gagcattgac atcattacgc gagcccatgg caatgtccag gaccgcattg gccgcccctc
421 agagaccggc attattggca tcattgaccc tgagtgccgg atgattggcc tgcgtctcta
481 tgatggcctt ttcaaggtta ttccactaga tcgcgataat aaagaactca aggccttcaa
541 catccgcctg gaggagctgc atgtcattga tgtcaagttc ctatatggtt gccaagcacc
601 tactatttgc tttgtctacc aggaccctca ggggcggcac gtaaaaacct atgaggtgtc
661 tctccgagaa aaggaattca ataagggccc ttggaaacag gaaaatgtcg aagctgaagc
721 ttccatggtg atcgcagtcc cagagccctt tgggggggcc atcatcattg gacaggagtc
781 aatcacctat cacaatggtg acaaatacct ggctattgcc cctcctatca tcaagcaaag
841 cacgattgtg tgccacaatc gagtggaccc taatggctca agatacctgc tgggagacat
901 ggaaggccgg ctcttcatgc tgcttttgga gaaggaggaa cagatggatg gcaccgtcac
961 tctcaaggat ctccgtgtag aactccttgg agagacctct attgctgagt gcttgacata
1021 ccttgataat ggtgttgtgt ttgtcgggtc tcgcctgggt gactcccagc ttgtgaagct
1081 caacgttgac agtaatgaac aaggctccta tgtagtggcc atggaaacct ttaccaactt
1141 aggacccatt gtcgatatgt gcgtggtgga cctggagagg caggggcagg ggcagctggt
1201 cacttgctct ggggctttca aggaaggttc tttgcggatc atccggaatg gaattggaat
1261 ccacgagcat gccagcattg acttaccagg catcaaagga ttatggccac tgcggtctga
1321 ccctaatcgt gagactgatg acactttggt gctctctttt gtgggccaga caagagttct
1381 catgttaaat ggagaggagg tagaagaaac cgaactgatg ggtttcgtgg atgatcagca
1441 gactttcttc tgtggcaacg tggctcatca gcagcttatc cagatcactt cagcatcggt
1501 gaggttggtc tctcaagaac ccaaagctct ggtcagtgaa tggaaggagc ctcaggccaa
1561 gaacatcagt gtggcctcct gcaatagcag ccaggtggtg gtggctgtag gcagggccct
1621 ctactatctg cagatccatc ctcaggagct ccggcagatc agccacacag agatggaaca
1681 tgaagtggct tgcttggaca tcaccccatt aggagacagc aatggactgt cccctctttg
1741 tgccattggc ctctggacgg acatctcggc tcgtatcttg aagttgccct cttttgaact
1801 actgcacaag gagatgctgg gtggagagat cattcctcgc tccatcctga tgaccacctt
1861 tgagagtagc cattacctcc tttgtgcctt gggagatgga gcgcttttct actttgggct
1921 caacattgag acaggtctgt tgagcgaccg taagaaggtg actttgggca cccagcccac
1981 cgtattgagg acttttcgtt ctctttctac caccaacgtc tttgcttgtt ctgaccgccc
2041 cactgtcatc tatagcagca accacaaatt ggtcttctca aatgtcaacc tcaaggaagt
2101 gaactacatg tgtcccctca attcagatgg ctatcctgac agcctggcgc tggccaacaa
2161 tagcaccctc accattggca ccatcgatga gatccagaag ctgcacattc gcacagttcc
2221 cctctatgag tctccaagga agatctgcta ccaggaagtg tcccagtgtt tcggggtcct
2281 ctccagccgc attgaagtcc aagacacgag tgggggcacg acagccttga ggcccagcgc
2341 tagcacccag gctctgtcca gcagtgtaag ctccagcaag ctgttctcca gcagcactgc
2401 tcctcatgag acctcctttg gagaagaggt ggaggtgcac aacctactta tcattgacca
2461 acacaccttt gaagtgcttc atgcccacca gtttctgcag aatgaatatg ccctcagtct
2521 ggtttcctgc aagctgggca aagaccccaa cacttacttc attgtgggca cagcaatggt
2581 gtatcctgaa gaggcagagc ccaagcaggg tcgcattgtg gtctttcagt attcggatgg
2641 aaaactacag actgtggctg aaaaggaagt gaaaggggcc gtgtactcta tggtggaatt
2701 taacgggaag ctgttagcca gcatcaatag cacggtgcgg ctctatgagt ggacaacaga
2761 gaaggagctg cgcactgagt gcaaccacta caacaacatc atggccctct acctgaagac
2821 caagggcgac ttcatcctgg tgggcgacct tatgcgctca gtgctgctgc ttgcctacaa
2881 gcccatggaa ggaaactttg aagagattgc tcgagacttt aatcccaact ggatgagtgc
2941 tgtggaaatc ttggatgatg acaattttct gggggctgaa aatgccttta acttgtttgt
3001 gtgtcaaaag gatagcgctg ccaccactga cgaggagcgg cagcacctcc aggaggttgg
3061 tcttttccac ctgggcgagt ttgtcaatgt cttttgccac ggctctctgg taatgcagaa
3121 tctgggtgag acttccaccc ccacacaagg ctcggtgctc ttcggcacgg tcaacggcat
3181 gatagggctg gtgacctcac tgtcagagag ctggtacaac ctcctgctgg acatgcagaa
3241 tcgactcaat aaagtcatca aaagtgtggg gaagatcgag cactccttct ggagatcctt
3301 tcacaccgag cggaagacag aaccagccac aggtttcatc gacggtgact tgattgagag
3361 tttcctggat attagccgcc ccaagatgca ggaggtggtg gcaaacctac agtatgacga
3421 tggcagcggt atgaagcgag aggccactgc agacgacctc atcaaggttg tggaggagct
3481 aactcggatc cattagccaa gggcaggggg cccctttgct gaccctcccc aaaggctttg
3541 ccctgctgcc ctccccctcc tctccaccat cgtcttcttg gccatgggag gcctttccct
3601 aagccagctg cccccagagc cacagttccc ctatgtggaa gtggggcggg cttcatagag
3661 acttgggaat gagctgaagg tgaaacattt tctccctgga tttttaccag tctcacatga
3721 ttccagccat caccttagac caccaagcct tgattggtgt tgccagttgt cctccttccg
3781 gggaaggatt ttgcagttct ttggctgaaa ggaagctgtg cgtgtgtgtg tgtttatgtg
3841 tgtgtgtgta tgtgtatctc acactcatgc attgtcctct ttttatttag attggcagtg
3901 tagggagttg tgggtagtgg ggaagagggt taggagggtt tcattgtctg tgaagtgaga
3961 ccttcctttt acttttcttc tattgcctct gagagcatca ggcctagagg cctgactgcc
4021 aagccatggg tagcctgggt gtaaaacctg gagatggtgg atgatcccca cgccacagcc
4081 cttttgtctc tgcaaactgc cttcttcgga aagaagaagg tgggaggatg tgaattgtta
4141 gtttctgagt tttaccaaat aaagtagaat ataagaagaa aaaaaaaaaa aaaaaa
//