LOCUS BC031522 4082 bp mRNA linear HUM 07-AUG-2008
DEFINITION Homo sapiens excision repair cross-complementing rodent repair
deficiency, complementation group 5, mRNA (cDNA clone MGC:9055
IMAGE:3869893), complete cds.
ACCESSION BC031522
VERSION BC031522.2
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 4082)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 4082)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (06-JUN-2002) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT On Aug 25, 2003 this sequence version replaced BC031522.1.
Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Life Technologies, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Baylor College of Medicine Human Genome
Sequencing Center
Center code: BCM-HGSC
Web site: http://www.hgsc.bcm.tmc.edu/cdna/
Contact: amg@bcm.tmc.edu
Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H.,
Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati,
A.N., Gibbs, R.A.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 13 Row: j Column: 22
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 51988899
This clone grew slowly and was rescued by PCR.
FEATURES Location/Qualifiers
source 1..4082
/db_xref="H-InvDB:HIT000041303"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:9055 IMAGE:3869893"
/tissue_type="Eye, retinoblastoma"
/clone_lib="NIH_MGC_67"
/lab_host="DH10B"
/note="Vector: pCMV-SPORT6"
gene 1..4082
/gene="ERCC5"
/gene_synonym="COFS3"
/gene_synonym="UVDR"
/gene_synonym="XPG"
/db_xref="GeneID:2073"
/db_xref="HGNC:HGNC:3437"
/db_xref="MIM:133530"
CDS 413..3973
/gene="ERCC5"
/gene_synonym="COFS3"
/gene_synonym="UVDR"
/gene_synonym="XPG"
/codon_start=1
/product="excision repair cross-complementing rodent
repair deficiency, complementation group 5"
/protein_id="AAH31522.1"
/db_xref="GeneID:2073"
/db_xref="HGNC:HGNC:3437"
/db_xref="MIM:133530"
/translation="MGVQGLWKLLECSGRQVSPEALEGKILAVDISIWLNQALKGVRD
RHGNSIENPHLLTLFHRLCKLLFFRIRPIFVFDGDAPLLKKQTLVKRRQRKDLASSDS
RKTTEKLLKTFLKRQAIKTAFRSKRDEALPSLTQVRRENDLYVLPPLQEEEKHSSEEE
DEKEWQERMNQKQALQEEFFHNPQAIDIESEDFSSLPPEVKHEILTDMKEFTKRRRTL
FEAMPEESDDFSQYQLKGLLKKNYLNQHIEHVQKEMNQQHSGHIRRQYEDEGGFLKEV
ESRRVVSEDTSHYILIKGIQAKTVAEVDSESLPSSSKMHGMSFDVKSSPCEKLKTEKE
PDATPPSPRTLLAMQAALLGSSSEEELESENRRQARGRNAPAAVDEGSISPRTLSAIK
RALDDDEDVKVCAGDDVQTGGPGAEEMRINSSTENSDEGLKVRDGKGIPFTATLASSS
VNSAEEHVASTNEGREPTDSVPKEQMSLVHVGTEAFPISDESMIKDRKDRLPLESAVV
RHSDAPGLPNGRELTPASPTCTNSVSKNETHAEVLEQQNELCPYESKFDSSLLSSDDE
TKCKPNSASEVIGPVSLQETSSIVSVPSEAVDNVENVVSFNAKEHENFLETIQEQQTT
ESAGQDLISIPKAVEPMEIDSEESESDGSFIEVQSVISDEELQAEFPETSKPPSEQGE
EELVGTREGEAPAESESLLRDNSERDDVDGEPQEAEKDAEDSLHEWQDINLEELETLE
SNLLAQQNSLKAQKQQQERIAATVTGQMFLESQELLRLFGIPYIQAPMEAEAQCAILD
LTDQTSGTITDDSDIWLFGARHVYRNFFNKNKFVEYYQYVDFHNQLGLDRNKLINLAY
LLGSDYTEGIPTVGCVTAMEILNEFPGHGLEPLLKFSEWWHEAQKNPKIRPNPHDTKV
KKKLRTLQLTPGFPNPAVAEAYLKPVVDDSKGSFLWGKPDLDKIREFCQRYFGWNRTK
TDESLFPVLKQLDAQQTQLRIDSFFRLAQQEKEDAKRIKSQRLNRAVTCMLRKEKEAA
ASEIEAVSVAMEKEFELLDKAKRKTQKRGITNTLEESSSLKRKRLSDSKRKNTCGGFL
GETCLSESSDGSSSEDAESSSLMNVQRRTAAKEPKTSASDSQNSVKEAPVKNGGATTS
SSSDSDDDGGKEKMVLVTARSVFGKKRRKLRRARGRKRKT"
BASE COUNT 1275 a 857 c 1037 g 913 t
ORIGIN
1 tgcggaccca ccagcgaagg cgggaggtgt cgcagggaca tcttctggct gtttccgtcg
61 cctgcgtggc ccttgcaccc cggtcttcca ttagcggcgc agacgtttgg gcctaagcgc
121 tgggcgaggc gaggccctgc ccctccccgc caacggccat tctctggacc tgtctttctt
181 ccgggaggcg gtgacagctg ctgagacgtg ttgcagccag agtctctccg ctttaatgcg
241 ctcccattag tgccgtcccc cactggaaaa ccgtggcttc tgtattattt gccatctttg
301 ttgtgtagga gcagggaggg cttcctcccg gggtcctagg cggcggtgca gtccgtcgta
361 gaagaattag agtagaagtt gtcggggtcc gctcttagga cgcagccgcc tcatgggggt
421 ccaggggctc tggaagctgc tggagtgctc cgggcggcag gtcagccccg aagcgctgga
481 agggaagatc ctggctgttg atattagcat ttggttaaac caagcactta aaggagtccg
541 ggatcgccac gggaactcaa tagaaaatcc tcatcttctc actttgtttc atcggctctg
601 caaactctta ttttttcgaa ttcgtcctat ttttgtgttt gatggggatg ctccactatt
661 gaagaaacag actttggtga agagaaggca gagaaaggac ttagcgtcca gtgactccag
721 gaaaacgaca gagaagcttc tgaaaacatt tttgaaaaga caagccatca aaactgcctt
781 cagaagcaaa agagatgaag cactacccag tcttacccaa gttcgaagag aaaacgacct
841 ctatgttttg cctcctttac aagaggaaga aaaacacagt tcagaagagg aagatgaaaa
901 agaatggcaa gaaagaatga atcaaaaaca agcattacag gaagagttct ttcataatcc
961 tcaagcgata gatattgagt ctgaggactt cagcagcctg ccccctgaag taaagcatga
1021 aatcttgact gatatgaaag agttcaccaa gcgcagaaga acattatttg aagcaatgcc
1081 agaggagtct gatgactttt cacagtacca actcaaaggc ttgcttaaaa agaactatct
1141 gaaccagcat atagaacatg tccaaaagga aatgaatcag caacattcag gacacatccg
1201 aaggcagtat gaagatgaag ggggctttct gaaggaggta gagtcaagga gagtggtctc
1261 tgaagacact tcacattaca tcttgataaa aggtattcaa gctaagacag ttgcagaagt
1321 ggattcagag tctcttcctt cttccagcaa aatgcacggc atgtcttttg acgtgaagtc
1381 atctccatgt gaaaaactga agacagagaa agagcctgat gctacccctc cttctccaag
1441 aactttacta gctatgcaag ctgccctgct gggaagtagc tcagaagagg agctggagag
1501 tgaaaatcga aggcaggccc gtgggaggaa cgcacctgct gctgtagacg aaggctccat
1561 atcaccccgg actctttcag ccattaagag agctcttgac gatgacgaag atgtaaaagt
1621 gtgtgctggg gatgatgtgc agacgggagg gccaggagca gaagaaatgc gtataaacag
1681 ctccaccgag aacagtgatg aaggacttaa agtgagagat ggaaaaggaa taccgtttac
1741 tgcaacactt gcgtcatcta gtgtgaactc tgcagaggag cacgtagcca gcactaatga
1801 ggggagagag cccacagact cagttccaaa agaacaaatg tcacttgttc acgtggggac
1861 tgaagccttt ccgataagtg atgagtctat gattaaggac agaaaagatc ggctgcctct
1921 ggagagtgca gtggttagac atagtgacgc acctgggctc ccgaatggaa gggaactgac
1981 accggcatct ccaacttgta caaattctgt gtcaaagaat gaaacacatg ctgaagtgct
2041 tgagcagcag aacgaacttt gcccatatga gagtaaattc gattcttctc ttctttcaag
2101 tgatgatgaa acaaaatgta aaccgaattc tgcttctgaa gtcattggcc ctgtcagttt
2161 gcaagaaaca agtagcatag taagtgtccc ttcagaggca gtagataatg tggaaaatgt
2221 ggtgtcattt aatgctaaag agcatgagaa ttttctggaa accatccaag aacagcagac
2281 cactgaatct gcaggccagg atttaatttc cattccaaag gccgtggaac caatggaaat
2341 tgactcggaa gaaagtgaat ctgatggaag tttcattgaa gtgcaaagtg tgattagtga
2401 tgaggaactt caagcagaat tccctgaaac ttccaaacct ccctcagaac aaggcgaaga
2461 ggaactggta ggaactaggg agggagaagc ccctgctgag tccgagagcc tcctgaggga
2521 caactctgag agggacgacg tggatggtga gccacaggaa gctgagaaag atgcggaaga
2581 ttcgctccat gaatggcaag atattaattt ggaggagttg gaaactctgg agagcaacct
2641 cttagcacag cagaattcac tgaaagctca aaaacagcag caagaacgga tcgctgctac
2701 tgtcaccgga cagatgttcc tggaaagcca ggaactcctg cgcctgttcg gcattcccta
2761 catccaggct cccatggaag cagaggcgca gtgcgccatc ctggacctga ctgatcagac
2821 ttccggaacc atcactgatg acagtgatat ctggctgttt ggagcgcggc atgtctatag
2881 aaactttttt aataaaaaca agtttgtaga atattatcaa tatgtggact ttcacaatca
2941 attgggattg gaccggaata agttaataaa tttggcttat ttgcttggaa gtgattatac
3001 cgaaggaata ccaactgtgg gttgtgtaac cgccatggaa attctcaatg aattccctgg
3061 gcatggcctg gaacctctcc taaaattctc agaatggtgg catgaagctc aaaaaaatcc
3121 aaagataaga cctaatcctc atgacaccaa agtgaaaaaa aaattacgga cattgcaact
3181 cacccctggc tttcctaacc cagctgttgc cgaggcctac ctcaaacccg tggtggatga
3241 ctcgaaggga tcctttctgt gggggaaacc tgatctcgac aaaattagag aattttgtca
3301 gcggtatttc ggctggaaca gaacgaagac agatgaatct ctgtttcctg tattaaagca
3361 actcgatgcc cagcagacac agctccgaat tgattccttc tttagattag cacaacagga
3421 gaaagaagat gctaaacgta ttaagagcca gagactaaac agagctgtga catgtatgct
3481 aaggaaagag aaagaagcag cagccagcga aatagaagca gtttctgttg ccatggagaa
3541 agaatttgag ctacttgata aggcaaaacg aaaaacccag aagagaggca taacaaatac
3601 cttagaagag tcatcaagcc tgaaaagaaa gaggctttca gattctaaac gaaagaatac
3661 atgcggtgga tttttggggg agacctgcct ctcagaatca tctgatggat cttcaagtga
3721 agatgctgaa agttcatctt taatgaatgt acaaaggaga acagctgcga aagagccaaa
3781 aaccagtgct tcagattcgc agaactcagt gaaggaagct cccgtgaaga atggaggtgc
3841 gaccaccagc agctctagtg atagtgatga cgatggaggg aaagagaaga tggtcctcgt
3901 gaccgccaga tctgtgtttg ggaagaaaag aaggaaacta agacgtgcga ggggaagaaa
3961 aaggaaaacc taattaaaaa atatgtatcc tctataatta gttatgacag ccatttgtaa
4021 tgaatttgtc gcaaagacgt aataaaatta actggtagca cggtaaaaaa aaaaaaaaaa
4081 aa
//