LOCUS BC031522 4082 bp mRNA linear HUM 07-AUG-2008 DEFINITION Homo sapiens excision repair cross-complementing rodent repair deficiency, complementation group 5, mRNA (cDNA clone MGC:9055 IMAGE:3869893), complete cds. ACCESSION BC031522 VERSION BC031522.2 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4082) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 4082) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (06-JUN-2002) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Aug 25, 2003 this sequence version replaced BC031522.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Baylor College of Medicine Human Genome Sequencing Center Center code: BCM-HGSC Web site: http://www.hgsc.bcm.tmc.edu/cdna/ Contact: amg@bcm.tmc.edu Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H., Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati, A.N., Gibbs, R.A. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 13 Row: j Column: 22 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 51988899 This clone grew slowly and was rescued by PCR. FEATURES Location/Qualifiers source 1..4082 /db_xref="H-InvDB:HIT000041303" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:9055 IMAGE:3869893" /tissue_type="Eye, retinoblastoma" /clone_lib="NIH_MGC_67" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..4082 /gene="ERCC5" /gene_synonym="COFS3" /gene_synonym="UVDR" /gene_synonym="XPG" /db_xref="GeneID:2073" /db_xref="HGNC:HGNC:3437" /db_xref="MIM:133530" CDS 413..3973 /gene="ERCC5" /gene_synonym="COFS3" /gene_synonym="UVDR" /gene_synonym="XPG" /codon_start=1 /product="excision repair cross-complementing rodent repair deficiency, complementation group 5" /protein_id="AAH31522.1" /db_xref="GeneID:2073" /db_xref="HGNC:HGNC:3437" /db_xref="MIM:133530" /translation="MGVQGLWKLLECSGRQVSPEALEGKILAVDISIWLNQALKGVRD RHGNSIENPHLLTLFHRLCKLLFFRIRPIFVFDGDAPLLKKQTLVKRRQRKDLASSDS RKTTEKLLKTFLKRQAIKTAFRSKRDEALPSLTQVRRENDLYVLPPLQEEEKHSSEEE DEKEWQERMNQKQALQEEFFHNPQAIDIESEDFSSLPPEVKHEILTDMKEFTKRRRTL FEAMPEESDDFSQYQLKGLLKKNYLNQHIEHVQKEMNQQHSGHIRRQYEDEGGFLKEV ESRRVVSEDTSHYILIKGIQAKTVAEVDSESLPSSSKMHGMSFDVKSSPCEKLKTEKE PDATPPSPRTLLAMQAALLGSSSEEELESENRRQARGRNAPAAVDEGSISPRTLSAIK RALDDDEDVKVCAGDDVQTGGPGAEEMRINSSTENSDEGLKVRDGKGIPFTATLASSS VNSAEEHVASTNEGREPTDSVPKEQMSLVHVGTEAFPISDESMIKDRKDRLPLESAVV RHSDAPGLPNGRELTPASPTCTNSVSKNETHAEVLEQQNELCPYESKFDSSLLSSDDE TKCKPNSASEVIGPVSLQETSSIVSVPSEAVDNVENVVSFNAKEHENFLETIQEQQTT ESAGQDLISIPKAVEPMEIDSEESESDGSFIEVQSVISDEELQAEFPETSKPPSEQGE EELVGTREGEAPAESESLLRDNSERDDVDGEPQEAEKDAEDSLHEWQDINLEELETLE SNLLAQQNSLKAQKQQQERIAATVTGQMFLESQELLRLFGIPYIQAPMEAEAQCAILD LTDQTSGTITDDSDIWLFGARHVYRNFFNKNKFVEYYQYVDFHNQLGLDRNKLINLAY LLGSDYTEGIPTVGCVTAMEILNEFPGHGLEPLLKFSEWWHEAQKNPKIRPNPHDTKV KKKLRTLQLTPGFPNPAVAEAYLKPVVDDSKGSFLWGKPDLDKIREFCQRYFGWNRTK TDESLFPVLKQLDAQQTQLRIDSFFRLAQQEKEDAKRIKSQRLNRAVTCMLRKEKEAA ASEIEAVSVAMEKEFELLDKAKRKTQKRGITNTLEESSSLKRKRLSDSKRKNTCGGFL GETCLSESSDGSSSEDAESSSLMNVQRRTAAKEPKTSASDSQNSVKEAPVKNGGATTS SSSDSDDDGGKEKMVLVTARSVFGKKRRKLRRARGRKRKT" BASE COUNT 1275 a 857 c 1037 g 913 t ORIGIN 1 tgcggaccca ccagcgaagg cgggaggtgt cgcagggaca tcttctggct gtttccgtcg 61 cctgcgtggc ccttgcaccc cggtcttcca ttagcggcgc agacgtttgg gcctaagcgc 121 tgggcgaggc gaggccctgc ccctccccgc caacggccat tctctggacc tgtctttctt 181 ccgggaggcg gtgacagctg ctgagacgtg ttgcagccag agtctctccg ctttaatgcg 241 ctcccattag tgccgtcccc cactggaaaa ccgtggcttc tgtattattt gccatctttg 301 ttgtgtagga gcagggaggg cttcctcccg gggtcctagg cggcggtgca gtccgtcgta 361 gaagaattag agtagaagtt gtcggggtcc gctcttagga cgcagccgcc tcatgggggt 421 ccaggggctc tggaagctgc tggagtgctc cgggcggcag gtcagccccg aagcgctgga 481 agggaagatc ctggctgttg atattagcat ttggttaaac caagcactta aaggagtccg 541 ggatcgccac gggaactcaa tagaaaatcc tcatcttctc actttgtttc atcggctctg 601 caaactctta ttttttcgaa ttcgtcctat ttttgtgttt gatggggatg ctccactatt 661 gaagaaacag actttggtga agagaaggca gagaaaggac ttagcgtcca gtgactccag 721 gaaaacgaca gagaagcttc tgaaaacatt tttgaaaaga caagccatca aaactgcctt 781 cagaagcaaa agagatgaag cactacccag tcttacccaa gttcgaagag aaaacgacct 841 ctatgttttg cctcctttac aagaggaaga aaaacacagt tcagaagagg aagatgaaaa 901 agaatggcaa gaaagaatga atcaaaaaca agcattacag gaagagttct ttcataatcc 961 tcaagcgata gatattgagt ctgaggactt cagcagcctg ccccctgaag taaagcatga 1021 aatcttgact gatatgaaag agttcaccaa gcgcagaaga acattatttg aagcaatgcc 1081 agaggagtct gatgactttt cacagtacca actcaaaggc ttgcttaaaa agaactatct 1141 gaaccagcat atagaacatg tccaaaagga aatgaatcag caacattcag gacacatccg 1201 aaggcagtat gaagatgaag ggggctttct gaaggaggta gagtcaagga gagtggtctc 1261 tgaagacact tcacattaca tcttgataaa aggtattcaa gctaagacag ttgcagaagt 1321 ggattcagag tctcttcctt cttccagcaa aatgcacggc atgtcttttg acgtgaagtc 1381 atctccatgt gaaaaactga agacagagaa agagcctgat gctacccctc cttctccaag 1441 aactttacta gctatgcaag ctgccctgct gggaagtagc tcagaagagg agctggagag 1501 tgaaaatcga aggcaggccc gtgggaggaa cgcacctgct gctgtagacg aaggctccat 1561 atcaccccgg actctttcag ccattaagag agctcttgac gatgacgaag atgtaaaagt 1621 gtgtgctggg gatgatgtgc agacgggagg gccaggagca gaagaaatgc gtataaacag 1681 ctccaccgag aacagtgatg aaggacttaa agtgagagat ggaaaaggaa taccgtttac 1741 tgcaacactt gcgtcatcta gtgtgaactc tgcagaggag cacgtagcca gcactaatga 1801 ggggagagag cccacagact cagttccaaa agaacaaatg tcacttgttc acgtggggac 1861 tgaagccttt ccgataagtg atgagtctat gattaaggac agaaaagatc ggctgcctct 1921 ggagagtgca gtggttagac atagtgacgc acctgggctc ccgaatggaa gggaactgac 1981 accggcatct ccaacttgta caaattctgt gtcaaagaat gaaacacatg ctgaagtgct 2041 tgagcagcag aacgaacttt gcccatatga gagtaaattc gattcttctc ttctttcaag 2101 tgatgatgaa acaaaatgta aaccgaattc tgcttctgaa gtcattggcc ctgtcagttt 2161 gcaagaaaca agtagcatag taagtgtccc ttcagaggca gtagataatg tggaaaatgt 2221 ggtgtcattt aatgctaaag agcatgagaa ttttctggaa accatccaag aacagcagac 2281 cactgaatct gcaggccagg atttaatttc cattccaaag gccgtggaac caatggaaat 2341 tgactcggaa gaaagtgaat ctgatggaag tttcattgaa gtgcaaagtg tgattagtga 2401 tgaggaactt caagcagaat tccctgaaac ttccaaacct ccctcagaac aaggcgaaga 2461 ggaactggta ggaactaggg agggagaagc ccctgctgag tccgagagcc tcctgaggga 2521 caactctgag agggacgacg tggatggtga gccacaggaa gctgagaaag atgcggaaga 2581 ttcgctccat gaatggcaag atattaattt ggaggagttg gaaactctgg agagcaacct 2641 cttagcacag cagaattcac tgaaagctca aaaacagcag caagaacgga tcgctgctac 2701 tgtcaccgga cagatgttcc tggaaagcca ggaactcctg cgcctgttcg gcattcccta 2761 catccaggct cccatggaag cagaggcgca gtgcgccatc ctggacctga ctgatcagac 2821 ttccggaacc atcactgatg acagtgatat ctggctgttt ggagcgcggc atgtctatag 2881 aaactttttt aataaaaaca agtttgtaga atattatcaa tatgtggact ttcacaatca 2941 attgggattg gaccggaata agttaataaa tttggcttat ttgcttggaa gtgattatac 3001 cgaaggaata ccaactgtgg gttgtgtaac cgccatggaa attctcaatg aattccctgg 3061 gcatggcctg gaacctctcc taaaattctc agaatggtgg catgaagctc aaaaaaatcc 3121 aaagataaga cctaatcctc atgacaccaa agtgaaaaaa aaattacgga cattgcaact 3181 cacccctggc tttcctaacc cagctgttgc cgaggcctac ctcaaacccg tggtggatga 3241 ctcgaaggga tcctttctgt gggggaaacc tgatctcgac aaaattagag aattttgtca 3301 gcggtatttc ggctggaaca gaacgaagac agatgaatct ctgtttcctg tattaaagca 3361 actcgatgcc cagcagacac agctccgaat tgattccttc tttagattag cacaacagga 3421 gaaagaagat gctaaacgta ttaagagcca gagactaaac agagctgtga catgtatgct 3481 aaggaaagag aaagaagcag cagccagcga aatagaagca gtttctgttg ccatggagaa 3541 agaatttgag ctacttgata aggcaaaacg aaaaacccag aagagaggca taacaaatac 3601 cttagaagag tcatcaagcc tgaaaagaaa gaggctttca gattctaaac gaaagaatac 3661 atgcggtgga tttttggggg agacctgcct ctcagaatca tctgatggat cttcaagtga 3721 agatgctgaa agttcatctt taatgaatgt acaaaggaga acagctgcga aagagccaaa 3781 aaccagtgct tcagattcgc agaactcagt gaaggaagct cccgtgaaga atggaggtgc 3841 gaccaccagc agctctagtg atagtgatga cgatggaggg aaagagaaga tggtcctcgt 3901 gaccgccaga tctgtgtttg ggaagaaaag aaggaaacta agacgtgcga ggggaagaaa 3961 aaggaaaacc taattaaaaa atatgtatcc tctataatta gttatgacag ccatttgtaa 4021 tgaatttgtc gcaaagacgt aataaaatta actggtagca cggtaaaaaa aaaaaaaaaa 4081 aa //