LOCUS BC050634 2058 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens uracil-DNA glycosylase, mRNA (cDNA clone MGC:60117
IMAGE:6500377), complete cds.
ACCESSION BC050634
VERSION BC050634.2
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 2058)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 2058)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (08-APR-2003) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT On Aug 25, 2003 this sequence version replaced BC050634.1.
Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Life Technologies, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Sequencing Group at the Stanford Human Genome
Center, Stanford University School of Medicine, Stanford, CA 94305
Web site: http://www-shgc.stanford.edu
Contact: (Dickson, Mark) mcd@paxil.stanford.edu
Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
R. M.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 110 Row: o Column: 8
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 19718744.
FEATURES Location/Qualifiers
source 1..2058
/db_xref="H-InvDB:HIT000053529"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:60117 IMAGE:6500377"
/tissue_type="Uterus, leiomyosarcoma"
/clone_lib="NIH_MGC_71"
/lab_host="DH10B"
/note="Vector: pCMV-SPORT6"
gene 1..2058
/gene="UNG"
/gene_synonym="HIGM4"
/gene_synonym="UDG"
/gene_synonym="UNG1"
/gene_synonym="UNG15"
/db_xref="GeneID:7374"
/db_xref="HGNC:HGNC:12572"
/db_xref="MIM:191525"
CDS 104..1018
/gene="UNG"
/gene_synonym="HIGM4"
/gene_synonym="UDG"
/gene_synonym="UNG1"
/gene_synonym="UNG15"
/codon_start=1
/product="uracil-DNA glycosylase"
/protein_id="AAH50634.1"
/db_xref="GeneID:7374"
/db_xref="HGNC:HGNC:12572"
/db_xref="MIM:191525"
/translation="MGVFCLGPWGLGRKLRTPGKGPLQLLSRLCGDHLQAIPAKKAPA
GQEEPGTPPSSPLSAEQLDRIQRNKAAALLRLAARNVPVGFGESWKKHLSGEFGKPYF
IKLMGFVAEERKHYTVYPPPHQVFTWTQMCDIKDVKVVILGQDPYHGPNQAHGLCFSV
QRPVPPPPSLENIYKELSTDIEDFVHPGHGDLSGWAKQGVLLLNAVLTVRAHQANSHK
ERGWEQFTDAVVSWLNQNSNGLVFLLWGSYAQKKGSAIDRKRHHVLQTAHPSPLSVYR
GFFGCRHFSKTNELLQKSGKKPIDWKEL"
BASE COUNT 484 a 479 c 537 g 558 t
ORIGIN
1 ccagcccgtc tccccgctcc agtttagaac ctaattccca attcccggac cgggcccagc
61 cctgggctct tactgtccgc ttttgctggg acctgttcca caaatgggcg tcttctgcct
121 tgggccgtgg gggttgggcc ggaagctgcg gacgcctggg aaggggccgc tgcagctctt
181 gagccgcctc tgcggggacc acttgcaggc catcccagcc aagaaggccc cggctgggca
241 ggaggagcct gggacgccgc cctcctcgcc gctgagtgcc gagcagttgg accggatcca
301 gaggaacaag gccgcggccc tgctcagact cgcggcccgc aacgtgcccg tgggctttgg
361 agagagctgg aagaagcacc tcagcgggga gttcgggaaa ccgtatttta tcaagctaat
421 gggatttgtt gcagaagaaa gaaagcatta cactgtttat ccacccccac accaagtctt
481 cacctggacc cagatgtgtg acataaaaga tgtgaaggtt gtcatcctgg gacaggatcc
541 atatcatgga cctaatcaag ctcacgggct ctgctttagt gttcaaaggc ctgttccgcc
601 tccgcccagt ttggagaaca tttataaaga gttgtctaca gacatagagg attttgttca
661 tcctggccat ggagatttat ctgggtgggc caagcaaggt gttctccttc tcaacgctgt
721 cctcacggtt cgtgcccatc aagccaactc tcataaggag cgaggctggg agcagttcac
781 tgatgcagtt gtgtcctggc taaatcagaa ctcgaatggc cttgttttct tgctctgggg
841 ctcttatgct cagaagaagg gcagtgccat tgataggaag cggcaccatg tactacagac
901 ggctcatccc tcccctttgt cagtgtatag agggttcttt ggatgtagac acttttcaaa
961 gaccaatgag ctgctgcaga agtctggcaa gaagcccatt gactggaagg agctgtgatc
1021 atcagctgag gggtggcctt tgagaagctg ctgttaacgt atttgccagt tacgaagttc
1081 cactgaaaat tttcctatta attcttaagt actctgcata agggggaaaa gcttccagaa
1141 agcagccatg aaccaggctg tccaggaatg gcagctgtat ccaaccacaa acaacaaagg
1201 ctaccctttg accaaatgtc tttctctgca acatggcttc ggcctaaaat atgcagaaga
1261 cagatgaggt caaatactca gttggctctc tttatctccc ttgcctttat ggtgaaacag
1321 gggagatgtg cacctttcag gcacagccct agtttggcgc ctgctgctcc ttggttttgc
1381 ctggttagac tttcagtgac agatgttggg gtgtttttgc ttagaaaggt ccccttgtct
1441 cagccttgca gggcaggcat gccagtctct gccagttcca ctgccccctt gatctttgaa
1501 ggagtcctca ggcccctcgc agcataagga tgttttgcaa ctttccagaa tctggcccag
1561 aaattagggc tcaatttcct gattgtagta gaggttaaga ttgctgtgag ctttatcaga
1621 taagagaccg agagaagtaa gctgggtctt gttattcctt gggtgttggt ggaataagca
1681 gtggaatttg aacaaggaag aggagaaaag ggaattttgt ctttatgggg tggggtgatt
1741 ttctcctagg gttatgtcca gttggggttt ttaaggcagc acagactgcc aagtactgtt
1801 ttttttaacc gactgaaatc actttgggat attttttcct gcaacactgg aaagttttag
1861 ttttttaaga agtactcatg cagatatata tatatatatt tttcccagtc ctttttttaa
1921 gagacggtct ttattgggtc tgcacctcca tccttgatct tgttagcaat gctgtttttg
1981 ctgttagtcg ggttagagtt ggctctacgc gaggtttgtt aataaaagtt tgttaaaagt
2041 taaaaaaaaa aaaaaaaa
//