LOCUS       BC020973                2084 bp    mRNA    linear   SYN 02-SEP-2016
DEFINITION  Synthetic construct Homo sapiens RAD23 homolog B (S. cerevisiae),
            mRNA (cDNA clone MGC:9444 IMAGE:3906269), complete cds.
ACCESSION   BC020973
VERSION     BC020973.2
KEYWORDS    MGC.
SOURCE      synthetic construct
  ORGANISM  synthetic construct
            other sequences; artificial sequences.
REFERENCE   1  (bases 1 to 2084)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 2084)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (03-JAN-2002) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     On Aug 25, 2003 this sequence version replaced BC020973.1.
            Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Sequencing Group at the Stanford Human Genome
            Center, Stanford University School of Medicine, Stanford, CA  94305
            Web site:       http://www-shgc.stanford.edu
            Contact:  (Dickson, Mark) mcd@paxil.stanford.edu
            Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
            R. M.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 14 Row: p Column: 2
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 19924138.
FEATURES             Location/Qualifiers
     source          1..2084
                     /db_xref="H-InvDB:HIT000039060"
                     /organism="synthetic construct"
                     /mol_type="mRNA"
                     /isolation_source="Homo sapiens; Uterus, leiomyosarcoma"
                     /db_xref="taxon:32630"
                     /clone="MGC:9444 IMAGE:3906269"
                     /clone_lib="NIH_MGC_71"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     gene            1..2084
                     /gene="RAD23B"
                     /gene_synonym="HHR23B"
                     /gene_synonym="HR23B"
                     /gene_synonym="P58"
                     /db_xref="GeneID:5887"
                     /db_xref="MIM:600062"
     CDS             51..1280
                     /gene="RAD23B"
                     /gene_synonym="HHR23B"
                     /gene_synonym="HR23B"
                     /gene_synonym="P58"
                     /codon_start=1
                     /transl_table=11
                     /product="RAD23B protein"
                     /protein_id="AAH20973.1"
                     /db_xref="GeneID:5887"
                     /db_xref="MIM:600062"
                     /translation="MQVTLKTLQQQTFKIDIDPEETVKALKEKIESEKGKDAFPVAGQ
                     KLIYAGKILNDDTALKEYKIDEKNFVVVMVTKPKAVSTPAPATTQQSAPASTTAVTSS
                     TTTTVAQAPTPVPALAPTSTPASITPASATASSEPAPASAAKQEKPAEKPAETPVATS
                     PTATDSTSGDSSRSNLFEDATSALVTGQSYENMVTEIMSMGYEREQVIAALRASFNNP
                     DRAVEYLLMGIPGDRESQAVVDPPQAASTGVPQSSAVAAAAATTTATTTTTSSGGHPL
                     EFLRNQPQFQQMRQIIQQNPSLLPALLQQIGRENPQLLQQISQHQEHFIQMLNEPVQE
                     AGGQGGGGGGGSGGIAEAGSGHMNYIQVTPQEKEAIERLKALGFPEGLVIQAYFACEK
                     NENLAANFLLQQNFDED"
     misc_feature    51..284
                     /gene="RAD23B"
                     /gene_synonym="HHR23B"
                     /gene_synonym="HR23B"
                     /gene_synonym="P58"
                     /note="ubiquitin; Region: Ubiquitin family. This family
                     contains a number of ubiquitin-like proteins: SUMO (smt3
                     homologue), Nedd8, Elongin B, Rub1"
                     /db_xref="CDD:pfam00240"
     misc_feature    615..734
                     /gene="RAD23B"
                     /gene_synonym="HHR23B"
                     /gene_synonym="HR23B"
                     /gene_synonym="P58"
                     /note="UBA; Region: UBA/TS-N domain. This small domain is
                     composed of three alpha helices. This family includes the
                     previously defined UBA and TS-N domains. The UBA-domain
                     (ubiquitin associated domain) is a novel sequence motif
                     found in several proteins having connections to ubiquitin
                     and the ubiquitination pathway. The structure of the UBA
                     domain consists of a compact three helix bundle. This
                     domain is found at the N terminus of EF-TS hence the name
                     TS-N. The structure of EF-TS is known and this domain is
                     implicated in its interaction with EF-TU. The domain has
                     been found in non EF-TS proteins such as alpha-NAC and
                     MJ0280"
                     /db_xref="CDD:pfam00627"
     misc_feature    1146..1256
                     /gene="RAD23B"
                     /gene_synonym="HHR23B"
                     /gene_synonym="HR23B"
                     /gene_synonym="P58"
                     /note="UBA; Region: Ubiquitin associated domain"
                     /db_xref="CDD:smart00165"
BASE COUNT          639 a          427 c          464 g          554 t
ORIGIN      
        1 cccacgcgtc cgcggacgcg tgggcggacg cgtgggtgcg cggcggcacc atgcaggtca
       61 ccctgaagac cctccagcag cagaccttca agatagacat tgaccccgag gagacggtga
      121 aagcactgaa agagaagatt gaatctgaaa aggggaaaga tgcctttcca gtagcaggtc
      181 aaaaattaat ttatgcaggc aaaatcctca atgatgatac tgctctcaaa gaatataaaa
      241 ttgatgagaa aaactttgtg gtggttatgg tgaccaaacc caaagcagtg tccacaccag
      301 caccagctac aactcagcag tcagctcctg ccagcactac agcagttact tcctccacca
      361 ccacaactgt ggctcaggct ccaacccctg tccctgcctt ggcccccact tccacacctg
      421 catccatcac tccagcatca gcgacagcat cttctgaacc tgcacctgct agtgcagcta
      481 aacaagagaa gcctgcagaa aagccagcag agacaccagt ggctactagc ccaacagcaa
      541 ctgacagtac atcgggtgat tcttctcggt caaacctttt tgaagatgca acgagtgcac
      601 ttgtgacggg tcagtcttac gagaatatgg taactgagat catgtcaatg ggctatgaac
      661 gagagcaagt aattgcagcc ctgagagcca gtttcaacaa ccctgacaga gcagtggagt
      721 atcttttaat gggaatccct ggagatagag aaagtcaggc tgtggttgac ccccctcaag
      781 cagctagtac tggggttcct cagtcttcag cagtggctgc agctgcagca actacgacag
      841 caacaactac aacaacaagt tctggaggac atccccttga atttttacgg aatcagcctc
      901 agtttcaaca gatgagacaa attattcagc agaatccttc cttgcttcca gcgttactac
      961 agcagatagg tcgagagaat cctcaattac ttcagcaaat tagccaacac caggagcatt
     1021 ttattcagat gttaaatgaa ccagttcaag aagctggtgg tcaaggagga ggaggtggag
     1081 gtggcagtgg aggaattgca gaagctggaa gtggtcatat gaactacatt caagtaacac
     1141 ctcaggaaaa agaagctata gaaaggttaa aggcattagg atttcctgaa ggacttgtga
     1201 tacaagcgta ttttgcttgt gagaagaatg agaatttggc tgccaatttt cttctacagc
     1261 agaactttga tgaagattga aagggacttt tttatatctc acacttcaca ccagtgcatt
     1321 acactaactt gttcactgga ttgtctggga tgacttgggc tcatatccac aatacttggt
     1381 ataaggtagt agattgttgg gggtggggag ggagggatct aggatacagg gcagggataa
     1441 atacagtgca tgtctgcttc aattagcaga tgccgcaact ccacacagtg tgtaaaatat
     1501 atacaaccaa aaatcagctt ttgcaggtct ttatttcttc tgtaaaacag taggtaactt
     1561 ttcctaggtt tcactctttt tagtgtacta gatccagaaa cttagtgtaa tgccctgctt
     1621 tatatttctt tgacttaaca ttggtttcag aaagaatctt agctacctag aatttacagt
     1681 ctctgtttca tggcaacact ggataatggc tttgtgaaat ttaaaaaatt tttgtagcga
     1741 ctgtaaacag aaatgccaaa ttgatggtta attgttgctg cttcaaaaat aagtataaaa
     1801 ttaatatgta aggaagccca ttctttcatg ttaaatactt ggggtgggag gggagaaagg
     1861 gaaccttttc ttaaaatgaa aataattact gctattttaa aatttcttga tcattgaatg
     1921 tgagaccctt ctaacatgat ttgagaagct gtacaagtat aggcagagtt attttcctgt
     1981 ttacattttt tttttgtttt ggggaaaaaa ttggtaggtg tctaattact gtttacttca
     2041 ttgttatatt gcagtaaaag ttttaaaaca aaaaaaaaaa aaaa
//