LOCUS       BC039593                4490 bp    mRNA    linear   HUM 15-JUL-2006
DEFINITION  Homo sapiens ubiquitin specific peptidase 20, mRNA (cDNA clone
            MGC:48817 IMAGE:5588842), complete cds.
ACCESSION   BC039593
VERSION     BC039593.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 4490)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 4490)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (01-NOV-2002) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: Invitrogen
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Institute for Systems Biology
            http://www.systemsbiology.org
            contact: amadan@systemsbiology.org
            Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
            Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 84 Row: i Column: 3
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 56713257.
FEATURES             Location/Qualifiers
     source          1..4490
                     /db_xref="H-InvDB:HIT000052220"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:48817 IMAGE:5588842"
                     /tissue_type="Ovary, pooled from 3 adults"
                     /clone_lib="NIH_MGC_125"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     gene            1..4490
                     /gene="USP20"
                     /gene_synonym="KIAA1003"
                     /gene_synonym="LSFR3A"
                     /gene_synonym="VDU2"
                     /db_xref="GeneID:10868"
                     /db_xref="HGNC:HGNC:12619"
     CDS             159..2900
                     /gene="USP20"
                     /gene_synonym="KIAA1003"
                     /gene_synonym="LSFR3A"
                     /gene_synonym="VDU2"
                     /codon_start=1
                     /product="ubiquitin specific peptidase 20"
                     /protein_id="AAH39593.1"
                     /db_xref="GeneID:10868"
                     /db_xref="HGNC:HGNC:12619"
                     /translation="MGDSRDLCPHLDSIGEVTKEDLLLKSKGTCQSCGVTGPNLWACL
                     QVACPYVGCGESFADHSTIHAQAKKHNLTVNLTTFRLWCYACEKEVFLEQRLAAPLLG
                     SSSKFSEQDSPPPSHPLKAVPIAVADEGESESEDDDLKPRGLTGMKNLGNSCYMNAAL
                     QALSNCPPLTQFFLECGGLVRTDKKPALCKSYQKLVSEVWHKKRPSYVVPTSLSHGIK
                     LVNPMFRGYAQQDTQEFLRCLMDQLHEELKEPVVATVALTEARDSDSSDTDEKREGDR
                     SPSEDEFLSCDSSSDRGEGDGQGRGGGSSQAETELLIPDEAGRAISEKERMKDRKFSW
                     GQQRTNSEQVDEDADVDTAMAALDQPAEAQPPSPRSSSPCRTPEPDNDAHLRSSSRPC
                     SPVHHHEGHAKLSSSPPRASPVRMAPSYVLKKAQVLSAGSRRRKEQRYRSVISDIFDG
                     SILSLVQCLTCDRVSTTVETFQDLSLPIPGKEDLAKLHSAIYQNVPAKPGACGDSYAA
                     QGWLAFIVEYIRRFVVSCTPSWFWGPVVTLEDCLAAFFAADELKGDNMYSCERCKKLR
                     NGVKYCKVLRLPEILCIHLKRFRHEVMYSFKINSHVSFPLEGLDLRPFLAKECTSQIT
                     TYDLLSVICHHGTAGSGHYIAYCQNVINGQWYEFDDQYVTEVHETVVQNAEGYVLFYR
                     KSSEEAMRERQQVVSLAAMREPSLLRFYVSREWLNKFNTFAEPGPITNQTFLCSHGGI
                     PPHKYHYIDDLVVILPQNVWEHLYNRFGGGPAVNHLYVCSICQVEIEALAKRRRIEID
                     TFIKLNKAFQAEESPGVIYCISMQWFREWEAFVKGKDNEPPGPIDNSRIAQVKGSGHV
                     QLKQGADYGQISEETWTYLNSLYGGGPEIAIRQSVAQPLGPENLHGEQKIEAETRAV"
BASE COUNT          948 a         1347 c         1341 g          854 t
ORIGIN      
        1 caggcggcgg cggcgcagtt gcgagtgcag gctccttgcc agaggcctcc actcactcca
       61 gacccctata gcccgtcgct gtcagctgtc aacaaaggat gcgaatgctg gccgcttcct
      121 gtgggcttcg tgtcacccag aggtgagccc aggccaggat gggggactcc agggaccttt
      181 gccctcacct tgactccata ggagaggtga ccaaagagga cttgctgctc aaatctaagg
      241 gaacctgtca gtcgtgtggg gtcaccggac caaacctatg ggcctgtctg caggttgcct
      301 gcccctatgt tggctgcgga gaatccttcg ctgaccacag caccattcat gcacaggcaa
      361 aaaagcacaa cttgaccgtg aacctgacca cgttccgact gtggtgttac gcctgtgaga
      421 aggaggtatt cctggagcag cggctggcag cccctctgct gggctcctct tccaagttct
      481 ctgaacagga ctccccgcca ccctcccacc ctctgaaagc tgttcctatt gctgtggctg
      541 atgaaggaga gtctgagtca gaggatgatg acctgaaacc tcgaggcctc acgggcatga
      601 agaacctcgg gaactcctgc tacatgaacg ccgccctgca ggccctgtcc aattgcccgc
      661 cgctgactca gttcttcttg gagtgtggcg gcctggtgcg cacagataag aagccagccc
      721 tgtgcaagag ctaccagaag ctggtctctg aggtctggca taagaaacgg ccaagctacg
      781 tggtccccac cagtctgtct catgggatca agttggtcaa cccaatgttc cgaggctatg
      841 cccagcagga cacccaagag ttccttcgct gcctgatgga ccagctgcac gaggagctca
      901 aggagccggt ggtggccacg gtggcgctga cggaggctcg ggactcagat tcgagtgaca
      961 cggatgagaa acgggagggt gaccggagcc catcagaaga tgagttcttg tcctgtgact
     1021 cgagcagtga ccggggtgag ggtgacgggc aggggcgtgg cgggggcagc tcgcaggccg
     1081 agacggagct gctgatccca gatgaggcgg gccgagccat ctctgagaag gagcggatga
     1141 aggaccgcaa gttctcctgg ggccagcagc gtacaaactc ggagcaagtg gacgaggacg
     1201 ctgatgtgga cactgccatg gctgcccttg accagcccgc ggaggcccag cccccgtcac
     1261 cacggtcctc cagcccctgc cggacgccag agccggacaa tgatgctcac ctacgcagct
     1321 cctctcgccc ctgcagcccc gtccaccacc acgagggcca tgccaagctg tctagcagcc
     1381 cccctcgtgc aagccccgtg aggatggcac cgtcgtacgt gctcaagaaa gcccaggtat
     1441 tgagtgctgg cagccggagg cggaaggagc agcgctaccg cagcgtcatc tcagacatct
     1501 ttgacggctc cattctcagc ctcgtgcagt gtctcacctg tgaccgggta tccaccacag
     1561 tggaaacgtt ccaggactta tcactgccca ttcctggaaa ggaggacctg gccaagctcc
     1621 attcagccat ctaccagaat gtgccggcca agccaggcgc ctgtggggac agctatgccg
     1681 cccagggctg gctggccttc attgtggagt acatccgacg gtttgtggta tcctgtaccc
     1741 ccagctggtt ttgggggcct gtcgtcaccc tggaagactg ccttgctgcc ttctttgccg
     1801 ctgatgagtt aaagggtgac aacatgtaca gctgtgagcg gtgtaagaag ctgcggaacg
     1861 gagtgaagta ctgcaaagtc ctgcggttgc ccgagatcct gtgcattcac ctaaagcgct
     1921 ttcggcacga ggtgatgtac tcattcaaga tcaacagcca cgtctccttc cccctcgagg
     1981 ggctcgacct gcgccccttc cttgccaagg agtgcacatc ccagatcacc acctacgacc
     2041 tcctctcggt catctgccac cacggcacgg caggcagtgg gcactacatc gcctactgcc
     2101 agaacgtgat caatgggcag tggtacgagt ttgatgacca gtacgtcaca gaagtccacg
     2161 agacggtggt gcagaacgcc gagggctacg tactcttcta caggaagagc agcgaggagg
     2221 ccatgcggga gcgacagcag gtggtgtccc tggccgccat gcgggagccc agcctgctgc
     2281 ggttctacgt gtcccgcgag tggctcaaca agttcaacac cttcgcggag ccaggcccca
     2341 tcaccaacca gaccttcctc tgctcccacg gaggcatccc gccccacaaa taccactaca
     2401 tcgacgacct ggtggtcatc ctgccccaga acgtctggga gcacctgtac aacagattcg
     2461 ggggtggccc cgccgtgaac cacctgtacg tgtgctccat ctgccaggtg gagatcgagg
     2521 cactggccaa gcgcaggagg atcgagatcg acaccttcat caagttgaac aaggccttcc
     2581 aggccgagga gtcgccgggc gtcatctact gcatcagcat gcagtggttc cgggagtggg
     2641 aggcgttcgt caaggggaag gacaacgagc cccccgggcc cattgacaac agcaggattg
     2701 cacaggtcaa aggaagcggc catgtccagc tgaagcaggg agctgactac gggcagattt
     2761 cggaggagac ctggacctac ctgaacagcc tgtatggagg tggccccgag attgccatcc
     2821 gccagagtgt ggcgcagccg ctgggcccag agaacctgca cggggagcag aagatcgaag
     2881 ccgagacgcg ggccgtgtga tctgctgggc tagtctgtaa gtcgccccgg ctggtccctc
     2941 catggcactc tgggtcctct cctcactctc cagagaccct cacatgtcct tttgaacatc
     3001 caaagagcag gtccctgaaa gcaccttcct ggaggatgtg ggagggccct ggacatggcc
     3061 cggccccact gctgagtgcc cgtgtcccca cagccccatg tgccccaccc cgcggaaggc
     3121 gtgtttgtgc ccagaagaga ggccgggctg ctgcagaacc ccgccgtgta aagaggcaga
     3181 aaagttggtt tggtttgcag taacgctgca actagaaaat atatgcactt caggcttgtt
     3241 gaaacgacca agactctgtg acgttaattt gggtctttgt cctggcagtg cctctgccag
     3301 tcactgtcat cgttgtgtcc cccacaactg tcctcttgct agctcggccc agctttgtcc
     3361 ctggagcccg atgctacccc tgtcagacag aggctgcggc ctgggccaga gtcagggagt
     3421 agctgctgct tcacggcgtc tccactgtgc gattggcccg gagccccgaa gactcggagg
     3481 gagctgctca gggccggtga gcgcagccag aagccctggc cagtgaggag ctcacaggtc
     3541 ctccctggtg gtcccgccgc acctctgcat ctcctgggcg tcaccaggaa ggctctgaag
     3601 tcccgggctg ctctcagcac ttctcctgca gactgaagac tctggactca ttgctgattg
     3661 gaacaccagg aggaggttgg atttctgcca gtgggggatg tttctggagg cagctggtcc
     3721 cccacaccgc gtcctgctga gcctgccccc tggattggct gtaatttgcc tcgaagttca
     3781 gcagttcatc ttcatgggaa atttgctgag cccccaccag ggaaccggat gatgaaacag
     3841 ggatacctca cagcttggcc atttgaggca aaggcagctt cccgagctga tgctaaagaa
     3901 gacagacttt cccttcctcc cagcagcagc agtgcagagc ccgcctggag ggatgtgggg
     3961 gctgtgcagg gtgcagcgct caggtggatc ctgggaagca gcctctggat gctgagtgga
     4021 gggagccact gagcacagca aggcaccaaa gcccctggag aaaccgccag ggcgaggtgc
     4081 gaccatcatc aggatcaaag cagacggggc gtgggtgggg aaggggctct gggaccagac
     4141 cccccacact actgcgtctt tgtttctatc agtctttgta gaagcaggtg gtggtggaaa
     4201 ttccagcagg tgggtcccgc agaggccctg aggcctcact tttcggatct tctgtcccag
     4261 atcctgctcc ctccctgctg agcctggggt tcccctggca ttggccccag ccttctgaaa
     4321 gccggcgctg cagccagagg ccgcacgctg cactgtcgcg acgcagagag gcttctgtgc
     4381 aggctgggat cgggccccat gtctgtgctg tctagtttgt gttcaaaatg tcagaataaa
     4441 cacagaataa atgttaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
//