LOCUS BC039593 4490 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens ubiquitin specific peptidase 20, mRNA (cDNA clone MGC:48817 IMAGE:5588842), complete cds. ACCESSION BC039593 VERSION BC039593.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4490) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 4490) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (01-NOV-2002) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Invitrogen cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Institute for Systems Biology http://www.systemsbiology.org contact: amadan@systemsbiology.org Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 84 Row: i Column: 3 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 56713257. FEATURES Location/Qualifiers source 1..4490 /db_xref="H-InvDB:HIT000052220" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:48817 IMAGE:5588842" /tissue_type="Ovary, pooled from 3 adults" /clone_lib="NIH_MGC_125" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..4490 /gene="USP20" /gene_synonym="KIAA1003" /gene_synonym="LSFR3A" /gene_synonym="VDU2" /db_xref="GeneID:10868" /db_xref="HGNC:HGNC:12619" CDS 159..2900 /gene="USP20" /gene_synonym="KIAA1003" /gene_synonym="LSFR3A" /gene_synonym="VDU2" /codon_start=1 /product="ubiquitin specific peptidase 20" /protein_id="AAH39593.1" /db_xref="GeneID:10868" /db_xref="HGNC:HGNC:12619" /translation="MGDSRDLCPHLDSIGEVTKEDLLLKSKGTCQSCGVTGPNLWACL QVACPYVGCGESFADHSTIHAQAKKHNLTVNLTTFRLWCYACEKEVFLEQRLAAPLLG SSSKFSEQDSPPPSHPLKAVPIAVADEGESESEDDDLKPRGLTGMKNLGNSCYMNAAL QALSNCPPLTQFFLECGGLVRTDKKPALCKSYQKLVSEVWHKKRPSYVVPTSLSHGIK LVNPMFRGYAQQDTQEFLRCLMDQLHEELKEPVVATVALTEARDSDSSDTDEKREGDR SPSEDEFLSCDSSSDRGEGDGQGRGGGSSQAETELLIPDEAGRAISEKERMKDRKFSW GQQRTNSEQVDEDADVDTAMAALDQPAEAQPPSPRSSSPCRTPEPDNDAHLRSSSRPC SPVHHHEGHAKLSSSPPRASPVRMAPSYVLKKAQVLSAGSRRRKEQRYRSVISDIFDG SILSLVQCLTCDRVSTTVETFQDLSLPIPGKEDLAKLHSAIYQNVPAKPGACGDSYAA QGWLAFIVEYIRRFVVSCTPSWFWGPVVTLEDCLAAFFAADELKGDNMYSCERCKKLR NGVKYCKVLRLPEILCIHLKRFRHEVMYSFKINSHVSFPLEGLDLRPFLAKECTSQIT TYDLLSVICHHGTAGSGHYIAYCQNVINGQWYEFDDQYVTEVHETVVQNAEGYVLFYR KSSEEAMRERQQVVSLAAMREPSLLRFYVSREWLNKFNTFAEPGPITNQTFLCSHGGI PPHKYHYIDDLVVILPQNVWEHLYNRFGGGPAVNHLYVCSICQVEIEALAKRRRIEID TFIKLNKAFQAEESPGVIYCISMQWFREWEAFVKGKDNEPPGPIDNSRIAQVKGSGHV QLKQGADYGQISEETWTYLNSLYGGGPEIAIRQSVAQPLGPENLHGEQKIEAETRAV" BASE COUNT 948 a 1347 c 1341 g 854 t ORIGIN 1 caggcggcgg cggcgcagtt gcgagtgcag gctccttgcc agaggcctcc actcactcca 61 gacccctata gcccgtcgct gtcagctgtc aacaaaggat gcgaatgctg gccgcttcct 121 gtgggcttcg tgtcacccag aggtgagccc aggccaggat gggggactcc agggaccttt 181 gccctcacct tgactccata ggagaggtga ccaaagagga cttgctgctc aaatctaagg 241 gaacctgtca gtcgtgtggg gtcaccggac caaacctatg ggcctgtctg caggttgcct 301 gcccctatgt tggctgcgga gaatccttcg ctgaccacag caccattcat gcacaggcaa 361 aaaagcacaa cttgaccgtg aacctgacca cgttccgact gtggtgttac gcctgtgaga 421 aggaggtatt cctggagcag cggctggcag cccctctgct gggctcctct tccaagttct 481 ctgaacagga ctccccgcca ccctcccacc ctctgaaagc tgttcctatt gctgtggctg 541 atgaaggaga gtctgagtca gaggatgatg acctgaaacc tcgaggcctc acgggcatga 601 agaacctcgg gaactcctgc tacatgaacg ccgccctgca ggccctgtcc aattgcccgc 661 cgctgactca gttcttcttg gagtgtggcg gcctggtgcg cacagataag aagccagccc 721 tgtgcaagag ctaccagaag ctggtctctg aggtctggca taagaaacgg ccaagctacg 781 tggtccccac cagtctgtct catgggatca agttggtcaa cccaatgttc cgaggctatg 841 cccagcagga cacccaagag ttccttcgct gcctgatgga ccagctgcac gaggagctca 901 aggagccggt ggtggccacg gtggcgctga cggaggctcg ggactcagat tcgagtgaca 961 cggatgagaa acgggagggt gaccggagcc catcagaaga tgagttcttg tcctgtgact 1021 cgagcagtga ccggggtgag ggtgacgggc aggggcgtgg cgggggcagc tcgcaggccg 1081 agacggagct gctgatccca gatgaggcgg gccgagccat ctctgagaag gagcggatga 1141 aggaccgcaa gttctcctgg ggccagcagc gtacaaactc ggagcaagtg gacgaggacg 1201 ctgatgtgga cactgccatg gctgcccttg accagcccgc ggaggcccag cccccgtcac 1261 cacggtcctc cagcccctgc cggacgccag agccggacaa tgatgctcac ctacgcagct 1321 cctctcgccc ctgcagcccc gtccaccacc acgagggcca tgccaagctg tctagcagcc 1381 cccctcgtgc aagccccgtg aggatggcac cgtcgtacgt gctcaagaaa gcccaggtat 1441 tgagtgctgg cagccggagg cggaaggagc agcgctaccg cagcgtcatc tcagacatct 1501 ttgacggctc cattctcagc ctcgtgcagt gtctcacctg tgaccgggta tccaccacag 1561 tggaaacgtt ccaggactta tcactgccca ttcctggaaa ggaggacctg gccaagctcc 1621 attcagccat ctaccagaat gtgccggcca agccaggcgc ctgtggggac agctatgccg 1681 cccagggctg gctggccttc attgtggagt acatccgacg gtttgtggta tcctgtaccc 1741 ccagctggtt ttgggggcct gtcgtcaccc tggaagactg ccttgctgcc ttctttgccg 1801 ctgatgagtt aaagggtgac aacatgtaca gctgtgagcg gtgtaagaag ctgcggaacg 1861 gagtgaagta ctgcaaagtc ctgcggttgc ccgagatcct gtgcattcac ctaaagcgct 1921 ttcggcacga ggtgatgtac tcattcaaga tcaacagcca cgtctccttc cccctcgagg 1981 ggctcgacct gcgccccttc cttgccaagg agtgcacatc ccagatcacc acctacgacc 2041 tcctctcggt catctgccac cacggcacgg caggcagtgg gcactacatc gcctactgcc 2101 agaacgtgat caatgggcag tggtacgagt ttgatgacca gtacgtcaca gaagtccacg 2161 agacggtggt gcagaacgcc gagggctacg tactcttcta caggaagagc agcgaggagg 2221 ccatgcggga gcgacagcag gtggtgtccc tggccgccat gcgggagccc agcctgctgc 2281 ggttctacgt gtcccgcgag tggctcaaca agttcaacac cttcgcggag ccaggcccca 2341 tcaccaacca gaccttcctc tgctcccacg gaggcatccc gccccacaaa taccactaca 2401 tcgacgacct ggtggtcatc ctgccccaga acgtctggga gcacctgtac aacagattcg 2461 ggggtggccc cgccgtgaac cacctgtacg tgtgctccat ctgccaggtg gagatcgagg 2521 cactggccaa gcgcaggagg atcgagatcg acaccttcat caagttgaac aaggccttcc 2581 aggccgagga gtcgccgggc gtcatctact gcatcagcat gcagtggttc cgggagtggg 2641 aggcgttcgt caaggggaag gacaacgagc cccccgggcc cattgacaac agcaggattg 2701 cacaggtcaa aggaagcggc catgtccagc tgaagcaggg agctgactac gggcagattt 2761 cggaggagac ctggacctac ctgaacagcc tgtatggagg tggccccgag attgccatcc 2821 gccagagtgt ggcgcagccg ctgggcccag agaacctgca cggggagcag aagatcgaag 2881 ccgagacgcg ggccgtgtga tctgctgggc tagtctgtaa gtcgccccgg ctggtccctc 2941 catggcactc tgggtcctct cctcactctc cagagaccct cacatgtcct tttgaacatc 3001 caaagagcag gtccctgaaa gcaccttcct ggaggatgtg ggagggccct ggacatggcc 3061 cggccccact gctgagtgcc cgtgtcccca cagccccatg tgccccaccc cgcggaaggc 3121 gtgtttgtgc ccagaagaga ggccgggctg ctgcagaacc ccgccgtgta aagaggcaga 3181 aaagttggtt tggtttgcag taacgctgca actagaaaat atatgcactt caggcttgtt 3241 gaaacgacca agactctgtg acgttaattt gggtctttgt cctggcagtg cctctgccag 3301 tcactgtcat cgttgtgtcc cccacaactg tcctcttgct agctcggccc agctttgtcc 3361 ctggagcccg atgctacccc tgtcagacag aggctgcggc ctgggccaga gtcagggagt 3421 agctgctgct tcacggcgtc tccactgtgc gattggcccg gagccccgaa gactcggagg 3481 gagctgctca gggccggtga gcgcagccag aagccctggc cagtgaggag ctcacaggtc 3541 ctccctggtg gtcccgccgc acctctgcat ctcctgggcg tcaccaggaa ggctctgaag 3601 tcccgggctg ctctcagcac ttctcctgca gactgaagac tctggactca ttgctgattg 3661 gaacaccagg aggaggttgg atttctgcca gtgggggatg tttctggagg cagctggtcc 3721 cccacaccgc gtcctgctga gcctgccccc tggattggct gtaatttgcc tcgaagttca 3781 gcagttcatc ttcatgggaa atttgctgag cccccaccag ggaaccggat gatgaaacag 3841 ggatacctca cagcttggcc atttgaggca aaggcagctt cccgagctga tgctaaagaa 3901 gacagacttt cccttcctcc cagcagcagc agtgcagagc ccgcctggag ggatgtgggg 3961 gctgtgcagg gtgcagcgct caggtggatc ctgggaagca gcctctggat gctgagtgga 4021 gggagccact gagcacagca aggcaccaaa gcccctggag aaaccgccag ggcgaggtgc 4081 gaccatcatc aggatcaaag cagacggggc gtgggtgggg aaggggctct gggaccagac 4141 cccccacact actgcgtctt tgtttctatc agtctttgta gaagcaggtg gtggtggaaa 4201 ttccagcagg tgggtcccgc agaggccctg aggcctcact tttcggatct tctgtcccag 4261 atcctgctcc ctccctgctg agcctggggt tcccctggca ttggccccag ccttctgaaa 4321 gccggcgctg cagccagagg ccgcacgctg cactgtcgcg acgcagagag gcttctgtgc 4381 aggctgggat cgggccccat gtctgtgctg tctagtttgt gttcaaaatg tcagaataaa 4441 cacagaataa atgttaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa //