LOCUS BC069237 3244 bp mRNA linear HUM 27-APR-2004 DEFINITION Homo sapiens ubiquilin 2, mRNA (cDNA clone MGC:78469 IMAGE:4543266), complete cds. ACCESSION BC069237 VERSION BC069237.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3244) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 3244) AUTHORS Strausberg,R. TITLE Direct Submission JOURNAL Submitted (26-APR-2004) National Institutes of Health, Mammalian Gene Collection (MGC), Cancer Genomics Office, National Cancer Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Genome Sequence Centre, BC Cancer Agency, Vancouver, BC, Canada info@bcgsc.bc.ca Steve Jones, Sarah Barber, Mabel Brown-John, Yaron Butterfield, Andy Chan, Steve S. Chand, William Chow, Alison Cloutier, Ruth Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao, Kim MacDonald, Amara Masson, Mike R. Mayo, Josh Moran, Ryan Morin, Teika Olson, Diana Palmquist, Anca Petrescu, Anna Liisa Prahbu, Parvaneh Saeedi, JR Santos, Angelique Schnerch, Ursula Skalska, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang, Jacquie Schein, Asim Siddiqui, Rob Holt, Marco Marra. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 54 Row: c Column: 4 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 16753206. FEATURES Location/Qualifiers source 1..3244 /db_xref="H-InvDB:HIT000263391" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:78469 IMAGE:4543266" /tissue_type="Placenta, choriocarcinoma" /clone_lib="NIH_MGC_21" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..3244 /gene="UBQLN2" /gene_synonym="CHAP1" /gene_synonym="CHAP1/DSK2" /gene_synonym="Dsk2" /gene_synonym="HRIHFB2157" /gene_synonym="LIC-2" /gene_synonym="N4BP4" /gene_synonym="PLIC-2" /gene_synonym="PLIC2" /db_xref="GeneID:29978" /db_xref="MIM:300264" CDS 153..2027 /gene="UBQLN2" /gene_synonym="CHAP1" /gene_synonym="CHAP1/DSK2" /gene_synonym="Dsk2" /gene_synonym="HRIHFB2157" /gene_synonym="LIC-2" /gene_synonym="N4BP4" /gene_synonym="PLIC-2" /gene_synonym="PLIC2" /codon_start=1 /product="UBQLN2 protein" /protein_id="AAH69237.1" /db_xref="GeneID:29978" /db_xref="MIM:300264" /translation="MAENGESSGPPRPSRGPAAAQGSAAAPAEPKIIKVTVKTPKEKE EFAVPENSSVQQFKEAISKRFKSQTDQLVLIFAGKILKDQDTLIQHGIHDGLTVHLVI KSQNRPQGQSTQPSNAAGTNTTSASTPRSNSTPISTNSNPFGLGSLGGLAGLSSLGLS STNFSELQSQMQQQLMASPEMMIQIMENPFVQSMLSNPDLMRQLIMANPQMQQLIQRN PEISHLLNNPDIMRQTLEIARNPAMMQEMMRNQDLALSNLESIPGGYNALRRMYTDIQ EPMLNAAQEQFGGNPFASVGSSSSSGEGTQPSRTENRDPLPNPWAPPPATQSSATTST TTSTGSGSGNSSSNATGNTVAAANYVASIFSTPGMQSLLQQITENPQLIQNMLSAPYM RSMMQSLSQNPDLAAQMMLNSPLFTANPQLQEQMRPQLPAFLQQMQNPDTLSAMSNPR AMQALMQIQQGLQTLATEAPGLIPSFTPGVGVGVLGTAIGPVGPVTPIGPIGPIVPFT PIGPIGPIGPTGPAAPPGSTGSGGPTGPTVSSAAPSETTSPTSESGPNQQFIQQMVQA LAGANAPQLPNPEVRFQQQLEQLNAMGFLNREANLQALIATGGDINAAIERLLGSQPS " misc_feature 285..461 /gene="UBQLN2" /gene_synonym="CHAP1" /gene_synonym="CHAP1/DSK2" /gene_synonym="Dsk2" /gene_synonym="HRIHFB2157" /gene_synonym="LIC-2" /gene_synonym="N4BP4" /gene_synonym="PLIC-2" /gene_synonym="PLIC2" /note="ubiquitin; Region: Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1" /db_xref="CDD:pfam00240" misc_feature 1899..2012 /gene="UBQLN2" /gene_synonym="CHAP1" /gene_synonym="CHAP1/DSK2" /gene_synonym="Dsk2" /gene_synonym="HRIHFB2157" /gene_synonym="LIC-2" /gene_synonym="N4BP4" /gene_synonym="PLIC-2" /gene_synonym="PLIC2" /note="UBA; Region: Ubiquitin Associated domain. The UBA domain is a commonly occurring sequence motif in some members of the ubiquitination pathway, UV excision repair proteins, and certain protein kinases. Although its specific role is so far unknown, it has been suggested that UBA domains are involved in conferring protein target specificity. The domain, a compact three helix bundle, has a conserved GFP-loop and the proline is thought to be critical for binding. The UBA domain is distinct from the conserved three helical domain seen in the N-terminus of EF-TS and eukaryotic NAC proteins" /db_xref="CDD:cd00194" BASE COUNT 879 a 831 c 698 g 836 t ORIGIN 1 agagttgctg ggagtgcgcg cggtcggatc acaaggcggc ggcggaggag gcccagcccg 61 ctgcggcggt gcctccttcc ttcctccttc cctcgcgctc tctctttcgc ccgcccgcgc 121 cttccctgcc cgcctgcgtc accgcggccg ccatggctga gaatggcgag agcagcggcc 181 ccccgcgccc ctcccgcggc cctgctgcgg cccaaggctc ggctgctgcc ccggctgagc 241 ctaaaatcat caaagtcacg gtgaagactc ccaaagagaa agaggagttc gcggtgcccg 301 agaacagctc ggttcagcag tttaaggaag cgatttcgaa acgcttcaaa tcccaaaccg 361 atcagctagt gctgattttt gccggaaaaa tcttaaaaga tcaagatacc ttgatccagc 421 atggcatcca tgatgggctg actgttcacc ttgtcatcaa aagccagaac cgacctcagg 481 gccagtccac gcagcctagc aatgccgcgg gaactaacac tacctcggcg tcgactccca 541 ggagtaactc cacacctatt tccacaaata gcaacccgtt tgggttgggg agcctgggag 601 gacttgcagg ccttagcagc ctgggcttga gctcgaccaa cttctctgag ctccagagcc 661 agatgcagca gcagcttatg gccagccctg agatgatgat ccaaataatg gaaaatccct 721 ttgttcagag catgctttcg aatcccgatc tgatgaggca gctcattatg gctaatccac 781 agatgcagca attgattcag agaaacccag aaatcagtca cctgctcaac aacccagaca 841 taatgaggca gacactcgaa attgccagga atccagccat gatgcaagag atgatgagaa 901 atcaagacct ggctcttagc aatctagaaa gcatcccagg tggctataat gctttacggc 961 gcatgtacac tgacattcaa gagccgatgc tgaatgccgc acaagagcag tttgggggta 1021 atccatttgc ctccgtgggg agtagttcct cctctgggga aggtacgcag ccttcccgca 1081 cagaaaatcg cgatccacta cccaatccat gggcaccacc gccagctacc cagagttctg 1141 caactaccag cacgaccaca agcactggta gtgggtctgg caatagttcc agcaatgcta 1201 ctgggaacac cgttgctgcc gctaattatg tcgccagcat ctttagtacc ccaggcatgc 1261 agagcctgct gcaacagata actgaaaacc cccagctgat tcagaatatg ctgtcggcgc 1321 cctacatgag aagcatgatg cagtcgctga gccagaatcc agatttggct gcacagatga 1381 tgctgaatag cccgctgttt actgcaaatc ctcagctgca ggagcagatg cggccacagc 1441 tcccagcctt cctgcagcag atgcagaatc cagacacact atcagccatg tcaaacccaa 1501 gagcaatgca ggctttaatg cagatccagc aggggctaca gacattagcc actgaagcac 1561 ctggcctgat tccgagcttc actccaggtg tgggggtggg ggtgctggga accgctatag 1621 gccctgtagg cccagtcacc cccataggcc ccataggccc tatagtccct tttaccccca 1681 taggccccat tgggcccata ggacccactg gccctgcagc cccccctggc tccaccggct 1741 ctggtggccc cacggggcct actgtgtcca gcgctgcacc tagtgaaacc acgagtccta 1801 catcagaatc tggacccaac cagcagttca ttcagcaaat ggtgcaggcc ctggctggag 1861 caaatgctcc acagctgccg aatccagaag tcagatttca gcaacaactg gaacagctca 1921 acgcaatggg gttcttaaac cgtgaagcaa acttgcaggc cctaatagca acaggaggcg 1981 acatcaatgc agccattgaa aggctgctgg gctcccagcc atcgtaatca catttctgta 2041 cctggaaaaa aaatgtatct tatttttgat aatggctctt aaatctttaa acacacacac 2101 aaaatcgttc tttactttca ttttgattct tttaaatctg tctagttgta agtctaatat 2161 gatgcatttt aagatggagt ccctccctcc tacttccctc actccctttc tcctttgctt 2221 atttttccta ccttcccttc ctcttgtctc cccactccct ccctctttgt ttccttcctt 2281 ccttatttcc tttagtttcc ttccttagcc gttttgagtg gtgggaatca atgctgtttc 2341 actcaaaagt gttgcatgca aacacttctc tttattctgc atttattgtg atttttggaa 2401 acaggtatca accttcacag ttgggtgaac aagtgttgtc ctacagatgt ccaatttatt 2461 tgcattttta aacattagcc tatgatagta atttaatgta gaatgaagat attaaaaaca 2521 gaagcaaatt atttgaagct ctctaatttg tggtacgata ttgcttattg tgactttggc 2581 atgtattttt gctagcaaaa tgctgtaaga tttataccat tgatcttttt tgctatattt 2641 gtatacagta cagtaagcac aattggcact gtacatctaa aaatattaca gtagaatctg 2701 agtgtaatat gtgtaaccaa aatgagaaag aatacaagaa atgtttctgg agctagttat 2761 gtctcacaat tttgtagaat cttacagcat ctttgataaa cttctcagtg aaaatgttgg 2821 ctaggcaagt tcagttaaaa tatagtagaa atgtttatcc tggtatctct aagtatacat 2881 ttaattgtac agaaaattta cagtgtaaca ttgtgtcaac atttgcagat tgactgtata 2941 tgaccttaat ctttgtgcag cctgaaggat cagtgtagta atgccaggaa agtgcttttt 3001 acctaagact tccttctcag cttctcccat aaagagaccc taatatgcat tttgatttgt 3061 aattggaaat gtaactttca ctgaaagtgt catgtgatgt ttgcattact tttaactgct 3121 atgtataaag gaaagtgtgt cttttgactt catcagttat ttctcttgtg cacagagaaa 3181 aatgcattaa aaatgactaa aaaaaataaa aaattaaaaa atggaaaaaa aaaaaaaaaa 3241 aaaa //