LOCUS BC038296 4578 bp mRNA linear HUM 24-JUL-2006 DEFINITION Homo sapiens Smg-5 homolog, nonsense mediated mRNA decay factor (C. elegans), mRNA (cDNA clone MGC:33341 IMAGE:4824487), complete cds. ACCESSION BC038296 VERSION BC038296.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4578) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 4578) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (01-OCT-2002) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Miklos Palkovits, M.D., Ph.D. cDNA Library Preparation: Michael J. Brownstein (NHGRI) & Shiraki Toshiyuki and Piero Carninci (RIKEN) cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Institute for Systems Biology http://www.systemsbiology.org contact: amadan@systemsbiology.org Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 46 Row: m Column: 8 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 24308040. FEATURES Location/Qualifiers source 1..4578 /db_xref="H-InvDB:HIT000052017" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:33341 IMAGE:4824487" /tissue_type="Testis" /clone_lib="NIH_MGC_97" /lab_host="DH10B" /note="Vector: pBluescriptR" gene 1..4578 /gene="SMG5" /gene_synonym="EST1B" /gene_synonym="KIAA1089" /gene_synonym="LPTS-RP1" /gene_synonym="LPTSRP1" /gene_synonym="RP11-54H19.7" /gene_synonym="SMG-5" /db_xref="GeneID:23381" /db_xref="HGNC:HGNC:24644" CDS 151..3201 /gene="SMG5" /gene_synonym="EST1B" /gene_synonym="KIAA1089" /gene_synonym="LPTS-RP1" /gene_synonym="LPTSRP1" /gene_synonym="RP11-54H19.7" /gene_synonym="SMG-5" /codon_start=1 /product="Smg-5 homolog, nonsense mediated mRNA decay factor (C. elegans)" /protein_id="AAH38296.1" /db_xref="GeneID:23381" /db_xref="HGNC:HGNC:24644" /translation="MSQGPPTGESSEPEAKVLHTKRLYRAVVEAVHRLDLILCNKTAY QEVFKPENISLRNKLRELCVKLMFLHPVDYGRKAEELLWRKVYYEVIQLIKTNKKHIH SRSTLECAYRTHLVAGIGFYQHLLLYIQSHYQLELQCCIDWTHVTDPLIGCKKPVSAS GKEMDWAQMACHRCLVYLGDLSRYQNELAGVDTELLAERFYYQALSVAPQIGMPFNQL GTLAGSKYYNVEAMYCYLRCIQSEVSFEGAYGNLKRLYDKAAKMYHQLKKCETRKLSP GKKRCKDIKRLLVNFMYLQSLLQPKSSSVDSELTSLCQSVLEDFNLCLFYLPSSPNLS LASEDEEEYESGYAFLPDLLIFQMVIICLMCVHSLERAGSKQYSAAIAFTLALFSHLV NHVNIRLQAELEEGENPVPAFQSDGTDEPESKEPVEKEEEPDPEPPPVTPQVGEGRKS RKFSRLSCLRRRRHPPKVGDDSDLSEGFESDSSHDSARASEGSDSGSDKSLEGGGTAF DAETDSEMNSQESRSDLEDMEEEEGTRSPTLEPPRGRSEAPDSLNGPLGPSEASIASN LQAMSTQMFQTKRCFRLAPTFSNLLLQPTTNPHTSASHRPCVNGDVDKPSEPASEEGS ESEGSESSGRSCRNERSIQEKLQVLMAEGLLPAVKVFLDWLRTNPDLIIVCAQSSQSL WNRLSVLLNLLPAAGELQESGLALCPEVQDLLEGCELPDLPSSLLLPEDMALRNLPPL RAAHRRFNFDTDRPLLSTLEESVVRICCIRSFGHFIARLQGSILQFNPEVGIFVSIAQ SEQESLLQQAQAQFRMAQEEARRNRLMRDMAQLRLQLEVSQLEGSLQQPKAQSAMSPY LVPDTQALCHHLPVIRQLATSGRFIVIIPRTVIDGLDLLKKEHPGARDGIRYLEAEFK KGNRYIRCQKEVGKSFERHKLKRQDADAWTLYKILDSCKQLTLAQGAGEEDPSGMVTI ITGLPLDNPSVLSGPMQAALQAAAHASVDIKDVLDFYKQWKEIG" BASE COUNT 1019 a 1308 c 1321 g 930 t ORIGIN 1 agcgggtcgt gggcagccgc ctcacagcga tggcggccga gcagggccgg tggcggcggc 61 ggctgcggct acggccggag acggcagtgt tggcggtagt ggtgggtggc aggggcctgt 121 gaccgggagc tgcccccgga cccgggcacc atgagccaag gcccccccac aggggagagc 181 agcgagcccg aagcaaaagt cctccacact aagcggcttt accgggctgt ggtggaggct 241 gtgcatcgac ttgacctcat cctttgcaac aaaactgctt atcaagaagt attcaaacca 301 gaaaacatta gcctgaggaa caagctgcgt gagctctgcg tcaagcttat gttcctgcac 361 ccagtggact atgggagaaa ggctgaggag ctgctgtgga gaaaggtata ctatgaagtt 421 atccagctta tcaagactaa caaaaagcac atccacagcc ggagcacttt ggaatgtgcc 481 tacaggacgc acctggttgc tggtattggc ttctaccagc atctccttct ctatatccag 541 tcccactacc agctggaact gcagtgctgc atcgactgga cccatgtcac tgaccccctc 601 ataggatgca agaagccagt gtctgcctca gggaaggaga tggattgggc acagatggca 661 tgtcaccgat gtctggtgta tctgggggat ttgtcccgat atcagaatga attagctggc 721 gtagataccg agctgctagc cgagagattt tactaccaag ccctgtcagt agctcctcag 781 attggaatgc ccttcaatca gctgggcacc ctggcaggca gcaagtacta taatgtggaa 841 gccatgtatt gctacctgcg ctgcatccag tcagaagtgt cctttgaggg agcctatggg 901 aacctcaagc ggctgtatga caaggcagcc aaaatgtacc accaactgaa gaagtgtgag 961 actcggaaac tgtctcctgg caaaaagcga tgtaaagaca ttaaaaggtt gctagtgaac 1021 tttatgtatc tgcaaagcct cctacagccc aaaagcagct ccgtggactc agagctgacc 1081 tcactttgcc agtcagtcct ggaggacttc aacctctgcc tcttctacct gccctcctca 1141 cccaacctca gcctggccag tgaggatgag gaggagtatg agagtggata tgctttcctc 1201 ccggaccttc tcatctttca aatggtcatc atctgcctta tgtgtgtgca cagcttggag 1261 agagcaggat ccaagcagta cagtgcagcc attgccttca ccctggccct cttttcccac 1321 ctcgtcaatc atgtcaacat acggctgcag gctgagctgg aagagggcga gaatcccgtc 1381 ccggcattcc agagtgatgg cacagatgaa ccagagtcca aggaacctgt ggagaaagag 1441 gaggagccag atcctgagcc tcctcctgta acaccccaag tgggtgaggg cagaaagagc 1501 cgtaagttct ctcgcctctc ctgtctccgc cgtcgccgcc acccacccaa agttggtgat 1561 gacagtgacc tgagtgaagg ctttgaatcg gactcaagcc atgactcagc ccgggccagt 1621 gagggctcag acagtggctc tgacaagagt cttgaaggtg ggggaacggc ctttgatgct 1681 gaaacagact cggaaatgaa tagccaggag tcccgatcag acttggaaga tatggaggaa 1741 gaggagggga cacggtcacc aaccctggag ccccctcggg gcagatcaga ggctcccgat 1801 tccctcaatg gcccactggg ccccagtgag gctagcattg ccagcaatct acaagccatg 1861 tccacccaga tgttccagac taagcgctgc ttccgactgg cccccacctt tagcaacctg 1921 ctcctccagc ccaccaccaa ccctcatacc tcggccagcc acaggccttg cgtcaatggg 1981 gatgtagaca agccttcaga gccagcctct gaggagggct ctgagtcgga ggggagtgag 2041 tccagtggac gctcctgtcg gaatgagcgc agcatccagg agaagcttca ggtcctgatg 2101 gccgaaggtc tgcttcctgc tgtgaaagtc ttcctggact ggcttcggac caaccccgac 2161 ctcatcatcg tgtgtgcgca gagctctcaa agtctgtgga accgcctgtc tgtgttgctg 2221 aatctgttgc ctgctgctgg tgaactccag gagtctggcc tggccttgtg tcctgaggtc 2281 caagatcttc ttgaaggttg tgaactgcct gacctcccct ctagccttct gctcccagag 2341 gacatggctc ttcgtaacct gcccccgctc cgagctgccc acagacgctt taactttgac 2401 acggatcggc ccctgctcag caccttagag gagtcagtgg tgcgcatctg ctgcatccgc 2461 agctttggtc atttcatcgc ccgcctgcaa ggcagcatcc tgcagttcaa cccagaggtt 2521 ggcatcttcg tcagcattgc ccagtctgag caggagagcc tgctgcagca ggcccaggca 2581 cagttccgaa tggcacagga ggaagctcgt cggaacaggc tcatgagaga catggctcag 2641 ctacgacttc agctcgaagt gtctcagctg gagggcagcc tgcagcagcc caaggcccag 2701 tcagccatgt ctccctacct cgtccctgac acccaggccc tctgccacca tctccctgtc 2761 atccgccaac tggccaccag tggccgcttc attgtcatca tcccaaggac agtgatcgat 2821 ggcctggatt tgctgaagaa ggaacaccca ggggcccggg atgggattcg gtacctggag 2881 gcagagttta aaaaaggaaa caggtacatt cgctgccaga aagaggtggg aaagagcttt 2941 gagcggcata agctgaagag gcaggatgca gatgcctgga ctctctataa gatcctagac 3001 agctgcaaac agctgactct ggcccagggg gcaggtgagg aggatccgag tggcatggtg 3061 accatcatca caggccttcc actggacaac cccagcgtgc tttcaggccc catgcaggca 3121 gccctgcagg ccgctgccca cgccagtgtg gacatcaagg atgttctgga cttctacaag 3181 cagtggaagg aaattggttg atactgaccc ccaggccctg cagtggggct gactccagat 3241 ctctcctgcc ctccctggca gccaggacca gcacctgtag tcaccccacc acacgcagac 3301 tcatgcacgc acacaggagg gaggcctagc tgctcagagg ctgcagggag ggcccaggag 3361 ccggctggga gggtggggtc cctttgttgc caagacgtta ggaaagcgag gaaagtgctt 3421 ggattaggag agtcttgtgg gcccctggcc agccttcctg cctcagctcc cctgctgtct 3481 ccaggggcag gtggtaggca tgggtacctg catttcactg gaatgggttc ttggatctct 3541 gaggggaagg aacagcaaaa gaggcccttc ttcctcaccc aagatgcagg gtggttgggg 3601 ccaggagttt ggaccctcta ggtcttgggg gaagagctgg gtaatacctg gtgtctgagt 3661 gattctctgc agacccttcc cctcctcaag gatcacccat cctcctttca gcccccttta 3721 tggggaccag gcagctctgg agccagccac aggggctgtt agagaagcaa ggcctggagt 3781 ggcctgcacc gagtagcagg gtcagggttc gtgtgctcct cctcctgctg caggggctgc 3841 acatcccatt gccccacttc tgctttgtgt ctccctctgt ctagcttcca gggcagggag 3901 caggccccac ctagggctgc aggcagtctg gcctgtgcca gcacggtctc ctgtgcccac 3961 cagccccaca ggtgctgtgc tttgtgctct tggctgctgt gctgggacag aatgggatgc 4021 caggaagaga agaaaggggg tgcagtctga ggccaccacc ccccttccta tctaagggag 4081 ggctgaagac aaggggccgg cattcagtgg gcagcagaaa ggagaggctc cttgaagctg 4141 ctcagtcaga ggcccccgtc cctccttttg ccttccgcag gactgaagac ctgaaggggc 4201 tggcttttgg agtgttgagg tgaatatctg ggagcagaga tcatgaatag ctcagggcag 4261 tgaatggcgc accaagagca gggctgtgtg tgggaggctg cagccaggat tgcctcagct 4321 cctccccctc aggctgggag gatagcacag gctaggggct cggggtggag ggtctcagct 4381 ctgctgcccc caccccagta ctagcctagc ttcccaagct gtggcttaga ggatagttgg 4441 cttcctgcct ctctcctcta aaatagcaag tctgggaaat cctggggtga gtggagtcac 4501 cccactccca gttgctggca gagactgaga ctaaagcatc acttaataaa ccccccaagc 4561 ccaaaaaaaa aaaaaaaa //