LOCUS BC039817 5357 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens UPF1 regulator of nonsense transcripts homolog (yeast), mRNA (cDNA clone MGC:48687 IMAGE:5555509), complete cds. ACCESSION BC039817 VERSION BC039817.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5357) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 5357) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (12-NOV-2002) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Institute for Systems Biology http://www.systemsbiology.org contact: amadan@systemsbiology.org Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 84 Row: g Column: 21 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 18375672. FEATURES Location/Qualifiers source 1..5357 /db_xref="H-InvDB:HIT000052253" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:48687 IMAGE:5555509" /tissue_type="Uterus, leiomyosarcoma" /clone_lib="NIH_MGC_71" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..5357 /gene="UPF1" /gene_synonym="HUPF1" /gene_synonym="KIAA0221" /gene_synonym="NORF1" /gene_synonym="pNORF1" /db_xref="GeneID:5976" /db_xref="HGNC:HGNC:9962" /db_xref="MIM:601430" CDS 273..3629 /gene="UPF1" /gene_synonym="HUPF1" /gene_synonym="KIAA0221" /gene_synonym="NORF1" /gene_synonym="pNORF1" /codon_start=1 /product="UPF1 regulator of nonsense transcripts homolog (yeast)" /protein_id="AAH39817.1" /db_xref="GeneID:5976" /db_xref="HGNC:HGNC:9962" /db_xref="MIM:601430" /translation="MSVEAYGPSSQTLTFLDTEEAELLGADTQGSEFEFTDFTLPSQT QTPPGGPGGPGGGGAGGPGGAGAGAAAGQLDAQVGPEGILQNGAVDDSVAKTSQLLAE LNFEEDEEDTYYTKDLPIHACSYCGIHDPACVVYCNTSKKWFCNGRGNTSGSHIVNHL VRAKCKEVTLHKDGPLGETVLECYNCGCRNVFLLGFIPAKADSVVVLLCRQPCASQSS LKDINWDSSQWQPLIQDRCFLSWLVKIPSEQEQLRARQITAQQINKLEELWKENPSAT LEDLEKPGVDEEPQHVLLRYEDAYQYQNIFGPLVKLEADYDKKLKESQTQDNITVRWD LGLNKKRIAYFTLPKTDSDMRLMQGDEICLRYKGDLAPLWKGIGHVIKVPDNYGDEIA IELRSSVGAPVEVTHNFQVDFVWKSTSFDRMQSALKTFAVDETSVSGYIYHKLLGHEV EDVIIKCQLPKRFTAQGLPDLNHSQVYAVKTVLQRPLSLIQGPPGTGKTVTSATIVYH LARQGNGPVLVCAPSNIAVDQLTEKIHQTGLKVVRLCAKSREAIDSPVSFLALHNQIR NMDSMPELQKLQQLKDETGELSSADEKRYRALKRTAERELLMNADVICCTCVGAGDPR LAKMQFRSILIDESTQATEPECMVPVVLGAKQLILVGDHCQLGPVVMCKKAAKAGLSQ SLFERLVVLGIRPIRLQVQYRMHPALSAFPSNIFYEGSLQNGVTAADRVKKGFDFQWP QPDKPMFFYVTQGQEEIASSGTSYLNRTEAANVEKITTKLLKAGAKPDQIGIITPYEG QRSYLVQYMQFSGSLHTKLYQEVEIASVDAFQGREKDFIILSCVRANEHQGIGFLNDP RRLNVALTRARYGVIIVGNPKALSKQPLWNHLLNYYKEQKVLVEGPLNNLRESLMQFS KPRKLVNTINPGARFMTTAMYDAREAIIPGSVYDRSSQGRPSSMYFQTHDQIGMISAG PSHVAAMNIPIPFNLVMPPMPPPGYFGQANGPAAGRGTPKGKTGRGGRQKNRFGLPGP SQTNLPNSQASQDVASQPFSQGALTQGYISMSQPSQMSQPGLSQPELSQDSYLGDEFK SQIDVALSQDSTYQGERAYQHGGVTGLSQY" BASE COUNT 1110 a 1566 c 1665 g 1016 t ORIGIN 1 ggcgacggcg gcggtggcgg cagttcctgc tctaggctgc gagcggctgg cggcttcgag 61 gggagctgag gcgcggaggg gctcggcggc agcggcggcg gctcggcact gttacctctc 121 ggtccggctg gcgccggggc gggcggtttg gtcctttccg ggcgcgcggg ggcgacagcg 181 gcagcgaccc gaggcctgcg gcctaggcct cagcgcggcg gcgggctcga gtgcagcgcg 241 gaaccggccc gagggcccta cccggaggca ccatgagcgt ggaggcgtac gggcccagct 301 cgcagactct cactttcctg gacacggagg aggccgagct gcttggcgcc gacacacagg 361 gctccgagtt cgagttcacc gactttactc ttcctagcca gacgcagacg ccccccggcg 421 gccccggcgg cccgggcggt ggcggcgcgg gaggcccggg cggcgcgggc gcgggcgctg 481 cggcgggaca gctcgacgcg caggttgggc ccgaaggcat cctgcagaac ggggctgtgg 541 acgacagtgt agccaagacc agccagttgt tggctgagtt gaacttcgag gaagatgaag 601 aagacaccta ttacacgaag gacctcccca tacacgcctg cagttactgt ggaatacacg 661 atcctgcctg cgtggtttac tgtaatacca gcaagaagtg gttctgcaac ggacgtggaa 721 atacttctgg cagccacatt gtaaatcacc ttgtgagggc aaaatgcaaa gaggtgaccc 781 tgcacaagga cgggcccctg ggggagacag tcctggagtg ctacaactgc ggctgtcgca 841 acgtcttcct cctcggcttc atcccggcca aagctgactc agtggtggtg ctgctgtgca 901 ggcagccctg tgccagccag agcagcctca aggacatcaa ctgggacagc tcgcagtggc 961 agccgctgat ccaggaccgc tgcttcctgt cctggctggt caagatcccc tccgagcagg 1021 agcagctgcg ggcacgccag atcacggcac agcagatcaa caagctggag gagctgtgga 1081 aggaaaaccc ttctgccacg ctggaggacc tggagaagcc gggggtggac gaggagccgc 1141 agcatgtcct cctgcggtac gaggacgcct accagtacca gaacatattc gggcccctgg 1201 tcaagctgga ggccgactac gacaagaagc tgaaggagtc ccagactcaa gataacatca 1261 ctgtcaggtg ggacctgggc cttaacaaga agagaatcgc ctacttcact ttgcccaaga 1321 ctgactctga catgcggctc atgcaggggg atgagatatg cctgcggtac aaaggggacc 1381 ttgcgcccct gtggaaaggg atcggccacg tcatcaaggt ccctgataat tatggcgatg 1441 agatcgccat tgagctgcgg agcagcgtgg gtgcacctgt ggaggtgact cacaacttcc 1501 aggtggattt tgtgtggaag tcgacctcct ttgacaggat gcagagcgca ttgaaaacgt 1561 ttgccgtgga tgagacctcg gtgtctggct acatctacca caagctgttg ggccacgagg 1621 tggaggacgt aatcatcaag tgccagctgc ccaagcgctt cacggcgcag ggcctccccg 1681 acctcaacca ctcccaggtt tatgccgtga agactgtgct gcaaagacca ctgagcctga 1741 tccagggccc gccaggcacg gggaagacgg tgacgtcggc caccatcgtc taccacctgg 1801 cccggcaagg caacgggccg gtgctggtgt gtgctccgag caacatcgcc gtggaccagc 1861 taacggagaa gatccaccag acggggctaa aggtcgtgcg cctctgcgcc aagagccgtg 1921 aggccatcga ctccccggtg tcttttctgg ccctgcacaa ccagatcagg aacatggaca 1981 gcatgcctga gctgcagaag ctgcagcagc tgaaagacga gactggggag ctgtcgtctg 2041 ccgacgagaa gcggtaccgg gccttgaagc gcaccgcaga gagagagctg ctgatgaacg 2101 cagatgtcat ctgctgcaca tgtgtgggcg ccggtgaccc gaggctggcc aagatgcagt 2161 tccgctccat tttaatcgac gaaagcaccc aggccaccga gccggagtgc atggttcccg 2221 tggtcctcgg ggccaagcag ctgatccttg taggcgacca ctgccagctg ggcccagtgg 2281 tgatgtgcaa gaaggcggcc aaggccgggc tgtcacagtc gctcttcgag cgcctggtgg 2341 tgctgggcat ccggcccatc cgcctgcagg tccagtaccg gatgcaccct gcactcagcg 2401 ccttcccatc caacatcttc tacgagggct ccctccagaa tggtgtcact gcagcggatc 2461 gtgtgaagaa gggatttgac ttccagtggc cccaacccga taaaccgatg ttcttctacg 2521 tgacccaggg ccaagaggag attgccagct cgggcacctc ctacctgaac aggaccgagg 2581 ctgcgaacgt ggagaagatc accacgaagt tgctgaaggc aggcgccaag ccggaccaga 2641 ttggcatcat cacgccctac gagggccagc gctcctacct ggtgcagtac atgcagttca 2701 gcggctccct gcacaccaag ctctaccagg aggtggagat cgccagtgtg gacgcctttc 2761 agggacgcga gaaggacttc atcatcctgt cctgtgtgcg ggccaacgag caccaaggca 2821 ttggcttttt aaatgacccc aggcgtctga acgtggccct gaccagagca aggtatggcg 2881 tcatcattgt gggcaacccg aaggcactat caaagcagcc gctctggaac cacctgctga 2941 actactataa ggagcagaag gtgctggtgg aggggccgct caacaacctg cgtgagagcc 3001 tcatgcagtt cagcaagcca cggaagctgg tcaacactat caacccggga gcccgcttca 3061 tgaccacagc catgtatgat gcccgggagg ccatcatccc aggctccgtc tatgatcgga 3121 gcagccaggg ccggccttcc agcatgtact tccagaccca tgaccagatt ggcatgatca 3181 gtgccggccc tagccacgtg gctgccatga acattcccat ccccttcaac ctggtcatgc 3241 cacccatgcc accgcctggc tattttggac aagccaacgg gcctgctgca gggcgaggca 3301 ccccgaaagg caagactggt cgtgggggac gccagaagaa ccgctttggg cttcctggac 3361 ccagccagac taacctcccc aacagccaag ccagccagga tgtggcgtca cagcccttct 3421 ctcagggcgc cctgacgcag ggctacatct ccatgagcca gccttcccag atgagccagc 3481 ccggcctctc ccagccggag ctgtcccagg acagttacct tggtgacgag tttaaatcac 3541 aaatcgacgt ggcgctctca caggactcca cgtaccaggg agagcgggct taccagcatg 3601 gcggggtgac ggggctgtcc cagtattaaa aggtggcggc ggaagagcta agcaacgtgg 3661 cttagtccat cagcatctta ttctgggtaa taaaaaataa aaataaacgg atacctgttt 3721 tccactgcta aaactgaagc accactgtgt gagcaacagg aagggagagc gcacgaggga 3781 gaggagccga ggccgagcgc cccctgctgg cccgcggcgg cgaggagcag agggagcgga 3841 ggaggggccg gcccgcggga gccgcggcca ccaggaggcc ccgctccgtc ccatcggggc 3901 tgcggccagg gcggagggag gaagaccctc atctcagagt agccctttcc tctgttcttt 3961 tatttctttt tctctttgat tgaaagggga ctacgtctta gcaggaaaaa aaacttcgca 4021 tttctgtgcc cgagcaggct ccttgcaaag acagcagcgt gcggggcaga gccccgggag 4081 ggcgcgtctg tccacgccta ccggacgcgc cgaggtcgcg ctgcctgtgt tctccgaggg 4141 ccttcattta aagaaaataa gggtgttttg ggtttttctc tttgtttttt tcaagattct 4201 tttaaaggag tactgaagaa tactttccta agtttgtctc taaaatctta gcggtggacc 4261 tgggagattt gagaagcttc cagaaacagt ttaaacaagc cagcgctact ggagaagagg 4321 agcaacacct gtgccgcggc cggaggagtt ttgttgttgg ttttagcttc cagtggcttc 4381 tttctgcggg gcatcaggct gctggggtag ccgcccgccg agcctggaag ctgctcgttc 4441 tccgctggac tcagaagcca agctgcttcc cgcctagact cggcgcaggg ccccgcaccg 4501 gtgaggaagg tgcttttggc cccattgcga ggggccttgg ccaggactgg ccctgtggcc 4561 aggaggcgag aaggtggctg ttcccggatt gacggctttt tcccgggggc ctttggaaga 4621 tttggtggaa ggacaagagg gcctgtccct gtccccgtcc ccaggaggta ccgacagtcc 4681 ctgtgctggt tagacacgga gcgctgcaca ccgaaagccc aaattgggag ctctgcctgc 4741 cggcaacttt gctgatgggg tgattgctgc ttctgggggg taaggaaaca agttacagaa 4801 attaccgcgt tctgtgtgaa gggactgagg gtgtggtgtc attggcagag ggtcatttta 4861 ggagagctgc cccagcccct cgaacgcctg gcttggggtg tcattctgcc tggcggccag 4921 gcctccagct tcccctgccc cgggcctggg gctgtcactg gccctgatcc gaacacctcc 4981 agattccggc ttctacatgg gacagacggg gacgcacagg ccaccttcct tctggcaggg 5041 actcttattt attcccattg ctctagggct ttcggtttcc ccttcttccg gtaggccgcg 5101 tagaggcatg caccgggtag gtttccgcgg tgaccccgcg gcggcctgag ggacgctccc 5161 tgccccatcc cggctgttgg gctgggccgc tttgcctctg cttcgccctg tgctgtgttc 5221 tccagctttg tagcagcagc cttgacaaac ccaggcgcac tgtaccaagg caatgtaact 5281 tttgattttc ggtcaattta agttcttttg tcaccaaata ttaataaaca gttttgactt 5341 caaaaaaaaa aaaaaaa //