LOCUS BC039817 5357 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens UPF1 regulator of nonsense transcripts homolog
(yeast), mRNA (cDNA clone MGC:48687 IMAGE:5555509), complete cds.
ACCESSION BC039817
VERSION BC039817.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 5357)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 5357)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (12-NOV-2002) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Life Technologies, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Institute for Systems Biology
http://www.systemsbiology.org
contact: amadan@systemsbiology.org
Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 84 Row: g Column: 21
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 18375672.
FEATURES Location/Qualifiers
source 1..5357
/db_xref="H-InvDB:HIT000052253"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:48687 IMAGE:5555509"
/tissue_type="Uterus, leiomyosarcoma"
/clone_lib="NIH_MGC_71"
/lab_host="DH10B"
/note="Vector: pCMV-SPORT6"
gene 1..5357
/gene="UPF1"
/gene_synonym="HUPF1"
/gene_synonym="KIAA0221"
/gene_synonym="NORF1"
/gene_synonym="pNORF1"
/db_xref="GeneID:5976"
/db_xref="HGNC:HGNC:9962"
/db_xref="MIM:601430"
CDS 273..3629
/gene="UPF1"
/gene_synonym="HUPF1"
/gene_synonym="KIAA0221"
/gene_synonym="NORF1"
/gene_synonym="pNORF1"
/codon_start=1
/product="UPF1 regulator of nonsense transcripts homolog
(yeast)"
/protein_id="AAH39817.1"
/db_xref="GeneID:5976"
/db_xref="HGNC:HGNC:9962"
/db_xref="MIM:601430"
/translation="MSVEAYGPSSQTLTFLDTEEAELLGADTQGSEFEFTDFTLPSQT
QTPPGGPGGPGGGGAGGPGGAGAGAAAGQLDAQVGPEGILQNGAVDDSVAKTSQLLAE
LNFEEDEEDTYYTKDLPIHACSYCGIHDPACVVYCNTSKKWFCNGRGNTSGSHIVNHL
VRAKCKEVTLHKDGPLGETVLECYNCGCRNVFLLGFIPAKADSVVVLLCRQPCASQSS
LKDINWDSSQWQPLIQDRCFLSWLVKIPSEQEQLRARQITAQQINKLEELWKENPSAT
LEDLEKPGVDEEPQHVLLRYEDAYQYQNIFGPLVKLEADYDKKLKESQTQDNITVRWD
LGLNKKRIAYFTLPKTDSDMRLMQGDEICLRYKGDLAPLWKGIGHVIKVPDNYGDEIA
IELRSSVGAPVEVTHNFQVDFVWKSTSFDRMQSALKTFAVDETSVSGYIYHKLLGHEV
EDVIIKCQLPKRFTAQGLPDLNHSQVYAVKTVLQRPLSLIQGPPGTGKTVTSATIVYH
LARQGNGPVLVCAPSNIAVDQLTEKIHQTGLKVVRLCAKSREAIDSPVSFLALHNQIR
NMDSMPELQKLQQLKDETGELSSADEKRYRALKRTAERELLMNADVICCTCVGAGDPR
LAKMQFRSILIDESTQATEPECMVPVVLGAKQLILVGDHCQLGPVVMCKKAAKAGLSQ
SLFERLVVLGIRPIRLQVQYRMHPALSAFPSNIFYEGSLQNGVTAADRVKKGFDFQWP
QPDKPMFFYVTQGQEEIASSGTSYLNRTEAANVEKITTKLLKAGAKPDQIGIITPYEG
QRSYLVQYMQFSGSLHTKLYQEVEIASVDAFQGREKDFIILSCVRANEHQGIGFLNDP
RRLNVALTRARYGVIIVGNPKALSKQPLWNHLLNYYKEQKVLVEGPLNNLRESLMQFS
KPRKLVNTINPGARFMTTAMYDAREAIIPGSVYDRSSQGRPSSMYFQTHDQIGMISAG
PSHVAAMNIPIPFNLVMPPMPPPGYFGQANGPAAGRGTPKGKTGRGGRQKNRFGLPGP
SQTNLPNSQASQDVASQPFSQGALTQGYISMSQPSQMSQPGLSQPELSQDSYLGDEFK
SQIDVALSQDSTYQGERAYQHGGVTGLSQY"
BASE COUNT 1110 a 1566 c 1665 g 1016 t
ORIGIN
1 ggcgacggcg gcggtggcgg cagttcctgc tctaggctgc gagcggctgg cggcttcgag
61 gggagctgag gcgcggaggg gctcggcggc agcggcggcg gctcggcact gttacctctc
121 ggtccggctg gcgccggggc gggcggtttg gtcctttccg ggcgcgcggg ggcgacagcg
181 gcagcgaccc gaggcctgcg gcctaggcct cagcgcggcg gcgggctcga gtgcagcgcg
241 gaaccggccc gagggcccta cccggaggca ccatgagcgt ggaggcgtac gggcccagct
301 cgcagactct cactttcctg gacacggagg aggccgagct gcttggcgcc gacacacagg
361 gctccgagtt cgagttcacc gactttactc ttcctagcca gacgcagacg ccccccggcg
421 gccccggcgg cccgggcggt ggcggcgcgg gaggcccggg cggcgcgggc gcgggcgctg
481 cggcgggaca gctcgacgcg caggttgggc ccgaaggcat cctgcagaac ggggctgtgg
541 acgacagtgt agccaagacc agccagttgt tggctgagtt gaacttcgag gaagatgaag
601 aagacaccta ttacacgaag gacctcccca tacacgcctg cagttactgt ggaatacacg
661 atcctgcctg cgtggtttac tgtaatacca gcaagaagtg gttctgcaac ggacgtggaa
721 atacttctgg cagccacatt gtaaatcacc ttgtgagggc aaaatgcaaa gaggtgaccc
781 tgcacaagga cgggcccctg ggggagacag tcctggagtg ctacaactgc ggctgtcgca
841 acgtcttcct cctcggcttc atcccggcca aagctgactc agtggtggtg ctgctgtgca
901 ggcagccctg tgccagccag agcagcctca aggacatcaa ctgggacagc tcgcagtggc
961 agccgctgat ccaggaccgc tgcttcctgt cctggctggt caagatcccc tccgagcagg
1021 agcagctgcg ggcacgccag atcacggcac agcagatcaa caagctggag gagctgtgga
1081 aggaaaaccc ttctgccacg ctggaggacc tggagaagcc gggggtggac gaggagccgc
1141 agcatgtcct cctgcggtac gaggacgcct accagtacca gaacatattc gggcccctgg
1201 tcaagctgga ggccgactac gacaagaagc tgaaggagtc ccagactcaa gataacatca
1261 ctgtcaggtg ggacctgggc cttaacaaga agagaatcgc ctacttcact ttgcccaaga
1321 ctgactctga catgcggctc atgcaggggg atgagatatg cctgcggtac aaaggggacc
1381 ttgcgcccct gtggaaaggg atcggccacg tcatcaaggt ccctgataat tatggcgatg
1441 agatcgccat tgagctgcgg agcagcgtgg gtgcacctgt ggaggtgact cacaacttcc
1501 aggtggattt tgtgtggaag tcgacctcct ttgacaggat gcagagcgca ttgaaaacgt
1561 ttgccgtgga tgagacctcg gtgtctggct acatctacca caagctgttg ggccacgagg
1621 tggaggacgt aatcatcaag tgccagctgc ccaagcgctt cacggcgcag ggcctccccg
1681 acctcaacca ctcccaggtt tatgccgtga agactgtgct gcaaagacca ctgagcctga
1741 tccagggccc gccaggcacg gggaagacgg tgacgtcggc caccatcgtc taccacctgg
1801 cccggcaagg caacgggccg gtgctggtgt gtgctccgag caacatcgcc gtggaccagc
1861 taacggagaa gatccaccag acggggctaa aggtcgtgcg cctctgcgcc aagagccgtg
1921 aggccatcga ctccccggtg tcttttctgg ccctgcacaa ccagatcagg aacatggaca
1981 gcatgcctga gctgcagaag ctgcagcagc tgaaagacga gactggggag ctgtcgtctg
2041 ccgacgagaa gcggtaccgg gccttgaagc gcaccgcaga gagagagctg ctgatgaacg
2101 cagatgtcat ctgctgcaca tgtgtgggcg ccggtgaccc gaggctggcc aagatgcagt
2161 tccgctccat tttaatcgac gaaagcaccc aggccaccga gccggagtgc atggttcccg
2221 tggtcctcgg ggccaagcag ctgatccttg taggcgacca ctgccagctg ggcccagtgg
2281 tgatgtgcaa gaaggcggcc aaggccgggc tgtcacagtc gctcttcgag cgcctggtgg
2341 tgctgggcat ccggcccatc cgcctgcagg tccagtaccg gatgcaccct gcactcagcg
2401 ccttcccatc caacatcttc tacgagggct ccctccagaa tggtgtcact gcagcggatc
2461 gtgtgaagaa gggatttgac ttccagtggc cccaacccga taaaccgatg ttcttctacg
2521 tgacccaggg ccaagaggag attgccagct cgggcacctc ctacctgaac aggaccgagg
2581 ctgcgaacgt ggagaagatc accacgaagt tgctgaaggc aggcgccaag ccggaccaga
2641 ttggcatcat cacgccctac gagggccagc gctcctacct ggtgcagtac atgcagttca
2701 gcggctccct gcacaccaag ctctaccagg aggtggagat cgccagtgtg gacgcctttc
2761 agggacgcga gaaggacttc atcatcctgt cctgtgtgcg ggccaacgag caccaaggca
2821 ttggcttttt aaatgacccc aggcgtctga acgtggccct gaccagagca aggtatggcg
2881 tcatcattgt gggcaacccg aaggcactat caaagcagcc gctctggaac cacctgctga
2941 actactataa ggagcagaag gtgctggtgg aggggccgct caacaacctg cgtgagagcc
3001 tcatgcagtt cagcaagcca cggaagctgg tcaacactat caacccggga gcccgcttca
3061 tgaccacagc catgtatgat gcccgggagg ccatcatccc aggctccgtc tatgatcgga
3121 gcagccaggg ccggccttcc agcatgtact tccagaccca tgaccagatt ggcatgatca
3181 gtgccggccc tagccacgtg gctgccatga acattcccat ccccttcaac ctggtcatgc
3241 cacccatgcc accgcctggc tattttggac aagccaacgg gcctgctgca gggcgaggca
3301 ccccgaaagg caagactggt cgtgggggac gccagaagaa ccgctttggg cttcctggac
3361 ccagccagac taacctcccc aacagccaag ccagccagga tgtggcgtca cagcccttct
3421 ctcagggcgc cctgacgcag ggctacatct ccatgagcca gccttcccag atgagccagc
3481 ccggcctctc ccagccggag ctgtcccagg acagttacct tggtgacgag tttaaatcac
3541 aaatcgacgt ggcgctctca caggactcca cgtaccaggg agagcgggct taccagcatg
3601 gcggggtgac ggggctgtcc cagtattaaa aggtggcggc ggaagagcta agcaacgtgg
3661 cttagtccat cagcatctta ttctgggtaa taaaaaataa aaataaacgg atacctgttt
3721 tccactgcta aaactgaagc accactgtgt gagcaacagg aagggagagc gcacgaggga
3781 gaggagccga ggccgagcgc cccctgctgg cccgcggcgg cgaggagcag agggagcgga
3841 ggaggggccg gcccgcggga gccgcggcca ccaggaggcc ccgctccgtc ccatcggggc
3901 tgcggccagg gcggagggag gaagaccctc atctcagagt agccctttcc tctgttcttt
3961 tatttctttt tctctttgat tgaaagggga ctacgtctta gcaggaaaaa aaacttcgca
4021 tttctgtgcc cgagcaggct ccttgcaaag acagcagcgt gcggggcaga gccccgggag
4081 ggcgcgtctg tccacgccta ccggacgcgc cgaggtcgcg ctgcctgtgt tctccgaggg
4141 ccttcattta aagaaaataa gggtgttttg ggtttttctc tttgtttttt tcaagattct
4201 tttaaaggag tactgaagaa tactttccta agtttgtctc taaaatctta gcggtggacc
4261 tgggagattt gagaagcttc cagaaacagt ttaaacaagc cagcgctact ggagaagagg
4321 agcaacacct gtgccgcggc cggaggagtt ttgttgttgg ttttagcttc cagtggcttc
4381 tttctgcggg gcatcaggct gctggggtag ccgcccgccg agcctggaag ctgctcgttc
4441 tccgctggac tcagaagcca agctgcttcc cgcctagact cggcgcaggg ccccgcaccg
4501 gtgaggaagg tgcttttggc cccattgcga ggggccttgg ccaggactgg ccctgtggcc
4561 aggaggcgag aaggtggctg ttcccggatt gacggctttt tcccgggggc ctttggaaga
4621 tttggtggaa ggacaagagg gcctgtccct gtccccgtcc ccaggaggta ccgacagtcc
4681 ctgtgctggt tagacacgga gcgctgcaca ccgaaagccc aaattgggag ctctgcctgc
4741 cggcaacttt gctgatgggg tgattgctgc ttctgggggg taaggaaaca agttacagaa
4801 attaccgcgt tctgtgtgaa gggactgagg gtgtggtgtc attggcagag ggtcatttta
4861 ggagagctgc cccagcccct cgaacgcctg gcttggggtg tcattctgcc tggcggccag
4921 gcctccagct tcccctgccc cgggcctggg gctgtcactg gccctgatcc gaacacctcc
4981 agattccggc ttctacatgg gacagacggg gacgcacagg ccaccttcct tctggcaggg
5041 actcttattt attcccattg ctctagggct ttcggtttcc ccttcttccg gtaggccgcg
5101 tagaggcatg caccgggtag gtttccgcgg tgaccccgcg gcggcctgag ggacgctccc
5161 tgccccatcc cggctgttgg gctgggccgc tttgcctctg cttcgccctg tgctgtgttc
5221 tccagctttg tagcagcagc cttgacaaac ccaggcgcac tgtaccaagg caatgtaact
5281 tttgattttc ggtcaattta agttcttttg tcaccaaata ttaataaaca gttttgactt
5341 caaaaaaaaa aaaaaaa
//