LOCUS AAH39817.1 1118 aa PRT HUM 15-JUL-2006
DEFINITION Homo sapiens UPF1 regulator of nonsense transcripts homolog
(yeast) protein.
ACCESSION BC039817-1
PROTEIN_ID AAH39817.1
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 5357)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 5357)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (12-NOV-2002) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Life Technologies, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Institute for Systems Biology
http://www.systemsbiology.org
contact: amadan@systemsbiology.org
Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 84 Row: g Column: 21
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 18375672.
FEATURES Qualifiers
source /db_xref="H-InvDB:HIT000052253"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:48687 IMAGE:5555509"
/tissue_type="Uterus, leiomyosarcoma"
/clone_lib="NIH_MGC_71"
/lab_host="DH10B"
/note="Vector: pCMV-SPORT6"
protein /gene="UPF1"
/gene_synonym="HUPF1"
/gene_synonym="KIAA0221"
/gene_synonym="NORF1"
/gene_synonym="pNORF1"
/db_xref="GeneID:5976"
/db_xref="HGNC:HGNC:9962"
/db_xref="MIM:601430"
BEGIN
1 MSVEAYGPSS QTLTFLDTEE AELLGADTQG SEFEFTDFTL PSQTQTPPGG PGGPGGGGAG
61 GPGGAGAGAA AGQLDAQVGP EGILQNGAVD DSVAKTSQLL AELNFEEDEE DTYYTKDLPI
121 HACSYCGIHD PACVVYCNTS KKWFCNGRGN TSGSHIVNHL VRAKCKEVTL HKDGPLGETV
181 LECYNCGCRN VFLLGFIPAK ADSVVVLLCR QPCASQSSLK DINWDSSQWQ PLIQDRCFLS
241 WLVKIPSEQE QLRARQITAQ QINKLEELWK ENPSATLEDL EKPGVDEEPQ HVLLRYEDAY
301 QYQNIFGPLV KLEADYDKKL KESQTQDNIT VRWDLGLNKK RIAYFTLPKT DSDMRLMQGD
361 EICLRYKGDL APLWKGIGHV IKVPDNYGDE IAIELRSSVG APVEVTHNFQ VDFVWKSTSF
421 DRMQSALKTF AVDETSVSGY IYHKLLGHEV EDVIIKCQLP KRFTAQGLPD LNHSQVYAVK
481 TVLQRPLSLI QGPPGTGKTV TSATIVYHLA RQGNGPVLVC APSNIAVDQL TEKIHQTGLK
541 VVRLCAKSRE AIDSPVSFLA LHNQIRNMDS MPELQKLQQL KDETGELSSA DEKRYRALKR
601 TAERELLMNA DVICCTCVGA GDPRLAKMQF RSILIDESTQ ATEPECMVPV VLGAKQLILV
661 GDHCQLGPVV MCKKAAKAGL SQSLFERLVV LGIRPIRLQV QYRMHPALSA FPSNIFYEGS
721 LQNGVTAADR VKKGFDFQWP QPDKPMFFYV TQGQEEIASS GTSYLNRTEA ANVEKITTKL
781 LKAGAKPDQI GIITPYEGQR SYLVQYMQFS GSLHTKLYQE VEIASVDAFQ GREKDFIILS
841 CVRANEHQGI GFLNDPRRLN VALTRARYGV IIVGNPKALS KQPLWNHLLN YYKEQKVLVE
901 GPLNNLRESL MQFSKPRKLV NTINPGARFM TTAMYDAREA IIPGSVYDRS SQGRPSSMYF
961 QTHDQIGMIS AGPSHVAAMN IPIPFNLVMP PMPPPGYFGQ ANGPAAGRGT PKGKTGRGGR
1021 QKNRFGLPGP SQTNLPNSQA SQDVASQPFS QGALTQGYIS MSQPSQMSQP GLSQPELSQD
1081 SYLGDEFKSQ IDVALSQDST YQGERAYQHG GVTGLSQY
//