LOCUS AAH17232.1 1443 aa PRT HUM 15-JUL-2006 DEFINITION Homo sapiens cleavage and polyadenylation specific factor 1, 160kDa protein. ACCESSION BC017232-1 PROTEIN_ID AAH17232.1 SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4487) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 4487) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (05-NOV-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Dec 9, 2003 this sequence version replaced BC017232.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 26 Row: n Column: 8 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 56676370. FEATURES Qualifiers source /db_xref="H-InvDB:HIT000037892" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:15424 IMAGE:4300196" /tissue_type="Pancreas, epithelioid carcinoma" /clone_lib="NIH_MGC_42" /lab_host="DH10B-R" /note="Vector: pOTB7" protein /gene="CPSF1" /gene_synonym="CPSF160" /gene_synonym="HSU37012" /gene_synonym="P/cl.18" /db_xref="GeneID:29894" /db_xref="HGNC:HGNC:2324" /db_xref="MIM:606027" BEGIN 1 MYAVYKQAHP PTGLEFSMYC NFFNNSERNL VVAGTSQLYV YRLNRDAEAL TKNDRSTEGK 61 AHREKLELAA SFSFFGNVMS MASVQLAGAK RDALLLSFKD AKLSVVEYDP GTHDLKTLSL 121 HYFEEPELRD GFVQNVHTPR VRVDPDGRCA AMLVYGTRLV VLPFRRESLA EEHEGLVGEG 181 QRSSFLPSYI IDVRALDEKL LNIIDLQFLH GYYEPTLLIL FEPNQTWPGR VAVRQDTCSI 241 VAISLNITQK VHPVIWSLTS LPFDCTQALA VPKPIGGVVV FAVNSLLYLN QSVPPYGVAL 301 NSLTTGTTAF PLRTQEGVRI TLDCAQATFI SYDKMVISLK GGEIYVLTLI TDGMRSVRAF 361 HFDKAAASVL TTSMVTMEPG YLFLGSRLGN SLLLKYTEKL QEPPASAVRE AADKEEPPSK 421 KKRVDATAGW SAAGKSVPQD EVDEIEVYGS EAQSGTQLAT YSFEVCDSIL NIGPCANAAV 481 GEPAFLSEEF QNSPEPDLEI VVCSGHGKNG ALSVLQKSIR PQVVTTFELP GCYDMWTVIA 541 PVRKEEEDNP KGEGTEQEPS TTPEADDDGR RHGFLILSRE DSTMILQTGQ EIMELDTSGF 601 ATQGPTVFAG NIGDNRYIVQ VSPLGIRLLE GVNQLHFIPV DLGAPIVQCA VADPYVVIMS 661 AEGHVTMFLL KSDSYGGRHH RLALHKPPLH HQSKVITLCL YRDLSGMFTT ESRLGGARDE 721 LGGRSGPEAE GLGSETSPTV DDEEEMLYGD SGSLFSPSKE EARRSSQPPA DRDPAPFRAE 781 PTHWCLLVRE NGTMEIYQLP DWRLVFLVKN FPVGQRVLVD SSFGQPTTQG EARREEATRQ 841 GELPLVKEVL LVALGSRQSR PYLLVHVDQE LLIYEAFPHD SQLGQGNLKV RFKKVPHNIN 901 FREKKPKPSK KKAEGGGAEE GAGARGRVAR FRYFEDIYGY SGVFICGPSP HWLLVTGRGA 961 LRLHPMAIDG PVDSFAPFHN VNCPRGFLYF NRQGELRISV LPAYLSYDAP WPVRKIPLRC 1021 TAHYVAYHVE SKVYAVATST NTPCARIPRM TGEEKEFETI ERDERYIHPQ QEAFSIQLIS 1081 PVSWEAIPNA RIELQEWEHV TCMKTVSLRS EETVSGLKGY VAAGTCLMQG EEVTCRGRIL 1141 IMDVIEVVPE PGQPLTKNKF KVLYEKEQKG PVTALCHCNG HLVSAIGQKI FLWSLRASEL 1201 TGMAFIDTQL YIHQMISVKN FILAADVMKS ISLLRYQEES KTLSLVSRDA KPLEVYSVDF 1261 MVDNAQLGFL VSDRDRNLMV YMYLPEAKES FGGMRLLRRA DFHVGAHVNT FWRTPCRGAT 1321 EGLSKKSVVW ENKHITWFAT LDGGIGLLLP MQEKTYRRLL MLQNALTTML PHHAGLNPRA 1381 FRMLHVDRRT LQNAVRNVLD GELLNRYLYL STMERSELAK KIGTTPDIIL DDLLETDRVT 1441 AHF //