LOCUS       AAH17232.1              1443 aa    PRT              HUM 15-JUL-2006
DEFINITION  Homo sapiens cleavage and polyadenylation specific factor
            1, 160kDa protein.
ACCESSION   BC017232-1
PROTEIN_ID  AAH17232.1
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 4487)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 4487)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (05-NOV-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     On Dec 9, 2003 this sequence version replaced BC017232.1.
            Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: National Institutes of Health Intramural
            Sequencing Center (NISC),
            Gaithersburg, Maryland;
            Web site: http://www.nisc.nih.gov/
            Contact: nisc_mgc@nhgri.nih.gov
            Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
            Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
            Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
            Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
            Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
            McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
            Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
            Young,A., Zhang,L.-H. and Green,E.D.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 26 Row: n Column: 8
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 56676370.
FEATURES             Qualifiers
     source          /db_xref="H-InvDB:HIT000037892"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:15424 IMAGE:4300196"
                     /tissue_type="Pancreas, epithelioid carcinoma"
                     /clone_lib="NIH_MGC_42"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     protein         /gene="CPSF1"
                     /gene_synonym="CPSF160"
                     /gene_synonym="HSU37012"
                     /gene_synonym="P/cl.18"
                     /db_xref="GeneID:29894"
                     /db_xref="HGNC:HGNC:2324"
                     /db_xref="MIM:606027"
BEGIN
        1 MYAVYKQAHP PTGLEFSMYC NFFNNSERNL VVAGTSQLYV YRLNRDAEAL TKNDRSTEGK
       61 AHREKLELAA SFSFFGNVMS MASVQLAGAK RDALLLSFKD AKLSVVEYDP GTHDLKTLSL
      121 HYFEEPELRD GFVQNVHTPR VRVDPDGRCA AMLVYGTRLV VLPFRRESLA EEHEGLVGEG
      181 QRSSFLPSYI IDVRALDEKL LNIIDLQFLH GYYEPTLLIL FEPNQTWPGR VAVRQDTCSI
      241 VAISLNITQK VHPVIWSLTS LPFDCTQALA VPKPIGGVVV FAVNSLLYLN QSVPPYGVAL
      301 NSLTTGTTAF PLRTQEGVRI TLDCAQATFI SYDKMVISLK GGEIYVLTLI TDGMRSVRAF
      361 HFDKAAASVL TTSMVTMEPG YLFLGSRLGN SLLLKYTEKL QEPPASAVRE AADKEEPPSK
      421 KKRVDATAGW SAAGKSVPQD EVDEIEVYGS EAQSGTQLAT YSFEVCDSIL NIGPCANAAV
      481 GEPAFLSEEF QNSPEPDLEI VVCSGHGKNG ALSVLQKSIR PQVVTTFELP GCYDMWTVIA
      541 PVRKEEEDNP KGEGTEQEPS TTPEADDDGR RHGFLILSRE DSTMILQTGQ EIMELDTSGF
      601 ATQGPTVFAG NIGDNRYIVQ VSPLGIRLLE GVNQLHFIPV DLGAPIVQCA VADPYVVIMS
      661 AEGHVTMFLL KSDSYGGRHH RLALHKPPLH HQSKVITLCL YRDLSGMFTT ESRLGGARDE
      721 LGGRSGPEAE GLGSETSPTV DDEEEMLYGD SGSLFSPSKE EARRSSQPPA DRDPAPFRAE
      781 PTHWCLLVRE NGTMEIYQLP DWRLVFLVKN FPVGQRVLVD SSFGQPTTQG EARREEATRQ
      841 GELPLVKEVL LVALGSRQSR PYLLVHVDQE LLIYEAFPHD SQLGQGNLKV RFKKVPHNIN
      901 FREKKPKPSK KKAEGGGAEE GAGARGRVAR FRYFEDIYGY SGVFICGPSP HWLLVTGRGA
      961 LRLHPMAIDG PVDSFAPFHN VNCPRGFLYF NRQGELRISV LPAYLSYDAP WPVRKIPLRC
     1021 TAHYVAYHVE SKVYAVATST NTPCARIPRM TGEEKEFETI ERDERYIHPQ QEAFSIQLIS
     1081 PVSWEAIPNA RIELQEWEHV TCMKTVSLRS EETVSGLKGY VAAGTCLMQG EEVTCRGRIL
     1141 IMDVIEVVPE PGQPLTKNKF KVLYEKEQKG PVTALCHCNG HLVSAIGQKI FLWSLRASEL
     1201 TGMAFIDTQL YIHQMISVKN FILAADVMKS ISLLRYQEES KTLSLVSRDA KPLEVYSVDF
     1261 MVDNAQLGFL VSDRDRNLMV YMYLPEAKES FGGMRLLRRA DFHVGAHVNT FWRTPCRGAT
     1321 EGLSKKSVVW ENKHITWFAT LDGGIGLLLP MQEKTYRRLL MLQNALTTML PHHAGLNPRA
     1381 FRMLHVDRRT LQNAVRNVLD GELLNRYLYL STMERSELAK KIGTTPDIIL DDLLETDRVT
     1441 AHF
//