LOCUS BC017232 4487 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens cleavage and polyadenylation specific factor 1, 160kDa, mRNA (cDNA clone MGC:15424 IMAGE:4300196), complete cds. ACCESSION BC017232 VERSION BC017232.2 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4487) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 4487) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (05-NOV-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Dec 9, 2003 this sequence version replaced BC017232.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 26 Row: n Column: 8 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 56676370. FEATURES Location/Qualifiers source 1..4487 /db_xref="H-InvDB:HIT000037892" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:15424 IMAGE:4300196" /tissue_type="Pancreas, epithelioid carcinoma" /clone_lib="NIH_MGC_42" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..4487 /gene="CPSF1" /gene_synonym="CPSF160" /gene_synonym="HSU37012" /gene_synonym="P/cl.18" /db_xref="GeneID:29894" /db_xref="HGNC:HGNC:2324" /db_xref="MIM:606027" CDS 46..4377 /gene="CPSF1" /gene_synonym="CPSF160" /gene_synonym="HSU37012" /gene_synonym="P/cl.18" /codon_start=1 /product="cleavage and polyadenylation specific factor 1, 160kDa" /protein_id="AAH17232.1" /db_xref="GeneID:29894" /db_xref="HGNC:HGNC:2324" /db_xref="MIM:606027" /translation="MYAVYKQAHPPTGLEFSMYCNFFNNSERNLVVAGTSQLYVYRLN RDAEALTKNDRSTEGKAHREKLELAASFSFFGNVMSMASVQLAGAKRDALLLSFKDAK LSVVEYDPGTHDLKTLSLHYFEEPELRDGFVQNVHTPRVRVDPDGRCAAMLVYGTRLV VLPFRRESLAEEHEGLVGEGQRSSFLPSYIIDVRALDEKLLNIIDLQFLHGYYEPTLL ILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVHPVIWSLTSLPFDCTQALAVPKPIG GVVVFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPLRTQEGVRITLDCAQATFISYDK MVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSL LLKYTEKLQEPPASAVREAADKEEPPSKKKRVDATAGWSAAGKSVPQDEVDEIEVYGS EAQSGTQLATYSFEVCDSILNIGPCANAAVGEPAFLSEEFQNSPEPDLEIVVCSGHGK NGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEDNPKGEGTEQEPSTTPEAD DDGRRHGFLILSREDSTMILQTGQEIMELDTSGFATQGPTVFAGNIGDNRYIVQVSPL GIRLLEGVNQLHFIPVDLGAPIVQCAVADPYVVIMSAEGHVTMFLLKSDSYGGRHHRL ALHKPPLHHQSKVITLCLYRDLSGMFTTESRLGGARDELGGRSGPEAEGLGSETSPTV DDEEEMLYGDSGSLFSPSKEEARRSSQPPADRDPAPFRAEPTHWCLLVRENGTMEIYQ LPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEARREEATRQGELPLVKEVLLVALGS RQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREKKPKPSKKKAE GGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMAIDGPV DSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVE SKVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVSWEAIP NARIELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIE VVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMA FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMV DNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGAT EGLSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNP RAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILDDLLET DRVTAHF" BASE COUNT 903 a 1412 c 1367 g 805 t ORIGIN 1 cggttcctct cgagtcggct ccaactgcca gcccgggttg gcgccatgta cgccgtgtac 61 aaacaggcgc atccgcccac cggtctggag ttctccatgt actgcaactt cttcaacaac 121 agcgagcgca acctggtagt ggccgggacc tcgcagctct acgtgtaccg cctcaaccgc 181 gacgccgagg ctctgaccaa gaatgacagg agcacagagg ggaaggccca ccgggagaag 241 ctcgagcttg ctgcctcctt ctccttcttt ggcaacgtca tgtccatggc cagcgtgcag 301 ctggcaggag ccaagcggga tgccctgctc ctaagcttca aggatgccaa gctgtctgtg 361 gtggagtacg acccgggcac ccatgacctg aagaccctgt cactgcacta ctttgaggag 421 cctgagcttc gggacgggtt tgtgcagaat gtacacacgc cgcgagtgcg ggtggacccc 481 gacgggcgct gtgcagccat gcttgtctac ggcacgcggc tggtggtcct gcccttccgc 541 agggagagcc tggctgagga gcacgagggg ctcgtgggtg aggggcagag gtccagcttc 601 ctgcccagct acatcatcga cgtgcgggcc ctagacgaga agctgctcaa catcatcgac 661 ctgcagttcc tgcatggcta ctacgagcct accctcctca tcctgtttga gcccaaccag 721 acctggcctg ggcgcgtggc cgtgcggcag gacacgtgct ccattgtggc catctcactg 781 aacatcacgc agaaggtgca ccccgtcatc tggtccctca ccagcctgcc ctttgactgc 841 acccaggctc tggctgtgcc caagcccata ggtggggtgg tggtgtttgc cgtcaactcg 901 ctgttgtacc tgaaccagag cgtccccccg tatggcgtgg ctctcaacag cctcaccaca 961 ggaaccacgg ctttcccgct tcgcacccag gagggtgtgc ggatcaccct ggactgcgcc 1021 caggccacct tcatctccta cgacaagatg gtcatctccc tcaagggcgg cgagatctac 1081 gtgctgaccc tcatcaccga cggcatgcgc agtgtccgag cgttccactt tgacaaggcg 1141 gccgccagcg tcctcaccac cagcatggtc accatggagc ccgggtacct gttcctgggt 1201 tctcgcctgg gcaattccct cctcctcaag tacacggaga agctgcagga gcccccggcc 1261 agtgctgtcc gtgaggctgc cgacaaggaa gagcctccct caaagaagaa gcgagtggat 1321 gcgacggccg gctggtcagc tgcgggtaag tcggtgccgc aggatgaggt ggacgagatt 1381 gaagtgtacg gcagcgaggc ccagtcggga acacagctgg ccacctactc ctttgaggtg 1441 tgtgacagca tcctgaacat tggaccctgt gccaatgccg ccgtgggcga gcctgccttc 1501 ctctctgaag agtttcagaa cagccccgag ccggacctgg agattgtggt ttgctccggc 1561 cacgggaaga acggggcttt gtcggtgctg cagaagagca tccggcccca ggtggtgaca 1621 acctttgagc ttcccggctg ctatgacatg tggacagtca tcgccccggt gcgtaaggag 1681 gaggaggaca atcccaaggg ggagggcaca gagcaggaac ccagcaccac ccctgaagca 1741 gacgacgacg gccgcagaca cggattcctg attctgagcc gggaagactc caccatgatc 1801 ctgcagacgg ggcaggagat catggagctg gacaccagtg gcttcgccac tcagggcccc 1861 acggtctttg ctgggaacat cggggacaac cgctacattg tccaagtgtc accactgggc 1921 atccgcctgc tggaaggagt gaatcagctg cacttcatcc ccgtggacct gggcgccccc 1981 atcgtgcagt gcgccgtggc cgacccctat gtggtcatca tgagtgccga gggccacgtc 2041 accatgttcc tgctgaagag tgactcctac ggtggccgcc accaccgcct ggcgctgcac 2101 aagcccccgc tgcaccatca gtccaaggtg attacgctgt gcctgtaccg agacctcagc 2161 ggcatgttca ccactgagag ccgcctgggt ggggcccgtg acgagctcgg gggccgcagt 2221 ggcccggagg ccgagggcct gggctcagag actagcccca cagtggatga cgaggaggag 2281 atgctgtatg gggattcggg ctccctcttc agccccagca aggaggaggc ccgaagaagc 2341 agccagcccc ctgctgaccg ggaccctgca cccttccggg cagagcctac ccactggtgc 2401 ctgctggtgc gggagaatgg caccatggag atctaccagc ttcccgactg gcggctggtg 2461 ttcctggtga agaacttccc tgtggggcag cgggtccttg tggacagctc ctttggacag 2521 cccactacac agggcgaggc ccgcagggag gaggccacgc gccaggggga gctgcccctc 2581 gtcaaggagg tgctgctggt ggcgctgggc agccgccaga gcaggcccta cctgctggtg 2641 catgtggacc aagagctgct tatctacgag gccttccccc acgactctca gctcggccag 2701 ggcaatctca aagtccgctt taagaaggtc cctcacaaca tcaacttccg tgagaagaag 2761 ccaaagccat ccaagaagaa agcagaaggt ggcggcgcag aggagggggc tggggcccgg 2821 ggccgcgtgg cgcgtttccg ctacttcgag gatatttatg gctactcagg ggtcttcatc 2881 tgcggcccct cccctcactg gctcttggtg accggccgag gggctctgcg gctacacccc 2941 atggccatcg acggcccggt cgactctttc gctccattcc acaatgtcaa ctgtccccgc 3001 ggcttcctgt acttcaacag acagggcgag ctgaggatca gtgtcctgcc tgcctacctg 3061 tcctatgatg ccccatggcc tgtcaggaag atcccgctgc gctgcacggc ccactatgtg 3121 gcttaccacg tggagtctaa ggtgtatgct gtggccacca gcaccaacac gccgtgtgcc 3181 cgcatcccac gcatgactgg cgaggagaag gagtttgaga ccatcgagag agatgagcgg 3241 tacatccacc cccagcagga ggccttctcc atccagctca tctccccggt cagctgggag 3301 gctattccca atgccaggat cgagctgcag gagtgggagc atgtgacctg catgaagaca 3361 gtgtctctgc gcagtgagga gaccgtgtcg ggcctcaaag gctacgtggc cgccgggacc 3421 tgcctcatgc agggggagga ggtcacgtgc cgagggcgga tcttgatcat ggatgtgatt 3481 gaggtggtgc ccgagcctgg ccagcccttg accaagaaca agttcaaagt cctttacgag 3541 aaggagcaga aggggcccgt gaccgccctg tgccactgca atggccacct ggtgtcggcc 3601 atcggccaga agattttcct gtggagcctg cgggccagcg agctgacggg catggccttc 3661 atcgacacgc agctctacat acaccagatg atcagcgtca agaacttcat cctggcagcc 3721 gacgtcatga agagcatttc gctgctgcgc taccaggagg aaagcaagac gctgagcctg 3781 gtgtcgcggg atgccaagcc cctggaggtg tacagcgtgg acttcatggt ggacaatgcc 3841 cagctgggtt ttctggtgtc tgaccgcgac cgcaacctca tggtgtacat gtacctgccc 3901 gaagccaagg agagtttcgg gggcatgcgc ctgctgcgtc gggcagactt ccacgtgggt 3961 gcccacgtga acacgttctg gaggaccccg tgccgggggg ccactgaagg gctcagcaaa 4021 aagtcggtcg tgtgggagaa taagcacatc acgtggtttg ccaccctgga cggcggcatc 4081 gggctgctgc tgcccatgca ggagaagacc taccggcggc tgctgatgct gcagaacgcg 4141 ctgaccacca tgctgccaca ccacgccggc ctcaaccccc gcgccttccg gatgctgcac 4201 gtggaccgcc gcaccctcca gaatgccgtg cgcaacgtgc tggatgggga gctgctcaac 4261 cgctacctgt acctgagcac catggagcgc agcgagctag ccaagaagat cggcaccaca 4321 ccagacataa tcctggacga cttgctggag acggaccgcg tcaccgccca cttctagccc 4381 cgtggatgcc gtcaccacca gcacacggaa ctacctccca cccccttttt gtacaaaaca 4441 caaggaaaaa cattttttgc ttgaaaaaaa aaaaaaaaaa aaaaaaa //