LOCUS BC070095 2922 bp mRNA linear HUM 17-JUL-2006 DEFINITION Homo sapiens cleavage and polyadenylation specific factor 2, 100kDa, mRNA (cDNA clone MGC:87517 IMAGE:5267724), complete cds. ACCESSION BC070095 VERSION BC070095.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2922) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2922) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (10-MAY-2004) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Miklos Palkovits, M.D., Ph.D. cDNA Library Preparation: Michael J. Brownstein (NHGRI) & Shiraki Toshiyuki and Piero Carninci (RIKEN) cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Sequencing Group at the Stanford Human Genome Center, Stanford University School of Medicine, Stanford, CA 94305 Web site: http://www-shgc.stanford.edu Contact: (Dickson, Mark) mcd@paxil.stanford.edu Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers, R. M. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 167 Row: m Column: 15 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 34101287. FEATURES Location/Qualifiers source 1..2922 /db_xref="H-InvDB:HIT000264053" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:87517 IMAGE:5267724" /tissue_type="Testis" /clone_lib="NIH_MGC_97" /lab_host="DH10B" /note="Vector: pBluescriptR" gene 1..2922 /gene="CPSF2" /gene_synonym="CPSF100" /gene_synonym="KIAA1367" /db_xref="GeneID:53981" /db_xref="HGNC:HGNC:2325" /db_xref="MIM:606028" CDS 239..2587 /gene="CPSF2" /gene_synonym="CPSF100" /gene_synonym="KIAA1367" /codon_start=1 /product="cleavage and polyadenylation specific factor 2, 100kDa" /protein_id="AAH70095.1" /db_xref="GeneID:53981" /db_xref="HGNC:HGNC:2325" /db_xref="MIM:606028" /translation="MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDI IDSLRKHVHQIDAVLLSHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLY QSRHNTEDFTLFTLDDVDAAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIW KIVKDGEEEIVYAVDFNHKREIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQ LLTNVLETLRGDGNVLIAVDTAGRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNV VEFSKSQVEWMSGKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLE CGFSRDLFIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEGKEL EEYLEKEKLKKEAAKKLEQSKEADIDSSDESDIEEDIDQPSAHKTKHDLMMKGEGSRK GSFFKQAKKSYPMFPAPEERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNGD EPMDQDLSDVPTKCISTTESIEIKARVTYIDYEGRSDGDSIKKIINQMKPRQLIIVHG PPEASQDLAECCRAFGGKDIKVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAK DAELAWIDGVLDMRVSKVDTGVILEEGELRDDGEDSEMQVEAPSDSSVIAQQKAMKSL FGDDEKETGEESEIIPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGG VLVCNNQVAVRRTETGRIGLEGCLCQDFYRIRDLLYEQYAIV" BASE COUNT 918 a 528 c 662 g 814 t ORIGIN 1 aataatggcg gctgccactg tggggcttct gccggccggt agtccctggc gctgctgacc 61 cagcatcggc ttttctacgt cttgaacctg gattcgccta ggggttggga agggctgtgg 121 acggcgttgg gggaggcctg acgagattaa taaagaactc ttcagaattc ctggtgtttc 181 atcatatata cgactaagat atcaactctt ctagcttgct gtttctggac caaaaaaaat 241 gacgtctatt atcaaattaa ctaccctttc tggggtccaa gaagaatctg ccctttgcta 301 tcttctccaa gttgatgagt ttagattttt attggactgt ggctgggatg agcacttttc 361 tatggatatt attgattccc tgaggaagca tgttcaccag attgatgcag tgctgttgtc 421 tcaccctgat cctctccacc ttggtgccct cccgtatgct gtcggaaagt tgggtctgaa 481 ctgtgctatc tatgcaacca ttcctgttta taaaatggga cagatgttca tgtatgatct 541 ttatcagtct cgacacaata cagaagattt tacactcttt acattagatg atgtggatgc 601 agcctttgat aaaatacagc agctaaaatt ctctcagatt gtgaatttga aaggtaaagg 661 acatggcctg tctatcacac ctctgccagc tggtcatatg ataggtggaa caatatggaa 721 aatagtcaaa gatggagaag aagaaattgt ttatgcagtt gacttcaacc acaagaggga 781 gatccattta aatggatgtt ccctggaaat gctaagcagg ccttccctac ttatcacaga 841 ttcattcaat gctacatatg tacagcctag aagaaaacag agagatgagc agcttctgac 901 aaatgtcctg gaaacacttc gaggtgatgg aaatgtgtta atagcagtgg acacagcagg 961 cagagttttg gaacttgctc aacttcttga tcagatttgg aggactaaag atgcaggatt 1021 gggtgtttac tcattggcac tcctaaataa tgtcagttac aatgtggtgg agttttctaa 1081 gtcccaggta gaatggatga gtggtaaatt gatgagatgt tttgaagaca aaagaaataa 1141 tccgtttcag tttcgccatc tctctttatg tcatggtctt tctgacttgg cccgtgtacc 1201 tagccctaaa gttgtacttg ccagccaacc tgacctggaa tgcggatttt caagggatct 1261 ctttattcag tggtgtcagg accctaaaaa ctcaatcatt ctaacctaca gaactactcc 1321 tgggacttta gcacgtttcc taattgataa tccttctgaa aaaattacag aaatagagtt 1381 gaggaaacgt gtgaagcttg aagggaaaga acttgaagaa tacttggaaa aagagaaact 1441 aaagaaagaa gctgccaaaa agcttgagca gtcaaaagag gcagatatag attccagtga 1501 tgagagtgat attgaggaag atattgacca gccatcagct cataagacga agcatgactt 1561 gatgatgaaa ggtgaaggca gtcgtaaagg aagttttttc aaacaggcaa aaaagtccta 1621 tcctatgttt cctgccccag aagaaagaat taaatgggat gaatatggag agattatcaa 1681 accagaggat ttcttagtgc cagagcttca agctactgaa gaagaaaaaa gcaaattaga 1741 atctggtttg acaaatggag atgaacctat ggatcaggat ttatctgatg ttcctactaa 1801 atgtatttct acaacagagt ctattgaaat aaaagcccgg gttacctaca tagactatga 1861 aggacgctct gatggggatt ccattaaaaa aatcattaat cagatgaaac cacgacagtt 1921 gatcatcgtc catggcccac cagaggccag tcaagatctg gcagagtgct gtcgcgcctt 1981 tggtgggaaa gatattaaag tgtacatgcc aaagctacat gaaacagttg atgccactag 2041 tgaaactcac atctaccagg tgaggttaaa agactcactt gtcagctctc ttcagttttg 2101 taaggcaaaa gatgctgaat tagcttggat agatggtgtc ttagatatga gagtttccaa 2161 agtggacaca ggggttattt tagaagaagg agaactaagg gatgatggag aagactcaga 2221 gatgcaagtg gaagctccct cagattctag cgttatagca caacaaaagg ccatgaaaag 2281 tttgttcgga gatgatgaaa aagaaacagg tgaagaaagt gagatcattc ctactttgga 2341 acccttgcca cctcatgagg ttcctggaca tcagtcagtt tttatgaatg aaccaaggct 2401 gtcagacttc aagcaagttc tcttacggga gggaattcaa gctgaatttg taggaggtgt 2461 acttgtttgc aacaatcaag tagcagtccg cagaacggaa actggacgca ttggattaga 2521 aggctgcctt tgtcaagatt tttataggat aagagacctt ttatatgaac aatatgccat 2581 tgtataaagg acatgatgtc aagaagtatc tgcttgacct ttctaagaaa aagggattct 2641 tatcttactc tgagcttttg atgttttgtt ttgtaacata caaaaagaat ctgccagaaa 2701 aacttacatg tatcagattt ttaaaaatat aaatagagaa cattttgcaa atgctcaaat 2761 gagcattcta tcttttggct ttcagagtga tagagctcct aacaggtgta caggcccaag 2821 agttgaaggt gattggtttt ctttacagac tccttgttct ctagaagggc tttttacttg 2881 aataaaacaa tgcaacttag caaaccaaaa aaaaaaaaaa aa //