LOCUS BC020211 2286 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens cleavage and polyadenylation specific factor 3, 73kDa, mRNA (cDNA clone MGC:31848 IMAGE:4890593), complete cds. ACCESSION BC020211 VERSION BC020211.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2286) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2286) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (19-DEC-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Genome Sequence Centre, BC Cancer Agency, Vancouver, BC, Canada info@bcgsc.bc.ca Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao, Kim MacDonald, Mike R. Mayo, Josh Moran, Diana Palmquist, JR Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang, Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 40 Row: n Column: 5 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 21314666. FEATURES Location/Qualifiers source 1..2286 /db_xref="H-InvDB:HIT000038753" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:31848 IMAGE:4890593" /tissue_type="Brain, neuroblastoma" /clone_lib="NIH_MGC_19" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..2286 /gene="CPSF3" /gene_synonym="CPSF" /gene_synonym="CPSF-73" /gene_synonym="CPSF73" /db_xref="GeneID:51692" /db_xref="HGNC:HGNC:2326" /db_xref="MIM:606029" CDS 36..2090 /gene="CPSF3" /gene_synonym="CPSF" /gene_synonym="CPSF-73" /gene_synonym="CPSF73" /codon_start=1 /product="cleavage and polyadenylation specific factor 3, 73kDa" /protein_id="AAH20211.1" /db_xref="GeneID:51692" /db_xref="HGNC:HGNC:2326" /db_xref="MIM:606029" /translation="MSAIPAEESDQLLIRPLGAGQEVGRSCIILEFKGRKIMLDCGIH PGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTSFKGRTFMTHATKAI YRWLLSDYVKVSNISADDMLYTETDLEESMDKIETINFHEVKEVAGIKFWCYHAGHVL GAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREER EARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAK KCMAVYQTYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQS GLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYIS FSAHTDYQQTSEFIRALKPPHVILVHGEQNEMARLKAALIREYEDNDEVHIEVHNPRN TEAVTLNFRGEKLAKVMGFLADKKPEQGQRVSGILVKRNFNYHILSPCDLSNYTDLAM STVKQTQAIPYTGPFNLLCYQLQKLTGDVEELEIQEKPALKVFKNITVIQEPGMVVLE WLANPSNDMYADTVTTVILEVQSNPKIRKGAVQKVSKKLEMHVYSKRLEIMLQDIFGE DCVSVKDDSILSVTVDGKTANLNLETRTVECEEGSEDDESLREMVELAAQRLYEALTP VH" BASE COUNT 797 a 428 c 484 g 577 t ORIGIN 1 gttcctcacc cccgcttcgc cctcacactt tcgggatgtc tgcgattcct gctgaggaga 61 gcgaccagct gctgatccga ccccttggag ctgggcaaga agtaggaaga tcatgtatta 121 ttctcgagtt caaaggaaga aaaataatgc tcgactgtgg gatccaccct ggcctagaag 181 gaatggatgc tcttccttat attgatttaa ttgacccagc tgagattgat ctcctattaa 241 ttagtcattt ccatttggat cactgtggag ctctgccctg gtttctacag aagacaagtt 301 tcaaaggaag aacatttatg actcatgcca caaaagctat ttatagatgg cttctttctg 361 attatgtcaa agttagtaac atatcagcag acgacatgct gtataccgag acagatttgg 421 aagaaagcat ggacaaaatt gaaactatca actttcatga agttaaggaa gttgcgggaa 481 tcaagttttg gtgttaccat gcaggtcacg tcctaggagc cgccatgttc atgattgaga 541 tcgcaggcgt gaagcttttg tacactggtg atttctcaag acaagaagat aggcacttaa 601 tggcagctga aattcctaat attaagcctg atattcttat cattgaatct acttatggga 661 cccatatcca tgagaaacgt gaagagcgag aagcaagatt ctgtaacact gtccacgata 721 ttgtaaacag aggaggcagg ggtctcattc ctgtctttgc tcttggaagg gctcaggagc 781 tgctcttgat tctagatgag tactggcaga atcacccaga actacatgac attccaatat 841 actatgcatc atctttggcc aagaagtgta tggcagtgta ccagacatat gtaaatgcca 901 tgaatgacaa aatccgcaaa cagatcaaca tcaataatcc ctttgttttc aaacacatta 961 gtaacctcaa gagcatggat cattttgatg acattggtcc cagtgttgta atggcctccc 1021 caggcatgat gcaaagtggc ttatccagag aattatttga aagctggtgt actgataaga 1081 ggaatggtgt cattatagcg ggatactgtg tagaagggac acttgccaag cacatcatgt 1141 ctgaacctga agaaatcact actatgtctg gacagaagtt accactgaaa atgtctgttg 1201 attacatttc tttctcagct cacacggatt accagcaaac cagtgaattt attcgtgctt 1261 tgaaaccgcc tcatgtgatt ttagtccatg gagaacagaa tgaaatggcc agattgaaag 1321 cagcactgat tcgagaatat gaagataacg atgaagttca catagaggtt cataatcctc 1381 ggaatacaga agcagtgacc ttaaacttca gaggagaaaa actagccaag gttatgggat 1441 ttttagcaga caaaaaacca gaacaaggcc agcgggtctc aggaatactt gttaaaagaa 1501 actttaatta tcacatactt tctccttgcg acctgtccaa ttatactgac ctggccatga 1561 gcacggtgaa gcagacccaa gccattccat atactggtcc ctttaatttg ctctgttacc 1621 agctgcagaa attgacaggt gatgtggaag aattagaaat tcaagaaaaa cctgctctga 1681 aagtgttcaa aaatattact gtaatacaag aaccaggcat ggtggtatta gaatggctgg 1741 caaacccttc taatgatatg tatgcagata cagtaacaac tgtgatattg gaagttcagt 1801 caaatcccaa aataagaaaa ggtgcagtac agaaggtttc taaaaaatta gaaatgcacg 1861 tttacagcaa gaggttggag atcatgctcc aggacatatt tggagaagac tgtgtaagtg 1921 taaaggatga ctctattctt agcgtcacag tggacgggaa aactgccaac cttaacttgg 1981 agacacggac tgtagaatgt gaagagggaa gtgaagacga tgaatccctc cgagaaatgg 2041 tggagctggc tgcacagaga ctgtacgagg ccctgacgcc agttcactga gactgtgcct 2101 gtatatgaac tttgaaaaaa tacttgactc tacttttgtt acctaaaata aaatgcattc 2161 gtttctctgg gaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2221 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2281 aaaaaa //