LOCUS       BC070095                2922 bp    mRNA    linear   HUM 17-JUL-2006
DEFINITION  Homo sapiens cleavage and polyadenylation specific factor 2,
            100kDa, mRNA (cDNA clone MGC:87517 IMAGE:5267724), complete cds.
ACCESSION   BC070095
VERSION     BC070095.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2922)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 2922)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (10-MAY-2004) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: Miklos Palkovits, M.D., Ph.D.
            cDNA Library Preparation: Michael J. Brownstein (NHGRI) &  Shiraki
            Toshiyuki and Piero Carninci (RIKEN)
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Sequencing Group at the Stanford Human Genome
            Center, Stanford University School of Medicine, Stanford, CA  94305
            Web site:       http://www-shgc.stanford.edu
            Contact:  (Dickson, Mark) mcd@paxil.stanford.edu
            Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
            R. M.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 167 Row: m Column: 15
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 34101287.
FEATURES             Location/Qualifiers
     source          1..2922
                     /db_xref="H-InvDB:HIT000264053"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:87517 IMAGE:5267724"
                     /tissue_type="Testis"
                     /clone_lib="NIH_MGC_97"
                     /lab_host="DH10B"
                     /note="Vector: pBluescriptR"
     gene            1..2922
                     /gene="CPSF2"
                     /gene_synonym="CPSF100"
                     /gene_synonym="KIAA1367"
                     /db_xref="GeneID:53981"
                     /db_xref="HGNC:HGNC:2325"
                     /db_xref="MIM:606028"
     CDS             239..2587
                     /gene="CPSF2"
                     /gene_synonym="CPSF100"
                     /gene_synonym="KIAA1367"
                     /codon_start=1
                     /product="cleavage and polyadenylation specific factor 2,
                     100kDa"
                     /protein_id="AAH70095.1"
                     /db_xref="GeneID:53981"
                     /db_xref="HGNC:HGNC:2325"
                     /db_xref="MIM:606028"
                     /translation="MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDI
                     IDSLRKHVHQIDAVLLSHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLY
                     QSRHNTEDFTLFTLDDVDAAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIW
                     KIVKDGEEEIVYAVDFNHKREIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQ
                     LLTNVLETLRGDGNVLIAVDTAGRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNV
                     VEFSKSQVEWMSGKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLE
                     CGFSRDLFIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEGKEL
                     EEYLEKEKLKKEAAKKLEQSKEADIDSSDESDIEEDIDQPSAHKTKHDLMMKGEGSRK
                     GSFFKQAKKSYPMFPAPEERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNGD
                     EPMDQDLSDVPTKCISTTESIEIKARVTYIDYEGRSDGDSIKKIINQMKPRQLIIVHG
                     PPEASQDLAECCRAFGGKDIKVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAK
                     DAELAWIDGVLDMRVSKVDTGVILEEGELRDDGEDSEMQVEAPSDSSVIAQQKAMKSL
                     FGDDEKETGEESEIIPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGG
                     VLVCNNQVAVRRTETGRIGLEGCLCQDFYRIRDLLYEQYAIV"
BASE COUNT          918 a          528 c          662 g          814 t
ORIGIN      
        1 aataatggcg gctgccactg tggggcttct gccggccggt agtccctggc gctgctgacc
       61 cagcatcggc ttttctacgt cttgaacctg gattcgccta ggggttggga agggctgtgg
      121 acggcgttgg gggaggcctg acgagattaa taaagaactc ttcagaattc ctggtgtttc
      181 atcatatata cgactaagat atcaactctt ctagcttgct gtttctggac caaaaaaaat
      241 gacgtctatt atcaaattaa ctaccctttc tggggtccaa gaagaatctg ccctttgcta
      301 tcttctccaa gttgatgagt ttagattttt attggactgt ggctgggatg agcacttttc
      361 tatggatatt attgattccc tgaggaagca tgttcaccag attgatgcag tgctgttgtc
      421 tcaccctgat cctctccacc ttggtgccct cccgtatgct gtcggaaagt tgggtctgaa
      481 ctgtgctatc tatgcaacca ttcctgttta taaaatggga cagatgttca tgtatgatct
      541 ttatcagtct cgacacaata cagaagattt tacactcttt acattagatg atgtggatgc
      601 agcctttgat aaaatacagc agctaaaatt ctctcagatt gtgaatttga aaggtaaagg
      661 acatggcctg tctatcacac ctctgccagc tggtcatatg ataggtggaa caatatggaa
      721 aatagtcaaa gatggagaag aagaaattgt ttatgcagtt gacttcaacc acaagaggga
      781 gatccattta aatggatgtt ccctggaaat gctaagcagg ccttccctac ttatcacaga
      841 ttcattcaat gctacatatg tacagcctag aagaaaacag agagatgagc agcttctgac
      901 aaatgtcctg gaaacacttc gaggtgatgg aaatgtgtta atagcagtgg acacagcagg
      961 cagagttttg gaacttgctc aacttcttga tcagatttgg aggactaaag atgcaggatt
     1021 gggtgtttac tcattggcac tcctaaataa tgtcagttac aatgtggtgg agttttctaa
     1081 gtcccaggta gaatggatga gtggtaaatt gatgagatgt tttgaagaca aaagaaataa
     1141 tccgtttcag tttcgccatc tctctttatg tcatggtctt tctgacttgg cccgtgtacc
     1201 tagccctaaa gttgtacttg ccagccaacc tgacctggaa tgcggatttt caagggatct
     1261 ctttattcag tggtgtcagg accctaaaaa ctcaatcatt ctaacctaca gaactactcc
     1321 tgggacttta gcacgtttcc taattgataa tccttctgaa aaaattacag aaatagagtt
     1381 gaggaaacgt gtgaagcttg aagggaaaga acttgaagaa tacttggaaa aagagaaact
     1441 aaagaaagaa gctgccaaaa agcttgagca gtcaaaagag gcagatatag attccagtga
     1501 tgagagtgat attgaggaag atattgacca gccatcagct cataagacga agcatgactt
     1561 gatgatgaaa ggtgaaggca gtcgtaaagg aagttttttc aaacaggcaa aaaagtccta
     1621 tcctatgttt cctgccccag aagaaagaat taaatgggat gaatatggag agattatcaa
     1681 accagaggat ttcttagtgc cagagcttca agctactgaa gaagaaaaaa gcaaattaga
     1741 atctggtttg acaaatggag atgaacctat ggatcaggat ttatctgatg ttcctactaa
     1801 atgtatttct acaacagagt ctattgaaat aaaagcccgg gttacctaca tagactatga
     1861 aggacgctct gatggggatt ccattaaaaa aatcattaat cagatgaaac cacgacagtt
     1921 gatcatcgtc catggcccac cagaggccag tcaagatctg gcagagtgct gtcgcgcctt
     1981 tggtgggaaa gatattaaag tgtacatgcc aaagctacat gaaacagttg atgccactag
     2041 tgaaactcac atctaccagg tgaggttaaa agactcactt gtcagctctc ttcagttttg
     2101 taaggcaaaa gatgctgaat tagcttggat agatggtgtc ttagatatga gagtttccaa
     2161 agtggacaca ggggttattt tagaagaagg agaactaagg gatgatggag aagactcaga
     2221 gatgcaagtg gaagctccct cagattctag cgttatagca caacaaaagg ccatgaaaag
     2281 tttgttcgga gatgatgaaa aagaaacagg tgaagaaagt gagatcattc ctactttgga
     2341 acccttgcca cctcatgagg ttcctggaca tcagtcagtt tttatgaatg aaccaaggct
     2401 gtcagacttc aagcaagttc tcttacggga gggaattcaa gctgaatttg taggaggtgt
     2461 acttgtttgc aacaatcaag tagcagtccg cagaacggaa actggacgca ttggattaga
     2521 aggctgcctt tgtcaagatt tttataggat aagagacctt ttatatgaac aatatgccat
     2581 tgtataaagg acatgatgtc aagaagtatc tgcttgacct ttctaagaaa aagggattct
     2641 tatcttactc tgagcttttg atgttttgtt ttgtaacata caaaaagaat ctgccagaaa
     2701 aacttacatg tatcagattt ttaaaaatat aaatagagaa cattttgcaa atgctcaaat
     2761 gagcattcta tcttttggct ttcagagtga tagagctcct aacaggtgta caggcccaag
     2821 agttgaaggt gattggtttt ctttacagac tccttgttct ctagaagggc tttttacttg
     2881 aataaaacaa tgcaacttag caaaccaaaa aaaaaaaaaa aa
//