LOCUS BC070095 2922 bp mRNA linear HUM 17-JUL-2006
DEFINITION Homo sapiens cleavage and polyadenylation specific factor 2,
100kDa, mRNA (cDNA clone MGC:87517 IMAGE:5267724), complete cds.
ACCESSION BC070095
VERSION BC070095.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 2922)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 2922)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (10-MAY-2004) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: Miklos Palkovits, M.D., Ph.D.
cDNA Library Preparation: Michael J. Brownstein (NHGRI) & Shiraki
Toshiyuki and Piero Carninci (RIKEN)
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Sequencing Group at the Stanford Human Genome
Center, Stanford University School of Medicine, Stanford, CA 94305
Web site: http://www-shgc.stanford.edu
Contact: (Dickson, Mark) mcd@paxil.stanford.edu
Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
R. M.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 167 Row: m Column: 15
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 34101287.
FEATURES Location/Qualifiers
source 1..2922
/db_xref="H-InvDB:HIT000264053"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:87517 IMAGE:5267724"
/tissue_type="Testis"
/clone_lib="NIH_MGC_97"
/lab_host="DH10B"
/note="Vector: pBluescriptR"
gene 1..2922
/gene="CPSF2"
/gene_synonym="CPSF100"
/gene_synonym="KIAA1367"
/db_xref="GeneID:53981"
/db_xref="HGNC:HGNC:2325"
/db_xref="MIM:606028"
CDS 239..2587
/gene="CPSF2"
/gene_synonym="CPSF100"
/gene_synonym="KIAA1367"
/codon_start=1
/product="cleavage and polyadenylation specific factor 2,
100kDa"
/protein_id="AAH70095.1"
/db_xref="GeneID:53981"
/db_xref="HGNC:HGNC:2325"
/db_xref="MIM:606028"
/translation="MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDI
IDSLRKHVHQIDAVLLSHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLY
QSRHNTEDFTLFTLDDVDAAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIW
KIVKDGEEEIVYAVDFNHKREIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQ
LLTNVLETLRGDGNVLIAVDTAGRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNV
VEFSKSQVEWMSGKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLE
CGFSRDLFIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEGKEL
EEYLEKEKLKKEAAKKLEQSKEADIDSSDESDIEEDIDQPSAHKTKHDLMMKGEGSRK
GSFFKQAKKSYPMFPAPEERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNGD
EPMDQDLSDVPTKCISTTESIEIKARVTYIDYEGRSDGDSIKKIINQMKPRQLIIVHG
PPEASQDLAECCRAFGGKDIKVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAK
DAELAWIDGVLDMRVSKVDTGVILEEGELRDDGEDSEMQVEAPSDSSVIAQQKAMKSL
FGDDEKETGEESEIIPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGG
VLVCNNQVAVRRTETGRIGLEGCLCQDFYRIRDLLYEQYAIV"
BASE COUNT 918 a 528 c 662 g 814 t
ORIGIN
1 aataatggcg gctgccactg tggggcttct gccggccggt agtccctggc gctgctgacc
61 cagcatcggc ttttctacgt cttgaacctg gattcgccta ggggttggga agggctgtgg
121 acggcgttgg gggaggcctg acgagattaa taaagaactc ttcagaattc ctggtgtttc
181 atcatatata cgactaagat atcaactctt ctagcttgct gtttctggac caaaaaaaat
241 gacgtctatt atcaaattaa ctaccctttc tggggtccaa gaagaatctg ccctttgcta
301 tcttctccaa gttgatgagt ttagattttt attggactgt ggctgggatg agcacttttc
361 tatggatatt attgattccc tgaggaagca tgttcaccag attgatgcag tgctgttgtc
421 tcaccctgat cctctccacc ttggtgccct cccgtatgct gtcggaaagt tgggtctgaa
481 ctgtgctatc tatgcaacca ttcctgttta taaaatggga cagatgttca tgtatgatct
541 ttatcagtct cgacacaata cagaagattt tacactcttt acattagatg atgtggatgc
601 agcctttgat aaaatacagc agctaaaatt ctctcagatt gtgaatttga aaggtaaagg
661 acatggcctg tctatcacac ctctgccagc tggtcatatg ataggtggaa caatatggaa
721 aatagtcaaa gatggagaag aagaaattgt ttatgcagtt gacttcaacc acaagaggga
781 gatccattta aatggatgtt ccctggaaat gctaagcagg ccttccctac ttatcacaga
841 ttcattcaat gctacatatg tacagcctag aagaaaacag agagatgagc agcttctgac
901 aaatgtcctg gaaacacttc gaggtgatgg aaatgtgtta atagcagtgg acacagcagg
961 cagagttttg gaacttgctc aacttcttga tcagatttgg aggactaaag atgcaggatt
1021 gggtgtttac tcattggcac tcctaaataa tgtcagttac aatgtggtgg agttttctaa
1081 gtcccaggta gaatggatga gtggtaaatt gatgagatgt tttgaagaca aaagaaataa
1141 tccgtttcag tttcgccatc tctctttatg tcatggtctt tctgacttgg cccgtgtacc
1201 tagccctaaa gttgtacttg ccagccaacc tgacctggaa tgcggatttt caagggatct
1261 ctttattcag tggtgtcagg accctaaaaa ctcaatcatt ctaacctaca gaactactcc
1321 tgggacttta gcacgtttcc taattgataa tccttctgaa aaaattacag aaatagagtt
1381 gaggaaacgt gtgaagcttg aagggaaaga acttgaagaa tacttggaaa aagagaaact
1441 aaagaaagaa gctgccaaaa agcttgagca gtcaaaagag gcagatatag attccagtga
1501 tgagagtgat attgaggaag atattgacca gccatcagct cataagacga agcatgactt
1561 gatgatgaaa ggtgaaggca gtcgtaaagg aagttttttc aaacaggcaa aaaagtccta
1621 tcctatgttt cctgccccag aagaaagaat taaatgggat gaatatggag agattatcaa
1681 accagaggat ttcttagtgc cagagcttca agctactgaa gaagaaaaaa gcaaattaga
1741 atctggtttg acaaatggag atgaacctat ggatcaggat ttatctgatg ttcctactaa
1801 atgtatttct acaacagagt ctattgaaat aaaagcccgg gttacctaca tagactatga
1861 aggacgctct gatggggatt ccattaaaaa aatcattaat cagatgaaac cacgacagtt
1921 gatcatcgtc catggcccac cagaggccag tcaagatctg gcagagtgct gtcgcgcctt
1981 tggtgggaaa gatattaaag tgtacatgcc aaagctacat gaaacagttg atgccactag
2041 tgaaactcac atctaccagg tgaggttaaa agactcactt gtcagctctc ttcagttttg
2101 taaggcaaaa gatgctgaat tagcttggat agatggtgtc ttagatatga gagtttccaa
2161 agtggacaca ggggttattt tagaagaagg agaactaagg gatgatggag aagactcaga
2221 gatgcaagtg gaagctccct cagattctag cgttatagca caacaaaagg ccatgaaaag
2281 tttgttcgga gatgatgaaa aagaaacagg tgaagaaagt gagatcattc ctactttgga
2341 acccttgcca cctcatgagg ttcctggaca tcagtcagtt tttatgaatg aaccaaggct
2401 gtcagacttc aagcaagttc tcttacggga gggaattcaa gctgaatttg taggaggtgt
2461 acttgtttgc aacaatcaag tagcagtccg cagaacggaa actggacgca ttggattaga
2521 aggctgcctt tgtcaagatt tttataggat aagagacctt ttatatgaac aatatgccat
2581 tgtataaagg acatgatgtc aagaagtatc tgcttgacct ttctaagaaa aagggattct
2641 tatcttactc tgagcttttg atgttttgtt ttgtaacata caaaaagaat ctgccagaaa
2701 aacttacatg tatcagattt ttaaaaatat aaatagagaa cattttgcaa atgctcaaat
2761 gagcattcta tcttttggct ttcagagtga tagagctcct aacaggtgta caggcccaag
2821 agttgaaggt gattggtttt ctttacagac tccttgttct ctagaagggc tttttacttg
2881 aataaaacaa tgcaacttag caaaccaaaa aaaaaaaaaa aa
//