LOCUS BC065920 1937 bp mRNA linear HUM 01-SEP-2006 DEFINITION Homo sapiens CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small phosphatase 2, mRNA (cDNA clone MGC:70608 IMAGE:5756196), complete cds. ACCESSION BC065920 VERSION BC065920.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1937) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 1937) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (02-FEB-2004) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Life Technologies, Inc. cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 139 Row: n Column: 9 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 19923329. FEATURES Location/Qualifiers source 1..1937 /db_xref="H-InvDB:HIT000262098" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:70608 IMAGE:5756196" /tissue_type="Blood, adult leukocytes" /clone_lib="NIH_MGC_118" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..1937 /gene="CTDSP2" /gene_synonym="OS4" /gene_synonym="PSR2" /gene_synonym="SCP2" /db_xref="GeneID:10106" /db_xref="HGNC:HGNC:17077" /db_xref="MIM:608711" CDS 245..1060 /gene="CTDSP2" /gene_synonym="OS4" /gene_synonym="PSR2" /gene_synonym="SCP2" /codon_start=1 /product="CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small phosphatase 2" /protein_id="AAH65920.1" /db_xref="GeneID:10106" /db_xref="HGNC:HGNC:17077" /db_xref="MIM:608711" /translation="MEHGSIITQARREDALVLTKQGLVSKSSPKKPRGRNIFKALFCC FRAQHVGQSSSSTELAAYKEEANTIAKSDLLQCLQYQFYQIPGTCLLPEVTEEDQGRI CVVIDLDETLVHSSFKPINNADFIVPIEIEGTTHQVYVLKRPYVDEFLRRMGELFECV LFTASLAKYADPVTDLLDRCGVFRARLFRESCVFHQGCYVKDLSRLGRDLRKTLILDN SPASYIFHPENAVPVQSWFDDMADTELLNLIPIFEELSGAEDVYTSLGQLRAP" BASE COUNT 465 a 531 c 525 g 416 t ORIGIN 1 ggagctgccg ctgggggatc ggggccgggg gcacccgggg gagccgctgc ccgggccgcc 61 cgccctttgt acaggccgcc tcccttcccg gtccggggag gaaacgagag gggggatgtg 121 aacagctgtg gaagtcggag tctcgggagc cggagcgggc ccccgcccag gccccccagc 181 ccagcccagc ccgcgcgccc gcccgtcctc ccgcccagcc agcccgggcc cgcgggattg 241 ttagatggaa cacggctcca tcatcaccca ggcgcggagg gaagacgccc tggtgctcac 301 caagcaaggc ctggtctcca agtcctctcc taagaagcct cgtggacgta acatcttcaa 361 ggcccttttc tgctgttttc gcgcccagca tgttggccag tcaagttcct ccactgagct 421 cgctgcgtat aaggaggaag caaacaccat tgctaagtcg gatctgctcc agtgtctcca 481 gtaccagttc taccagatcc cagggacctg cctgctccca gaggtgacag aggaagatca 541 aggaaggatc tgtgtggtca ttgacctcga tgaaaccctt gtgcatagct cctttaagcc 601 aatcaacaat gctgacttca tagtgcctat agagattgag gggaccactc accaggtgta 661 tgtgctcaag aggccttatg tggatgagtt cctgagacgc atgggggaac tctttgaatg 721 tgttctcttc actgccagcc tggccaagta tgccgaccct gtgacagacc tgctggaccg 781 gtgtggggtg ttccgggccc gcctattccg tgagtcttgc gtgttccacc agggctgcta 841 cgtcaaggac ctcagccgcc tggggaggga cctgagaaag accctcatcc tggacaactc 901 gcctgcttct tacatattcc accccgagaa tgcagtgcct gtgcagtcct ggtttgatga 961 catggcagac actgagttgc tgaacctgat cccaatcttt gaggagctga gcggagcaga 1021 ggacgtctac accagccttg ggcagctgcg ggccccttag cctgccctgc ttccaagcga 1081 cggccatccc agtaggggac tttcccacac tgtgccttta cgatcagcgt gacagagtag 1141 aagctggagt gcctcaccac acggcccgga aacagcggga agtaactgga aagagcttta 1201 ggacagctta gatgccgagt gggcgaatgc cagaccaatg atacccagag ctacctgccg 1261 ccaacttgtt gagatgtgtg tttgactgtg agagagtgtg tgtttgtgtg tgtgttttgc 1321 catgaactgt ggccccagtg tatagtgttt cagtggggga gaagctgaaa gaccaagact 1381 cttcccaagt tagcttgtct cctctcctgt caccctaaga gccactgagt tgtgtaggga 1441 tgaagactat tgaagactcc attgccaaac catggccttt cctcagtgtt gtaaggccta 1501 tgccaaggat aaaggaaggg tatgcctttg ggtactccag gcacacacct ttctgaaatc 1561 cttctccagc cagctgctgc agacaaaaga tcacatttct gggaagatga gaacttgttt 1621 ccagaccagc atccagtggc catcaggtct tgtggcccaa aggctatgct tgcctccggc 1681 tgagtgcctg ggataggcct tttctatgtc tccccaaggc tggggtgctg agcctgcctt 1741 cctcaccacc tagccatagt ctcaaacctg tggggaagga ggttttctcc ctgcccggga 1801 agaggacaga taactgattt ccgttctttt gactgtgttt taaaattctc tttctaaaca 1861 cagaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1921 aaaaaaaaaa aaaaaaa //