LOCUS BC012977 2364 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small phosphatase 1, mRNA (cDNA clone MGC:15065 IMAGE:3687816), complete cds. ACCESSION BC012977 VERSION BC012977.2 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2364) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2364) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (20-AUG-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Dec 9, 2003 this sequence version replaced BC012977.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Louis M. Staudt, M.D., Ph.D. cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Institute for Systems Biology http://www.systemsbiology.org contact: amadan@systemsbiology.org Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 24 Row: a Column: 18 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 32813442. FEATURES Location/Qualifiers source 1..2364 /db_xref="H-InvDB:HIT000036057" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:15065 IMAGE:3687816" /tissue_type="Lymph, Burkitt lymphoma" /clone_lib="NIH_MGC_8" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..2364 /gene="CTDSP1" /gene_synonym="NLIIF" /gene_synonym="SCP1" /db_xref="GeneID:58190" /db_xref="HGNC:HGNC:21614" /db_xref="MIM:605323" CDS 46..831 /gene="CTDSP1" /gene_synonym="NLIIF" /gene_synonym="SCP1" /codon_start=1 /product="CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small phosphatase 1" /protein_id="AAH12977.1" /db_xref="GeneID:58190" /db_xref="HGNC:HGNC:21614" /db_xref="MIM:605323" /translation="MDSSAVITQISKEEARGPLRGKGDQKSAASQKPRSRGILHSLFC CVCRDDGEALPAHSGAPLLVEENGAIPKQTPVQYLLPEAKAQDSDKICVVIDLDETLV HSSFKPVNNADFIIPVEIDGVVHQVYVLKRPHVDEFLQRMGELFECVLFTASLAKYAD PVADLLDKWGAFRARLFRESCVFHRGNYVKDLSRLGRDLRRVLILDNSPASYVFHPDN AVPVASWFDNMSDTELHDLLPFFEQLSRVDDVYSVLRQPRPGS" BASE COUNT 446 a 741 c 682 g 495 t ORIGIN 1 ggcgcccggg ccagagtccg gccggagcgg agcgcgcccg gccccatgga cagctcggcc 61 gtcattactc agatcagcaa ggaggaggct cggggcccgc tgcggggcaa aggtgaccag 121 aagtcagcag cttcccagaa gccccgaagc cggggcatcc tccactcact cttctgctgt 181 gtctgccggg atgatgggga ggccctgcct gctcacagcg gggcgcccct gcttgtggag 241 gagaatggag ccatccctaa gcagacccca gtccaatacc tgctccctga ggccaaggcc 301 caggactcag acaagatctg cgtggtcatc gacctggacg agaccctggt gcacagctcc 361 ttcaagccag tgaacaacgc ggacttcatc atccctgtgg agattgatgg ggtggtccac 421 caggtctacg tgttgaagcg tcctcacgtg gatgagttcc tgcagcgaat gggcgagctc 481 tttgaatgtg tgctgttcac tgctagcctc gccaagtacg cagacccagt agctgacctg 541 ctggacaaat ggggggcctt ccgggcccgg ctgtttcgag agtcctgcgt cttccaccgg 601 gggaactacg tgaaggacct gagccggttg ggtcgagacc tgcggcgggt gctcatcctg 661 gacaattcac ctgcctccta tgtcttccat ccagacaatg ctgtaccggt ggcctcgtgg 721 tttgacaaca tgagtgacac agagctccac gacctcctcc ccttcttcga gcaactcagc 781 cgtgtggacg acgtgtactc agtgctcagg cagccacggc cagggagcta gtgagggtga 841 tggggccagg acctgcccct gaccaatgat acccacacct cctcccagga agactgccca 901 ggcctttgtt aggaaaaccc atgggccgcc gccacactca gtgccatggg gaagcgggcg 961 tctcccccac cagccccacc aggcggtgta ggggcagcag gctgcactga ggaccgtgag 1021 ctccaggccc cgtgtcagtg ccttcaaacc tcctccccta ttctcagggg acctgggggg 1081 ccctgcctgc tgctcccttt ttctgtctct gtccatgctg ccatgtttct ctgctgccaa 1141 attgggcccc ttggcccctt ccggttctgc ttcctggggg cagggttcct gccttggacc 1201 cccagtctgg gaacggtgga catcaagtgc cttgcataga gccccctctt ccccgcccag 1261 ctttcccagg ggcacagctc taggctggga ggggagaacc agcccctccc cctgccccac 1321 ctcctccctt gggactgaga gggcccctac caacctttgc ctctgccttg gagggagggg 1381 aggtctgtta ccactgggga aggcagcagg agtctgtcct tcaggcccca cagtgcagct 1441 tctccagggc cgacagctga gggctgctcc ctgcatcatc caagcaatga cctcagactt 1501 ctgccttaac cagccccggg gcttggctcc cccagctctg agcgtggggg cataggcagg 1561 accccccttg tggtgccata taaatatgta catgtgtata tagattttta ggggaaggag 1621 agagggaagg gtcagggtag agacacccct cccttgcccc tttcctgggc ccagaagttg 1681 gggggaggga gggaaaggat ttttacattt tttaaactgc tattttctga atggaacaag 1741 ctgggccaag gggcccaggc cctgtcctct gtccctcaca cccctttgct ccgttcattc 1801 attcaaaaaa acatttcttg agcaccttct gtgcccagca tatgctaggc ccaccagcta 1861 agtgtgtgtg gggggtctct acgccagctc atcagtgcct ccttgcccat ccttcaccgg 1921 tgcctttggg ggatctgtag gaggtgggac cttctgtggg gtttggggat ctccaggaag 1981 cccgaccaag ctgtcccctt cccctgtgcc aacccatctc ctacagcccc ctgcctgatc 2041 ccctgctggc tgggggcagc tcccaggata tcctgccttc caactgtttc tgaagcccct 2101 cctcctaaca tggcgattcc ggaggtcaag gccttgggct ctccccaggg tctaacggtt 2161 aaggggaccc acataccagt gccaaggggg atgtcaagtg gtgatgtcgt tgtgctcccc 2221 tcccccagag cgggtgggcg gggggtgaat atggttggcc tgcatcaggt ggccttccca 2281 tttaagtgcc ttctctgtga ctgagagccc tagtgtgatg agaactaaag agaaagccag 2341 acccctaaaa aaaaaaaaaa aaaa //