LOCUS BC000246 4691 bp mRNA linear HUM 11-AUG-2006 DEFINITION Homo sapiens RNA polymerase II associated protein 1, mRNA (cDNA clone MGC:858 IMAGE:3357380), complete cds. ACCESSION BC000246 VERSION BC000246.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4691) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 4691) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (14-NOV-2000) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Genome Sequence Centre, BC Cancer Agency, Vancouver, BC, Canada info@bcgsc.bc.ca Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao, Kim MacDonald, Mike R. Mayo, Josh Moran, Diana Palmquist, JR Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang, Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 6 Row: a Column: 16 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 24430138. FEATURES Location/Qualifiers source 1..4691 /db_xref="H-InvDB:HIT000029456" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:858 IMAGE:3357380" /tissue_type="Eye, retinoblastoma" /clone_lib="NIH_MGC_16" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..4691 /gene="RPAP1" /gene_synonym="DKFZP727M111" /gene_synonym="FLJ12732" /gene_synonym="KIAA1403" /gene_synonym="MGC858" /db_xref="GeneID:26015" /db_xref="HGNC:HGNC:24567" CDS 118..4299 /gene="RPAP1" /gene_synonym="DKFZP727M111" /gene_synonym="FLJ12732" /gene_synonym="KIAA1403" /gene_synonym="MGC858" /codon_start=1 /product="RNA polymerase II associated protein 1" /protein_id="AAH00246.2" /db_xref="GeneID:26015" /db_xref="HGNC:HGNC:24567" /translation="MLSRPKPGESEVDLLHFQSQFLAAGAAPAVQLVKKGNRGGGDAN SDRPPLQDHRDVVMLDNLPDLPPALVPSPPKRARPSPGHCLPEDEDPEERLRRHDQHI TAVLTKIIERDTSSVAVNLPVPSGVAFPAVFLRSRDTQGKSATSGKRSIFAQEIAARR IAEAKGPSVGEVVPNVGPPEGAVTCETPTPRNQGCQLPGSSHSFQGPNLVTGKGLRDQ EAEQEAQTIHEENIARLQAMAPEEILQEQQRLLAQLDPSLVAFLRSHSHTQEQTGETA SEEQRPGGPSANVTKEEPLMSAFASEPRKRDKLEPEAPALALPVTPQKEWLHMDTVEL EKLHWTQDLPPVRRQQTQERMQARFSLQGELLAPDVDLPTHLGLHHHGEEAERAGYSL QELFHLTRSQVSQQRALALHVLAQVISRAQAGEFGDRLAGSVLSLLLDAGFLFLLRFS LDDRVDGVIATAIRALRALLVAPGDEELLDSTFSWYHGALTFPLMPSQEDKEDEDKDE ECPAGKAKRKSPEEESRPPPDLARHDVIKGLLATSLLPRLRYVLEVTYPGPAVVLDIL AVLIRLARHSLESATRVLECPRLIETIVREFLPTSWSPVGAGPTPSLYKVPCATAMKL LRVLASAGRNIAARLLSSFDLRSRLCRIIAEAPQELALPPEEAEMLSTEALRLWAVAA SYGQGGYLYRELYPVLMRALQVVPRELSTHPPQPLSMQRIASLLTLLTQLTLAAGSTP AETISDSAEASLSATPSLVTWTQVSGLQPLVEPCLRQTLKLLSRPEMWRAVGPVPVAC LLFLGAYYQAWSQQPSSCPEDWLQDMERLSEELLLPLLSQPTLGSLWDSLRHCSLLCN PLSCVPALEAPPSLVSLGCSGGCPRLSLAGSASPFPFLTALLSLLNTLAQIHKGLCGQ LAAILAAPGLQNYFLQCVAPGAAPHLTPFSAWALRHEYHLQYLALALAQKAAALQPLP ATHAALYHGMALALLSRLLPGSEYLTHELLLSCVFRLEFLPERTSGGPEAADFSDQLS LGSSRVPRCGQGTLLAQACQDLPSIRNCYLTHCSPARASLLASQALHRGELQRVPTLL LPMPTEPLLPTDWPFLPLIRLYHRASDTPSGLSPTDTMGTAMRVLQWVLVLESWRPQA LWAVPPAARLARLMCVFLVDSELFRESPVQHLVAALLAQLCQPQVLPNLNLDCRLPGL TSFPDLYANFLDHFEAVSFGDHLFGALVLLPLQRRFSVTLRLALFGEHVGALRALSLP LTQLPVSLECYTVPPEDNLALLQLYFRTLVTGALRPRWCPVLYAVAVAHVNSFIFSQD PQSSDEVKAARRSMLQKTWLLADEGLRQHLLHYKLPNSTLPEGFELYSQLPPLRQHYL QRLTSTVLQNGVSET" BASE COUNT 940 a 1443 c 1321 g 987 t ORIGIN 1 ctgcggggca agatggcggc gcccagacag gcctggagca cggatgaata agagggaacc 61 cccacacgga gacactgctg gagagagtcg tactggggag gcagctggag cagcaagatg 121 ctgtcgagac cgaagccagg ggagtccgag gtggacctgc tgcacttcca gagtcagttt 181 ctcgcagctg gtgcagcccc agcagtgcag ctggtgaaga aaggaaatag gggcggtggt 241 gatgccaact cagaccggcc tccgctccag gaccatcggg atgtggtgat gttggacaat 301 ctcccagatt tgcccccagc tttggtccct tctcctccaa agagagccag gcccagccct 361 ggccactgcc tgcctgagga tgaggaccca gaagagaggc tgaggaggca tgatcagcac 421 atcactgctg tcttgactaa gattattgaa cgagatacaa gttcagtggc cgtgaatctg 481 cctgtgccca gtggtgttgc tttccctgct gtgttccttc gctcgcggga cacacagggg 541 aaatcagcaa catctggtaa gagaagcatc tttgcccagg aaattgcggc aaggaggata 601 gctgaagcca agggcccatc agttggggaa gttgtgccca acgtgggccc accagagggt 661 gccgtgacct gtgagacacc cactcctagg aaccagggct gccagcttcc tgggagcagc 721 cacagctttc agggacccaa tctggtcaca gggaaggggc tcagggatca agaagctgag 781 caggaagccc agactatcca tgaagagaac atagcaagac tgcaggccat ggctcctgag 841 gagatcctgc aggaacagca gcggttgctg gcccagcttg accccagctt ggttgctttc 901 ttgagatctc acagccacac gcaagagcaa acaggagaga cagcctctga ggagcagagg 961 ccaggaggac cctctgctaa tgtcaccaag gaggaacccc tcatgtcagc ttttgccagt 1021 gagcccagga agagagacaa gctggagcca gaagccccag ctctggcatt gcccgtgacc 1081 cctcagaaag aatggctgca catggacact gtcgagctgg agaagctcca ctggacccag 1141 gacttgcccc ctgtccggcg gcagcagaca caggagagga tgcaggctcg gttcagtctt 1201 cagggagaac tactggcccc tgacgtggac ctgcccaccc acctgggtct gcaccaccat 1261 ggagaggagg cagagagagc ggggtattcc ctacaggagc tgttccacct gacccgcagc 1321 caggtttccc agcagagagc actggcactg catgtgttag cccaggtcat cagcagggcc 1381 caggctggtg agtttgggga ccggctagca ggcagtgtct taagcctcct tttggatgct 1441 ggtttcctct tcctactgcg cttctccttg gatgacagag tggatggggt cattgcaacc 1501 gccatccgtg ctcttcgggc tctgctggtg gctcctggag atgaggagct cctcgacagc 1561 accttctctt ggtaccatgg agctttgacg ttccctctga tgcccagcca ggaggacaag 1621 gaggatgagg acaaggatga agaatgccca gcaggaaaag caaaaaggaa aagccctgaa 1681 gaagaaagcc ggcctccacc tgacctggcc cgacatgatg tcatcaaggg gctcctggct 1741 accagcctgc tgcctcggct gcgctacgtg ctggaggtga catacccagg acctgcggtg 1801 gtccttgaca tcctggctgt gctcatccgc ctggcccggc attccctgga atcagccaca 1861 agggtcctgg agtgccctcg gctgatagag actatagttc gagagttctt gcccaccagt 1921 tggtctcctg tgggggcagg gcctacccct agtctataca aagtaccctg tgctactgcc 1981 atgaaactac ttcgtgtcct ggcctcagct gggaggaata ttgctgcccg gctgttgagc 2041 agctttgatc tccggagccg cctgtgccgc atcatagctg aggctcccca agaactggcc 2101 ttgcccccag aggaagctga gatgctgagc accgaggccc tccgtctgtg ggctgtggct 2161 gcctcctatg gccagggcgg ttacctttac agggagctct acccagtgct gatgcgggcc 2221 ttgcaggtgg tgccgcggga gctcagcacc cacccacctc aacccctgtc catgcagcgg 2281 atagcctcac tgctcactct cctcacccag ctaaccctgg cagccggcag tacccctgct 2341 gaaaccatca gtgattctgc tgaggccagc ctctcggcca ccccttcctt agtcacttgg 2401 acacaggtgt ctgggctcca gcctcttgtt gagccgtgtc taaggcagac cttgaagttg 2461 ctgtccagac ctgagatgtg gagagccgtg ggcccagtgc ccgttgcctg cctgttgttc 2521 ctgggagcct actaccaggc ctggagccag caaccaagct catgcccgga ggattggctc 2581 caggacatgg agcgcctgtc agaggagctg ctgctgccac tgctgagtca gcccacactg 2641 ggcagcctgt gggattccct taggcactgc tcccttctct gcaacccgct gtcctgtgtg 2701 ccagcccttg aagctccccc cagcctcgtg tcactgggct gctcgggagg ctgcccccgt 2761 ctcagtctgg ctggctcagc ctcacccttc ccattcctca ctgccctcct ctctcttctt 2821 aataccctgg cccagatcca caaggggctg tgtggccagc tggctgccat attggctgcc 2881 ccgggactcc agaattactt cctccagtgt gtggctcctg gggctgcccc acacctcaca 2941 cctttctctg catgggccct gcgccatgag taccacctgc agtacctggc actcgctctg 3001 gcccagaaag cggcagcgct gcagccactg ccagccaccc atgctgccct ctatcatggt 3061 atggccttgg ccctgctgag ccggctgctg cccggaagtg agtacctcac ccatgagctg 3121 ctgctgagct gtgtattccg gctggagttc ctcccggaaa gaacatcagg gggtccagag 3181 gcagccgact tctctgacca gctgtcgtta ggaagcagca gggtccctcg gtgtgggcaa 3241 gggactctgc tggctcaggc ctgccaggac ctccccagca tccgcaactg ctacctgact 3301 cattgctcgc cagcccgagc cagtctgctg gcctcccagg ctctgcaccg aggggagcta 3361 cagcgagtcc caaccctgct actgcccatg cctacggagc cgctgctgcc caccgactgg 3421 cccttcctgc cactgattcg cctctaccac cgggcttcag acaccccctc gggactctct 3481 cccacagaca ccatgggcac agccatgcgg gtcctgcagt gggtgctagt tttggagagc 3541 tggcgccccc aggctctctg ggctgtgccc cctgctgccc gcctggcacg gctcatgtgt 3601 gtgttcctgg tggacagtga gctgttccgg gagtccccag tacagcatct ggtggcagcc 3661 ctcctcgccc agctctgtca gcctcaagtc ttgccaaacc tcaacctgga ctgccgactc 3721 cctggcctga cgtctttccc tgacctctat gccaacttcc tggatcattt tgaggctgtc 3781 tcttttgggg accacctctt tggggccctg gtcctcctgc ccctgcagcg tcggttcagt 3841 gtcaccttgc gccttgccct ctttggggaa cacgtgggag ccttgcgagc tctgagcctg 3901 cctctgaccc agttgcctgt gtccctggag tgttacacag tgcctcctga agacaacctg 3961 gccctccttc agctctactt ccggaccctg gttactggtg cgctccgccc acgttggtgc 4021 cccgtgctct atgctgtggc tgtggctcat gtcaatagct tcatcttctc tcaggaccca 4081 cagagctcag atgaggtcaa agctgcccgc aggagtatgc tgcagaaaac atggctgctg 4141 gcagatgagg gtctccggca gcacctcctg cactataagc ttcccaattc cacgctccca 4201 gagggctttg agctctattc tcagttgccc cctctgcgtc agcactacct ccagagactg 4261 acttcaacag tgctccaaaa tggggtatca gagacctagg atagttgata tagatggaaa 4321 gatgggtacg ttgtcctgta tccagccttt caacagatgt ctggccagac gaagaacatt 4381 gtgtcctaat ggtaggcagg agaccaagga gcagaaggct tgccttcctg ggagcaggtt 4441 gtttgagctg ttttagagca gtgagcccta ccattacatc ctgatatctg gggcttctga 4501 aggtctgtgc tgggagtgaa gagtggctta gctatttacc cgctctttgg ggacagggca 4561 aactaaatgc atcccttctt acctaactcc caacccctgc cctgggctga ggcatatgaa 4621 tgctatagtt gtgcattaaa ataaatgttt tttatctcct ggaaaaaaaa aaaaaaaaaa 4681 aaaaaaaaaa a //