LOCUS BC022792 4356 bp mRNA linear HUM 19-JAN-2006 DEFINITION Homo sapiens Vpr (HIV-1) binding protein, mRNA (cDNA clone MGC:23092 IMAGE:4853730), complete cds. ACCESSION BC022792 VERSION BC022792.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4356) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 4356) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (04-FEB-2002) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Louis Staudt cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Genome Sequence Centre, BC Cancer Agency, Vancouver, BC, Canada info@bcgsc.bc.ca Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao, Kim MacDonald, Mike R. Mayo, Josh Moran, Diana Palmquist, JR Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang, Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 34 Row: c Column: 19. FEATURES Location/Qualifiers source 1..4356 /db_xref="H-InvDB:HIT000039641" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:23092 IMAGE:4853730" /tissue_type="Primary B-Cells from Tonsils" /clone_lib="NIH_MGC_48" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..4356 /gene="VPRBP" /gene_synonym="KIAA0800" /gene_synonym="MGC102804" /db_xref="GeneID:9730" /db_xref="HGNC:HGNC:30911" /db_xref="IMGT/GENE-DB:30911" /db_xref="MIM:30911" CDS 200..3376 /gene="VPRBP" /gene_synonym="KIAA0800" /gene_synonym="MGC102804" /codon_start=1 /product="VPRBP protein" /protein_id="AAH22792.1" /db_xref="GeneID:9730" /db_xref="HGNC:HGNC:30911" /db_xref="IMGT/GENE-DB:30911" /db_xref="MIM:30911" /translation="MTTVVVHVDSKAELTTLLEQWEKEHGSGQDMVPILTRMSQLIEK ETEEYRKGDPDPFDDRHPGRADPECMLGHLLRILFKNDDFMNALVNAYVMTSREPPLN TAACRLLLDIMPGLETAVVFQEKEGIVENLFKWAREADQPLRTYSTGLLGGAMENQDI AANYRDENSQLVAIVLRRLRELQLQEVALRQENKRPSPRKLSSEPLLPLDEEAVDMDY GDMAVDDAEIQKSALQIIINCVCGPDNRISSIGKFISGTPRRKLPQNPKSSEHTLAKM WNVVQSNNGIKVLLSLLSIKMPITDADQIRALACKALVGLSRSSTVRQIISKLPLFSS CQIQQLMKEPVLQDKRSDHVKFCKYAAELIERVSGKPLLIGTDVSLARLQKADVVAQS RISFPEKELLLLIRNHLISKGLGETATVLTKEADLPMTAASHSSAFTPVTAAASPVSL PRTPRIANGIATRLGSHAAVGASAPSAPTAHPQPRPPQGPLALPGPSYAGNSPLIGRI SFIRERPSPCNGRKIRVLRQKSDHGAYSQSPAIKKQLDRHLPSPPTLDSIITEYLREQ HARCKNPVATCPPFSLFTPHQCPEPKQRRQAPINFTSRLNRRASFPKYGGVDGGCFDR HLIFSRFRPISVFREANEDESGFTCCAFSARERFLMLGTCTGQLKLYNVFSGQEEASY NCHNSAITHLEPSRDGSLLLTSATWSQPLSALWGMKSVFDMKHSFTEDHYVEFSKHSQ DRVIGTKGDIAHIYDIQTGNKLLTLFNPDLANNYKRNCATFNPTDDLVLNDGVLWDVR SAQAIHKFDKFNMNISGVFHPNGLEVIINTEIWDLRTFHLLHTVPALDQCRVVFNHTG TVMYGAMLQADDEDDLMEERMKSPFGSSFRTFNATDYKPIATIDVKRNIFDLCTDTKD CYLAVIENQGSMDALNMDTVCRLYEVGRQRLAEDEDEEEDQEEEEQEEEDDDEDDDDT DDLDELDTDQLLEAELEEDDNNENAGEDGDNDFSPSDEELANLLEEGEDGEDEDSDAD EEVELILGDTDSSDNSDLEDDIILSLNE" BASE COUNT 1230 a 987 c 1027 g 1112 t ORIGIN 1 ggcggagccg cctcggacga taaggaaatt gagatttgga gaagttaaat aaccaagact 61 tcacagacaa agtctcactc cttttgccca ggccggtgtg cattggggta atctcggctg 121 actgcaacct ccgtcccctc cgagttcaag tgattctcct gcctcaacct cccgagtagc 181 tgggattaca ggcaaagcca tgactacagt agtggtacat gtggactcca aagctgagct 241 cactaccctg ctggagcagt gggaaaagga acatggcagt gggcaggaca tggtacctat 301 ccttaccagg atgtctcaat tgattgaaaa agaaactgaa gagtatcgta aaggggatcc 361 agacccattt gatgatcgac atcctggtcg agctgatcca gagtgtatgc tgggccactt 421 gctgagaata ctcttcaaga atgatgattt catgaatgca ctggtgaatg catatgtgat 481 gacaagccga gagccccctt taaacactgc agcttgcaga ctcctattag acatcatgcc 541 agggctggaa actgctgtcg tctttcaaga aaaggaggga attgtcgaga atcttttcaa 601 atgggcccga gaggccgatc aaccattgag gacatattct actggactgt taggaggtgc 661 tatggaaaat caagacattg ctgccaacta tagagatgaa aattcacagc tggtggcaat 721 agtgcttcga agactgaggg agctacagct acaggaagtg gctttgcggc aggaaaacaa 781 gcgtcccagt ccacggaagc tctcttctga accccttttg cctctggatg aggaggctgt 841 ggatatggac tatggtgaca tggctgtaga tgatgctgaa attcagaagt cagcacttca 901 gattatcatc aattgtgtgt gtggcccaga taaccgaata tccagtattg gtaaatttat 961 ctctggtact cctcggagaa agctgcctca gaaccctaaa agcagtgagc acaccctggc 1021 caagatgtgg aatgtggttc agtccaacaa cggcatcaag gtgctcctgt ccttactgtc 1081 cattaagatg cccatcacag atgcagacca aatccgggcc ctggcctgca aagccctagt 1141 gggcctgtct cgcagtagca ctgtccggca gatcatcagt aaactgcccc ttttcagcag 1201 ctgccagatc cagcagctga tgaaggagcc tgtgctgcag gacaagcgca gtgaccatgt 1261 caagttctgc aagtatgctg ctgaactcat tgaacgggtg tcaggaaaac cacttctcat 1321 tggcactgat gtttccctag cacgactgca gaaagcagat gttgttgccc agtcaaggat 1381 ctccttccct gagaaagagc tgcttttgtt gatacgaaac catcttattt ctaaagggct 1441 tggagaaaca gcaaccgtgc tgacaaaaga ggctgacctg cccatgactg ctgcctccca 1501 ttcttctgcc tttaccccag tcactgctgc tgcttctcct gtctctctac cccgaacccc 1561 tcgtatcgct aatggcattg caactcgtct gggcagccat gctgctgtgg gtgcctctgc 1621 gccttctgcc cctactgctc atcctcagcc acggcccccc cagggtccgc tagctctgcc 1681 cggcccatct tatgcaggca actccccttt gattggtaga atcagtttta tcagagagag 1741 gccatcaccc tgcaatggca ggaaaatcag agtgttgcgg cagaagtcgg accatggtgc 1801 ctacagccaa agcccagcca taaaaaaaca gctggacaga catcttcctt ccccacctac 1861 gctggacagt ataatcacag agtatcttag agaacaacat gctcgctgca agaatccagt 1921 tgccacctgc ccacctttct ccctctttac tcctcaccaa tgtcctgagc caaaacagag 1981 gcggcaagcg ccaataaact ttacgtcaag gctaaaccgc agggcatcat ttccaaagta 2041 tggaggggtg gatggcggat gctttgatag gcaccttatc tttagcagat tccgtcctat 2101 ttcagtgttc cgggaagcca atgaagatga gagtggcttc acctgctgtg cattctcagc 2161 acgggagcgg ttcctgatgc ttggcacctg cacagggcag ctgaagctct ataatgtgtt 2221 tagtggacag gaggaggcca gctataactg tcacaactca gccatcacac atcttgaacc 2281 ttccagggat gggtccttgc tgctgacatc tgctacttgg agccagcctt tgtctgcact 2341 ttggggaatg aagtcagtat ttgatatgaa gcattccttc acagaagatc actatgttga 2401 gttcagtaag cactcccagg atcgggtcat cggcacaaaa ggagacattg cccacattta 2461 tgatattcag actggcaaca agctgttgac tctgtttaac ccagatcttg ccaacaacta 2521 caagaggaac tgtgccacct ttaatcctac agatgatctt gtcttaaatg atggcgtcct 2581 ctgggatgtc cgctctgcac aggccatcca caagtttgac aagttcaata tgaacatcag 2641 tggtgttttc catccaaatg gactggaggt gatcattaat actgagattt gggaccttcg 2701 aacttttcat cttttgcata ctgttcccgc tctggatcag tgtcgcgtgg tgttcaatca 2761 cacgggaaca gtgatgtatg gagctatgtt gcaggcagat gatgaagatg acttaatgga 2821 agagaggatg aaaagcccct ttgggtcatc cttccgaaca tttaatgcaa ctgactacaa 2881 acctatagca accattgatg tgaaacggaa catctttgac ctgtgtacag acaccaaaga 2941 ctgctatctt gctgtcattg agaatcaagg cagcatggat gccctgaaca tggacacagt 3001 atgcaggctg tatgaagtgg gcaggcagcg tctggcagag gatgaggatg aagaggagga 3061 ccaggaagag gaagaacagg aggaagaaga tgatgatgaa gatgatgatg acaccgatga 3121 tttagatgag cttgacactg accagttgct ggaggcggag ttggaggagg acgacaataa 3181 tgagaacgca ggggaagatg gggacaatga cttctctccc tctgatgagg agctagcaaa 3241 ccttctagag gagggagagg acggggagga tgaagactct gatgcagatg aggaggtgga 3301 actgatcctg ggggacactg acagctctga caactctgat ttggaagatg acatcatctt 3361 atctctgaat gagtgaggag ccatcactgc ttggaagaga ttcttggcag gcgagaaact 3421 gagtcaaatg aattcagaac atattccctt ctctttctcc cagggctgtc tgtcttttaa 3481 ggagctgcat gccctgcatt cagaagatta tggcttagag agcctcattg gcacccgagg 3541 gtccttccag aatcaataac caccacaaaa atgacaacag ggactaggcc ctactctgca 3601 ccccccacat ccacccccca ccatttccta ggatggacat atctttcaag gggaaaaaaa 3661 aaccatgtct ctggggtatt tcacaatata ttttctttgt atagtgttct ttacttaaag 3721 aacaagaaat agttttttat aaaactttaa aaaggaaaaa aaaacaggca tttataactg 3781 agggtatgaa cttatatcca ccaggtctct tgtccctgca cttcattttc tttggaagaa 3841 aatgatgtct aaagaacaat atagaggcat ttttacatac gtatttaaat gaaaaggaaa 3901 atcgtggttt cttaaattga taaggattaa gaatatttta ttataaatat aatatatgat 3961 tttttaacct gttttgttgc ctcatatgct gtcaggttaa tttgttttcc ttcgtgccag 4021 aggtggggag gaaggcactc tgtctgctgg gtaaatgcct aaattcactc accttcatgg 4081 tttgggggca gcatggtcat tgtggatatt ggttttgtgg agttgaggga acttaggata 4141 taagttcact ccctctattt ttctttgtga ttcagttttt caaaaatctt tttttcttcc 4201 ctttctcccc attgtggaaa ttacaaatca aaggcctttt tctttaatgt aaagtgtatt 4261 tatttaaaaa aaatacaaaa taaactacaa gtctgtcttt gttaaaaaaa aaaaaaaaaa 4321 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaa //