LOCUS BC022792 4356 bp mRNA linear HUM 19-JAN-2006
DEFINITION Homo sapiens Vpr (HIV-1) binding protein, mRNA (cDNA clone
MGC:23092 IMAGE:4853730), complete cds.
ACCESSION BC022792
VERSION BC022792.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 4356)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 4356)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (04-FEB-2002) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: Louis Staudt
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Genome Sequence Centre,
BC Cancer Agency, Vancouver, BC, Canada
info@bcgsc.bc.ca
Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson
Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen
Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel
Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave
Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth
Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao,
Kim MacDonald, Mike R. Mayo, Josh Moran, Diana Palmquist, JR
Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang,
Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 34 Row: c Column: 19.
FEATURES Location/Qualifiers
source 1..4356
/db_xref="H-InvDB:HIT000039641"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:23092 IMAGE:4853730"
/tissue_type="Primary B-Cells from Tonsils"
/clone_lib="NIH_MGC_48"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene 1..4356
/gene="VPRBP"
/gene_synonym="KIAA0800"
/gene_synonym="MGC102804"
/db_xref="GeneID:9730"
/db_xref="HGNC:HGNC:30911"
/db_xref="IMGT/GENE-DB:30911"
/db_xref="MIM:30911"
CDS 200..3376
/gene="VPRBP"
/gene_synonym="KIAA0800"
/gene_synonym="MGC102804"
/codon_start=1
/product="VPRBP protein"
/protein_id="AAH22792.1"
/db_xref="GeneID:9730"
/db_xref="HGNC:HGNC:30911"
/db_xref="IMGT/GENE-DB:30911"
/db_xref="MIM:30911"
/translation="MTTVVVHVDSKAELTTLLEQWEKEHGSGQDMVPILTRMSQLIEK
ETEEYRKGDPDPFDDRHPGRADPECMLGHLLRILFKNDDFMNALVNAYVMTSREPPLN
TAACRLLLDIMPGLETAVVFQEKEGIVENLFKWAREADQPLRTYSTGLLGGAMENQDI
AANYRDENSQLVAIVLRRLRELQLQEVALRQENKRPSPRKLSSEPLLPLDEEAVDMDY
GDMAVDDAEIQKSALQIIINCVCGPDNRISSIGKFISGTPRRKLPQNPKSSEHTLAKM
WNVVQSNNGIKVLLSLLSIKMPITDADQIRALACKALVGLSRSSTVRQIISKLPLFSS
CQIQQLMKEPVLQDKRSDHVKFCKYAAELIERVSGKPLLIGTDVSLARLQKADVVAQS
RISFPEKELLLLIRNHLISKGLGETATVLTKEADLPMTAASHSSAFTPVTAAASPVSL
PRTPRIANGIATRLGSHAAVGASAPSAPTAHPQPRPPQGPLALPGPSYAGNSPLIGRI
SFIRERPSPCNGRKIRVLRQKSDHGAYSQSPAIKKQLDRHLPSPPTLDSIITEYLREQ
HARCKNPVATCPPFSLFTPHQCPEPKQRRQAPINFTSRLNRRASFPKYGGVDGGCFDR
HLIFSRFRPISVFREANEDESGFTCCAFSARERFLMLGTCTGQLKLYNVFSGQEEASY
NCHNSAITHLEPSRDGSLLLTSATWSQPLSALWGMKSVFDMKHSFTEDHYVEFSKHSQ
DRVIGTKGDIAHIYDIQTGNKLLTLFNPDLANNYKRNCATFNPTDDLVLNDGVLWDVR
SAQAIHKFDKFNMNISGVFHPNGLEVIINTEIWDLRTFHLLHTVPALDQCRVVFNHTG
TVMYGAMLQADDEDDLMEERMKSPFGSSFRTFNATDYKPIATIDVKRNIFDLCTDTKD
CYLAVIENQGSMDALNMDTVCRLYEVGRQRLAEDEDEEEDQEEEEQEEEDDDEDDDDT
DDLDELDTDQLLEAELEEDDNNENAGEDGDNDFSPSDEELANLLEEGEDGEDEDSDAD
EEVELILGDTDSSDNSDLEDDIILSLNE"
BASE COUNT 1230 a 987 c 1027 g 1112 t
ORIGIN
1 ggcggagccg cctcggacga taaggaaatt gagatttgga gaagttaaat aaccaagact
61 tcacagacaa agtctcactc cttttgccca ggccggtgtg cattggggta atctcggctg
121 actgcaacct ccgtcccctc cgagttcaag tgattctcct gcctcaacct cccgagtagc
181 tgggattaca ggcaaagcca tgactacagt agtggtacat gtggactcca aagctgagct
241 cactaccctg ctggagcagt gggaaaagga acatggcagt gggcaggaca tggtacctat
301 ccttaccagg atgtctcaat tgattgaaaa agaaactgaa gagtatcgta aaggggatcc
361 agacccattt gatgatcgac atcctggtcg agctgatcca gagtgtatgc tgggccactt
421 gctgagaata ctcttcaaga atgatgattt catgaatgca ctggtgaatg catatgtgat
481 gacaagccga gagccccctt taaacactgc agcttgcaga ctcctattag acatcatgcc
541 agggctggaa actgctgtcg tctttcaaga aaaggaggga attgtcgaga atcttttcaa
601 atgggcccga gaggccgatc aaccattgag gacatattct actggactgt taggaggtgc
661 tatggaaaat caagacattg ctgccaacta tagagatgaa aattcacagc tggtggcaat
721 agtgcttcga agactgaggg agctacagct acaggaagtg gctttgcggc aggaaaacaa
781 gcgtcccagt ccacggaagc tctcttctga accccttttg cctctggatg aggaggctgt
841 ggatatggac tatggtgaca tggctgtaga tgatgctgaa attcagaagt cagcacttca
901 gattatcatc aattgtgtgt gtggcccaga taaccgaata tccagtattg gtaaatttat
961 ctctggtact cctcggagaa agctgcctca gaaccctaaa agcagtgagc acaccctggc
1021 caagatgtgg aatgtggttc agtccaacaa cggcatcaag gtgctcctgt ccttactgtc
1081 cattaagatg cccatcacag atgcagacca aatccgggcc ctggcctgca aagccctagt
1141 gggcctgtct cgcagtagca ctgtccggca gatcatcagt aaactgcccc ttttcagcag
1201 ctgccagatc cagcagctga tgaaggagcc tgtgctgcag gacaagcgca gtgaccatgt
1261 caagttctgc aagtatgctg ctgaactcat tgaacgggtg tcaggaaaac cacttctcat
1321 tggcactgat gtttccctag cacgactgca gaaagcagat gttgttgccc agtcaaggat
1381 ctccttccct gagaaagagc tgcttttgtt gatacgaaac catcttattt ctaaagggct
1441 tggagaaaca gcaaccgtgc tgacaaaaga ggctgacctg cccatgactg ctgcctccca
1501 ttcttctgcc tttaccccag tcactgctgc tgcttctcct gtctctctac cccgaacccc
1561 tcgtatcgct aatggcattg caactcgtct gggcagccat gctgctgtgg gtgcctctgc
1621 gccttctgcc cctactgctc atcctcagcc acggcccccc cagggtccgc tagctctgcc
1681 cggcccatct tatgcaggca actccccttt gattggtaga atcagtttta tcagagagag
1741 gccatcaccc tgcaatggca ggaaaatcag agtgttgcgg cagaagtcgg accatggtgc
1801 ctacagccaa agcccagcca taaaaaaaca gctggacaga catcttcctt ccccacctac
1861 gctggacagt ataatcacag agtatcttag agaacaacat gctcgctgca agaatccagt
1921 tgccacctgc ccacctttct ccctctttac tcctcaccaa tgtcctgagc caaaacagag
1981 gcggcaagcg ccaataaact ttacgtcaag gctaaaccgc agggcatcat ttccaaagta
2041 tggaggggtg gatggcggat gctttgatag gcaccttatc tttagcagat tccgtcctat
2101 ttcagtgttc cgggaagcca atgaagatga gagtggcttc acctgctgtg cattctcagc
2161 acgggagcgg ttcctgatgc ttggcacctg cacagggcag ctgaagctct ataatgtgtt
2221 tagtggacag gaggaggcca gctataactg tcacaactca gccatcacac atcttgaacc
2281 ttccagggat gggtccttgc tgctgacatc tgctacttgg agccagcctt tgtctgcact
2341 ttggggaatg aagtcagtat ttgatatgaa gcattccttc acagaagatc actatgttga
2401 gttcagtaag cactcccagg atcgggtcat cggcacaaaa ggagacattg cccacattta
2461 tgatattcag actggcaaca agctgttgac tctgtttaac ccagatcttg ccaacaacta
2521 caagaggaac tgtgccacct ttaatcctac agatgatctt gtcttaaatg atggcgtcct
2581 ctgggatgtc cgctctgcac aggccatcca caagtttgac aagttcaata tgaacatcag
2641 tggtgttttc catccaaatg gactggaggt gatcattaat actgagattt gggaccttcg
2701 aacttttcat cttttgcata ctgttcccgc tctggatcag tgtcgcgtgg tgttcaatca
2761 cacgggaaca gtgatgtatg gagctatgtt gcaggcagat gatgaagatg acttaatgga
2821 agagaggatg aaaagcccct ttgggtcatc cttccgaaca tttaatgcaa ctgactacaa
2881 acctatagca accattgatg tgaaacggaa catctttgac ctgtgtacag acaccaaaga
2941 ctgctatctt gctgtcattg agaatcaagg cagcatggat gccctgaaca tggacacagt
3001 atgcaggctg tatgaagtgg gcaggcagcg tctggcagag gatgaggatg aagaggagga
3061 ccaggaagag gaagaacagg aggaagaaga tgatgatgaa gatgatgatg acaccgatga
3121 tttagatgag cttgacactg accagttgct ggaggcggag ttggaggagg acgacaataa
3181 tgagaacgca ggggaagatg gggacaatga cttctctccc tctgatgagg agctagcaaa
3241 ccttctagag gagggagagg acggggagga tgaagactct gatgcagatg aggaggtgga
3301 actgatcctg ggggacactg acagctctga caactctgat ttggaagatg acatcatctt
3361 atctctgaat gagtgaggag ccatcactgc ttggaagaga ttcttggcag gcgagaaact
3421 gagtcaaatg aattcagaac atattccctt ctctttctcc cagggctgtc tgtcttttaa
3481 ggagctgcat gccctgcatt cagaagatta tggcttagag agcctcattg gcacccgagg
3541 gtccttccag aatcaataac caccacaaaa atgacaacag ggactaggcc ctactctgca
3601 ccccccacat ccacccccca ccatttccta ggatggacat atctttcaag gggaaaaaaa
3661 aaccatgtct ctggggtatt tcacaatata ttttctttgt atagtgttct ttacttaaag
3721 aacaagaaat agttttttat aaaactttaa aaaggaaaaa aaaacaggca tttataactg
3781 agggtatgaa cttatatcca ccaggtctct tgtccctgca cttcattttc tttggaagaa
3841 aatgatgtct aaagaacaat atagaggcat ttttacatac gtatttaaat gaaaaggaaa
3901 atcgtggttt cttaaattga taaggattaa gaatatttta ttataaatat aatatatgat
3961 tttttaacct gttttgttgc ctcatatgct gtcaggttaa tttgttttcc ttcgtgccag
4021 aggtggggag gaaggcactc tgtctgctgg gtaaatgcct aaattcactc accttcatgg
4081 tttgggggca gcatggtcat tgtggatatt ggttttgtgg agttgaggga acttaggata
4141 taagttcact ccctctattt ttctttgtga ttcagttttt caaaaatctt tttttcttcc
4201 ctttctcccc attgtggaaa ttacaaatca aaggcctttt tctttaatgt aaagtgtatt
4261 tatttaaaaa aaatacaaaa taaactacaa gtctgtcttt gttaaaaaaa aaaaaaaaaa
4321 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaa
//