LOCUS FR720313 5142 bp DNA circular VRL 12-NOV-2010 DEFINITION BK polyomavirus complete genome, isolate RU7. ACCESSION FR720313 VERSION FR720313.1 KEYWORDS complete genome. SOURCE Human polyomavirus 1 ORGANISM Human polyomavirus 1 Viruses; Polyomaviridae; Betapolyomavirus. REFERENCE 1 (bases 1 to 5142) AUTHORS Shevtsov A. JOURNAL Submitted (05-NOV-2010) to the INSDC. Shevtsov A., National Scientific Laboratory of Collective Use, National Center for Biotechnology of Republic of Kazakhstan, Valikhanov str., 13/1, Astana 010000, Republic of Kazakhstan. REFERENCE 2 AUTHORS Shevtsov A., Seydalina A., Gribanov O., Gorbatenko E., Momynaliev K. TITLE BK polyomavirus DNA, complete genome, isolate RU7 JOURNAL Unpublished. FEATURES Location/Qualifiers source 1..5142 /organism="Human polyomavirus 1" /mol_type="genomic DNA" /country="Russia" /db_xref="taxon:1891762" CDS 268..468 /product="agnoprotein" /db_xref="GOA:Q65614" /db_xref="InterPro:IPR002643" /db_xref="UniProtKB/TrEMBL:Q65614" /protein_id="CBX88299.1" /translation="MVLRQLSRQASVKLGKTWTGTKKRAQRIFIFILELLLEFCRGED SVDGKNKSTTALPAVKDSVKDS" CDS 504..1559 /product="VP2" /db_xref="GOA:Q068X9" /db_xref="InterPro:IPR001070" /db_xref="UniProtKB/TrEMBL:Q068X9" /protein_id="CBX88300.1" /translation="MGAALALLGDLVASVSEAAAATGFSVAEIAAGEAAAAIEVQIAS LATVEGITSTSEAIAAIGLTPQTYAVIAGAPGAIAGFAALIQTVTGISSLAQVGYRFF SDWDHKVSTVGLYQQSGMALELFNPDEYYDILFPGVNTFVNNIQYLDPRHWGPSLFAT ISQALWHVIRDDIPAITSQELQRRTERFFRDSLARFLEETTWTIVNAPINFYNYIQEY YSDLSPIRPSMVRQVAEREGTRVHFGHTYSIDDADSIEEVTQRMDLRNQQTVHSGEFI EKTIAPGGANQRTAPQWMLPLLLGLYGTVTPALEAYEDGPNKKKRRVSRGSSQKAKGT RASAKTANKRRSRSSRS" CDS 861..1559 /product="VP3" /db_xref="GOA:Q068X8" /db_xref="InterPro:IPR001070" /db_xref="UniProtKB/TrEMBL:Q068X8" /protein_id="CBX88301.1" /translation="MALELFNPDEYYDILFPGVNTFVNNIQYLDPRHWGPSLFATISQ ALWHVIRDDIPAITSQELQRRTERFFRDSLARFLEETTWTIVNAPINFYNYIQEYYSD LSPIRPSMVRQVAEREGTRVHFGHTYSIDDADSIEEVTQRMDLRNQQTVHSGEFIEKT IAPGGANQRTAPQWMLPLLLGLYGTVTPALEAYEDGPNKKKRRVSRGSSQKAKGTRAS AKTANKRRSRSSRS" CDS 1444..2532 /product="VP1" /db_xref="GOA:E5BBC3" /db_xref="InterPro:IPR000662" /db_xref="InterPro:IPR011222" /db_xref="InterPro:IPR036931" /db_xref="UniProtKB/TrEMBL:E5BBC3" /protein_id="CBX88302.1" /translation="MAPTKRKGECPGAAPKKPKEPVQVPKLLIKGGVEVLEVKTGLDA ITEVECFLNPEMGDPDENLRGFSLKLSAENDFSSDSPERKMLPCYSTARIPLPNLNED LTCGNLLMWEAVTVQTEVIGITSMLNLHAGSQKVHEHGGGKPIQGSNFHFFAVGGDPL EMQGVLMNYRTKYPEGTITPKNPTAQSQVMNTDHKAYLDKNNAYPVECWIPDPTRNEN TRYFGTFTGGENVPPVLHITNTATTVLLDEQGVGPLCKADSLYVSAADICGLFTNSSG TQQWRGLARYFKIRLRKRSVKNPYPISFLLSDLINRRTQRVDGQPMYGMESQVEEVRV FDGTEKLPGDPDMIRYIDKQGQLQTKML" CDS complement(join(2602..4446,4791..5033)) /product="large T antigen" /db_xref="GOA:Q0PCN5" /db_xref="InterPro:IPR001623" /db_xref="InterPro:IPR003133" /db_xref="InterPro:IPR010932" /db_xref="InterPro:IPR014015" /db_xref="InterPro:IPR016392" /db_xref="InterPro:IPR017910" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR036869" /db_xref="UniProtKB/TrEMBL:Q0PCN5" /protein_id="CBX88303.1" /translation="MDKVLNREESMELMDLLGLERAAWGNLPLMRKAYLRKCKEFHPD KGGDEDKMKRMNTLYKKMEQDVKVAHQPDFGTWSSSEVPTYGTEEWESWWSSFNEKWD EDLFCHEDMFASDEEATADSQHSTPPKKKRKVEDPKDFPSDLHQFLSQAVFSNRTLAC FAVYTTKEKAQILYKKLMEKYSVTFISRHMCAGHNIIFFLTPHRHRVSAINNFCQKLC TFSFLICKGVNKEYLLYSALTRDPYHTIEESIQGGLKEHDFNPEEPEETKQVSWKLIT EYAVETKCEDVFLLLGMYLEFQYNVEECKKCQKKDQPYHFKYHEKHFANATIFAESKN QKSICQQAVDTVLAKKRVDSLHMTREEMLTERFNHILDKMDLIFGAHGNAVLEQYMAG VAWLHCLLPKMDSVIFDFLHCIVFNVPKRRYWLFKGPIDSGKTTLAAGLLDLCGGKAL NVNLPMERLTFELGVAIDQYMVVFEDVKGTGAESKDLPSGHGINNLDSLRDYLDGSVK VNLEKKHLNKRTQIFPPGLVTMNEYPVPKTLQARFVRQIDFRPKIYLRKSLQNSEFLL EKRILQSGMTLLLLLIWFRPVADFATDIQSRIVEWKERLDSEISMYTFSRMKYNICMG KCILDITREEDSETEDSGHGSSTESQSQCSSQVSDTSAPAEDSQRSDPHSQELHLCKG FQCFKRPKTPPPK" CDS complement(4515..5033) /product="small T antigen" /db_xref="InterPro:IPR001623" /db_xref="InterPro:IPR003354" /db_xref="InterPro:IPR036092" /db_xref="InterPro:IPR036869" /db_xref="UniProtKB/TrEMBL:Q3C215" /protein_id="CBX88304.1" /translation="MDKVLNREESMELMDLLGLERAAWGNLPLMRKAYLRKCKEFHPD KGGDEDKMKRMNTLYKKMEQDVKVAHQPDFGTWSSSEVCADFPLCPDTLYCKEWPICS KKPSVHCPCMLCQLRLRHLNRKFLRKEPLVWIDCYCIDCFTQWFGLDLTEETLQWWVQ IIGETPFRDLKL" BASE COUNT 1552 a 1014 c 1002 g 1574 t ORIGIN 1 gcctcggcct cttatatatt ataaaaaaaa aggccacagg gaggagctgc ttacccatgg 61 aatgcagcca aaccatgacc tcaggaagga aagtgcatga ctgggcagcc agccagtggc 121 agttaatagt gaaaccccgc ccctgaaatt ctcaaataaa cacaagagga agtggaaact 181 ggccaaagga gtggaaagca gccagacaga catgttttgc gggcctagga atcttggcct 241 tgtccccagt taaactggac aaaggccatg gttctgcgcc agctgtcacg acaagcttct 301 gtgaaacttg gtaaaacctg gactggaaca aaaaaaagag ctcagaggat ttttattttt 361 attttagagc ttttgctgga attttgtaga ggtgaagaca gtgtagacgg gaaaaacaaa 421 agtaccactg ctttacctgc tgtaaaagac tctgtaaaag actcctaggt aagtaatccc 481 tttttttttg tatttccagg ttgatgggtg ctgctctagc acttttgggg gacctagttg 541 ccagtgtatc tgaggctgct gctgccacag gattttcagt ggctgaaatt gctgctgggg 601 aggctgctgc tgctatagaa gttcaaattg catcccttgc tactgtagag ggcataacaa 661 gtacctcaga ggctatagct gctataggcc taactcctca aacatatgct gtaattgctg 721 gtgctcctgg ggctattgct gggtttgctg ctttaattca aactgttact ggtattagtt 781 ccttggctca agtagggtat aggtttttta gtgattggga tcacaaagtt tccactgtag 841 gcctctatca gcaatcaggc atggctttgg aattgtttaa cccagatgag tactatgata 901 tattgtttcc tggtgtaaat acttttgtaa ataatattca ataccttgat cctaggcatt 961 ggggtccttc tttgtttgct actatttctc aggctttgtg gcatgttatt agggatgata 1021 tacctgctat aacctcacaa gaattgcaaa gaagaacaga aagatttttt agagactcct 1081 tggctagatt tttggaggaa actacctgga caattgtaaa tgcccctata aacttttata 1141 attatattca agaatattat tctgatcttt cccctattag gccctcaatg gttagacaag 1201 tagctgaaag ggaaggtacc cgtgtacatt ttggccatac ttatagtata gatgatgctg 1261 acagtataga agaagttaca caaagaatgg acttaagaaa tcaacaaact gtacattcag 1321 gagagtttat agaaaaaact attgccccag gaggtgctaa tcaaagaact gctcctcaat 1381 ggatgttgcc tttacttcta ggcctgtacg ggactgtaac acctgctctt gaagcatatg 1441 aagatggccc caacaaaaag aaaaggagag tgtccagggg cagctcccaa aaagccaaag 1501 gaacccgtgc aagtgccaaa actgctaata aaaggaggag tagaagttct agaagttaaa 1561 actgggctag atgctataac agaggtagaa tgcttcctaa acccagaaat gggggatcca 1621 gatgaaaacc ttaggggctt tagtctaaag ctaagtgctg aaaatgactt tagcagtgat 1681 agcccagaaa gaaaaatgct tccctgttac agcacagcaa gaattcccct ccccaattta 1741 aatgaggacc taacctgtgg aaatctactg atgtgggagg ctgtaacagt acaaacagag 1801 gtcattggaa taactagcat gcttaacctt catgcagggt cacaaaaagt gcatgagcat 1861 ggtggaggta aacctattca aggcagtaat ttccactttt ttgctgttgg tggagacccc 1921 ttggaaatgc agggagtgct aatgaattac aggacaaagt acccagaagg tactataacc 1981 ccaaaaaacc caacagccca gtcccaagta atgaatactg accataaggc ctatttggac 2041 aaaaacaatg cttatccagt tgagtgctgg attcctgatc ccaccagaaa tgaaaatact 2101 aggtattttg ggactttcac aggaggggaa aatgttcctc cagttcttca tataacaaac 2161 actgctacca cagtgttgct agatgaacag ggtgtggggc ctctttgtaa agctgatagc 2221 ctgtatgttt cagctgctga tatttgtggc ctgtttacta acagctctgg aacacaacag 2281 tggagaggcc ttgcaagata ttttaagatt cgcctgagaa aaagatctgt aaaaaatcct 2341 tacccaattt cctttttgct aagtgacctt ataaacagga gaacccagag agtggatggg 2401 cagcctatgt atggtatgga atcccaggta gaagaggtta gggtgtttga tggcacagaa 2461 aaacttccag gggacccaga tatgataaga tatattgaca aacaaggaca attgcaaacc 2521 aaaatgcttt aaacaggtgc ttttattgtt gatatacatt taataaatgc tgcttttgta 2581 taagccagtt ttaagcttgt gttattttgg gggtggtgtt ttaggccttt taaaacactg 2641 aaagccttta cacaaatgca actcttgact atgggggtct gacctttggg aatcttcagc 2701 aggggctgaa gtatctgaga cttgggaaga gcattgtgat tgggattcag tgcttgatcc 2761 atgtccagag tcttcagttt ctgaatcttc ttctcttgtg atatcaagaa tacattttcc 2821 catgcatata ttatatttca tccttgaaaa agtatacata cttatctcag aatccagcct 2881 ttccttccat tcaacaattc tagattgtat atctgttgca aaatcagcta caggcctaaa 2941 ccaaattagc agtagcaaca aggtcattcc actttgtaaa attctttttt caagtaagaa 3001 ctctgagttt tgtaaggatt ttcttaaata tattttgggc ctaaaatcta tctgtcttac 3061 aaatctagcc tgcagggttt tagggacagg atactcattc attgtaacca ggcctggtgg 3121 aaatatttgg gttcttttgt ttaaatgttt cttttctaaa ttaaccttaa cacttccatc 3181 taaataatct ctcaaactgt ctaaattgtt tattccatgt cctgaaggca aatcctttga 3241 ttcagctcct gttcctttta catcttcaaa aacaaccatg tactgatcta tagctacacc 3301 tagttcaaag gttagccttt ccatgggtag gtttacattt aaagctttac ctccacacaa 3361 atctaataac cctgcagcta gtgttgtttt tccactatca atgggacctt taaataacca 3421 gtatcttctt ttaggtacat taaaaacaat acagtgcaaa aaatcaaata taacagaatc 3481 cattttaggt agcaaacagt gcagccaagc aacacctgcc atatattgtt ctaatacagc 3541 atttccatga gccccaaata ttaaatccat tttatctaat atatgattaa atctttctgt 3601 tagcatttct tctctagtca tatgaaggct atctactctt tttttagcta aaactgtatc 3661 tactgcttgc tgacaaatac ttttttgatt tttactttct gcaaagatag tagcatttgc 3721 aaaatgcttt tcatgatact taaagtgata aggttggtct tttttctgac actttttaca 3781 ctcctctaca ttgtattgaa attctaaata catacctaat aataaaaaca catcctcaca 3841 ctttgtttct actgcatact cagtaattaa tttccaagag acctgctttg tttcttcagg 3901 ctcttctggg ttaaaatcat gctcctttaa gcccccttga atgctttctt ctattgtatg 3961 gtatggatct ctagttaagg cactatatag taagtattcc ttattaacac ccttacaaat 4021 taaaaaacta aaggtacaca gcttttgaca gaaattatta attgcagaaa ctctatgtct 4081 atgtggagtt aaaaagaata taatattatg cccagcacac atgtgtctac taataaaagt 4141 tacagaatat ttttccataa gttttttata cagaatttga gctttttctt tagtagtata 4201 cacagcaaag caggcgaggg ttctattact aaatacagct tgactaagaa actggtgtag 4261 atcagaagga aagtctttag ggtcttctac ctttcttttt ttcttgggtg gtgttgagtg 4321 ttgagaatct gctgttgctt cttcatcact ggcaaacata tcttcatggc aaaataaatc 4381 ttcatcccat ttttcattaa aggaactcca ccaagactcc cactcttctg ttccataggt 4441 tggcacctat aaaacaaata attacttagg gcctttaaat attttattat ttatctaaat 4501 ataaggtagt taccttaaag ctttagatct ctgaagggag tttctccaat tatttggacc 4561 caccattgca gagtttcttc agttaggtct aagccaaacc actgtgtgaa gcagtcaatg 4621 cagtagcaat ctatccaaac caagggctct tttcttaaaa attttctatt taaatgcctt 4681 aatctaagct gacatagcat gcaagggcag tgcacagaag gctttttgga acaaataggc 4741 cattccttgc agtacagggt atctgggcaa agaggaaaat cagcacaaac ctctgagcta 4801 ctccaggttc caaaatcagg ctgatgagct acctttacat cttgctccat ttttttatat 4861 aaagtattca ttctcttcat tttatcctcg tcgccccctt tgtcagggtg aaattcctta 4921 cacttcctta aataggcttt tctcattaag ggaaggtttc cccaggcagc tctttcaagg 4981 cccaaaaggt ccatgagctc catggattct tccctgttaa gcactttatc catttttgca 5041 aaaattgcaa aagaataggg atttccccca aatagttttg ctaggcctca gaaaaagcct 5101 ccacaccctt actacttgag agaaagggtg gaggcagagg cg //