LOCUS OHVCGD 3323 bp DNA circular VRL 04-MAY-1994 DEFINITION Woodchuck hepatitis virus (WHV), complete genome, clone WHV 7. ACCESSION M18752 VERSION M18752.1 KEYWORDS DNA polymerase; coat protein; complete genome; core protein; envelope-associated protein; origin of replication; polymerase. SOURCE Woodchuck hepatitis virus ORGANISM Woodchuck hepatitis virus Viruses; Riboviria; Pararnavirae; Artverviricota; Revtraviricetes; Blubervirales; Hepadnaviridae; Orthohepadnavirus. REFERENCE 1 (bases 1 to 3323) AUTHORS Cohen,J.I., Miller,R.H., Rosenblum,B., Denniston,K., Gerin,J.L. and Purcell,R.H. TITLE Sequence comparison of woodchuck hepatitis virus replicative forms shows conservation of the genome JOURNAL Virology 162 (1), 12-20 (1988) PUBMED 3336938 COMMENT Original source text: Woodchuck hepatitis virus DNA, clone WHV 7, from a carrier trapped in Maryland. FEATURES Location/Qualifiers source 1..3323 /organism="Woodchuck hepatitis virus" /mol_type="genomic DNA" /db_xref="taxon:35269" CDS join(2427..3323,1..1758) /codon_start=1 /product="DNA polymerase" /protein_id="AAA46767.1" /translation="MHPFSRLFRNIQSLGEEEVQELLGPPEDALPLLAGEDLNHRVAD ALNLHLPTADLQWVHKTNAITGLYSNQAAQFNPHWIQPEFPELHLHNDLIQKLQQYFG PLTINEKRKLQLNFPARFFPKATKYFPLIKGIKNNYPNFALEHFFATANYLWTLWEAG ILYLRKNQTTLTFKGKPYSWEHRQLVQHNGQQHKSHLQSRQNSSMVACSGHLLHNHLS SESVSVSTRNLSNNISDKSQKSTRTGLCSYKQIQTDRLEHLARISCGSKITIGQQGSS PKTLYKSISSNFRNQTWAYNSSRNSGHTTWFSSASNSNKSRSREKAYSSNSTSKRYSP PLNYEKSDFSSPGVRRRITRLDNNGTPTQCLWRSFYNTKPCGSYCIHHIVSSLDDWGP CTVTGDVTIKSPRTPRRITGGVFLVDKNPNNSSESRLVVDFSQFSRGHTRVHWPKFAV PNLQTLANLLSTNLQWLSLDVSAAFYHIPISPAAVPHLLVGSPGLERFNTCLSSSTHN RNNSQLQTMHNLCTRHVYSSLLLLFKTYGRKLHLLAHPFIMGFRKLPMGVGLSPFLLA QFTSALASMVRRNFPHCVVFAYMDDLVLGARTSEHLTAIYSHICSVFLDLGIHLNVNK TKWWGNHLHFMGYVITSSGVLPQDKHVKKISRYLRSVPVNQPLDYKICERLTGILNYV APFTLCGYAALMPLYHAITSRTAFIFSSLYKSWLLSLYEELWPVVRQRGVVCTVFADA TPTGWGIATTYQLLSGTFAFPLPIATAELIAACLARCWTGARLLGTDNSVVLSGKLTS FPWLLACVANWILRGTSFCYVPSALNPADLPSRGLLPVLRPLPRLRLRPQTSRISLWA ASPPVSPRRPVRVAWSSPVQNCEPWIPP" CDS join(2992..3323,1..964) /note="precursor" /codon_start=1 /product="surface protein" /protein_id="AAA46766.1" /translation="MGNNIKVTFNPDKIAAWWPAVGTYYTTTYPQNQSVFQPGIYQTT SLINPKNQQELDSVLINRYKQIDWNTWQGFPVDQKLPLVSRDPPPKPYINQSAQTFEI KPGPIIVPGIRDIPRGLVPPQTPTNRDQGRKPTPPTPPLRDTHPHLTMKNQTFHLQGF VDGLRDLTTTERQHNAYGDPFTTLSPAVPTVSTILSPPSTTGDPALSPEMSPSSLLGL LAGLQVVYFLWTKILTIAQNLDWWWTSLSFPGGIPECTGQNSQFQTCKHLPTSCPPTC NGFRWMYLRRFIIYLLVLLLCLIFLLVLLDWKGLIPVCPLQPTTETTVNCRQCTISAQ DMYTPPYCCCLKPTAGNCTCWPIPSSWALGNYLWEWALARFSWLNLLVPLLQWLGGIS LIAWFLLIWMIWFWGPALLSILPPFIPIFVLFFLIWVYI" sig_peptide join(2992..3323,1..295) /note="pre-surface protein" mat_peptide 296..961 /product="surface protein" CDS 1503..1928 /codon_start=1 /product="X protein" /protein_id="AAA46768.1" /translation="MAARLCCQLDSARDVLLLRPFGPQSSGPSFPRPAAGSAASSASS PSPSDESDLPLGRLPACFASASGPCCLVFTCAELRTMDSTVNFVSWHANRQLGMPSKD LWTPYIKDQLLTKWEEGSIDPRLSIFVLGGCRHKCMRLL" CDS 2021..2587 /codon_start=1 /product="core protein" /protein_id="AAA46769.1" /translation="MDIDPYKEFGSSYQLLNFLPLDFFPDLNALVDTATALYEEELTG REHCSPHHTAIRQALVCWDELTKLIAWMSSNITSEQVRTIIVNHVNDTWGLKVRQSLW FHLSCLTFGQHTVQEFLVSFGVWIRTPAPYRPPNAPILSTLPEHTVIRRRGGARASRS PRRRTPSPRRRRSQSPRRRRSQSPSANC" BASE COUNT 845 a 817 c 661 g 1000 t ORIGIN 1 aattcgggac ataccacgtg gtttagttcc gcctcaaact ccaacaaatc gagatcaagg 61 gagaaagcct actcctccaa ctccacctct aagagatact cacccccact taactatgaa 121 aaatcagact tttcatctcc aggggttcgt agacggatta cgagacttga caacaacgga 181 acgccaacac aatgcctatg gagatccttt tacaacacta agccctgcgg ttcctactgt 241 atccaccata ttgtctcctc cctcgacgac tggggaccct gcactgtcac cggagatgtc 301 accatcaagt ctcctaggac tcctcgcagg attacaggtg gtgtatttct tgtggacaaa 361 aatcctaaca atagctcaga atctagattg gtggtggact tctctcagtt ttccaggggg 421 cataccagag tgcactggcc aaaattcgca gttccaaact tgcaaacact tgccaacctc 481 ctgtccacca acttgcaatg gctttcgttg gatgtatctg cggcgtttta tcatatacct 541 attagtcctg ctgctgtgcc tcatcttctt gttggttctc ctggactgga aaggtttaat 601 acctgtctgt cctcttcaac ccacaacaga aacaacagtc aattgcagac aatgcacaat 661 ctctgcacaa gacatgtata ctcctcctta ctgttgttgt ttaaaaccta cggcaggaaa 721 ttgcacttgt tggcccatcc cttcatcatg ggctttagga aattacctat gggagtgggc 781 cttagcccgt ttctcttggc tcaatttact agtgcccttg cttcaatggt taggaggaat 841 ttccctcatt gcgtggtttt tgcttatatg gatgatttgg ttttgggggc ccgcacttct 901 gagcatctta ccgccattta ttcccatatt tgttctgttt ttcttgattt gggtatacat 961 ttgaatgtca ataaaacaaa atggtggggc aatcatctac atttcatggg atatgtgatt 1021 actagttcag gtgtattgcc acaagacaaa catgttaaga aaatttcccg ttatttgcgc 1081 tctgttcctg ttaatcaacc tctggattac aaaatttgtg aaagattgac tggtattctt 1141 aactatgttg ctccttttac gctatgtgga tacgctgctt taatgccttt gtatcatgct 1201 attacttccc gtacggcttt cattttctcc tccttgtata aatcctggtt gctgtctctt 1261 tatgaggagt tgtggcccgt tgtcaggcaa cgtggcgtgg tgtgcactgt gtttgctgac 1321 gcaaccccca ctggttgggg cattgccacc acctatcaac tcctttccgg gactttcgct 1381 ttccccctcc ctattgccac ggcggaactc attgccgcct gccttgcccg ctgctggaca 1441 ggggctcggc tgttgggcac tgacaattcc gtggtgttgt cggggaagct gacgtccttt 1501 ccatggctgc tcgcctgtgt tgccaactgg attctgcgcg ggacgtcctt ctgctacgtc 1561 ccttcggccc tcaatccagc ggaccttcct tcccgcggcc tgctgccggt tctgcggcct 1621 cttccgcgtc ttcgccttcg ccctcagacg agtcggatct ccctttgggc cgcctccccg 1681 cctgtttcgc ctcggcgtcc ggtccgtgtt gcttggtctt cacctgtgca gaattgcgaa 1741 ccatggattc caccgtgaac tttgtctcct ggcatgcaaa tcgtcaactt ggcatgccaa 1801 gtaaggacct ttggactcct tatataaaag atcaattatt aactaaatgg gaggagggca 1861 gcattgatcc tagattatca atatttgtat taggaggctg taggcataaa tgcatgcgac 1921 ttctgtaacc atgtatcttt ttcacctgtg ccttgttttt gcctgtgttc catgtcctac 1981 ttttcaagcc tccaagctgt gccttggatg gctttggggc atggacatag atccctataa 2041 agaatttggt tcatcttatc agttgttgaa ttttcttcct ttggacttct ttcctgacct 2101 taatgctttg gtggacactg ctactgcctt gtatgaagaa gagctaacag gtagggaaca 2161 ttgctctccg caccatacag ctattagaca agctttagta tgctgggatg aattaactaa 2221 attgatagct tggatgagct ctaacataac ttctgaacaa gtaagaacaa tcatagtaaa 2281 tcatgtcaat gatacctggg gacttaaggt gagacaaagt ttatggtttc atttgtcatg 2341 tctcactttt ggacaacata cagttcaaga atttttagta agttttggag tatggatcag 2401 aactccagct ccatatagac ctcctaatgc acccattctc tcgactcttc cggaacatac 2461 agtcattagg agaagaggag gtgcaagagc ttctaggtcc cccagaagac gcactccctc 2521 tcctcgcagg agaagatctc aatcaccgcg tcgcagacgc tctcaatctc catctgccaa 2581 ctgctgatct tcaatgggta cataaaacta atgctattac aggtctttac tctaaccaag 2641 ctgctcagtt caatccgcat tggattcaac ctgagtttcc tgaacttcat ttacataatg 2701 atttaattca aaaattgcaa cagtattttg gtcctttgac tataaatgaa aagagaaaat 2761 tgcaattaaa ttttcctgcc agatttttcc ccaaagctac taaatatttc cctttaatta 2821 aaggcataaa aaacaattat cctaattttg ctttagaaca tttctttgct accgcaaatt 2881 atttgtggac tttatgggaa gctggaattt tgtatttaag gaagaatcaa acaactttga 2941 cttttaaagg taaaccatat tcttgggaac acagacagct agtgcaacat aatgggcaac 3001 aacataaaag tcaccttcaa tccagacaaa atagcagcat ggtggcctgc agtgggcact 3061 tattacacaa ccacttatcc tcagaatcag tcagtgtttc aaccaggaat ttatcaaaca 3121 acatctctga taaatcccaa aaatcaacaa gaactggact ctgttcttat aaacagatac 3181 aaacagatag actggaacac ttggcaagga tttcctgtgg atcaaaaatt accattggtc 3241 agcagggatc ctcccccaaa accttatata aatcaatcag ctcaaacttt cgaaatcaaa 3301 cctgggccta taatagttcc cgg //