LOCUS OHVCGC 3323 bp DNA circular VRL 04-MAY-1994 DEFINITION Woodchuck hepatitis virus (WHV), complete genome, clone WHV 59. ACCESSION M19183 VERSION M19183.1 KEYWORDS DNA polymerase; WHc protein; WHsAg protein; coat protein; complete genome; core protein; envelope-associated protein; origin of replication; polymerase. SOURCE Woodchuck hepatitis virus ORGANISM Woodchuck hepatitis virus Viruses; Riboviria; Pararnavirae; Artverviricota; Revtraviricetes; Blubervirales; Hepadnaviridae; Orthohepadnavirus. REFERENCE 1 (bases 1 to 3323) AUTHORS Cohen,J.I., Miller,R.H., Rosenblum,B., Denniston,K., Gerin,J.L. and Purcell,R.H. TITLE Sequence comparison of woodchuck hepatitis virus replicative forms shows conservation of the genome JOURNAL Virology 162 (1), 12-20 (1988) PUBMED 3336938 COMMENT Original source text: Woodchuck hepatitis virus DNA, clone WHV 59, from a carrier woodchuck trapped in Pennsylvania. FEATURES Location/Qualifiers source 1..3323 /organism="Woodchuck hepatitis virus" /mol_type="genomic DNA" /db_xref="taxon:35269" CDS join(2427..3323,1..1758) /codon_start=1 /product="DNA polymerase" /protein_id="AAA46763.1" /translation="MHPFSRLFRNIQSLGEEEVQELLGPPEDALPLLAGEDLNHRVAD ALNLHLPTADLQWVHKTNAITGLYSNQAAQFNPHWIQPEFPELHLHNDLIQKLQQYFG PLTINEKRKLQLNFPARFFPKATKYFPLIKGIKNNYPNFALEHFFATANYLWTLWEAG ILYLRKNQTTLTFKGKPYSWEHRQLVQHNGQQHKSHLQSRQNSSMVACSGHLLHNHLP SEPVSVSTRNLSNNISDKSQKSTRTGLCSYKQVQTDRLEHLARISCGSKITIGQQGSS PKTSYKSISSNFRNQTWAYNSSRNSGHTTWFSSASNSNKSRSREKAYSSNSTSQRYSP PLNYEKSDFSSPGVRGRITRLDNNGTLPQCLWRSFYNTKPCGSYCIHHIVSSLDDWGP CTVTGDVTIKSPRTPRRITGGVFLVDKNPNNSSESRLVVDFSQFSRGHTRVHWPKFAV PNLQTLANLLSTNLQWLSLDVSAAFYHIPISPAAVPHLLVGSPGLERFNTCMSSSTHN GNDSQLQTMHALCTRHVYSSLLLLFKTYGRKLHLLAHPFIMGFRKLPMGVGLSPFLLA QFTSAIASMVRRNFPHCVVFAYMDDLVLGARTSEHLTAIYSHICSVFLDLGIHLNVNK TKWWGNHLHFMGYVITSSGVLPQDKHVKKLSRYLRSVPVNQPLDYKICERLTGILNYV APFTLCGYAALMPLYHAIASRTAFIFSSLYKSWLLSLYEELWPVVRQRGVVCTVFADA TPTGWGIATTCQLLSGTFAFPLPIATAELIAACLARCWTGARLLGTDNSVVLSGKLTS FPWLLACVANWILRGTSFCYVPSALNPADLPSRGLLPVLRPLPRLRLRPQTSRISLWA ASPPVSPRRPVRVAWSSPVQTCEPWIPP" CDS join(2992..3323,1..964) /note="precursor" /codon_start=1 /product="surface protein" /protein_id="AAA46762.1" /translation="MGNNIKVTFNPDKIAAWWPAVGTYYTTTYPQNQSVFQPGIYQTT SLINPKNQQELDSVLINRYKQIDWNTWQGFPVDQKLPLVSRDPPLKPHINQSAQTFEI KPGPIIVPGIRDIPRGLVPPQTPTNRDQGRKPTPPTPPLRDTHPHLTMKNQTFRLQGF VDGLRDLTTTERYHNAYGDPFTTLSPVVPTVSTILSPPSTTGDPALSPEMSPSSLLGL LAGLQVVYFLWTKILTIAQNLDWWWTSLSFPGGIPECTGQNSQFQTCKHLPTSCPPTC NGFRWMYLRRFIIYLLVLLLCLIFLLVLLDWKGLIPVCPLQPTTETTVNCRQCTLSVQ DTYTPPYCCCLKPTAGNCTCWPIPSSWALGNYLWEWALARFSWLNLLVPLLQWLGGIS LIAWFLLIWMIWFWGPALLSILPPFIPIFVLFFLIWVYI" sig_peptide join(2992..3323,1..295) /note="surface protein" mat_peptide 296..961 /product="surface protein" CDS 1503..1928 /codon_start=1 /product="X protein" /protein_id="AAA46764.1" /translation="MAARLCCQLDSARDVLLLRPFGPQSSGPPFPRPAAGSAASSTSS PSPSDESDLPLGRLPACFASASGPCCLVFTCADLRTMDSTVNFVSWHAKRQLGMPSKD LWTPYIKDQLLTKWEEGSIDPRLSIFVLGGCRHKCMRLL" CDS 2021..2587 /codon_start=1 /product="core protein" /protein_id="AAA46765.1" /translation="MDIDPYKEFGSSYQLLNFLPLDFFPDLNALVDTATALYEEELTG REHCSPHHTAIRQALVCWDELTKLIAWMSSNITSEQVRTIIVNHVNDTWGLKVRQSLW FHLSCLTFGQHTVQEFLVSFGVWIRTPAPYRPPNAPILSTLPEHTVIRRRGGARASRS PRRRTPSPRRRRSQSPRRRRSQSPSANC" BASE COUNT 834 a 824 c 668 g 997 t ORIGIN 1 aattcgggac ataccacgtg gtttagttcc gcctcaaact ccaacaaatc gagatcaagg 61 gagaaagcct actcctccaa ctccacctct cagagatact cacccccact taactatgaa 121 aaatcagact tttcgtctcc aggggttcgt ggacggatta cgagacttga caacaacgga 181 acgctaccac aatgcctatg gagatccttt tacaacacta agccctgtgg ttcctactgt 241 atccaccata ttgtctcctc cctcgacgac tggggaccct gcactgtcac cggagatgtc 301 accatcaagt ctcctaggac tcctcgcagg attacaggtg gtgtatttct tgtggacaaa 361 aatcctaaca atagctcaga atctagattg gtggtggact tctctcagtt ttccaggggg 421 cataccagag tgcactggcc aaaattcgca gttccaaact tgcaaacact tgccaacctc 481 ctgtccacca acttgcaatg gctttcgttg gatgtatctg cggcgtttta tcatatacct 541 attagtcctg ctgctgtgcc tcatcttctt gttggttctc ctggactgga aaggtttaat 601 acctgtatgt cctcttcaac ccacaacgga aacgacagtc aattgcagac aatgcacgct 661 ctctgtacaa gacacgtata ctcctcctta ctgttgttgt ttaaaaccta cggcaggaaa 721 ttgcacttgc tggcccatcc cttcatcatg ggctttagga aattacctat gggagtgggc 781 cttagcccgt ttctcttggc tcaatttact agtgccattg cttcaatggt taggaggaat 841 ttccctcatt gcgtggtttt tgcttatatg gatgatttgg ttttgggggc ccgcacttct 901 gagcatctta ccgccattta ttcccatatt tgttctgttt ttcttgattt gggtatacat 961 ttgaatgtca ataaaacaaa atggtggggc aatcatttac atttcatggg atatgtgatt 1021 actagttcag gtgtattgcc acaagacaaa catgttaaga aactttcccg ttatttgcgc 1081 tctgttcctg ttaatcaacc tctagattac aaaatttgtg aaagattgac tggtattctt 1141 aactatgttg ctccttttac gctgtgtgga tatgctgctt taatgcctct gtatcatgct 1201 attgcttccc gtacggcttt cattttctcc tccttgtata aatcctggtt actgtctctt 1261 tatgaggagt tgtggcccgt tgtcaggcaa cgtggcgtgg tgtgcactgt gtttgctgac 1321 gcaaccccca ctggctgggg cattgccacc acctgtcaac tcctttccgg gactttcgct 1381 ttccccctcc ctatcgccac ggcggaactc atcgccgcct gccttgcccg ctgctggaca 1441 ggggctcggc tgttgggcac tgacaattcc gtggtgttgt cggggaagct gacgtccttt 1501 ccatggctgc tcgcctgtgt tgccaactgg attctgcgcg ggacgtcctt ctgctacgtc 1561 ccttcggccc tcaatccagc ggacctccct tcccgcggcc tgctgccggt tctgcggcct 1621 cttccacgtc ttcgccttcg ccctcagacg agtcggatct ccctttgggc cgcctccccg 1681 cctgtttcgc ctcggcgtcc ggtccgtgtt gcttggtctt cacctgtgca gacttgcgaa 1741 ccatggattc caccgtgaac tttgtctcct ggcatgcaaa acgtcaactt ggcatgccca 1801 gtaaggacct ttggactcct tatataaaag atcaattatt aactaaatgg gaggagggca 1861 gcattgatcc tagattatca atatttgtat taggaggctg taggcataaa tgcatgcgac 1921 ttctgtaacc atgtatcttt ttcacctgtg ccttgttttt gcctgtgttc catgtcctac 1981 ttttcaagcc tccaagctgt gccttggatg gctttggggc atggacatag atccctataa 2041 agaatttggt tcatcttatc agttgttgaa ttttcttcct ttggacttct ttcctgacct 2101 taatgctttg gtggacactg ctactgcctt gtatgaagaa gagctaacag gtagggaaca 2161 ttgctctccg caccatacag ctattagaca agctttagta tgctgggatg aattaactaa 2221 attgatagct tggatgagct ctaacataac ttctgaacaa gtaagaacaa ttatagtaaa 2281 tcatgtcaat gatacctggg gacttaaggt gagacaaagt ttatggtttc atttgtcatg 2341 tctcactttt ggacaacata cagttcaaga atttttagta agttttggag tatggatcag 2401 aactccggct ccttatagac ctcctaatgc acccattctc tcgactcttc cggaacatac 2461 agtcattagg agaagaggag gtgcaagagc ttctaggtcc cccagaagac gcactccctc 2521 tcctcgcagg agaagatctc aatcaccgcg tcgcagacgc tctcaatctc catctgccaa 2581 ctgctgatct tcaatgggta cataaaacta atgctattac aggtctttac tctaaccaag 2641 ctgctcagtt caatccgcat tggattcaac ctgagtttcc tgaacttcat ttacataatg 2701 atttaattca aaaattgcaa cagtattttg gtcctttgac tattaatgaa aagagaaaat 2761 tgcaattaaa ttttcctgcc agatttttcc ccaaagctac taaatatttc cctttaatca 2821 aaggcataaa aaacaattat cctaattttg ctttagaaca tttctttgct accgcaaatt 2881 atttgtggac tttatgggaa gcaggaattt tgtatttaag gaagaatcaa acaactttga 2941 cttttaaagg taaaccatat tcttgggaac acagacagct agtgcaacat aatgggcaac 3001 aacataaaag tcaccttcaa tccagacaaa atagcagcat ggtggcctgc agtgggcact 3061 tattacacaa ccacttaccc tcagaaccag tcagtgtttc aaccaggaat ttatcaaaca 3121 acatctctga taaatcccaa aaatcaacaa gaactggact ctgttcttat aaacaggtac 3181 aaacagatag attggaacac ttggcaagga tttcctgtgg atcaaaaatt accattggtc 3241 agcagggatc ctcccctaaa acctcatata aatcaatcag ctcaaacttt cgaaatcaaa 3301 cctgggccta taatagttcc cgg //