LOCUS X05592 2206 bp DNA linear VRL 19-MAR-1997 DEFINITION Human spumaretrovirus DNA with 3' LTR. ACCESSION X05592 Y00070 VERSION X05592.1 KEYWORDS bel1 gene; bel2 gene; bel3 gene; long terminal repeat; overlapping genes; unidentified reading frame. SOURCE Human spumaretrovirus ORGANISM Human spumaretrovirus Viruses; Ortervirales; Retroviridae; Spumaretrovirinae; Spumavirus. REFERENCE 1 (bases 1 to 2206) AUTHORS Flugel R.M., Rethwilm A., Maurer B., Darai G. TITLE Nucleotide sequence analysis of the env gene and its flanking regions of the human spumaretrovirus reveals two novel genes JOURNAL EMBO J. 6(7), 2077-2084(1987). PUBMED 2820721 COMMENT Recombinant clones that represent the 3' part of the genome of the human spumaretrovirus (foamy virus) were established from cDNA (clone C55) and viral DNA (clone B52). See X05591 for C55 sequence. There are 12 octamer sequences, 6 nonamers and one decamer that form direct repeats within the LTR. The indirect repeats include a decamer with a loop size of 78bp and two octamers with a relatively small loop size of 34 and 5bp, respectively. Data kindly reviewed (02-SEP-1987) by Fluegel R.M. See U21247 which contains the corrected nucleotide sequence. FEATURES Location/Qualifiers source 1..2206 /organism="Human spumaretrovirus" /mol_type="genomic DNA" /clone="C55" /cell_type="HSRV-infected HEL fibroblasts" /db_xref="taxon:11963" CDS <25..>1116 /codon_start=1 /note="unidentified reading frame; no ATG; put. bel-2 gene product (AA 1-364)" /db_xref="GOA:P89873" /db_xref="InterPro:IPR004956" /db_xref="UniProtKB/Swiss-Prot:P89873" /protein_id="CAA29088.1" /translation="NINPSEELLDLLCLLCINIKETVLLLNLLPVAQKVQNQDLGTIL SFAVTCLKSITSLGRNDPGDDPSIMSHVLPVVTPWSMSHDHYAPTLFGILDRYYQGYL KSPATYQTWKFTCQVDPSGKRFMETQFWVPPLGQVNIQFYKNYQILTCCQAVDPFANI FHGTDEEMFDIDSGPDVWCSPSLCFKVIYEGAMGQKQEQKTWLCRLGHGHRMGACDYR KVDLYAMRQGKENPYGDRGDAALQYAYQVKRGCKAGCLASPVLNYKALQFHRTIMADF TNPRIGEGHLAHGYQAAMEAYGPQRGSNEERVWWNVTRNQGKQGGEYYREGGEEPHYP NTPAPHRRTWDERHKVLKLSSFATPSDIQR" misc_feature 42..43 /note="pot. splice acceptor site" CDS <701..>1202 /codon_start=1 /note="unidentified reading frame; bel-3 gene product" /db_xref="GOA:P14355" /db_xref="UniProtKB/Swiss-Prot:P14355" /protein_id="CAA29089.1" /translation="DKEKKTLMEIGVMQLCNMLIRLKGAVKQGAWHHLYLTTKLCSFI EPLWQTSPILGLEKDILLMVTKQLWKLMDLREEVTRRGCGGMSLETRENKEESITGKE VKNLITQILLLLIDVPGMRDTRFLNCPHSLLPLTSNAELLKHCLMAGKWSPKAEMIIL AAERSEH" misc_feature 923..946 /note="polypurine tract 1" misc_feature 947..2206 /note="long terminal repeat" misc_feature 947..1862 /note="U3 region" misc_feature 1028..1044 /note="polypurine tract 2" regulatory 1833..1837 /note="put. TATA-box" /regulatory_class="promoter" misc_feature 1863..2051 /note="R region" misc_feature 2027..2032 /note="pot. polyA signal" misc_feature 2052..2206 /note="U5 region" BASE COUNT 691 a 406 c 484 g 625 t ORIGIN 1 atcctgaagt tggaatttgg gtaaaatata aacccctcag aggaattgtt ggatctgctg 61 tgtttattat gcataaacat caaagaaact gttctcttgt taaaccttct accagttgct 121 cagaaggtcc aaaaccaaga cctaggcacg atcctgtcct tcgctgtgac atgtttgaaa 181 agcatcacaa gcctcggcag aaacgaccca ggagacgatc catcgataat gagtcatgtg 241 cttccagtag tgacaccatg gtcaatgagc catgatcact atgcaccaac cctctttgga 301 atccttgacc gctactatca gggctacttg aagagtccag caacctacca aacttggaag 361 ttcacatgtc aggtggaccc ttctgggaag aggtttatgg agactcaatt ttgggtcccc 421 cctctgggtc aggtgaacat tcagttttat aagaattatc agattctaac ttgctgtcag 481 gctgtagacc catttgctaa tatttttcat ggtactgatg aagaaatgtt tgacattgat 541 tcaggtcctg atgtttggtg ttctccctct ttgtgtttca aggtaattta tgaaggggca 601 atgggccaaa agcaagaaca aaaaacatgg ctgtgcagac taggacatgg tcatcgtatg 661 ggagcatgcg attaccgtaa agtagatctg tatgcaatga gacaaggaaa agaaaaccct 721 tatggagata ggggtgatgc agctttgcaa tatgcttatc aggttaaaag gggctgtaaa 781 gcagggtgct tggcatcacc tgtacttaac tacaaagctt tgcagtttca tagaaccatt 841 atggcagact tcaccaatcc taggattgga gaaggacatc ttgctcatgg ttaccaagca 901 gctatggaag cttatggacc tcagagagga agtaacgagg agagggtgtg gtggaatgtc 961 actagaaacc agggaaaaca aggaggagag tattacaggg aaggaggtga agaacctcat 1021 tacccaaata ctcctgctcc tcatagacgt acctgggatg agagacacaa ggttcttaaa 1081 ttgtcctcat tcgctactcc ctctgacatc caacgctgaa ctactaaagc attgccttat 1141 ggctggaaag tggtcaccga aagcggaaat gattatacta gccgcagaaa gatcagaaca 1201 ttgacagaga tgactcagga tgaaattaga aaaaggtggg aaagtggata ttgtgacccc 1261 ttcattgact caggaagtga ctcagatgga cccttctaaa agccacagac agtaaaaatg 1321 tgttagcact ttatacaata ttatatctgc ttaagctata gaagctttca catactcagt 1381 agctgtttca caatcaacaa aacaatgatg atgtaatcat aaggaagtag tttaaaatag 1441 gttaagtaag tttactgcag tagataatcc ctggggagga tctggctctg taagctggaa 1501 cagcaatgtt ttcagttcca atcctctcaa aggagaacca gagggatgat gtgttagttc 1561 aaatcccatt atcctcatgg ttcccttttc catagtttac tatattaatt taaggataag 1621 gtataaggat taaggtatga ggtgtgtggc tcaacacgta gggtgacaag aaaatctact 1681 gtaataggac acaacacctc taaagttgcc cgtgggaagg tgaagtgaga tcgaatcttt 1741 ccttaacgca gacagctttt tatccactag ggataatgtt ttaaggaata ctatagtaat 1801 agattgatag ttttaacaat gatagaaata gtatataagg atagtttcta gattgtacgg 1861 gaggctcttc actactcgct gcgtcgagag tgtacgagac tctccaggtt tggtaagaaa 1921 tattttatat tgttataatg ttactatgat ccattaacac tctgcttata gattgtaagg 1981 gtgattgcaa tgctttctgc ataaaacttt ggttttcttg ttaatcaata aaccgacttg 2041 attcgagaac caactcctat attattgtct cttttatact ttattaagta aaaggatttg 2101 tatattagcc ttgctaaggg agacatctag tgatataagt gtgaactaca cttatcttaa 2161 atgatgtaac tccttaggat aatcaatata caaaattcca tgacaa //