LOCUS BK013462 3884 bp RNA linear PHG 09-FEB-2021 DEFINITION TPA_asm: SsRNA phage Gephyllon.3_7 genomic sequence. ACCESSION BK013462 VERSION BK013462.1 KEYWORDS Third Party Data; TPA; TPA:assembly. SOURCE ssRNA phage Gephyllon.3_7 ORGANISM ssRNA phage Gephyllon.3_7 Viruses; Riboviria; Orthornavirae; Lenarviricota; Leviviricetes; Norzivirales; Solspiviridae; Intasivirus; Intasivirus tellurivivens. REFERENCE 1 (bases 1 to 3884) AUTHORS Stockdale,S.R., Callanan,J., Adriaenssens,E.M., Kuhn,J.H., Rumnieks,J., Shkoporov,A., Draper,L.A., Ross,P. and Hill,C. TITLE Leviviricetes taxonomy JOURNAL Unpublished REFERENCE 2 (bases 1 to 3884) AUTHORS Stockdale,S.R., Callanan,J., Adriaenssens,E.M., Kuhn,J.H., Rumnieks,J., Shkoporov,A., Draper,L.A., Ross,P. and Hill,C. TITLE Direct Submission JOURNAL Submitted (01-SEP-2020) Gut Phageomics, APC Microbiome Ireland, Western Road, Cork T12YT20, Ireland COMMENT THIRD PARTY DATABASE: This TPA record uses data from DDBJ/EMBL/GenBank entry MN033114.1 ##Assembly-Data-START## Assembly Method :: metaSPAdes v. v3.11.1 Sequencing Technology :: Illumina ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..3884 /organism="ssRNA phage Gephyllon.3_7" /mol_type="genomic RNA" /isolation_source="soil" /db_xref="taxon:2786164" /geo_loc_name="USA" /note="Viruses; Riboviria; Orthnornavirae; Lenarviricota; Leviviricetes; Norzivirales; Solspiviridae; Intasivirus; Intasivirus Gephyllon.3_7" gene 225..1532 /gene="Gephyllon.3_7_1" CDS 225..1532 /gene="Gephyllon.3_7_1" /codon_start=1 /product="maturation protein" /protein_id="DAD50213.1" /translation="MSFERVRTRGSLKPGPNLEAWIYNYPVCGSETFSLDFVVTSSVC QVGDVATMYDTVTPDFRARQRRGEFTFSNMSSTRRVSAIDQPGNGFIRMNAAPNPPAC TISGVPHYDGYMWKGAYLPQMVMNFHHVVRPAPLPVPVGIFSDSDIANMQTEVSTGVL AERGQADSNLFESVAEYKETLRLFHGPISSFFRFFKKNREAMKLMGPHEAWLTYRYGI RPLIQDITMVVEGLSKKVGLRRQTSRRNLTKTLTRSTQISETTFWAIVTANQLETDTV TVRGMCIDEYLATLSSNIGFTAKGLITVPWELVRLSFVVDWFVTLGDFLKSYAPAPGY KTLGGSLVTIRQRSYFWTATNTFQNNFETILRPVTGTCSSLYESKSRGGLSGAGVVIK SDFRFASLTRVADSVALIAALVSQYFADSPVGSAIELTRLKVK" gene 1602..1958 /gene="Gephyllon.3_7_2" CDS 1602..1958 /gene="Gephyllon.3_7_2" /codon_start=1 /product="coat protein" /protein_id="DAD50214.1" /translation="MTVSVNAKVYSADSFQKDIVAYNGPAKTGSVKDDLRLSRVAPKP SATFSGLSRTEAKLTRTLNLTGSLTPTGDLICTISLAVPVGYTAADIDTALNDMGAFL ASASFKTHVKTPQISF" gene 2107..>3884 /gene="Gephyllon.3_7_3" CDS 2107..>3884 /gene="Gephyllon.3_7_3" /codon_start=1 /product="RNA-directed RNA polymerase" /protein_id="DAD50215.1" /translation="MKSKLPLSYYAARKALRLSSHEIYCEIMTELFQSYDHFGFVKSL SGHFRSKRFDLALVLADSLSGAVHPDATTHFVANQFSNMIRKYPWLSDVVKTDPEGQA IRTFNRFERRCKLLNRKFVLYEKLRNPISSEIRSMQDFISHVIGSEPPLETILQRSCS FGAGASLGVHGNATNLRRKIHSEGWSVSPGAFTYSYWALMSDPNLRDVLLKNRSGISC LDWLQSKSEYARKTRIVNYNKISFVPKTAKTHRAIAVEPLLNGFVQKGIDVFMRSCLK RVGIDLSDQSLNQRLARSGSINDSEDSFATIDLSSASDSISIGLARLLLPPAWFDFLN SVRSHSYELGGNVYRYHKFCSMGNGFCFPLETLIFAACCHASGCVNPGIDFSVYGDDI IVRSGRSKEVLSLLKRIGFLPNVDKTFTSGPFRESCGSDWFGGVDVRPYTLDYRLDSI ESLFKYLNLTRRSELTSSFFESTWSVVLNHVPVDFRFFRPFKGNADSGIDSWADQHLT SPHCRYNYRQWNWTCKSLDHKAAVDSAPRDRYRSDSTDMYALLSGLSSRWTGDVEYTV RRKTKTTVRLVTSSSADSGWLPPSFS" BASE COUNT 887 a 943 c 946 g 1108 t ORIGIN 1 ccccccgtcg cttcttgtaa ctaagggcac ggctgctgaa ggtcaatcgt catatccgtc 61 tactctgaaa gaggagtagc agaggatccc gatagtaata tccgggtacc tttgagactg 121 atgatagatc tatgcaaccg tgttcctgtt acaattagct tcggggaggg tatcatctgg 181 gctttaagag gtgcccataa aactctcaac ttaggagaca tcttttgtcg tttgaacgtg 241 tcagaactcg cggctcgttg aaaccaggcc ccaatttgga agcttggatt tataactatc 301 ctgtctgtgg gagcgaaact ttctccctag acttcgtagt tacctcatcc gtttgtcagg 361 ttggggacgt cgccacgatg tacgacacgg tcactcctga ctttcgagct cgtcaacgaa 421 gaggagaatt taccttttcg aatatgagct caactcgtcg ggtttcggca attgatcaac 481 caggtaacgg gtttatccgc atgaacgcgg ctccaaaccc gcccgcatgt accattagtg 541 gtgttcctca ctacgatggt tacatgtgga aaggtgccta ccttccacag atggttatga 601 atttccatca tgtggttagg cctgcacccc tgccggttcc ggttggcatt tttagcgata 661 gtgatattgc taacatgcaa accgaagtct ccacaggtgt tcttgctgaa cgcggacaag 721 ctgactcaaa tttatttgag agcgtggccg aatacaaaga gaccctgcgc cttttccatg 781 gccctataag ctcattcttc aggttcttta aaaagaaccg cgaggctatg aaacttatgg 841 gtccccatga ggcttggcta acttatcgtt acggaatcag acctttgatt caggatatta 901 cgatggttgt tgagggcctg agcaaaaagg tcggactaag gcgacagact tctaggcgta 961 atctcacgaa aacgctgacg aggtctactc aaattagcga aaccactttc tgggctattg 1021 tcacggctaa tcagcttgaa actgatactg tgacagtccg gggaatgtgt atcgacgaat 1081 atcttgcgac tctctcatct aacattggct ttaccgccaa gggactaatt acagtcccat 1141 gggagttagt tagactttcg ttcgttgtcg actggttcgt aactcttgga gattttctca 1201 agagctacgc tcctgctcct ggatataaaa ccttaggggg gtctttagtg accattcgtc 1261 aaagaagtta tttctggact gcaacaaata ctttccagaa taacttcgag acgattcttc 1321 gtcccgttac aggcacatgt tctagtctgt acgagtcgaa gtcccgtggc ggactatcgg 1381 gcgctggtgt ggtgattaaa agcgactttc gctttgcatc cctcactcgc gttgccgaca 1441 gcgttgctct tatcgctgcc ttagtcagcc aatactttgc cgatagccct gtcggttctg 1501 caatagaact tacgaggtta aaggttaagt agatccttgg ccgtgaggct gagctcgccc 1561 tgttatgggg ttttatcaat ccttttaggg agttattccc gatgactgtt tcggtcaatg 1621 cgaaagtgta ctcagccgat tcttttcaga aagatatcgt cgcgtataac gggccggcaa 1681 aaaccggctc cgttaaggac gatctgaggt tgtcacgagt ggctcccaag ccatccgcga 1741 ccttcagcgg tctgagtcgc accgaagcga agctcacccg taccttgaac ctgacgggca 1801 gtttgacccc tacgggggat cttatctgca ctatcagtct cgcggtaccg gtgggttaca 1861 ccgctgcgga catcgacacg gcattaaacg atatgggggc gttccttgcg tccgcctctt 1921 tcaagactca cgtcaagact ccccagattt cgttttaagg ggagaattgg ccgtcgttct 1981 gaagatatcc agaacttcgt cttggcagtt atcgctaagg ttacaatttc aataattgca 2041 atcttaacgg tactacatcg tttaggcgtt atctagattc tggataacgt tttcctggag 2101 atcgttatga aatccaagtt gcccttgtcg tattacgcag caaggaaggc acttcggctc 2161 tcttctcacg agatatactg tgagatcatg accgagttgt tccagtccta cgatcacttt 2221 ggcttcgtca agtccctctc aggacatttc cgttctaaga ggtttgactt agccttggtg 2281 ctcgctgatt ctttgtcggg cgctgtgcat cccgatgcga ctacacattt tgtcgcgaat 2341 cagttttcga acatgattag gaaatatccc tggctctctg acgtcgttaa aaccgaccca 2401 gaaggtcagg ctattaggac ttttaatcga tttgagaggc gttgtaagct tctcaatcga 2461 aaatttgtcc tttacgagaa acttcgtaat cctatttcct ctgaaattcg ttccatgcag 2521 gatttcattt cccatgttat cgggagtgag cctccgcttg aaacgatact tcagaggtca 2581 tgctcttttg gggctggcgc ctctttaggt gttcacggta atgcaaccaa ccttaggaga 2641 aagattcact ctgaaggttg gtccgtgtcg cccggcgcgt tcacgtactc gtactgggca 2701 cttatgtctg atccgaacct gagggatgtt ctcctcaaga ataggtcagg catatcttgc 2761 ctggactggc tacaaagtaa atccgagtat gccagaaaaa cccgtattgt gaactacaac 2821 aaaataagtt tcgttccgaa gacagctaaa acccataggg ctatagctgt cgagccgttg 2881 cttaatggtt ttgtacagaa aggtatcgac gtcttcatgc gcagttgctt gaagcgcgtc 2941 ggtatcgatc tgtcggacca aagcctgaat cagagattgg cccgctctgg gtctatcaat 3001 gattcagaag attcgttcgc gaccattgat ctatcgtctg cttctgactc tatatcaatt 3061 ggtctcgcac gtctcttact ccccccggcc tggttcgatt ttttgaactc ggtcaggagc 3121 catagttacg aactaggtgg caacgtttat cgttaccata agttttgttc tatggggaac 3181 ggcttctgtt ttccacttga gactctgatc ttcgcggcat gctgccacgc gagcggttgt 3241 gtaaaccccg gtatcgattt ctcggtatac ggggacgaca taatcgtgcg cagtggtcgc 3301 agtaaggagg ttctatccct tcttaagagg ataggcttcc tgcctaacgt tgataagacc 3361 tttacttcgg gtccttttag agaatcatgt ggttcagact ggttcggcgg cgtagacgtt 3421 cgtccgtaca cacttgatta tcgcctcgat tcaatcgagt ctttattcaa gtacctcaac 3481 ctgacgagaa ggagtgagct taccagctct ttcttcgagt cgacgtggtc agttgttctg 3541 aaccacgtcc ctgtcgactt cagattcttc agacccttca aggggaatgc tgattctgga 3601 atagatagtt gggctgatca gcatcttacg agtccccact gtcggtataa ctaccggcag 3661 tggaactgga cctgtaagtc gctggaccac aaggctgcgg ttgattctgc tcctcgtgat 3721 cggtatcggt cggattctac cgatatgtac gctcttctct ctggcctatc gagtcgatgg 3781 accggagatg tcgagtacac cgttcgtcgt aagacgaaga cgacagttcg acttgtaacg 3841 agttcgtcgg ctgattctgg ctggctacct ccttccttta gttg //