LOCUS DUZB01000104 5863 bp DNA linear ENV 04-AUG-2020 DEFINITION MAG TPA_asm: Deltaproteobacteria bacterium isolate MAG_00792_naph_016 BS_KBA_SWE02_21mDRAFT_10003351, whole genome shotgun sequence. ACCESSION DUZB01000104 DUZB01000000 VERSION DUZB01000104.1 DBLINK BioProject: PRJNA632036 BioSample: SAMN14911656 Sequence Read Archive: SRR405101, SRR405102 KEYWORDS WGS; ENV; Metagenome Assembled Genome; MAG; Third Party Data; TPA; TPA:assembly. SOURCE Deltaproteobacteria bacterium (marine metagenome) ORGANISM Deltaproteobacteria bacterium Bacteria; Myxococcota; Myxococcia. REFERENCE 1 (bases 1 to 5863) AUTHORS Uzun,M., Alekseeva,L., Krutkina,M., Koziaeva,V. and Grouzdev,D. TITLE Unravelling the diversity of magnetotactic bacteria through analysis of open genomic databases JOURNAL Sci Data 7 (1), 252 (2020) PUBMED 32737307 REMARK Publication Status: Online-Only REFERENCE 2 (bases 1 to 5863) AUTHORS Uzun,M. TITLE Direct Submission JOURNAL Submitted (15-MAY-2020) Molecular diagnostics, Research Center of Biotechnology RAS, 60 let Oktyabrya prospect 7/1, Moscow 119071, Russia COMMENT The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: Velvet v. 1.0 Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: 56.9222x Sequencing Technology :: Illumina GAIIx ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 05/19/2020 15:24:40 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 4.11 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 3,002 CDSs (total) :: 2,967 Genes (coding) :: 2,915 CDSs (with protein) :: 2,915 Genes (RNA) :: 35 rRNAs :: 1 (16S) partial rRNAs :: 1 (16S) tRNAs :: 31 ncRNAs :: 3 Pseudo Genes (total) :: 52 CDSs (without protein) :: 52 Pseudo Genes (ambiguous residues) :: 1 of 52 Pseudo Genes (frameshifted) :: 24 of 52 Pseudo Genes (incomplete) :: 23 of 52 Pseudo Genes (internal stop) :: 5 of 52 Pseudo Genes (multiple problems) :: 1 of 52 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..5863 /organism="Deltaproteobacteria bacterium" /mol_type="genomic DNA" /submitter_seqid="BS_KBA_SWE02_21mDRAFT_10003351" /isolate="MAG_00792_naph_016" /isolation_source="sediment" /db_xref="taxon:2026735" /environmental_sample /geo_loc_name="Sweden: KBA site, Vaertahamnen, Baltic Sea" /lat_lon="59.363333 N 18.119167 E" /collection_date="2008-10-04" /metagenome_source="marine metagenome" /note="metagenomic" gene <1..531 /locus_tag="HPP90_07695" CDS <1..531 /locus_tag="HPP90_07695" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006419155.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="energy-coupling factor transporter transmembrane protein EcfT" /protein_id="HIJ40945.1" /translation="GEPWMLAGLLGVNLACYFFARLGFANLWRDIRFFLIQMAVILLL FFLKYGFMEGLWPGLRTGSQIFLLFLPGTILLRTTQGSQMIGSLKKVMPERLAFLLFT SLRFVPFFARESREIAIAQQLRGAPVGIRRAWNPMHWKDLFQCLMIPLMVRALKTARA AAMSAEARGFEDRSRP" gene 528..1052 /locus_tag="HPP90_07700" CDS 528..1052 /locus_tag="HPP90_07700" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006419172.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="HIJ40946.1" /translation="MKKFTLQEALCLAFCACFIVIFRAAFRLHLNISGHAMLFTMFFL ILGRGCVPRLGAATLVGILAGLLCTLLGMGKGGPLIILRFLIPGLIVDLAGLFSPGLA KSYVACAIVGALGAASRFLTIIVVESILGMDWDLILQHALISSSMGVIFGVAGALMVP PIVRKLSAHGLIGV" gene 1606..2346 /locus_tag="HPP90_07705" CDS 1606..2346 /locus_tag="HPP90_07705" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_003451697.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="AzlC family ABC transporter permease" /protein_id="HIJ40947.1" /translation="MDKRNTIFTRKSSPLSQFFEGAKDTFPLIVGAIPFGIIFGTLAT TAGLSFGATMGMSMFVFAGASQFVCLSLVVAGTAWPMIVLTTFVVNLRHMLYGATMVP FYKKLNPLWKMLLAFGLTDETFAVAVNRYNQKDGVPGKHYYNLGSMVFMYTNWNLCTI IGLTAGNAFPGISHWGLDFAMPATFIGIVIPYLVSKPMWASVITAGTVSIMAGGLPHK LGLMVAALAGVTAGVICEKVFSRKKELV" gene 2385..2699 /locus_tag="HPP90_07710" CDS 2385..2699 /locus_tag="HPP90_07710" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_020590797.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="AzlD domain-containing protein" /protein_id="HIJ40948.1" /translation="MIAGMALVTFAIRYSMFPISGRFQFPELFKQGLRYVPPAVLTAI IVPSVLMSNGETLNLKLSNPYLIGALAACVTGGLFKNLLLTIVVSMVVFMGFQWAFAV RW" gene complement(3094..3927) /locus_tag="HPP90_07715" CDS complement(3094..3927) /locus_tag="HPP90_07715" /EC_number="2.1.1.-" /inference="COORDINATES: protein motif:HMM:TIGR00027.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="SAM-dependent methyltransferase" /protein_id="HIJ40949.1" /translation="MKQGRISQTALKVALGLVTLSVKDDWAQHLPAGLVEMSERLLMA SGSPGYGHRMMRMSKRPWMIRAYEFQDLLMPGQFEGFGHRKIFVQQQVEVAIEQGARQ VLVVGAGFDTLCLRLAPQHPHVQFFEVDHPATSAAKAKGIALEGQPTNMIQIAADLGE RPLSKVLSEEGRWETSLPSVFVAEGLFQYLTDEEVQGLLVEAAACTSPGSRFVFTHAI PGYRRMVSILTVLIGEPWKSAVQSEDLPEYVEGTGWTMISDVDTDSAHGVERYAVAER R" gene complement(4033..4626) /locus_tag="HPP90_07720" CDS complement(4033..4626) /locus_tag="HPP90_07720" /inference="COORDINATES: ab initio prediction:GeneMarkS-2+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS-2+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="HIJ40950.1" /translation="MFNVLGKRIKIIQLRQICLDETLNIFCEISCVRIVSKGLEIDLS AVIGVAGETVWTATETFFYRGNFGEPDTNAENTAFESIPEAKVIARWFLPGGNGFRFA RISGDGNGIHYSKLYARFFGFERDFAQPFLILGNAINHLMDNGNINAISLDVAFKGQF YYKRNVTIKGVKTNGSHRFNIYSEGNDRPCVIGILNE" gene complement(5085..5858) /locus_tag="HPP90_07725" CDS complement(5085..5858) /locus_tag="HPP90_07725" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_014809053.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="enoyl-CoA hydratase/isomerase family protein" /protein_id="HIJ40951.1" /translation="MEIKQNGNITEVILNRSESYNAFNLEMITELANHLTQLATDNSV RGVAITGRGKAFCAGGDLKWAVGFSQKAGSSFHTLASQLHLAIVEIRRMKKPVVAAIN GTAAGAGFSLVLACDFRIIEKSAILKQAYTSNGLCIDGGGTFTLPRIVGLSRSLEIAA FDEPISSAKALEWGLVTKVVDDGMVLEEANSMLNKLIKKSTHSFGWSKKLLTDSFNNS FESHLELEREGLSDCANHSDGQEGLKAFIEKRKPVFGNK" BASE COUNT 1342 a 1481 c 1408 g 1627 t ORIGIN 1 ggggagccgt ggatgctggc cggacttctc ggcgtcaacc tggcctgcta tttttttgcg 61 cgtctcgggt ttgccaatct atggcgggat atccgattct tcctgatcca gatggcggtt 121 attctcttgc tctttttctt gaaatacgga ttcatggaag gcctctggcc gggattgcgg 181 accggttcac agatctttct tctttttttg ccgggaacga tcctgctccg aaccacccag 241 ggatctcaga tgatcgggag cctgaaaaag gtcatgccgg aacgattggc gtttctcctt 301 tttaccagcc tgcgcttcgt tcccttcttt gcccgggaat caagggagat tgccatagcc 361 cagcagcttc gcggcgcacc ggtgggaatc agacgggcct ggaacccgat gcactggaaa 421 gacctgtttc agtgcctgat gatccctctc atggtgcgag ccttgaaaac cgcccgggcc 481 gccgccatgt ccgccgaggc gagggggttt gaggatcggt cacgcccatg aagaaattca 541 ccctccaaga ggccctttgc ctggcgttct gcgcctgctt catcgttatc ttcagggcgg 601 cgttccggct gcacctgaac atatcggggc atgccatgct tttcaccatg ttcttcctga 661 tcctgggaag gggttgcgtc ccgcgattag gcgccgccac actggttgga atcctggcgg 721 ggctcctctg caccctgttg ggcatgggga agggtggacc gttgattatc ctgaggttct 781 taatccccgg cctgattgtt gacctggccg gtcttttttc tcccggattg gccaaaagct 841 acgtggcctg cgccatcgtc ggcgcccttg gtgcggcgag ccgtttcctg accattatcg 901 tggtggaatc gatcctcgga atggactggg atctcatcct tcagcacgct ctcatctcct 961 cttccatggg tgtcatcttc ggggtcgcgg gcgccctcat ggtcccgccc attgtccgca 1021 aactgtctgc ccacgggctg attggtgtct gaacgaaaaa gctgttcagg cacgattgcg 1081 gaatgcggct gtcggccgta ggggtccaaa ctccctccgg agtttggatt taaaggatat 1141 tgctacccac cacaaactgt gttccagaac cagaataccc cgccgaataa actgcttagc 1201 gcgttttttt acctcaacca tctgggtttg aagaccggtg gtgaagtagt tgcgccatac 1261 acccatatcc atacggttat cccaaacgtt ttggatcgat ttttaaaagg tcggtcacga 1321 tcataatatt cgctggtctt gaacaggcgt ttgaagtgct atccgtcctt gtagaaaaag 1381 agcataggag gcggttccag gatcgattct atctttatgg gctggccttt actttttcag 1441 atatcgcgtt caaagcggat tgctcgtcct acctctgcca tctttgtgcg aggcaattgg 1501 ggatgtccta aatgatggtt taaaatttat ggaaaccgtg ataaagagaa cggtcctgaa 1561 atgctcattc attgttccag ggaaaaaaga gagagttgaa ccacaatgga caaacgtaat 1621 accattttca ccagaaagag ctcgcccctg tcccaatttt ttgaaggggc aaaggatacc 1681 tttcctttaa ttgtaggcgc cattcccttt ggcatcatat ttggtacctt agcaacgacg 1741 gccgggcttt cctttggcgc caccatggga atgtccatgt tcgtttttgc cggcgcctca 1801 cagtttgtct gtttgagcct ggtggttgcg ggcacggcat ggcccatgat tgtgctgact 1861 acctttgtgg ttaacctgcg ccatatgctc tatggcgcca ccatggttcc tttttacaaa 1921 aagttgaacc cgctctggaa aatgttgctg gcctttgggt taaccgatga aacctttgcc 1981 gtggctgtca accgctacaa tcaaaaggat ggggtcccgg gcaaacatta ctacaatctg 2041 ggctccatgg tgtttatgta caccaattgg aatttatgta ccattattgg gctcactgcc 2101 ggaaatgctt ttccagggat atcccactgg ggacttgatt ttgccatgcc ggccacattc 2161 attggcattg tcatccccta tcttgtctca aaaccgatgt gggcttcggt gatcaccgcc 2221 ggaacggtct ccatcatggc cggaggactt ccccataagc tgggtttgat ggtggctgcc 2281 cttgccggcg taacggcggg agttatttgt gaaaaagtat tctctcgaaa aaaggaactg 2341 gtttgaatat ttttggaacg gatatccaac cgcatgaaat ttacatgatt gcggggatgg 2401 ctctggtgac ctttgccata cgctatagta tgtttccgat ttccggacga ttccaattcc 2461 ctgaactgtt taaacagggg ttgagatatg tcccaccggc ggtgctgaca gccattattg 2521 tgccttccgt actcatgtcc aatggggaaa cgctcaatct gaagctgagc aatccttatc 2581 tcattggcgc ccttgccgcc tgtgtcactg gagggttatt taaaaatctg ctgttgacca 2641 ttgtggtgag tatggtggtt tttatggggt ttcaatgggc ctttgcagtg agatggtaat 2701 gggacgatat tcaaagtcgg ctgattaatg cttttcgtac ctgtatcctc attcgtacta 2761 ctttcgacgg ggaggcatag cccggtagac agtgtcggtt tgctccgggc attttccttg 2821 tgttcttaaa atggccgttt tacgatcaaa aacctccttt taagcccaaa aaagggaggg 2881 gttcaacgta tttactctaa cgtttcaatg tgttccttgg tccgacccaa gcccctccta 2941 tctgtatcgg tgctgaaatc aaagttgtga ccggatgaca acagcaaatg gggttcgtta 3001 atggatagtt ttacccttga catccggaga ccttataagg gttcggcggc gtgctgttcg 3061 cggcactgtt cggtcagcct cgcgactgag ctttcagcgc cgctcagcta ccgcatagcg 3121 ttcgactccg tgcgctgagt cggtgtcgac gtcggagatc atcgtccaac cggttccttc 3181 gacgtactcg ggcaggtcct cgctctggac ggcggacttc cacggctctc cgatcaggac 3241 cgtcaggatc gagaccatcc tgcgataccc cggaatcgcg tgcgtgaaca cgaaccggct 3301 cccgggtgag gtgcaggcgg ccgcttccac cagcaaccct tgtacctctt cgtccgtcag 3361 gtactggaag agaccctccg ccacgaacac tgaaggcagg gaggtctccc agcgaccctc 3421 ctccgacaac accttcgaga ggggccgctc gccaagatcg gccgcaatct ggatcatgtt 3481 cgtcggttgg ccctcgaggg cgatgccttt ggccttcgcc gcggacgtcg cgggatggtc 3541 gacctcgaag aactgcacgt gggggtgttg cggcgcgagt ctcaggcaga gggtgtcgaa 3601 gccagcaccc accaccagca cctgtcgcgc tccctgttcg atcgcgacct ccacctgctg 3661 ttgcacgaag atcttgcggt ggccaaagcc ctcgaactgc cccggcatca agaggtcctg 3721 gaactcgtag gcccggatca tccacggccg cttggacata cgcatcatgc gatggccgta 3781 gccgggtgat cccgaggcca tgagtagacg ctcgctcatt tccaccaggc ctgcaggaag 3841 atgctgcgcc cagtcatctt tcacgctcaa cgtgacaaga ccgagcgcca ccttgagcgc 3901 cgtctggctg attcgtccct gcttcacgat ggatcatccc caagattcct cgatgaggag 3961 agagttcgcg gtctcgccgc ccaaaacctc aaaatcaccg gtttgatccg gtgcattttg 4021 atggttgaac ccctactcat ttaatatccc gattacgcaa ggacgatcat ttccttcact 4081 atatatattg aatcgatgac tgccgttcgt tttcacgccc ttgatcgtta catttctctt 4141 ataataaaac tgtcccttaa aagcgacatc aagagaaata gcgtttatgt ttccattatc 4201 cattaaatgg ttaatggcgt ttcccaaaat gagaaagggc tgcgcaaaat cgcgttcaaa 4261 gccgaaaaac cgcgcatata atttggaata gtgaatacca tttccatctc cagagatccg 4321 ggcaaaacga aacccatttc cgccgggcag aaaccacctg gcaatcactt tggcttcggg 4381 gatggattca aaagcggtat tttcagcatt cgtatcaggt tctccgaaat tacctcgata 4441 aaaaaatgtt tccgttgccg tccaaacagt ttctcctgcg actccaatga cagcggaaag 4501 atctatttcc aacccttttg aaacaattct aacacacgaa atttcacaaa aaatattgag 4561 ggtttcatca agacaaatct gtctgagctg gatgattttt atacgtttac ccaatacatt 4621 gaacatggat aaaggggctt gtttcaggca gagtattctc tgaaagaatg gaaacactaa 4681 ggtgattggg taaatcaatg aaattttatc ncgatctttt atattgcaga tcttattgaa 4741 ccgagataaa tgtgctctat taatctccaa tctatccaat tttacagtta taggaggaat 4801 ngattcctta ccatgaaaaa ctttggatct tccccaaaac gcntataaag gagctcctaa 4861 aaaaaagaac gatggcatct ttctgtattg cangtgnata actgctgtca taattcatat 4921 ccttggaaat gaaatttgtc cgtttcgtta ctctttattt atttagatat tccagactac 4981 tttttcaatt acattccacc atgggttctg gcgagattag acataatgcc gtcatttatt 5041 aggaaccggt tattgctttg ttttgtgttt cgcatctttt cgggctattt attgccaaat 5101 actggtttgc gcttttcaat aaatgcttta agaccttcct gtccgtcaga gtgatttgca 5161 cagtcggaga gtccttccct ttctaattct aaatgtgact caaaggaatt attgaacgag 5221 tcggtaagta atttttttga ccaaccgaaa gaatgggttg atttttttat aagtttgtta 5281 agcatgctat tcgcttcttc caataccatt ccatcatcaa ccactttggt taccaatccc 5341 cactctaagg cctttgctga agagattggt tcatcgaagg cagcaatttc caaagaacgg 5401 gaaagcccta caattcttgg taaagtgaat gtaccgccac catctatgca caatccgttt 5461 gaagtatatg cttgcttcaa aattgcggat ttttctataa tacggaaatc gcaggctaaa 5521 accaatgaaa agccagcgcc tgcagccgtc ccgtttatcg ctgctacaac aggtttcttc 5581 attcgtctaa tttccactat ggccagatgc aattgcgaag cgagtgtgtg gaaagaggaa 5641 ccggcttttt gagaaaatcc aacagcccat ttgaggtctc ctccagcaca gaaggctttt 5701 cccctgccag taattgcaac acctctcaca gaattatctg tagccaattg tgtgagatgg 5761 tttgccaatt cagtaatcat ttcaagattg aacgcattat atgattccga acggtttaaa 5821 attacttctg tgatatttcc attttgtttt atctcaatat att //