LOCUS JAHIZD010000030 4013 bp DNA linear ENV 09-JUN-2021 DEFINITION MAG: Planctomycetota bacterium isolate Modern_marine.mb.161 Modern_marine.mb.161_k141_123583, whole genome shotgun sequence. ACCESSION JAHIZD010000030 JAHIZD010000000 VERSION JAHIZD010000030.1 DBLINK BioProject: PRJNA627556 BioSample: SAMN19297948 KEYWORDS WGS; ENV; Metagenome Assembled Genome; MAG. SOURCE Planctomycetota bacterium (groundwater metagenome) ORGANISM Planctomycetota bacterium Bacteria; Planctomycetota. REFERENCE 1 (bases 1 to 4013) AUTHORS Mehrshad,M., Lopez-Fernandez,M., Bell,E., Bernier-Latmani,R., Bertilsson,S. and Dopson,M. TITLE Energy efficiency and biological interactions define the core microbiome of deep oligotrophic groundwater JOURNAL Unpublished REFERENCE 2 (bases 1 to 4013) AUTHORS Mehrshad,M., Lopez-Fernandez,M., Bell,E., Bernier-Latmani,R., Bertilsson,S. and Dopson,M. TITLE Direct Submission JOURNAL Submitted (22-MAY-2021) Department of Ecology and Genetics, Department of Ecology and Genetics, Limnology and Science for Life Laboratory, Uppsala University, Norbyvagen 18D, Uppsala 752 36, Sweden COMMENT The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: megahit v. 1.1.1 Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: 10x Sequencing Technology :: Illumina ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/03/2021 15:48:33 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 5.2 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 3,156 CDSs (total) :: 3,127 Genes (coding) :: 3,101 CDSs (with protein) :: 3,101 Genes (RNA) :: 29 rRNAs :: 1, 1, 1 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 1, 1 (16S, 23S) tRNAs :: 24 ncRNAs :: 2 Pseudo Genes (total) :: 26 CDSs (without protein) :: 26 Pseudo Genes (ambiguous residues) :: 0 of 26 Pseudo Genes (frameshifted) :: 3 of 26 Pseudo Genes (incomplete) :: 17 of 26 Pseudo Genes (internal stop) :: 8 of 26 Pseudo Genes (multiple problems) :: 2 of 26 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..4013 /organism="Planctomycetota bacterium" /mol_type="genomic DNA" /submitter_seqid="Modern_marine.mb.161_k141_123583" /isolate="Modern_marine.mb.161" /isolation_source="Aspo HRL" /db_xref="taxon:2026780" /environmental_sample /geo_loc_name="Sweden" /lat_lon="57.264 N 16.3936 E" /metagenome_source="groundwater metagenome" /note="metagenomic" gene complement(224..637) /locus_tag="KJ749_00525" CDS complement(224..637) /locus_tag="KJ749_00525" /inference="COORDINATES: ab initio prediction:GeneMarkS-2+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS-2+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="MBU0716706.1" /translation="MSVPLVHHPDYYSDIGAHVFPTQKFALALEGIRSQANDFDKCVH VPSPATREQLLRVHTTEYLADLEACRRTPRTLLSELPLTRQIINAYYLMAGGTCLAAR LALEHGCAMNLGGGFPPWARRTSASRSFAMICSAV" gene complement(819..1274) /locus_tag="KJ749_00530" CDS complement(819..1274) /locus_tag="KJ749_00530" /inference="COORDINATES: protein motif:HMM:NF012792.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GNAT family N-acetyltransferase" /protein_id="MBU0716707.1" /translation="MATAKIEVVGPAEYELISDLYNAVFKPPVDAKFFEQRLQHHNSL VMVAELDKRPVGFSCGYELRPSTFYSWLYGVLPDARRLGIATQLMDAEYAWVSNRGYE MIRLECYNNHRPMLVLAIKRGYDIVGIRWDSRTAENLVVLEKYVSAEAT" gene complement(1362..2672) /gene="mgtE" /locus_tag="KJ749_00535" CDS complement(1362..2672) /gene="mgtE" /locus_tag="KJ749_00535" /inference="COORDINATES: protein motif:HMM:TIGR00400.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="magnesium transporter" /protein_id="MBU0716708.1" /translation="MTLVNPLTPLVEKYVAADAAAAARALEALPEEEAVEVFKALPHN MNALILPHLQLNFAAVLIKDMDPASFEAVVRAMDPKRGAFLLMRLPRDALERLLPHAP KKLKDEARGLLVYPEGSVGRIMHTDYLSLDEGELVRDAVRKIRKLASERYPASYAYVV DVEERLVGVINMRDLLLAAEDAPLGSVVKRDVFTLDCFLDSTEAAQELGKRRYFAAPV VDAEKRMLGIVKAEQLIHGAQANVVESMQRMFGGSPDERAFSPLGFSLRKRLPWLHVN LATAFLAAFVVSLFEDTIARITVLAVFLPVVAGQGGNAGAQSLAIVMRGLLMREIPRN RVLVLVLKEGLLGITTGAVTGIVTGLVAWLWQGNAILGLVIGLGMVVNLFFAGLFGAA IPVIMKSLGWDPAQCSNIILTTVTDVVGFVAFLGFAVLFQNSLI" gene complement(2762..3808) /locus_tag="KJ749_00540" CDS complement(2762..3808) /locus_tag="KJ749_00540" /inference="COORDINATES: protein motif:HMM:NF013528.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DHH family phosphoesterase" /protein_id="MBU0716709.1" /translation="MASQKRKRSDRLLSVLGEFDETMVIMHNNPDPDAIATGWALLLL VDRRLHKPVRLLGRGPVLRAENVQLLKLLQPPIELVEEISPDAHTATVLVDCSPASAN HLLGGDNPPVAVIDHHESTGDGFRIPFRDLRPKVTASASIATEYMREQRLDPPPAVAT ALLYAIRTEMIGARKALSRVDHSALRWLSGFAEYDVLFEIENPPLPRYYYEELLLGLD SVLVYADSAVCFLPRITAPESVGEVADLLIRCDHLKYVLCGGRIGDDFLFSARSKGDK HTALTLLNPVLSGLGNFGGHRHRAGGKISVPMSGMDSDELEQAVRTRWLDACGSTVRR GQRLVGRKEIMRRL" BASE COUNT 858 a 1244 c 1120 g 791 t ORIGIN 1 agccgcgcac tgctcaaagc catcggcgat gccctacgct ccgtcactct tgaagactgc 61 cgaggtttct tccaaggatg cggatatacc gctacacaat aagtgactac gctctagccc 121 gtatttctca cctgacctga cccccgagaa cggatccagg ccgtatctta tgatcatgtg 181 ggttgaggat ctggaatgga cgccggatca tggcgaagaa acgtcacacg gcggaacaga 241 tcatcgcgaa gcttcgggag gcggaagtcc tccttgccca aggcgggaag ccaccgccca 301 gattcatggc gcagccatgc tctagcgcca atcttgcagc aaggcaggtt ccaccagcca 361 tcaagtagta ggcattgatg atctggcgcg taagcgggag ttctgacaat agggttctcg 421 gggtccgtcg acatgcttcg aggtcggcca ggtattccgt agtatggaca cgaaggagct 481 gctcgcgagt agcaggtgag ggcacgtgca cgcacttgtc gaagtcgttt gcttggcttc 541 ggattccctc gagtgccagg gcaaacttct gcgtggggaa cacgtgcgcc cctatgtccg 601 agtaatagtc aggatgatgg accagtggaa cgctcatctc cgcctctcgt acatggggtc 661 atcttatcgc tcctcatggt gtctgtcgcg agtgctttcg ttgattgtcg gccgggcttg 721 gcgggcctcg tcgctcgtcg cgtcaacaac caaccgcgtg gattcgcccg ccatgaccga 781 aacaccgatt gagcaacacc atccaacgtg gacgatgttc aggtggcctc ggccgagaca 841 tacttctcaa gcacgaccaa gttctctgca gtgcgtgagt cccagcgtat gccgacgatg 901 tcgtagccgc gtttaattgc cagcaccagc atcggacgat ggttgttgta acactccaga 961 cggatcatct cgtaaccccg gttgctgacc caggcatact ccgcgtccat aagctgcgtt 1021 gcaatcccta accggcgggc atcgggcagc accccgtaga gccaactgta aaaagttgac 1081 ggcctgagct cgtatccgca cgagaacccc accggccgct tgtccagctc cgccaccata 1141 acgagcgaat tgtgatgctg caggcgctgc tcgaagaatt tggcatccac cggcggcttg 1201 aagaccgcgt tgtacagatc gctgatcagc tcatactctg caggcccaac gacttcgatc 1261 ttggcagttg ccatgggatg gccctcctcc gccctagtta gtttcgctca agcgcccctg 1321 gtggaccacg tcattcaccg ctcagacatt gggtatgctg ttcagatcaa ggagttctgg 1381 aacaagacgg cgaatcccag aaacgccacg aaaccgacga cgtcggtgac ggtcgtgaga 1441 ataatgttcg aacactgggc ggggtcccag ccgagagact tcatgatgac cgggatggcg 1501 gcaccgaaca gtccggcgaa gaacagattg actaccattc ccaggccgat gacgagcccc 1561 agtatcgcgt tcccttgcca aagccaggcc accaaccctg tcacaatccc ggtgaccgca 1621 cccgtggtga ttcccaacaa tccttcctta agcacaagga ccaacacgcg gttgcgaggt 1681 atctcgcgca tcagtagtcc gcgcatgacg attgccagag actgggctcc cgcattgcct 1741 ccctggcccg cgacaaccgg caagaatacc gccagaacgg tgatccttgc aatcgtatcc 1801 tcaaacagcg aaaccacgaa cgcggccaag aacgcggtcg ctagattgac gtgaagccag 1861 ggcaaacgct tgcgcagcga gaacccaagc ggtgagaatg cccgttcgtc cgggcttccg 1921 ccgaacatgc gctgcatact ctcgacgaca ttcgcctgcg ccccgtggat aagctgttcc 1981 gccttcacga taccgagcat tcgtttctcg gcgtccacca ccggcgcggc gaagtaccgc 2041 cgtttcccaa gctcttgcgc cgcctccgta ctgtcgagaa agcagtcaag ggtgaagacg 2101 tctcgtttga ctaccgatcc gagcggcgcg tcctccgcgg ccaggagaag atcgcgcatg 2161 ttgattaccc cgacgagacg ctcctcgacg tcgaccacgt aggcgtacga agcggggtat 2221 cgctcggacg cgagcttgcg tattttccgt acggcgtccc gtaccagttc gccctcgtcg 2281 agcgacagat agtcggtgtg catgatgcgg cccacgctcc cctccgggta caccagcaaa 2341 cctcgggcct cgtccttcag cttcttggga gcgtgcggca gaagccgttc gagagcatcc 2401 cgggggagtc tcatcagaag gaatgctccc cgcttgggat ccatggcgcg cacgacggct 2461 tcgaaactcg cgggatccat atctttgata aggacggccg cgaaattcag ctggaggtgc 2521 ggcagaatca gggcgttcat attatgcggc agcgccttga atacctcgac ggcctcctcc 2581 tccggcaagg cctccagggc gcgcgccgcg gcagctgcat cagccgcaac atacttctca 2641 actagcggag tcagcgggtt gactaaggtc attggaacca acgctccgta acaatactgt 2701 gctcacaatc gcgagcgact ttcctcaagc gtgaaccacc tgtaagctct tgccaattga 2761 gctacagtcg ccgcatgatc tccttgcgtc ctacaagacg ctgtccccgg cgaaccgtgc 2821 tcccgcaggc atcaagccat ctcgtgcgga ctgcctgctc gagctcatcc gaatccatcc 2881 cggacatcgg caccgatatc ttcccgcccg cccgatgtcg gtgcccgcca aagttcccca 2941 acccggacag gacaggattc aataaggtca gcgcggtgtg cttatcgcct tttgatcttg 3001 ccgagaaaag gaagtcatca ccaatcctgc ccccacaaag gacgtacttg aggtgatcgc 3061 atcggatgag aagatcagcc acttccccga cgctctccgg ggcggtgatt cggggcagga 3121 aacataccgc cgaatccgcg tagaccagta cgctgtccag gccgagcagc agttcctcgt 3181 agtaatagcg agggagtggc ggattctcga tttcgaagag gacgtcatac tccgcgaatc 3241 cggacagcca ccgcagagcg gaatgatcca ctcgggacaa cgccttccgg gcgccgatca 3301 tctcggtacg aatggcatac agaagggcag tagcaaccgc cggcggagga tcgagccgtt 3361 gttcacgcat gtattccgtc gcgatactcg ctgaggccgt gaccttcggt cgcaggtcgc 3421 ggaacggaat ccggaagccg tcacccgtgg actcatgatg atcgatgacc gcaaccggcg 3481 gattgtcccc gccaagcaga tgattggccg atgccggtga acagtccacc agcacggttg 3541 ccgtatgcgc atcggggctg atctcctcga caagttcgat tggcggctga agtaatttga 3601 ggagctggac attctcggct cggaggacag gtccgcgccc caacaggcgt acgggtttat 3661 gcaaccgtcg atccacgagc agaagaaggg cccagcccgt ggcgatggca tccggatcgg 3721 gattgttatg cataatgacc atcgtctcgt cgaattcgcc gagaacgctc agcaggcgat 3781 cggaacgttt ccgtttctgg cttgccatca cagccacctt ggttccacaa gaccgatgtt 3841 acccgcacct cagctacgca caactcactt ggacgagcgc ccgaagcatc gctatcgccc 3901 tcccggaagt gcgacgcctt cctcggtgtt ttcgctaatg acgtccaact tcactttcac 3961 ttccgtaggt tgatcgctgt cgagccgcat ttgctgtggc acgacgagct tag //