LOCUS JAGQRG010000139 6869 bp DNA linear ENV 12-OCT-2021 DEFINITION MAG: Planctomycetota bacterium isolate HKST-UBA86 NODE_139_length_6869_cov_2.235866, whole genome shotgun sequence. ACCESSION JAGQRG010000139 JAGQRG010000000 VERSION JAGQRG010000139.1 DBLINK BioProject: PRJNA432264 BioSample: SAMN14564107 KEYWORDS WGS; ENV; Metagenome Assembled Genome; MAG. SOURCE Planctomycetota bacterium (activated sludge metagenome) ORGANISM Planctomycetota bacterium Bacteria; Planctomycetota. REFERENCE 1 (bases 1 to 6869) AUTHORS Wang,Y., Ye,J., Ju,F., Liu,L., Boyd,J.A., Deng,Y., Parks,D.H., Jiang,X., Yin,X., Woodcroft,B.J., Tyson,G.W., Hugenholtz,P., Polz,M.F. and Zhang,T. TITLE Successional dynamics and alternative stable states in a saline activated sludge microbial community over 9 years JOURNAL Microbiome 9 (1), 199 (2021) PUBMED 34615557 REMARK Publication Status: Online-Only REFERENCE 2 (bases 1 to 6869) AUTHORS Zhang,T. TITLE Direct Submission JOURNAL Submitted (14-APR-2020) Civil Engineering, The University Hong Kong, Pokfulam Road, Hong Kong 999077, Hong Kong COMMENT The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: CLC de novo assembler v. 6.04 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 49x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 04/19/2021 22:50:27 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 5.1 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 2,769 CDSs (total) :: 2,722 Genes (coding) :: 2,706 CDSs (with protein) :: 2,706 Genes (RNA) :: 47 rRNAs :: 1 (16S) partial rRNAs :: 1 (16S) tRNAs :: 42 ncRNAs :: 4 Pseudo Genes (total) :: 16 CDSs (without protein) :: 16 Pseudo Genes (ambiguous residues) :: 0 of 16 Pseudo Genes (frameshifted) :: 2 of 16 Pseudo Genes (incomplete) :: 11 of 16 Pseudo Genes (internal stop) :: 4 of 16 Pseudo Genes (multiple problems) :: 1 of 16 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..6869 /organism="Planctomycetota bacterium" /mol_type="genomic DNA" /submitter_seqid="NODE_139_length_6869_cov_2.235866" /isolate="HKST-UBA86" /isolation_source="activated sludge from Shatin waste water treatment plant collected monthly from 2007 through 2015" /db_xref="taxon:2026780" /environmental_sample /geo_loc_name="China:Hong Kong SAR, Shatin waste water treatment plant" /lat_lon="22.406236 N 114.213394 E" /metagenome_source="activated sludge metagenome" /note="metagenomic" gene complement(<1..1237) /locus_tag="KDB32_07985" CDS complement(<1..1237) /locus_tag="KDB32_07985" /inference="COORDINATES: protein motif:HMM:NF014071.1,HMM:NF017215.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hydantoinase/oxoprolinase family protein" /protein_id="MCA8919003.1" /translation="MATRGTAIAAIDVGGTFTDACVLRSGGFKRCKIPSTPDEPGRAV AEALQRLGGADLLLHGTTVATNALLEGKLARVAFVTTKGFRDVLAIGRQNRAPEDLYQ LEPVQRSQLVSRDFRIECHERLEPDGSVLLKLTRQEVERVVEEVRELNVEAVAVCLLH SYANSKHELALGRALRKLKLPVVLSSELAPEFREYERSLVTAANAGLMPLLQTYIATL ERKMKPTRVVLMHSAGGWLPAEIAAAEPVKLALSGPAGGIAGVRHALDAEGFDTGVAF DIGGTSTDVSLVTKTPLLRAETPIAGLPLRTPSLDIHTIGAGGGSVAYFDAGGALHVG PESAGASPGPACYGRGGSQPTLTDALLVLGRLPVDLKLGGELSLHPNLSFEAMKHLDS STQPKRLADAIVRVALAGIE" gene complement(1227..1943) /locus_tag="KDB32_07990" CDS complement(1227..1943) /locus_tag="KDB32_07990" /inference="COORDINATES: protein motif:HMM:NF024109.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alpha/beta hydrolase" /protein_id="MCA8919004.1" /translation="MTLRVLYLPGLDGIENVPGKLQSYLRGVELVPFTYPTGHRLTWE ELTDLVSSRLRNLKSGMLVGESFGGAVALKTTFAKPEAVKGLCLVAGFSANPEPFAAG LGSTATRVLPKPLMKPVARLLAGWKLAGTLKGEERTRFLERFSNLDYHDIAARLDLLQ KFDVEDRLGALRCPVDLIYGSEDPIASSRRQRELWTRIPDVRQHQLDSYGHLISHEVP IGVAARLQSWVERVKGRCGN" gene complement(2040..2441) /locus_tag="KDB32_07995" CDS complement(2040..2441) /locus_tag="KDB32_07995" /inference="COORDINATES: protein motif:HMM:NF012790.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="rhodanese-like domain-containing protein" /protein_id="MCA8919005.1" /translation="MQLPPRLRPTDAATILEPTTKAAQKRKRTKKTAPKKTATAVVND NLILIDVRMPQELTGGTIPGSEHIPLDNILDTVPKSIPDPQAPVVVYCKSGMRGGMAQ GALRKLGYTNVSNIVGGFDAWQKAGLPVTIS" gene 2688..3635 /gene="mdh" /locus_tag="KDB32_08000" CDS 2688..3635 /gene="mdh" /locus_tag="KDB32_08000" /EC_number="1.1.1.37" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007725335.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="malate dehydrogenase" /protein_id="MCA8919006.1" /translation="MAFVRPKISVFGAGFVGSTVAQRCADRELGDVLLIDIVEDMPQG KALDMFESSPVERFDARINGSNNPADVKDSDVVVVTSGIARKPGMSRDDLLTTNAKII QSVADSIKENAPNAIVIMVTNPLDVMSWVAHKRLGFAKNKVMGMAGVLDSARMASFVA MELNCSVKDISPMVLGGHGDTMVPLPRYTTVSGISITDLIPADRIESINDRTRKGGAE IVGLLKTGSAYYAPGASAAQMVESIVKDQRRILPTCALLEGEYGLKDTWQGVPCMLGK NGIERIVELKLTDDEKKLLEKSNEHVAGAINQAAKLLGM" gene complement(3689..4600) /locus_tag="KDB32_08005" CDS complement(3689..4600) /locus_tag="KDB32_08005" /EC_number="2.7.7.65" /inference="COORDINATES: protein motif:HMM:TIGR00254.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="diguanylate cyclase" /protein_id="MCA8919007.1" /translation="MGDTQRNQFNTILNAIPDVMVLVDSDLSILSVNAAWGHLAREQR ADEQWFRPLAKSYFAACAAVGLGDNKFAIDAEAGLAEVINGKRETFEFEYPCHTPAEP CYYLLRVSAIPAETGRAALCAHVNITRRKLAEMELHRETRKLREQSLTDPLTGLRNRR ALELLGRQYWANAQRRGGVLAVIYLDLDDFKPINDNFGHQEGDRVLCFVADYLRETFR ESDVIARVGGDEFVVLAWMAKPDDLDKIKARMNLEHRMTSSSGDVYRVGMSVGYAVHD PKTEGDLLNLISHADTAMYKNKKSRKE" gene complement(4675..5265) /locus_tag="KDB32_08010" CDS complement(4675..5265) /locus_tag="KDB32_08010" /inference="COORDINATES: ab initio prediction:GeneMarkS-2+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS-2+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="MCA8919008.1" /translation="MSTDVRKVEDTGWRRPLEALVAQTAQLRENFASHLDFTFSRSLI GSAEKQAAAIIDADRKGMIARLNGWRSFVPTRNVTWAEVVDRLTNAFGVPAQLDKTIE QRERHLQKLFHERGLPKGSVLYSVADAVHDGSVVKSQTRRMLLDPVRLVMQTLQGITS LYTTDAAPAVNCVYAILEMLHPAEVPTPFSAIPEML" gene complement(5327..6508) /locus_tag="KDB32_08015" CDS complement(5327..6508) /locus_tag="KDB32_08015" /inference="COORDINATES: protein motif:HMM:NF019606.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="inositol-3-phosphate synthase" /protein_id="MCA8919009.1" /translation="MARTGVYFIGARGNVATTAMVGAIAISQGRSPATGMVTDGPEFK GLPLIGVQDLLFGGADVSIKPLPAKAVELANARILPTELAVELKDELETLDKQLHTVK GFAFGSVPDGGYLAELTRIEERLARFRSDNNLDRVVVVNVSSTEPHFAETPDYDTPEQ LMKALTTAGGGFTSATLLNALAALRQGCAYVNFTPSQASELPALRKIARDSRLPHAGK DGKTGETLLKTVLGPMFLARNLKVMSWVGNNFLGNNDGAVLDAPASKDSKLRQKDAAL REMLGGAFVKTDIQYVPSLGDWKTAWDLIHFQGFLGTPMTMTFTWQGCDSALAAPLVL DLVRLTDLAWQREESGLLAQLAPFFKNPLDGHTHDFHQQMAGLYAWLDVVRAGQTFER K" gene complement(6517..>6869) /locus_tag="KDB32_08020" CDS complement(6517..>6869) /locus_tag="KDB32_08020" /inference="COORDINATES: ab initio prediction:GeneMarkS-2+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS-2+." /codon_start=3 /transl_table=11 /product="hypothetical protein" /protein_id="MCA8919010.1" /translation="VESVGNETVGLAPGDLPDELPADTKDQAVLADKLEGAAAGLSAD AARLKADSAATTQDSGTSGDFDVSVSPREFVTLPALKLEPGKSTAQSLATQTPARRAL AARAVQSLERIQEG" BASE COUNT 1409 a 2059 c 1945 g 1456 t ORIGIN 1 gttcaatacc ggcaagcgcg acgcggacga tggcatctgc caaacgcttc ggttgtgtgg 61 aactgtcgag gtgtttcatg gcttcgaatg aaagattcgg gtgcagcgat aactcgccgc 121 cgagtttgag atcaaccggt aaccgcccaa gcacaagcaa tgcatcggtc agtgtgggct 181 gcgagccgcc acgaccgtag caagccgggc cgggactcgc cccggcgctc tccgggccga 241 catgcaacgc gcctcctgcg tcaaaatacg cgacactgcc gccacccgcg ccgatcgtat 301 gaatgtccaa cgaaggcgta cgcagcggca agccggcaat cggtgtttct gcacgaagta 361 acggcgtctt cgtgacaagg ctgacgtctg tacttgtacc gccgatatcg aacgccacac 421 cagtgtcaaa tccttcggca tcaagtgcgt gtcggacgcc cgcaatccca ccagcagggc 481 ccgacagagc gagcttaacc ggttcggcag cagcaatctc tgcgggaagc caaccgccgg 541 ctgaatgcat gagcacgacg cgtgtcggct tcatctttcg ttccaacgtc gcaatgtagg 601 tctgtagcag cggcatcaag ccggcatttg cagcggtgac taagctgcgt tcgtactcgc 661 ggaattcagg tgcaagttcg ctggacaaca ccacggggag tttgagcttt cgcagcgcac 721 gccccagcgc aagctcgtgc tttgagttgg cgtaggaatg aagcaagcag accgccactg 781 cttcgacgtt caactcgcgc acttcttcga cgactcgctc gacttcttga cgtgtcaact 841 ttagcaatac cgacccgtct ggctccaaac gctcatgaca ctcgatacgg aaatcgcgcg 901 agaccagctg actccgttga acaggttcga gttggtagag gtcttccgga gcgcggttct 961 gccgcccgat tgccagcacg tcacgaaacc ccttggtcgt tacaaacgcg actcttgcga 1021 gctttccctc aagcagcgcg ttcgtcgcta cggtcgttcc gtgcagaagc aagtcggcac 1081 cacccagacg ttgaagcgcc tccgcaacag cgcgcccggg ttcgtccgga gtacttggga 1141 ttttgcaacg cttgaatccg ccgctacgca ggacgcaggc atccgtaaat gtgccgccca 1201 cgtcaatcgc cgcgatggct gtacctctag ttgccacatc tgcccttcac acgttcaacc 1261 cagctttgaa gccgcgccgc cacgccaatt ggtacttcgt gagaaatcag gtgtccgtac 1321 gaatcaagct gatgttgacg tacatcgggg atgcgtgtcc atagctcacg ttgccgcctt 1381 gacgaggcaa ttgggtcctc gctgccatag atcaggtcca ccggacaacg caacgcaccc 1441 agcctgtcct caacgtcaaa cttctgcagc aggtctaatc tcgccgcgat gtcgtggtag 1501 tcgaggtttg aaaagcgttc aaggaagcgg gtgcgctctt cgcctttcag agtgcctgca 1561 agcttccagc cggccagaag gcgtgcgacc ggtttcatca acggtttcgg tagaacccgc 1621 gtagccgtgc ttccaaggcc tgcggcaaac ggttcaggat tggcgctgaa gcccgccacc 1681 agacacagtc ctttcacggc ctcaggcttc gcgaacgtag tcttcaacgc aaccgctccg 1741 ccaaatgact cccctaccaa catccctgat ttcaagttgc gcaaccggga actcaccaga 1801 tctgtcagtt cctcccaagt caggcgatgt ccggttggat acgtaaaggg cacgagttcc 1861 acgccgcgca agtacgactg aagtttgccg ggcacgtttt cgatgccgtc gagtcccgga 1921 agatagagaa cgcgcagtgt catggttccg acaggtttag agtagacctg acgtcttgcc 1981 aaccgccgtt aaagcaacaa ggacggccaa ccggccgtcc ttgttcgtgc ctggattctc 2041 tagctgatgg tgacgggcaa cccagccttt tgccatgcgt cgaaaccacc gacgatgttg 2101 gagacattgg tgtagcccag cttgcgaagc gcgccctgcg ccatgccacc gcgcattcca 2161 cttttgcagt acacaacaac cggcgcctgc gggtcgggaa tcgactttgg cacggtgtcc 2221 agaatgttgt cgagcgggat gtgttctgag cctgggatcg ttccgccggt cagttcctgc 2281 ggcatccgaa cgtcaatcag gatcaggttg tcattcacaa ccgccgtggc ggttttcttc 2341 ggagcggtct ttttggtgcg tttgcgtttc tgtgctgctt ttgtcgtggg ctcaagaatc 2401 gttgctgcgt cagtgggtcg aagtctcggt ggaagctgca tcgcggttcc ttcctcaaat 2461 ggccgataag cggaagcgtc agagtgtcga actaataatt atacgtacgc tttacagaaa 2521 tagaagccta aagcacttga cgttaggcac tccaaaccta tgttccgccg cccattatgt 2581 cagaaattgg ggtcattgac cccctgacgt cccaccgtta tatgacgcta cagatgccgc 2641 gtacgggcag ctaatctgcc cgcaattgac tgcccgtgag gactaccatg gcttttgtgc 2701 gacccaagat ttccgttttc ggcgccggct ttgtcggctc aactgtggcc caacgctgtg 2761 cagaccgcga gttgggcgac gtactgctga ttgatatcgt tgaagacatg ccccagggca 2821 aagctctcga tatgttcgag tcgagccccg ttgaacgctt tgacgcacga atcaacggca 2881 gcaacaaccc ggccgacgtg aaagatagtg acgtcgttgt tgtgacttcc ggcatcgcaa 2941 gaaaaccggg catgagccgc gatgacttgc tgaccaccaa cgccaagatc atccagagcg 3001 ttgcggacag catcaaagaa aacgcgccca atgcgattgt catcatggtc acaaaccctt 3061 tggacgtaat gagttgggtc gcacacaagc gcctcggctt cgccaaaaac aaggtcatgg 3121 gcatggccgg cgtgctggat agcgcccgca tggccagctt tgtcgcgatg gaactgaact 3181 gcagcgtcaa ggacatcagc ccaatggttc tgggcgggca cggcgacacg atggtgccgc 3241 tgccgcgcta cacgactgta agcggcattt cgatcactga cctgatcccg gccgaccgca 3301 ttgagtccat caacgaccgt acgcgcaagg gtggcgctga gatcgtcggg ttgctgaaga 3361 ccggtagcgc ctattacgcg ccaggtgcca gcgccgcgca gatggtcgaa agcattgtga 3421 aggaccagcg ccgaattctg ccaacctgcg cactgcttga aggcgagtac gggctgaaag 3481 acacgtggca aggcgtaccc tgcatgctcg gcaagaacgg catcgagcgt atcgttgagc 3541 tgaagctgac tgacgacgaa aagaagctgc tcgagaagag caacgaacac gtggctgggg 3601 ccatcaacca ggccgcgaaa ctgctgggaa tgtagtctgg aatgaccaag aaagcgggcg 3661 ccccagggcg cccgcttcgt tttgttgtct actctttgcg cgatttcttg ttcttgtaca 3721 tggctgtgtc ggcgtggctt atcaggttta acaagtcgcc ttccgtcttc gggtcgtgca 3781 ccgcataccc tacgctcatc cctaccctgt aaacatcgcc gctggatgac gtcatccggt 3841 gttcaagatt catgcgggct ttgatcttgt ccagatcatc cggcttcgcc atccacgcca 3901 acacgacaaa ctcgtcgccg ccaacgcggg cgatcacgtc agactcacgg aatgtctcgc 3961 gcaggtagtc ggcgacgaag cacagtacac ggtcaccctc ctggtgtccg aagttgtcgt 4021 tgatcggctt gaagtcatcc agatcaaggt agatcacagc cagcacgcca ccacgccgtt 4081 gcgcgtttgc ccaatactga cgccccagca gctcaagcgc acgacggttt cgcaacccgg 4141 taagaggatc agtcaacgat tgttcgcgca actttcgcgt ctcgcgatgt aactccatct 4201 cggcgagctt acgacgcgtg atgtttacat gggcacacag tgctgcacgc ccggtttcgg 4261 cgggaatggc gcttacgcgg agcagataat agcagggctc agctggcgta tgacaggggt 4321 actcaaactc gaatgtctcg cgcttgccgt tgatgacttc agctaggccg gcctcggcgt 4381 caatcgcgaa cttgttgtcg cccaatccga cggccgcgca agctgcgaag taagacttcg 4441 caagtggacg aaaccattgc tcgtccgcac gctgctcacg cgcaagatgc ccccacgccg 4501 cgttgacact gaggatcgat agatcggaat ccaccagcac catgacgtca ggaatggcgt 4561 tcaaaatcgt attgaactgg ttgcgttgcg tgtcgcccat accacgagca tatcccgtga 4621 tatcgacaac cgcgagttga actgcttcga atcgttgggt gtgtggtgct gatgctacag 4681 catctcggga atcgcactga acggcgtcgg tacttcggcc ggatgaagca tttcaaggat 4741 cgcgtaaacg cagtttacgg caggcgccgc gtcggtcgtg tacagcgacg tgatgccttg 4801 cagcgtttgc atcacgaggc ggaccggatc aagcagcatg cggcgtgtct ggctcttgac 4861 caccgacccg tcgtgcacag catccgccac tgagtacaga actgagccct ttggcagtcc 4921 acgctcgtgg aagagtttct gcaggtggcg ctctcgttgc tcaatcgttt tgtctagctg 4981 cgccggaaca ccgaacgcgt tggtcaaacg atcgacaacc tcggcccatg tcacgttccg 5041 cgtcggcaca aagcttcgcc atccgtttag acgcgcaatc atacccttgc ggtctgcgtc 5101 aatgatggcg gctgcctgtt tctcggcgct gccaatgagt gaccttgaga acgtgaaatc 5161 gagatgcgac gcaaagttct cgcgcaactg ggcggtctgg gcgaccagcg cttccagcgg 5221 acggcgccag ccggtgtctt caactttgcg aacatctgtg gacatgcatt gccctgtctc 5281 gggcccacaa ggctagtttg atctgcaatg aattgccacc ctggaactat ttacgttcaa 5341 aggtttgccc agcccgcacg acatcaagcc atgcgtacaa cccggccatt tgctggtgaa 5401 aatcgtgagt gtgcccgtcg agcggatttt tgaagaacgg tgcgagttgc gctagcaagc 5461 cgctttcttc gcgctgccat gccagatcag tcaagcgcac caggtcgagg accaacggtg 5521 cagccagtgc ggaatcacaa ccttgccagg tgaaggtcat ggtcatgggt gtaccgagga 5581 agccttggaa atggatcagg tcccaggccg ttttccagtc accgagactc ggcacgtact 5641 ggatgtctgt cttcacaaac gcgccgccaa gcatctcgcg tagtgcggcg tccttctggc 5701 gcagcttact gtctttgctc gcgggcgcat ccagcaccgc tccgtcgttg ttgccgagaa 5761 agttgttgcc gacccacgac atgaccttga gattgcgcgc gaggaacatc ggccccagca 5821 cagtcttcag caatgtctca ccggtcttgc cgtccttgcc cgcgtgcggc aggcgtgagt 5881 cccttgcgat cttgcgtaat gcgggcaact cgcttgcctg gctgggagta aagttgacgt 5941 aagcgcaacc ctgacgcagt gccgccagtg cgttcagcag cgttgcactg gtgaatcccc 6001 caccggccgt ggtgagcgct ttcatcaact gctcaggcgt gtcgtagtca ggcgtttcgg 6061 cgaagtgcgg ctcggtgctg cttacgttga cgacaaccac ccggtccaga ttgttgtcgc 6121 tgcggaaacg ggcaagacgt tcttcaatgc gtgtgagttc ggcgagatag ccgccgtccg 6181 gcacactccc gaacgcgaat ccctttaccg tgtgaagctg tttgtccagc gtttccagct 6241 cgtctttcaa ctcgacggca agttctgtcg gcagaatgcg agcatttgcc aactcaacgg 6301 ctttggctgg caaaggctta atcgagacat cagcgccacc gaacaggagg tcctgtacgc 6361 cgatcagcgg taatcccttg aactcagggc cgtccgtgac catgccggtg gcgggtgacc 6421 gcccttgtga aatcgcgatc gcgccaacca tcgccgtagt ggctacgttg cctcgcgctc 6481 cgatgaagta gactcctgtg cgtgccatct gttgcctcaa ccttcctgaa ttctttccag 6541 cgactgcacc gcccgtgctg ccaaggcgcg ccgcgccggc gtttgcgttg ccagtgactg 6601 cgctgtcgac ttgcctggct caagcttcag cgccggcagc gtgacaaact cgcgcggtga 6661 aacacttacg tcaaaatccc cgctggtgcc gctgtcctgc gtcgtagcgg cgctgtcggc 6721 cttcagtcgg gcagcgtcag cgctgagtcc tgcggcggcg ccctcaagtt tatcggctaa 6781 gacagcctga tccttcgtgt cggcaggcag ctcgtccggc aaatcgcccg gcgcaagccc 6841 gactgtttcg ttgccgacgc tttctacct //