LOCUS JAGRPA010000581 6322 bp DNA linear ENV 12-OCT-2021 DEFINITION MAG: Verrucomicrobiia bacterium isolate HKST-UBA32 2012-05-30_1_(paired)_contig_475930, whole genome shotgun sequence. ACCESSION JAGRPA010000581 JAGRPA010000000 VERSION JAGRPA010000581.1 DBLINK BioProject: PRJNA432264 BioSample: SAMN14564467 KEYWORDS WGS; ENV; Metagenome Assembled Genome; MAG. SOURCE Verrucomicrobiia bacterium (activated sludge metagenome) ORGANISM Verrucomicrobiia bacterium Bacteria; Verrucomicrobiota; Verrucomicrobiia. REFERENCE 1 (bases 1 to 6322) AUTHORS Wang,Y., Ye,J., Ju,F., Liu,L., Boyd,J.A., Deng,Y., Parks,D.H., Jiang,X., Yin,X., Woodcroft,B.J., Tyson,G.W., Hugenholtz,P., Polz,M.F. and Zhang,T. TITLE Successional dynamics and alternative stable states in a saline activated sludge microbial community over 9 years JOURNAL Microbiome 9 (1), 199 (2021) PUBMED 34615557 REMARK Publication Status: Online-Only REFERENCE 2 (bases 1 to 6322) AUTHORS Zhang,T. TITLE Direct Submission JOURNAL Submitted (14-APR-2020) Civil Engineering, The University Hong Kong, Pokfulam Road, Hong Kong 999077, Hong Kong COMMENT The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: CLC de novo assembler v. 6.04 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 32x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 04/20/2021 17:10:28 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 5.1 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 4,690 CDSs (total) :: 4,665 Genes (coding) :: 4,607 CDSs (with protein) :: 4,607 Genes (RNA) :: 25 tRNAs :: 23 ncRNAs :: 2 Pseudo Genes (total) :: 58 CDSs (without protein) :: 58 Pseudo Genes (ambiguous residues) :: 24 of 58 Pseudo Genes (frameshifted) :: 20 of 58 Pseudo Genes (incomplete) :: 23 of 58 Pseudo Genes (internal stop) :: 3 of 58 Pseudo Genes (multiple problems) :: 12 of 58 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..6322 /organism="Verrucomicrobiia bacterium" /mol_type="genomic DNA" /submitter_seqid="2012-05-30_1_(paired)_contig_475930" /isolate="HKST-UBA32" /isolation_source="activated sludge from Shatin waste water treatment plant collected monthly from 2007 through 2015" /db_xref="taxon:2499141" /environmental_sample /geo_loc_name="China:Hong Kong SAR, Shatin waste water treatment plant" /lat_lon="22.406236 N 114.213394 E" /metagenome_source="activated sludge metagenome" /note="metagenomic" gene <1..198 /locus_tag="KDM91_16060" CDS <1..198 /locus_tag="KDM91_16060" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015346146.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATP-binding protein" /protein_id="MCB1236583.1" /translation="EVSDAVIDHIIAAAARRETGARGLRASLAPHLEEAAFQTFGQDG AGKVRVDLVDGEIRVEVALAA" gene 266..637 /locus_tag="KDM91_16065" CDS 266..637 /locus_tag="KDM91_16065" /EC_number="4.1.1.11" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009576177.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aspartate 1-decarboxylase" /protein_id="MCB1236584.1" /translation="MRFLLRSKIHLATVTEANPDYVGSITIDRRLIDAVGLWPGEKVL VASATTGARLETYVLEGEAGSGIIGINGAAAHLINAGEKVIIMGFELCEKPVEPKVVL VDRDNRIVETLVERPEMAVCG" gene 856..2589 /locus_tag="KDM91_16070" CDS 856..2589 /locus_tag="KDM91_16070" /inference="COORDINATES: ab initio prediction:GeneMarkS-2+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS-2+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="MCB1236585.1" /translation="MAWHFTVVLPRSDDWNIVAPVIVEARAGNISWESLNAQVGDARM PVARIVHAMLALSSGWNQRLESMAILSFTMLSFFYFELLARHAFGRRPRAFAAMTLAG SLLTFWPAMGMYWTFPTMLCYAIGTSLAFAIIPLMRTPLPPEARAAGGAVLGFLACET FLSGWLAWLVLFGLTFLSAWEDHWSRPWRRAIALIGIGLILAIGIYIPGWHSHAGQPM RGGNPVGIGAYTLFFFRWLGSPFSFPPIGIDDAEAVFRWQMTVGLVVGILGAVCIAGI TTVALLRCRLFRGEIRRIAPWLAIIGFSLAAGILVTAGRTRFSPSFCFLGRYISFSIW AYIGAAALFFEIWPFPVGKLPRRLAGAAIPLLLALYGFGFWRGLLSIESDHFATERIR CAFELMPLYVGKSPEIEGDVLRHPFSPPPAELWRLGNRVRQADLLPVPLVDEATWRAR LREDSGGAAGKLESLSPKGPDWIVSGWAADTVHRHRAHGIFITAERPGEPEKLMGFAQ KNAKRPKYEKKYRFREFSPHAGWIFTVEKERLLRQAPPGTILRAYAVDANTATFHRLD GEIDMPEAPAR" gene complement(2599..2976) /locus_tag="KDM91_16075" CDS complement(2599..2976) /locus_tag="KDM91_16075" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017504547.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="response regulator" /protein_id="MCB1236586.1" /translation="MKGRILLVDDNVNLTTLLGKALTKCGYEAHAENDSTLAVNRVRE LRPDLIVLDVMMPVMDGGDVLAELRRDFQLRDIPVIMLTALAQEAGSLARIGGGNCPV IGKPVELGVLVNEIERGLGRAAA" gene 3127..3456 /locus_tag="KDM91_16080" CDS 3127..3456 /locus_tag="KDM91_16080" /inference="COORDINATES: protein motif:HMM:NF013868.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="STAS domain-containing protein" /protein_id="MCB1236587.1" /translation="MSVHFHPSTHDLVITIDSQRFEGSDADALESDFESLADCGPIAS IQIDLARVECIDSTGVSALVSLKQRFATGATDISLQNLHPGVERVVNLLRLNRLFAIA EPMAKAG" gene complement(3493..4563) /locus_tag="KDM91_16085" CDS complement(3493..4563) /locus_tag="KDM91_16085" /inference="COORDINATES: protein motif:HMM:NF012335.1,HMM:NF019845.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="zinc-binding alcohol dehydrogenase family protein" /protein_id="MCB1236588.1" /translation="MEHSPVRSIDPVPDTQPQIFLAEPGRFERRSGPAPEPGPGEALV RVRHIGVCGTDIHAFRGHQPFFEYPRILGHELGVEVVSVNGSAACPAPGTRCAVEPYL NDPDSPASLRGKTNCCESLRVLGVHCDGGMRPLLAVPAGKLHASPVLDTERLALVEML CIGRHAVERARIESPDEFALVLGAGPIGMSVLQFLKTATRRAAVADLDAGRLDFCRSS LGIDLTLLVPRGNPVDSGVLRDLGGDGRLPTVIFDATGSAASMMSAFELAAHGGRLVF VGLFQGSVTFDDPNFHRRELTVMSSRNATAADFRATIEAIETGRIDTAPWITHRLAFD EVPDRFAETIEIPAMRKTVIAV" gene 4688..5587 /locus_tag="KDM91_16090" CDS 4688..5587 /locus_tag="KDM91_16090" /inference="COORDINATES: protein motif:HMM:NF013088.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DMT family transporter" /protein_id="MCB1236589.1" /translation="MPWYLLLPLSSAILYALASIFLKRGLRDGATMEQSFHINNLAVA LIFAPLLLLERDPVRWELWRQPLLTTVSFFAGTWLTFAAMRSADVSLVTPLMGTKVVF VAIGIAILAGQGPPPALWVAAILTAAGVFLMGFRDFRKTAAGHGIAIGAALLSAAVFG VCDVFVRLWSKNFGAMTFLALSSMGVGVLSLLVWLAKGMRPLWPRAENAKPGVFAAAG AAIIGVQAVSMGLAVSYFDDATGINVVYASRGLWSIALIGLAGPLLGNHERHTAGAAY RWRVCGSVLVTAAVVIAVLARKN" gene complement(5663..>6322) /locus_tag="KDM91_16095" CDS complement(5663..>6322) /locus_tag="KDM91_16095" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009959339.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="PQQ-binding-like beta-propeller repeat protein" /protein_id="MCB1236590.1" /translation="GELLWKGEDDAITHATPVVANLHGVRQAIYFTQTGLVAVSVEDG KVLWRQDFPFKVSTAASPVVFDDIVYCSAGYGVGGGAYRVKKDGAAFSSEQIWRTEND NINHWSTPVVKDGYLYGMFSFKEYGKGPLACVDIRTGEKKWAEPGFGPGNVTLTADGT LIVLSDKGEIVLVEAKPDAYHEIDRADVLDGKCWSTPTLADGKIYARSTTEGGCFDFS K" BASE COUNT 987 a 2061 c 2005 g 1269 t ORIGIN 1 gaagtctccg acgccgtcat cgaccacatc atcgccgccg ccgcgcggcg cgaaacgggc 61 gcgcgcggac tgagggcctc gctcgctccg catctggagg aggcggcttt tcagaccttc 121 ggccaggacg gcgcggggaa agtccgcgtc gatctcgtcg acggcgaaat tcgagtcgag 181 gtcgccctcg cggcctgatc cggcggccgg cgacggttga ctccggccgc ttttcgccca 241 ctttacccct ccttccaccc gagccatgcg tttcctgctg cgatcgaaaa tccacctcgc 301 caccgtcacg gaggcgaatc ccgactacgt cggcagcatc accatcgacc ggcgcctgat 361 cgacgccgtc ggcctgtggc ccggcgaaaa ggtgctcgtc gccagcgcca ccaccggcgc 421 gcggctggag acctacgttc tggagggcga ggccggctcc ggcatcatcg gcatcaacgg 481 cgccgccgcc catctcatca acgcgggcga gaaagtcatc atcatgggct tcgaactgtg 541 cgaaaagccc gtcgagccca aagtcgtcct cgtggaccgc gacaaccgca tcgtcgaaac 601 cctcgtcgag cggcccgaaa tggcggtgtg cgggtgattg cgcccgacgg acgcgagcga 661 gccgcggttg acaggcactc cattgccttt taagctcgcc gttctccgct cccggcaagg 721 gagcccgacg tgaaccaatc gcaggcttcg gccactcccc gcttccttca taccgcggcg 781 cgtcacctgg ccggcatccg ctgggctcgg gtggtcgctt ggggagcgtt tcttcccccg 841 ctggtcatcg ctttcgtggc ttggcatttt acggtcgtat tgccccgaag cgacgactgg 901 aacatcgtgg ctcccgtgat cgtcgaggcg cgggccggga acatttcctg ggaaagcctg 961 aacgcccagg ttggcgatgc ccgcatgccc gtggcgcgaa tcgttcacgc gatgctggcg 1021 ctttcttcgg gctggaacca gcggctcgaa tcgatggcga tcctgtcatt cacgatgttg 1081 tctttttttt atttcgaact cctggcccgc cacgcctttg gccgccgtcc ccgggcgttc 1141 gccgccatga cgctcgccgg ctcgctcctg acattctggc cagcgatggg catgtactgg 1201 acgtttccga ccatgctctg ctatgcgatc ggcacctcgc tggcattcgc gatcatccct 1261 ctgatgcgga cgccactgcc gcccgaggcg agggcggcgg gcggcgcggt cctcggcttc 1321 ctggcctgtg aaactttcct gtccggatgg ctcgcgtggc tcgttctctt cggtctcacc 1381 ttcctctcgg cgtgggaaga tcactggagc cgcccttggc gccgcgccat cgcgctgatc 1441 ggaatcggcc tgattctggc gatcgggatc tacatccctg gatggcattc ccatgcgggc 1501 caaccgatgc gcggcgggaa tcccgtcggc atcggcgcct acacgctctt cttcttccgc 1561 tggctcggtt cgcccttttc gttcccgccg atcgggatcg acgatgccga agccgtcttc 1621 cgctggcaaa tgacggtcgg tcttgtcgtc gggatactcg gcgccgtttg catcgcggga 1681 atcacaaccg tggcgctgct tcggtgtcgg ctttttcggg gggaaatccg ccgcatcgct 1741 ccctggctgg ccataatcgg attttctctc gcggccggaa ttctggtaac ggcggggcgc 1801 acgcggtttt ccccctcgtt ttgtttcctc ggccggtata tcagtttttc gatctgggcc 1861 tatatcgggg ccgccgcgct gtttttcgaa atttggccct tccccgtcgg caaactcccg 1921 cgccgactcg ccggtgcggc gatcccgctc ctcctcgcgc tctacggttt tggattttgg 1981 cgtgggcttc tttccatcga gtcggaccat ttcgccaccg agcggatccg ctgcgcgttc 2041 gaactgatgc ccctctatgt cggcaagtct cccgaaattg aaggcgatgt gctcaggcat 2101 cctttttccc cgccgccggc ggaactttgg cggttgggaa atcgcgtgcg ccaggcggat 2161 ctcctgccgg ttccgctggt ggacgaggca acctggcgag cccgccttcg cgaggactcc 2221 ggtggcgcgg ccgggaaact cgagagcctc tctccgaaag ggcccgattg gatcgtttcc 2281 ggctgggcgg ccgataccgt ccaccgccac cgcgcgcacg gaatcttcat caccgccgag 2341 cgtccaggcg aacccgagaa actgatgggc ttcgcccaga aaaatgccaa gcgaccgaaa 2401 tacgaaaaga aataccgttt tcgcgaattc tcgccccacg cgggctggat cttcactgtg 2461 gaaaaagagc gcctgctccg gcaggcgcct cccggaacca ttcttcgcgc ctacgcggtc 2521 gacgcaaaca ccgcgacctt ccaccgcctc gacggggaga tcgacatgcc cgaggccccc 2581 gcccgctgag ttcgcgaatc aggccgccgc gcggccgagg ccgcgctcga tctcgttgac 2641 caggacgcca agttcgaccg gcttgccgat cacgggacag ttgccgccgc cgatccgggc 2701 caggctgccg gcctcctgcg ccagggcggt cagcatgatc accgggatgt cgcgaagctg 2761 aaaatcgcgg cgcagttcgg cgagcacgtc gccgccgtcc atcaccggca tcatcacatc 2821 gaggacgatc aggtcggggc gcaactcgcg gacccggttc accgcgaggg tggagtcgtt 2881 ttccgcgtgc gcctcgtaac cgcacttcgt cagcgctttg ccgagaagcg tggtcagatt 2941 gacgttgtcg tcgacgagaa ggatgcgccc tttcatggtg ccagaaatcg acggcaaaag 3001 tagtccagaa attatgaaaa ttcaacttta ttagtgattt tccaaaataa ttgcgctctt 3061 cgtttttgaa gttgcgaaaa ttatcattaa ttgctataat ttctgaaaat cacacctcgt 3121 tcctccatgt ccgttcactt ccatccttcc acccacgacc tggtcatcac catcgattcc 3181 caacgtttcg aaggctccga cgccgacgcg ctcgaatccg atttcgaatc cctcgccgat 3241 tgcggcccca tcgcttcgat tcagatcgac ctcgcccggg tcgaatgcat cgacagcacc 3301 ggtgtcagcg ccctcgtcag tctgaagcag cgtttcgcca ccggcgcgac ggatatttcc 3361 ctgcagaatc tccaccccgg cgtcgagcgg gtcgtcaatc tgcttcgtct caatcgcctt 3421 ttcgccatcg cggaaccgat ggccaaggcg ggatgagcgg aagcctttta ccgggccgat 3481 cttccccgat cgctacaccg cgatcaccgt cttgcgcatc gcgggaatct ctatggtttc 3541 ggcgaagcgg tccggcacct cgtcgaaggc caggcgatgc gtgatccacg gcgcggtgtc 3601 gatcctcccg gtctcgatgg cctcgatggt ggcccggaaa tcggcggccg tcgcgttgcg 3661 gctcgacatc acggtcagct cgcggcggtg gaaattcgga tcgtcaaagg tcacgctccc 3721 ctgaaacagg ccgacgaaga cgaggcgccc gccgtgcgcg gcgagttcga aggcgctcat 3781 catggaggcg gcgctgcccg tggcgtcgaa gatcacggtg ggcagccggc cgtctccgcc 3841 caggtcgcgc aggaccccgg agtcaacagg gttgcctcga ggaaccaaca gggttaggtc 3901 gattccgagg ctggatcggc aaaaatcgag ccgtccggcg tcgagatcgg ccacggcggc 3961 gcggcgggtc gcggttttca ggaattgcag gacgctcatc ccgatcggtc cggcgccgag 4021 gacgagggcg aactcgtcgg gcgactcgat gcgcgcgcgc tcgacggcgt ggcgcccgat 4081 gcacagcatc tcgaccagcg cgaggcgctc ggtgtcgagc acgggtgagg cgtggagttt 4141 cccggcggga acggcgagca gggggcgcat gccgccgtcg caatgcacgc ccagcacgcg 4201 gaggctttcg cagcagttgg tttttccgcg aagcgaggcc gggctgtccg ggtcgttcag 4261 ataaggctcg acggcgcagc gggtccccgg cgcgggacag gcggcggaac cgttgaccga 4321 cacgacttcg acgccgagtt cgtgcccgag gatgcgcggg tattcgaaaa agggctggtg 4381 tccgcggaag gcgtggatgt ccgtgccgca gaccccgatg tggcgcacgc gcacgagggc 4441 ttcgcccggc ccgggctcgg gagcgggtcc cgatcgcctt tcgaaacggc cgggttcggc 4501 gagaaaaatc tgtggctgcg tgtctggcac cggatcgatc gagcgaacgg gagaatgctc 4561 catttttcaa agatgacacc cggattttcg gcctcaaaac acccgggcgg tgcgggcgaa 4621 ttgatcgtgc cttttcgccg gcgcgggtga tggttggcgt cgtcgtcccc cgttccgcct 4681 ttttcgcgtg ccctggtatc tgctcctgcc cctgtcctcc gcgatcctct atgcgctggc 4741 ctcgattttc ctgaaacggg gcctgcgcga cggggcgacg atggagcagt cctttcacat 4801 caacaatctg gcggtggctc tgattttcgc gcccctgctg ctgctggagc gcgatccggt 4861 gcgctgggag ctgtggcggc agcccctgct gacgaccgtc tcgtttttcg ccggaacgtg 4921 gctgaccttt gccgccatgc gcagcgcgga tgtctcgctg gtcacgccgc tgatgggcac 4981 caaggtcgtc tttgtcgcga tcgggatcgc catactcgcc ggccaggggc cgccgccggc 5041 tctctgggtg gccgcgatcc tgacggcggc gggcgttttt ttgatgggct ttcgcgattt 5101 tcgaaagacc gcggccggtc acgggatcgc catcggcgcg gcgctgctga gcgcggcggt 5161 cttcggggtc tgcgatgtct tcgttcgcct gtggtcgaaa aacttcggcg ccatgacctt 5221 tctcgcgctc tcgtcgatgg gcgtcggcgt gttgtccctg ctggtttggc tggccaaagg 5281 catgcgcccg ctgtggccgc gggcggaaaa cgcgaagccg ggtgttttcg ccgcggccgg 5341 agcggcgatc atcggcgttc aggcggtctc gatggggctc gcggtgagct acttcgacga 5401 cgccacgggg atcaatgtcg tttacgcctc gcggggcctg tggtcgatcg cgctgatcgg 5461 gctggctggt ccgctgctgg gaaaccacga acgccacacg gccggcgccg cctaccggtg 5521 gcgggtctgc ggctcggtgc tcgtgacggc ggcggtcgtc atcgccgtgc tggcgcgaaa 5581 aaattgaggg cttcgcaagg cggagaaggg atggccgcgg agggtgtttg gtttcccgcc 5641 gacctccgct cttgagcctc aatcacttgg agaaatcgaa acacccgccc tccgtggtgc 5701 tgcgggcata gatcttgccg tcggcgagcg tcggggtgct ccagcatttt ccgtcgagga 5761 catcggcgcg gtcgatctcg tgataggcgt ccggcttggc ttcgacgagc acgatttcgc 5821 ctttgtcgga caggacaatc agcgtgccgt cggccgtcag cgtgacattt cccggtccga 5881 atccaggttc ggcccatttt ttctcgcccg tccggatgtc gacgcaggcg agcgggcctt 5941 tgccgtactc cttgaagctg aacatgccgt agagatatcc gtccttcacc acgggcgtgc 6001 tccagtggtt gatgttgtcg ttttccgtgc gccagatctg ctcggaggaa aaggccgcgc 6061 cgtccttctt gacgcggtag gcgcctccgc cgacgccgta tccggccgag caatagacga 6121 tgtcatcgaa cacgacgggc gaggccgccg tggatacttt gaagggaaaa tcctggcgcc 6181 agagcacctt tccatcctcc accgacacgg cgaccagccc cgtctgggtg aaatagatcg 6241 cctgccgcac gccgtggaga ttcgccacca ccggtgtggc gtgcgtgatg gcgtcgtcct 6301 cgcctttcca aaggagttcg cc //