LOCUS JAGRCD010000160 5876 bp DNA linear ENV 12-OCT-2021 DEFINITION MAG: Anaerolineales bacterium isolate HKST-UBA17 NODE_160_length_5876_cov_2.656148, whole genome shotgun sequence. ACCESSION JAGRCD010000160 JAGRCD010000000 VERSION JAGRCD010000160.1 DBLINK BioProject: PRJNA432264 BioSample: SAMN14563008 KEYWORDS WGS; Metagenome Assembled Genome; MAG. SOURCE Anaerolineales bacterium (activated sludge metagenome) ORGANISM Anaerolineales bacterium Bacteria; Chloroflexi; Anaerolineae; Anaerolineales. REFERENCE 1 (bases 1 to 5876) AUTHORS Wang,Y., Ye,J., Ju,F., Liu,L., Boyd,J.A., Deng,Y., Parks,D.H., Jiang,X., Yin,X., Woodcroft,B.J., Tyson,G.W., Hugenholtz,P., Polz,M.F. and Zhang,T. TITLE Successional dynamics and alternative stable states in a saline activated sludge microbial community over 9 years JOURNAL Microbiome 9 (1), 199 (2021) PUBMED 34615557 REMARK Publication Status: Online-Only REFERENCE 2 (bases 1 to 5876) AUTHORS Zhang,T. TITLE Direct Submission JOURNAL Submitted (13-APR-2021) Civil Engineering, The University Hong Kong, Pokfulam Road, Hong Kong 999077, Hong Kong COMMENT The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: CLC de novo assembler v. 6.04 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 153x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 04/20/2021 03:52:53 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 5.1 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 3,246 CDSs (total) :: 3,219 Genes (coding) :: 3,208 CDSs (with protein) :: 3,208 Genes (RNA) :: 27 tRNAs :: 24 ncRNAs :: 3 Pseudo Genes (total) :: 11 CDSs (without protein) :: 11 Pseudo Genes (ambiguous residues) :: 0 of 11 Pseudo Genes (frameshifted) :: 1 of 11 Pseudo Genes (incomplete) :: 9 of 11 Pseudo Genes (internal stop) :: 1 of 11 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..5876 /organism="Anaerolineales bacterium" /mol_type="genomic DNA" /submitter_seqid="NODE_160_length_5876_cov_2.656148" /isolate="HKST-UBA17" /isolation_source="activated sludge from Shatin waste water treatment plant collected monthly from 2007 through 2015" /db_xref="taxon:2073117" /environmental_sample /geo_loc_name="China:Hong Kong SAR, Shatin waste water treatment plant" /lat_lon="22.406236 N 114.213394 E" /metagenome_source="activated sludge metagenome" /note="metagenomic" gene <1..492 /locus_tag="KDD72_07975" CDS <1..492 /locus_tag="KDD72_07975" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_014713134.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATP-binding cassette domain-containing protein" /protein_id="MCB0118951.1" /translation="MVFARPVTLPMSIRENITYGLELAGEKRRAKLNEAVERSLKLAA IWDEVKDRLNEPAIALSGGQQQRVCLARVLALQPEIILLDEPTSGLDPISTGKVEAAL YELKKNFTVILVPHSVQQAARTADHAAFFLQGELIEYGDGKKMFTAPKQKKTEDYIMG RFG" gene complement(537..1880) /locus_tag="KDD72_07980" CDS complement(537..1880) /locus_tag="KDD72_07980" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013561174.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nicotinate phosphoribosyltransferase" /protein_id="MCB0118952.1" /translation="MSIFDGKRLTDETFKLDIERMRQGWYSDKYFENINRMLTALSAE GYFYSGDHHNLPEDMSPEQIAVGDIEVEMQWFTRRMGKTTIVGVDKALAMLRHCTGYF EGDTFVDTSDTLEVSAVHDGTIVKYEGDPLKIQPVIKVRGKYRDFAMLETPTLGILTR SSRVATNVYETLVAARGKPVLFFPARFDVHEVQAADGYAYNIAVQRFNRDHAQDLGPF VSTDAQGDWWGGAGGGTVAHAAIASFLGDTAEAMMQFSNILPPNIPRIALVDFNNDSV RDSLRVLDVMFAKYRELMTDGYKEEAEKYKLFGVRLDTSGSLRDVSVQPLGEPMLDLG VNPRLVFNVRSALDNAWERWDLPQSWKEAAKEFCHNVKIVVSGGFNPEKIRRFEKLDV PVDIYAVGSYLFNNSNSTVTDFTADVVRVKVHGEWIDMAKVGRQVGENEKLERVW" gene complement(1925..2932) /locus_tag="KDD72_07985" CDS complement(1925..2932) /locus_tag="KDD72_07985" /inference="COORDINATES: ab initio prediction:GeneMarkS-2+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS-2+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="MCB0118953.1" /translation="MTNLPIPEFFDPTRVGEVWKVDYAARVEDARKFALQHDLKPASA SKEHISLLLIDVQNTFCMPGFELFVGGRSGTGAVDDNVRLCEFIYRNLGRITHILATM DTHTSQQIFHPIFFVDAEGNHPAPYTDIHAEELRSGKWTFNPALSPQLGVAPEYGQQM MIHYAEALEKAGKYALTVWPYHAMLGGIGHAIVPAVEEAVFFHSHARIDQPYFAIKGD KPFTENYSVIGPEVLTGPMDEVLGTRNPMFIQHLQEVDKLYIAGQAKSHCVAWTVQDL LDDIMATDPELAKKVYLLDDCSSAVVVPGVVDHTEAADEAYTRFAQAGMHIVKSTDAI E" gene complement(3025..4449) /locus_tag="KDD72_07990" CDS complement(3025..4449) /locus_tag="KDD72_07990" /inference="COORDINATES: protein motif:HMM:NF012529.1,HMM:NF013828.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ATP-binding protein" /protein_id="MCB0118954.1" /translation="MKPLGDPNCPHCGGAGYVRYDVPVGHEKFGKLETCVCRARDVAE AARSRLFAMSNLNRLSHLTFENFNSSGNDKAKFMSGQERENLRVAFDASEEFARLPKG WLLLEGGYGCGKTHLAAAIANFAVNVGMPTLFITVPDLLDSLRFAYDDPETTFEQRFE EIRNSGLLILDDFGTQNATPWAQEKLFQIINFRYINKLPTVITTNLMLDEIEARIRSR LQDDGFVKHLKILAPDYRRPEETSNPGISMLALPEMKVMTFKTFQPRHDEVGTEAVMT TTTEKNDRFGNKVKDKEITRIKISKEDVRTLDDAHNKALEFAEKPGGWLVLLGGSFCG KTHLAAAIGNYIIALGGQANMIDAAGLPDYIREPIGGRMDVTFNRRLNEIRNIPMLIL DDLKSSGSLTAWAEERMMAILSYRYNAHLPTVITSTLRQDEFALSYPNLWNKLLDPTK CQVLAINMPPYRRVAKGGRSTKKK" gene complement(4454..5104) /locus_tag="KDD72_07995" CDS complement(4454..5104) /locus_tag="KDD72_07995" /inference="COORDINATES: protein motif:HMM:TIGR01446.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DnaD domain protein" /protein_id="MCB0118955.1" /translation="MTQFPGFTSSETFTQVPDSLLRLMNDIDDIAELKVTLFAIQRIE HLEGNFRALCETDFEAEALGLTIDEIRRGLGKAVERQTLLRAENEADVFYFLNSPRGR ISAEAFAKGQWRDAMRAYVPNKSNVFKLYEENIGPLTPLLADMLKEAERNYPAAWFEE AFEIAVSRNVRNWKYIEAILSRWKENGKDERRDPKDSVKDAKRYTEGEFSEFFKRD" gene complement(5138..>5876) /locus_tag="KDD72_08000" CDS complement(5138..>5876) /locus_tag="KDD72_08000" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013559112.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=2 /transl_table=11 /product="DnaB-like helicase C-terminal domain-containing protein" /protein_id="MCB0118956.1" /translation="LHTITGRLGQGKTGFLLSIAKNAGLTHKKHVAIFSLEMSNEQVV QRLIAQETGIDSQRLRTGKLLENEWSLFTHAIEVFSDTHIYLDDTPAITPMQLRTKCR RLHMEFGLDLVIVDYLQLMGGDIRTDNRVQEVSQISRSLKVLARELNVPVLTAAQLSR AVEQRTDKRPVLSDLRESGSLEQDADIVMFIYRPDQYEKDTDKQNIAQIIVAKHRNGP VGDVELIFRGALAKFENAATKVFKPNE" BASE COUNT 1374 a 1666 c 1454 g 1382 t ORIGIN 1 atggtttttg cgcgtccggt cacattgccg atgagcatcc gcgagaatat tacatacgga 61 ttggaactcg caggcgaaaa acgccgcgca aaacttaatg aagccgtgga acggagttta 121 aaactcgctg ccatctggga tgaagtgaaa gaccgtttga atgagcctgc cattgccctc 181 tccggcggac agcagcagcg cgtctgcctt gcgcgtgtgc tggcattgca gcctgagatc 241 atcctgctcg atgaaccaac ctcggggctt gaccccatct cgaccggcaa ggttgaagcg 301 gcactgtatg aactgaagaa aaatttcacc gttatcctcg tccctcattc ggtacagcaa 361 gccgcgcgca cagcggatca cgccgctttc tttttacagg gcgaattgat cgaatacggc 421 gacggcaaaa aaatgttcac agccccaaaa caaaagaaga ccgaagatta catcatgggc 481 aggttcggtt aaaacaaagt gacggtcacc ttatgaggtg accgtcactt ttataactac 541 cagacccgtt ccaacttttc gttctcgccg acctgacgcc ccaccttcgc catatcgatc 601 cactcgccgt ggactttcac gcgcaccaca tcggcggtga aatccgtcac cgtggaatta 661 ctgttattga agagatacga ccccaccgca taaatatcca cagggacatc caatttttcg 721 aacctgcgga tcttttcagg attgaacccg cctgagacga cgatcttcac gttgtgacaa 781 aattccttcg ccgcctcttt ccacgactgc ggcaggtccc aacgttccca ggcattatcc 841 aacgccgaac ggacgttgaa caccagccgc gggttgaccc ccagatcgag catcggttcg 901 ccaagcggct ggacagacac atcgcgcaga cttccgctcg tatccaaacg cacgccgaat 961 aatttgtatt tttccgcttc ttccttgtat ccatccgtca tcagttcgcg gtatttcgcg 1021 aacatcacat ccaacacccg cagcgaatcg cggacggagt cattgttgaa atccaccagc 1081 gcaatgcgcg gaatatttgg cggcagaata tttgagaatt gcatcatcgc ttccgccgta 1141 tcgccaagga aggacgcaat cgcagcgtga gccaccgtcc cgcctccggc gccgccccac 1201 cagtcgcctt gagcgtcggt agagacaaag ggtccaagat cctgcgcatg gtcccgattg 1261 aagcgttgaa cggcgatgtt ataggcgtat ccatccgctg cctgaacttc atgcacgtca 1321 aagcgtgctg ggaagaaaag caccggcttg ccgcgcgccg caaccaacgt ttcatacaca 1381 ttcgtggcta cacgactcga acgcgttaag atgccaagtg tgggagtttc gagcatcgcg 1441 aagtcgcggt attttccgcg cactttgatg acgggctgga tcttcagcgg gtcgccttcg 1501 tacttgacga tggtgccgtc atgcaccgcc gacacctcaa gcgtatccga cgtatcgacg 1561 aaggtatcgc cctcgaaata tcccgtgcag tgacgcagca tcgccagcgc cttatccaca 1621 ccgacgatgg tggtcttgcc catgcggcgc gtgaaccact gcatctccac ttcgatatcg 1681 ccgacagcga tctgctcggg tgacatatcc tccggcaaat tgtgatgatc tcctgaatag 1741 aaatatcctt ccgcggagag cgccgtcaac atgcggttga tgttttcgaa atatttatcc 1801 gaataccacc cctgtctcat gcgttctatg tcaagcttaa aagtttcgtc tgtaagccgt 1861 ttaccgtcaa aaatggacat gcttcctctt tcataacatg cggtataaaa ccgcgcgtta 1921 tggattattc gattgcatcc gtggatttta cgatgtgcat tcccgcctgc gcgaaccgtg 1981 tgtacgcctc atccgccgcc tcggtatggt ccaccacacc aggcaccacc accgccgacg 2041 aacaatcgtc gagcaagtag accttcttcg cgagttccgg gtcggtcgcc atgatgtcat 2101 cgagcaggtc ctgcactgtc cacgcaacgc agtgactctt cgcctgcccg gcgatataca 2161 gcttatccac ctcttgaaga tgctgaatga acattgggtt acgtgtgccc agcacctcgt 2221 ccatgggacc cgtcaacact tcgggaccaa tgaccgaata attttcggtg aagggcttat 2281 cgcctttgat ggcgaaataa ggctggtcta tccttgcatg tgaatggaag aaaaccgcct 2341 cctccacggc agggacaatg gcatgaccaa tgccgccgag catggcatgg tagggccaga 2401 cggtcaacgc gtacttccct gccttctcca acgcctcggc ataatggatc atcatctgct 2461 gaccgtattc cggtgcgacg ccaagctgcg gggaaagcgc cgggttgaac gtccacttac 2521 cggagcgaag ttcctcggcg tggatgtctg tatacggcgc ggggtgattc ccttccgcat 2581 cgacaaaaaa gatgggatga aatatctgct gcgatgtgtg cgtatccatg gtggcgagga 2641 tgtgggtgat cctgccaaga ttgcggtaaa tgaactcgca caagcggacg ttatcgtcca 2701 cggcgccagt gccgctgcgc ccgccgacga acagttcgaa gccgggcata cagaatgtat 2761 tttgcacatc gatcagtaaa agggagatgt gttctttcga agccgaagca ggcttgaggt 2821 catgttgaag cgcgaacttg cgcgcatctt cgacgcgggc ggcgtagtcc actttccaaa 2881 cttcgccaac gcgggtcggg tcaaaaaatt cagggatggg taagttggtc atgtgttaag 2941 tatacatggc gttagcaaat aaaatgttaa cgttaaataa aacagacttc acgttgtatg 3001 aacatgaagt ctgttgagag cgcattattt ctttttcgtt gacctgccgc cttttgcaac 3061 gcgccgatac ggcggcatgt tgatcgccaa cacctgacat ttcgtcggat caagaagctt 3121 gttccaaagg ttgggataac tcaacgcgaa ttcatcctgc ctcagcgtgg acgtgataac 3181 cgtgggcaga tgcgcgttat aacgatagct gagtatcgcc atcattctct cttcagccca 3241 agctgtaagc gatccactag attttaaatc gtccaatatc aacattggaa tgttgcgaat 3301 ttcatttaat ctgcgattaa acgtcacatc cattcttccg ccaattggct cacgaatata 3361 atcaggtaaa cctgccgcat ctatcatgtt agcctgaccg ccaagggcaa ttatgtaatt 3421 tccgatcgcc gccgcgaggt gggtcttgcc gcagaacgaa ccgccgagta gtacaagcca 3481 tcccccaggc ttttcggcaa attccaatgc cttgttatgc gcatcatcca atgttctcac 3541 atcttccttg gatattttga tgcgcgtgat ctctttatcc ttgaccttgt taccgaagcg 3601 gtcattcttt tcggttgtgg tcgtcatcac ggcttcggtt ccaacttcat cgtggcgcgg 3661 ctggaaggtc ttgaatgtca tcaccttcat ctcgggcaaa gccagcatcg aaatgccggg 3721 gttgctcgtc tcttcgggac ggcgataatc cggcgcgaga attttgagat gtttgacaaa 3781 accatcatcc tgcaaacgcg aacggatgcg cgcctcgatc tcatccagca tcaaattggt 3841 ggtgatgaca gtcggcaatt tgttgatgta acggaaatta atgatctgaa ataatttttc 3901 ctgcgcccac ggagtggcgt tctgcgtgcc aaaatcgtcg aggatcaaca gacccgaatt 3961 gcggatctct tcaaagcgtt gttcgaacgt tgtttcgggg tcgtcatatg caaagcgcag 4021 cgaatcaagc aaatccggca cggtgatgaa cagggtcggc atgcccacat tgacggcaaa 4081 gttcgcgatc gccgccgcaa gatgcgtctt accgcagcca taaccgcctt ccaataacaa 4141 ccatcccttt ggcaggcgcg cgaattcttc gcttgcatcg aacgccacac gcaaattctc 4201 gcgttcctgc ccagacatga acttcgcctt atcgttgccg gaagaattga aattctcaaa 4261 ggtcaaatga ctgagccgat tcaaattgct catcgcaaag agacgtgaac gcgccgcctc 4321 cgccacatcc ctggcgcggc aaacacacgt ctcaagtttc ccgaattttt catgtcccac 4381 cggcacatca tagcggacat accccgcccc accgcagtgc gggcagttcg ggtcaccgag 4441 cggcttcaca acattaatct cgtttgaaga attcggagaa ctcgccttcg gtgtatcgtt 4501 ttgcatcttt gacagagtct ttcggatctc ttctttcatc ttttccgttt tccttccaac 4561 gcgaaaggat cgcttcaata tacttccagt tccgcacatt gcggctgacg gcaatctcga 4621 acgcctcctc gaaccaggcg gcgggataat tccgttcagc ctccttcaac atatccgcca 4681 gcagcggcgt cagcggaccg atattctctt catacaactt gaacacgttc gacttgttcg 4741 gcacatacgc ccgcatcgca tcacgccatt gacctttggc aaacgcctca gcggatattc 4801 ttccgcgcgg ggaattgagg aaataaaaaa catccgcctc attctccgcc cgcagcaggg 4861 tctgcctctc gacggctttt ccaagcccgc gccgaatctc atctatggtc aaacccaaag 4921 cttctgcctc gaaatcagtc tcacacaacg cgcggaagtt tccctccaaa tgctcgatgc 4981 gttggatggc aaagagcgtc accttcaact ccgcaatatc gtcgatatcg ttcatcaaac 5041 ggagcaacga atcagggact tgcgtgaacg tttcagagga ggtaaagccg gggaattgag 5101 tcatacattt tctcgtaaat gcgaggtgac cccgccctta ctcattcggt ttgaacacct 5161 tcgtcgccgc attctcgaac tttgccaacg cgccgcggaa gatcaactcc acatcgccaa 5221 cgggaccgtt acgatgcttg gcaacgatga tctgcgcaat gttctgttta tccgtatcct 5281 tctcatactg gtcgggacga taaatgaaca tgacgatgtc ggcgtcctgt tcgagtgatc 5341 cggattcgcg caagtccgaa aggacggggc gtttgtcggt gcgctgttcc acggcgcggc 5401 tcaactgcgc ggcagtgaga acgggcacat tcaattcacg cgccagcacc ttgaggctgc 5461 gcgagatctg cgaaacttcc tgcacacggt tatcggtgcg gatgtcaccc cccatcagct 5521 ggagataatc cacgatgaca agatcaagcc caaattccat gtggaggcgt cggcacttcg 5581 tccgcaattg catgggcgtg atggcaggcg tatcgtcgag gtagatgtga gtatccgaaa 5641 aaacttcgat ggcatgcgtg aaaagcgacc actcgttttc gagcagtttg ccggtgcgca 5701 gccgctgcga gtcgatgccc gtttcctgcg cgatcaaacg ctgaaccacc tgctcattgg 5761 acatttccaa cgaaaagatg gcgacgtgtt ttttatgcgt cagcccggcg ttcttggcga 5821 tcgacagcag aaaacccgtc ttgccctgac caagtctgcc cgtgatggtg tgcagg //