LOCUS JAGQGW010000169 4684 bp DNA linear ENV 12-OCT-2021 DEFINITION MAG: Dehalococcoidia bacterium isolate HKST-UBA77 NODE_170_length_4684_cov_3.933579, whole genome shotgun sequence. ACCESSION JAGQGW010000169 JAGQGW010000000 VERSION JAGQGW010000169.1 DBLINK BioProject: PRJNA432264 BioSample: SAMN14563837 KEYWORDS WGS; Metagenome Assembled Genome; MAG. SOURCE Dehalococcoidia bacterium (activated sludge metagenome) ORGANISM Dehalococcoidia bacterium Bacteria; Chloroflexi; Dehalococcoidia. REFERENCE 1 (bases 1 to 4684) AUTHORS Wang,Y., Ye,J., Ju,F., Liu,L., Boyd,J.A., Deng,Y., Parks,D.H., Jiang,X., Yin,X., Woodcroft,B.J., Tyson,G.W., Hugenholtz,P., Polz,M.F. and Zhang,T. TITLE Successional dynamics and alternative stable states in a saline activated sludge microbial community over 9 years JOURNAL Microbiome 9 (1), 199 (2021) PUBMED 34615557 REMARK Publication Status: Online-Only REFERENCE 2 (bases 1 to 4684) AUTHORS Zhang,T. TITLE Direct Submission JOURNAL Submitted (14-APR-2020) Civil Engineering, The University Hong Kong, Pokfulam Road, Hong Kong 999077, Hong Kong COMMENT The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: CLC de novo assembler v. 6.04 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 162x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 04/19/2021 13:28:49 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 5.1 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 2,738 CDSs (total) :: 2,713 Genes (coding) :: 2,695 CDSs (with protein) :: 2,695 Genes (RNA) :: 25 tRNAs :: 22 ncRNAs :: 3 Pseudo Genes (total) :: 18 CDSs (without protein) :: 18 Pseudo Genes (ambiguous residues) :: 1 of 18 Pseudo Genes (frameshifted) :: 6 of 18 Pseudo Genes (incomplete) :: 10 of 18 Pseudo Genes (internal stop) :: 3 of 18 Pseudo Genes (multiple problems) :: 2 of 18 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..4684 /organism="Dehalococcoidia bacterium" /mol_type="genomic DNA" /submitter_seqid="NODE_170_length_4684_cov_3.933579" /isolate="HKST-UBA77" /isolation_source="activated sludge from Shatin waste water treatment plant collected monthly from 2007 through 2015" /db_xref="taxon:2026734" /environmental_sample /geo_loc_name="China:Hong Kong SAR, Shatin waste water treatment plant" /lat_lon="22.406236 N 114.213394 E" /metagenome_source="activated sludge metagenome" /note="metagenomic" gene complement(<1..457) /locus_tag="KC472_07705" CDS complement(<1..457) /locus_tag="KC472_07705" /inference="COORDINATES: protein motif:HMM:NF013775.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphotransferase" /protein_id="MCA9847842.1" /translation="MAWRLPASGGDWLARVPRHEGAPRAIEDQTCLMGALAPLGFPVP PEPRLVRDVRGRVVAGLYRYVQGRPAEVRGAPERERLARGLAAFVSRLHALDLDRLDA CRVRRYEPWRDEFAPMVERVLPHLAPRTASWLRARAELLAELSDRLPPPV" gene complement(607..1926) /locus_tag="KC472_07710" CDS complement(607..1926) /locus_tag="KC472_07710" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008930112.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MFS transporter" /protein_id="MCA9847843.1" /translation="MTPAALGGSVGNVIEWYDFALFGYFSSTIADLFFPGSSSSSLLA TFAVFAAGFTMRPVGAGVFGYIGDRIGRRPALFLSVILMAIPTAALGLLPTYGSIGIA APILLVVIRLVQGFSVGGEFSGSVTYLVETADPSRRGIAGSWANVGSLVGMLLGSASA TVVTTVLSADAASAWGWRIPFVLGGVIGAFALWLRTNLPEESGREHDEAHREDSPLHE ALTNDRMQTLKAVAFAGGYGVVFYLPLVYLPTYVSRRGNVPLDEALRVNTIATAALIL VIPLAAMASDQWLRRRTLILVAFIGMAVASVPLFALMNGGGWPELLIAQTVFALLIAI PLGSAPAFFAEMFPREDRATGYSIAYNLGLGIVGGTAPMIATGLIDASGNDLAPAFYL LALAVVAILSVWTIRDRSREPLRSRSGEVEAPRATSARQHDRTVAAS" gene complement(2254..2634) /locus_tag="KC472_07715" CDS complement(2254..2634) /locus_tag="KC472_07715" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_005799367.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="RNA-binding protein" /protein_id="MCA9847844.1" /translation="MRFLNQRIYVGNLPFSARQEDVEQLFGEFGEVISVALPNDRETG RPRGFGFVEMSKDDATAAIKALDGKDFDGRNLRVNEAEPREERRGGGGGYGGGNRGGG GYGGGNRGGGYGGGGYGGGGRDRY" gene complement(2835..3521) /locus_tag="KC472_07720" CDS complement(2835..3521) /locus_tag="KC472_07720" /inference="COORDINATES: protein motif:HMM:NF013323.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="MCA9847845.1" /translation="MADEEKQPLTRDQMAAVLAQHIGSGWIVNLGIGIPTLVSNFLYP EQEITLHSENGVIGYGRVAGEGEEDPDTVNASAVTAVTLDPGAAIVHHADSFAVIRRG MVDVTCLGAYEVAPDGSFANWKTTDDEWAHLGGIGGAMDLAACAKQVYLAMEHTTRDG QPRLLEKCNLPVTAPSGVTLVVTNLAVVAVRDGKFVLEQHAPGYSAEEIQAVTGAPLE VSPDFRRVSV" gene complement(3556..4305) /locus_tag="KC472_07725" CDS complement(3556..4305) /locus_tag="KC472_07725" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006895719.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="CoA transferase subunit A" /protein_id="MCA9847846.1" /translation="MDEAVADVADGSSIMIPGFGPGAPINLLAALWRQGATNLTTISN GVGFGGSSEELRGQGDLVEAGRVKKVIAAFTASTRPSRVGTAEGLIRSGEVEAELVPQ GTLAERIRAGGAGIPAFYTPAAVGTRLAEGRETRDFDGRTYLMETALFADYSFIRAYK ADTAGNLVFRRSARNFNPIMAMAAKCTIVEVEQPIVEAGEIDPDQVHTPGIFVHRLVH IPTGGVLRVARASGAVVHQTDTPETFPRVAE" gene complement(4438..>4684) /gene="nadA" /locus_tag="KC472_07730" CDS complement(4438..>4684) /gene="nadA" /locus_tag="KC472_07730" /EC_number="2.5.1.72" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013389688.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=2 /transl_table=11 /product="quinolinate synthase NadA" /protein_id="MCA9847847.1" /translation="VIGVGTEINLVSRLAKENPDKTVFCLDPVVCPCSTMYRVHPAYL AWVMESLVAGEVVNEIHVPEQHRRDAKVALDRMLALK" BASE COUNT 725 a 1633 c 1682 g 644 t ORIGIN 1 gcaccggtgg cggcagccga tcggagagct ccgccaggag ctcggcccgc gcccgcagcc 61 acgaggccgt cctcggtgcg aggtgtggaa ggacgcgctc gaccatcggc gcgaactcgt 121 cccgccacgg ctcgtagcgg cgcacccggc acgcatcgag acgatcgagg tccagggcgt 181 gcagccgaga gacgaacgct gcgaggccgc gcgccaaccg ctcccgctca ggcgcgccac 241 gcacctcggc agggcggccc tgcacgtagc ggtagaggcc ggcgacgact cggccccgaa 301 cgtcgcgcac gagccgcggc tccggcggca ccgggaaccc gagcggcgca agcgcgccca 361 tcaggcaggt ctggtcctcg atcgcccgtg gtgcgccttc gtgccggggc acgcgggcga 421 gccagtcgcc gcccgacgcg ggcaggcgcc acgccaccac gtcccagccg acgcccgcga 481 tggtggcgcg cgccgggtcg gcgccgggga gggcgtcgcg gtcgcgatcg cgacgtccgt 541 gggcagggac tgctcggtgc tctcgttcac ccgcggagta tcgcgcggtg gcgcgaccgc 601 caccgcctag gaggcggcga ccgtgcggtc gtgctggcga gcactcgtgg cgcgcggcgc 661 ctccacctcg cccgatcgcg agcgaagcgg ctcacggctg cggtcgcgga tcgtccacac 721 gctcaggatc gcgacgaccg ccagtgccag gaggtagaac gccggcgcga ggtcgttccc 781 cgacgcatcg atgaggccgg tggcgatcat cggagcggtg ccgccgacga tgccgagccc 841 gaggttgtac gcgatcgagt acccggtggc ccgatcctcg cgggggaaca tctcggcgaa 901 gaacgccggc gccgatccca gcgggatcgc gatgagcagc gcgaagaccg tctgcgcgat 961 cagcaactcg ggccagcctc caccgttcat cagcgcgaac agcggcacgg atgcgaccgc 1021 catcccgatg aacgcgacga ggatcaacgt ccggcgtcgg agccactggt cggacgccat 1081 cgccgccagc gggatcacga ggatcagtgc tgccgtcgcg atcgtgttca cgcgcagggc 1141 ctcatcgagc gggacgttgc cgcgacggct cacatacgtc ggcaggtaca ccagcggcag 1201 gtagaacacg acgccgtagc cgccggcgaa tgcgaccgcc ttcagcgtct gcatccggtc 1261 gttggtgagc gcctcgtgca gcggcgagtc ctcgcggtgg gcttcgtcgt gctcgcgccc 1321 ggactcctcc gggaggttgg tccgcagcca gagggcgaac gccccgatca cgccgccgag 1381 gacgaacggg atgcgccagc cccaggcgct cgcggcgtcc gcgctcagga ccgtggtgac 1441 gacggtcgcg ctcgccgagc cgaggagcat gcccacgagg ctgccgacgt tcgcccacga 1501 gccggcgatg ccacgcctcg acggatccgc cgtctcgacg aggtatgtga cggagcccga 1561 gaactcgccg ccgaccgaga acccctgcac gagccggatg accacgagca ggatcggcgc 1621 cgcgatgccg atcgagccgt acgtcggcag gagaccaagc gccgccgtcg ggatcgccat 1681 gaggatgacc gagaggaaga gggcggggcg gcgcccgatg cggtctccga tgtagccgaa 1741 cacgcccgcg cccacgggcc gcatcgtgaa gcctgctgcg aacaccgcga acgtcgcgag 1801 gagggaggag gagctgctac cggggaagaa gaggtctgcg atcgtcgacg agaagtaccc 1861 gaagagggcg aagtcgtacc actcgatgac gttgccgacg ctgccgccga gggccgcggg 1921 ggtgatctgg aagcgctgag agagccgctc gccttcgcct ggcacgcgcg ccgctcccct 1981 ccatgcttcg gaccaccgac ccgctcagcg tacgccccgg cgtcgagtcc gccggcgagt 2041 cctccggaga agcgttccgt gcgcgggcgg gcgactcggc gggaggcccg cgggtggccc 2101 gacaggagac ccggcgggag acgcagagag agcccggtct cccgggctct ctcgatctgt 2161 gcactcaact tcggcagtgc cggcatccga accctcgagg tcgagagtcg cggcatcact 2221 caggctgggg gctcagtctg ctaggagtga tgcctagtag cggtcgcgac cgccgccgcc 2281 gtacccgccg ccgccgtagc caccaccgcg gttcccaccg ccgtagccgc caccgccgcg 2341 gttgccgccg ccgtaaccac caccgccgcc gcggcgctcc tcgcgaggct cagcctcgtt 2401 cacgcgcagg ttgcggccgt cgaagtcctt gccgtcgagg gccttgatgg ccgccgtggc 2461 gtcatccttc gacatctcga cgaagccgaa gccgcgcggg cggccggtct cacggtcgtt 2521 cggaagggcg accgagatca cctcgccgaa ctcgccgaac agctgctcca cgtcctcctg 2581 ccgagcggag aaggggaggt tgccaacata aattcgctga ttcaagaaac gcatgctcca 2641 cgccggcaca tgccggcaac aggtcgcccg agtgggtgac cgatcaacta gacatccaac 2701 aagacgcgac gtggaggctc ggatgtaacg accaggaagc ggtctagagg agtgcggtgg 2761 cttcaccgcg gatccaacgt agcatccccg gcgcgtttgc gccggggatg gatgtcgccg 2821 ggcgggagcg aggactagac gctgacgcgg cggaagtccg ggctgacctc cagcggcgcg 2881 cccgtgaccg cctggatctc ctcggcgctg tagccgggag cgtgctgctc gaggacgaac 2941 ttgccgtcgc ggaccgcgac caccgcgagg ttcgtcacga ccagcgtgac acccgagggc 3001 gccgtgaccg ggaggttgca cttctccagc aggcgcggct ggccgtcccg ggtggtgtgc 3061 tccatcgcga ggtagacctg cttcgcacag gcggcgaggt ccatcgcgcc gccgatgccg 3121 ccgaggtgcg cccactcgtc gtcggtggtc ttccagttcg cgaaggagcc gtccggcgcc 3181 acctcgtacg ccccgagaca ggtcacgtcg accatgccgc ggcggatgac ggcgaacgag 3241 tcggcgtggt gcacgatcgc cgcgcccggg tcgagcgtca cggccgtgac cgcggaggcg 3301 ttcaccgtgt ccgggtcttc ctcgccttcg ccggcgacgc ggccgtagcc gatcacgccg 3361 ttctcggagt ggagcgtgat ctcctgctcc gggtagagga agttcgagac gagcgtcggg 3421 atgccgatgc cgaggttcac gatccagccg gagccgatgt gctgcgcgag cacggcggcc 3481 atctggtcac gggtgagggg ctgcttctct tcgtctgcca tcgtcgtgct cctctgaacg 3541 cctgctgctc gcgtgctact cggcgacgcg cgggaaggtc tcgggggtgt cggtctgatg 3601 caccaccgcg cccgaggcgc gcgcgacgcg gaggacgccg ccggtgggga tgtgcacgag 3661 gcggtggacg aagatgccgg gcgtgtgcac ctggtccggg tcgatctcac cggcctcgac 3721 gatcggctgc tcgacctcga cgatcgtgca cttcgccgcc atcgccatga tcgggttgaa 3781 gttgcgcgcc gaccgccgga acacgaggtt gcccgccgtg tccgccttgt aggcgcggat 3841 gaacgagtag tcggcgaaga gggccgtctc catgaggtac gtgcgaccgt cgaagtcgcg 3901 ggtctcacgg ccctcggcca ggcgcgtgcc gaccgcagcc ggcgtgtaga acgccgggat 3961 gccggcgccc cctgcgcgga tgcgctcggc gagcgtcccc tgcgggacga gctccgcctc 4021 gacctcgccc gagcgaatga ggccctcggc ggtgccgacg cgggacggtc gcgtcgaggc 4081 ggtgaacgcc gcgatgacct tcttcacgcg cccggcctcg acgaggtcgc cctgcccgcg 4141 gagttcctcc gaggagccgc cgaagccgac gccgttggag atcgtcgtga ggttggtggc 4201 gccctgccgc cagagggcgg ccaggaggtt gatcggcgcg cccggcccga agccggggat 4261 catgatcgag gacccgtccg ccacatcggc gacggcctcg tccatcgtct ggaagatctt 4321 cggtcgcggc acgctcgtcc cctcggtcta gcgccttcca cgaagcacgg aagcccctcg 4381 gcgcgtgcgc gttatacgac acgcggaggg gcttcgccta gagccgaggg ctgggtgcta 4441 cttgagggcg agcatccggt cgagggcgac cttcgcatcg cggcggtgct gctccggcac 4501 gtgaatctcg ttgaccacct cgccggcgac gagcgactcc atcacccacg cgaggtaggc 4561 cgggtgtacg cggtacatcg tgctgcacgg gcagacgacc gggtcgaggc agaacacggt 4621 cttgtccggg ttctccttgg cgagccgcga gaccaggttg atctcggtcc cgacgccgat 4681 cacc //