LOCUS JAGQHC010000359 4814 bp DNA linear ENV 12-OCT-2021 DEFINITION MAG: Dehalococcoidia bacterium isolate HKST-UBA83 2015-01-06_1_(paired)_contig_316087, whole genome shotgun sequence. ACCESSION JAGQHC010000359 JAGQHC010000000 VERSION JAGQHC010000359.1 DBLINK BioProject: PRJNA432264 BioSample: SAMN14563843 KEYWORDS WGS; Metagenome Assembled Genome; MAG. SOURCE Dehalococcoidia bacterium (activated sludge metagenome) ORGANISM Dehalococcoidia bacterium Bacteria; Chloroflexi; Dehalococcoidia. REFERENCE 1 (bases 1 to 4814) AUTHORS Wang,Y., Ye,J., Ju,F., Liu,L., Boyd,J.A., Deng,Y., Parks,D.H., Jiang,X., Yin,X., Woodcroft,B.J., Tyson,G.W., Hugenholtz,P., Polz,M.F. and Zhang,T. TITLE Successional dynamics and alternative stable states in a saline activated sludge microbial community over 9 years JOURNAL Microbiome 9 (1), 199 (2021) PUBMED 34615557 REMARK Publication Status: Online-Only REFERENCE 2 (bases 1 to 4814) AUTHORS Zhang,T. TITLE Direct Submission JOURNAL Submitted (14-APR-2020) Civil Engineering, The University Hong Kong, Pokfulam Road, Hong Kong 999077, Hong Kong COMMENT The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: CLC de novo assembler v. 6.04 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 75x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 04/19/2021 13:27:16 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 5.1 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 2,869 CDSs (total) :: 2,855 Genes (coding) :: 2,841 CDSs (with protein) :: 2,841 Genes (RNA) :: 14 rRNAs :: 1 (5S) complete rRNAs :: 1 (5S) tRNAs :: 12 ncRNAs :: 1 Pseudo Genes (total) :: 14 CDSs (without protein) :: 14 Pseudo Genes (ambiguous residues) :: 4 of 14 Pseudo Genes (frameshifted) :: 4 of 14 Pseudo Genes (incomplete) :: 5 of 14 Pseudo Genes (internal stop) :: 2 of 14 Pseudo Genes (multiple problems) :: 1 of 14 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..4814 /organism="Dehalococcoidia bacterium" /mol_type="genomic DNA" /submitter_seqid="2015-01-06_1_(paired)_contig_316087" /isolate="HKST-UBA83" /isolation_source="activated sludge from Shatin waste water treatment plant collected monthly from 2007 through 2015" /db_xref="taxon:2026734" /environmental_sample /geo_loc_name="China:Hong Kong SAR, Shatin waste water treatment plant" /lat_lon="22.406236 N 114.213394 E" /metagenome_source="activated sludge metagenome" /note="metagenomic" gene complement(123..476) /locus_tag="KC470_07905" CDS complement(123..476) /locus_tag="KC470_07905" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013676038.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="YciI family protein" /protein_id="MCA9822513.1" /translation="MRYMLLIYSDESTDLGPGDPGFEEMMNGYMNFTNEIRANGVFEA GDPLQPVATATTVRVRDGRASHTDGPFAETREQLGGYYIVDCKDLDQALEYAARIPGA ARGCVEVRPVMDLEG" assembly_gap 521..652 /estimated_length=132 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene <653..1280 /locus_tag="KC470_07910" CDS <653..1280 /locus_tag="KC470_07910" /inference="COORDINATES: ab initio prediction:GeneMarkS-2+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS-2+." /codon_start=2 /transl_table=11 /product="hypothetical protein" /protein_id="MCA9822514.1" /translation="RFHEALNAGMPRLGWKVALSDASAMQRLGIDEPAVAWLDGNRLL RPGDAYLAPAGARTAVEAEVAIRIDESGTIEAVAPAIEFIDLSRPGGAIDIILSHAVF HDAVLLGQEQPFGDWVNSNWPPNGWPTISKNGVVAEILQPSMAPGDFAALAAAKSTQL RTAGEKLEADDWIITGSLTTPVPAGEGDEITIEYGVLGTLTVRIGNAG" gene 1353..1622 /locus_tag="KC470_07915" CDS 1353..1622 /locus_tag="KC470_07915" /inference="COORDINATES: ab initio prediction:GeneMarkS-2+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS-2+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="MCA9822515.1" /translation="MAELNETRERHAKDVVANNMASLMGDFTPAAMTKVMGIAANPIQ ATSYEIKDLGNNEAEITYIGSTRRTIWSKWEQNGEKWQIADLSER" gene 1627..1980 /locus_tag="KC470_07920" CDS 1627..1980 /locus_tag="KC470_07920" /inference="COORDINATES: ab initio prediction:GeneMarkS-2+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS-2+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="MCA9822516.1" /translation="MTNPGPQTAREAAELNAKAVAEGNLAVVMGQLTPEAMAQMMQLG AQGGGLTPQQMPAITGYTIEEAGSDGESETFNVTFASAIGTATVAARWKQVLGQWKIA GIALVSAEQTGEAGS" gene 1982..2464 /locus_tag="KC470_07925" CDS 1982..2464 /locus_tag="KC470_07925" /inference="COORDINATES: protein motif:HMM:NF012521.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="histidine phosphatase family protein" /protein_id="MCA9822517.1" /translation="MRLCLVRHGIAEDRGNYADDAERPLTERGRDRMEAAAQGLATLI QPDVILSSPLVRAHQTAELIAAATGATVEECEALANGDHESLLAAACLETVVAVGHEP HISGFLSWALGAHHLPVEIKKGSAALVSFDGVPEPGSGRLDWFMPPRALRQLRKVPIG " gene complement(2432..3976) /locus_tag="KC470_07930" CDS complement(2432..3976) /locus_tag="KC470_07930" /inference="COORDINATES: protein motif:HMM:NF014587.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Ppx/GppA family phosphatase" /protein_id="MCA9822518.1" /translation="MSGSSQRPIYAVVDVGSKSIRLLVARRLSSSAFEVVDEERFDAR LGEGLVAGAISPEAFERGLLAMRIIAQVAAGYAPRATIAVGTEVLRTASNAGALISRI HAETGISVRVISAQEEAHASYLGIINATRLADGAIIDIGGGSLEVIRVAGRRFTAARS VPLGAIYSTERYLASDPPLPREVRALRKAVRRQINGQQEQEVPVETLWATGGACRNIA RMVRLRRSYPLRRLHGFTFDRRELKAVLKEMLRQPAGERRHIAGLNSARAATLPAAAI VLDEVMAVLDVETVLVSGQGLREGLVWQQLRGQRPLLPDVRAASIAGLAAANGVDPGA SAPEVRLASELFEATVPLHGLGHAELELLVSATRLSEIGMHVDFYNRDRHAEYLVHSG DLHGFSHREIVLLAAIVRYSSGGTVDLSSYAPVVYERDGRLVATLAAMLGVARAVARR PGSPVVEADLQMGKHLDLVLRSQVPLDAELYALERPLRRLENALGIGIDVAVEALPDR HLSELA" gene 4134..>4814 /locus_tag="KC470_07935" CDS 4134..>4814 /locus_tag="KC470_07935" /inference="COORDINATES: protein motif:HMM:NF012798.1,HMM:NF014898.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="tyrosine-type recombinase/integrase" /protein_id="MCA9822519.1" /translation="MEDCIAQFLNFLRVEKNASDNTIQAYKNDLGQFAKQFAGNPEPP LSEWKAVSRDKVIRFVEWMKETRGYKDATVARKVAAVKSFFAFLAAEAIIENDPTENL KSPQVGKSLPGALTLEEVDALLEQPARKSTPEGRRDKAMLELLYATGVRVTELVSLNL EDVALESDPVTVRCVGKGDHDRIRPLPQRAVDEIRQYIFHVRPRLVRNKKERALFVNR RGERLTRQG" BASE COUNT 896 a 1434 c 1557 g 795 t ORIGIN 1 gccaggattc ggctgtggtc gtcgcggaac gcgcggtcga gtgcatcacg cgctgccttc 61 gccgactcgc tcatcacggc gccagtgtgc accagcgcgt caaccgcgcc catggctact 121 gactagcctt cgaggtccat gaccggccgc acttcgacgc aaccgcgggc cgccccgggg 181 atcctggcgg catactcaag ggcctggtcg aggtctttgc agtcgacaat gtagtacccg 241 ccgagctgtt ctctggtctc agcgaatgga ccgtcagtgt gagacgcccg gccatccctc 301 acgcgaacgg tcgttgccgt tgccaccggc tggagcgggt cgccggcctc aaaaacgccg 361 tttgcgcgaa tctcattggt gaagttcatg tagccattca tcatctcttc gaaaccggga 421 tcgccggggc cgaggtcggt ggactcgtcg ctgtagatca gcagcatgta gcgcatggtg 481 tttctcctct gtgagctggc gaccgagccg gccgcctcta nnnnnnnnnn nnnnnnnnnn 541 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 601 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnacgcttcc 661 acgaggcgct caatgccggt atgccgcggc tcgggtggaa ggttgcactc agcgacgcgt 721 ctgccatgca acgactggga atcgacgagc ctgctgtcgc ctggctcgat ggcaacaggc 781 tcctgcggcc cggggacgcc taccttgccc cggctggcgc ccgtacggca gtggaagcgg 841 aggttgcgat ccgcatcgat gagagcggaa ctatcgaagc ggtagcgccg gcgatcgagt 901 tcatcgacct ctccaggccc ggtggggcga tcgacatcat cctcagccat gccgtcttcc 961 acgatgcagt cctgctgggg caggagcagc cgtttgggga ctgggtgaac agcaattggc 1021 ctcccaacgg ctggccgacc atctcgaaga acggtgtggt cgcggaaatc ctccagccgt 1081 cgatggcacc cggcgacttt gcggcactgg cggccgcgaa gtccactcag ctgaggactg 1141 caggcgagaa gctggaggcg gacgattgga tcatcacggg ctcgctgacc acgcccgtcc 1201 ctgccggcga gggcgatgaa atcaccattg agtacggggt gctggggact ctcacggtgc 1261 gaattggcaa cgccggttaa cgaagcattg ccgccggttc gtcctgccgg gtatgctccc 1321 ggcgaaaacc tagcgatcgg aaagcaggaa tcatggctga attgaacgaa actcgcgaac 1381 gccacgccaa ggacgtcgtg gccaacaata tggcatcgct gatgggcgac tttacgccgg 1441 ccgcgatgac caaggtcatg gggatcgcgg cgaacccgat ccaggcgacg agctacgaga 1501 tcaaggacct tggcaacaac gaggccgaga tcacctacat tggctcgacc cggcgcacga 1561 tctggagcaa gtgggaacag aacggcgaga agtggcagat cgccgacctc agcgaacggt 1621 aggcggatga ccaatccggg tccccagacg gcccgcgagg cagccgaact gaacgcgaag 1681 gcagtagccg agggcaacct tgccgttgtg atgggccagc ttacgcccga ggcgatggcg 1741 cagatgatgc agcttggcgc ccagggcggg ggactcacgc cccagcagat gccggccatc 1801 accggttaca ccattgagga agccggcagc gatggggaat ctgagacgtt caacgtgacg 1861 tttgcctctg cgattgggac cgccacggtg gctgcccgct ggaagcaggt cctggggcag 1921 tggaagatcg ccgggattgc gttggtgtcg gcggagcaga cgggcgaggc cgggagctga 1981 agtgcgtctc tgccttgtcc gccatggaat cgccgaagac cgggggaact atgccgacga 2041 tgcagaacgc cccctcaccg agcgaggccg cgaccggatg gaggcagccg cccagggcct 2101 cgcgacgctg atccaacccg acgtgattct cagcagcccg ctggtgaggg cgcatcagac 2161 cgcagagctg attgccgcgg caacgggtgc gaccgttgaa gaatgcgagg cgctggcgaa 2221 cggcgaccac gaatcgctgc ttgcggccgc ctgcctggag accgtggtcg cagtgggcca 2281 cgagccgcac atttccggct ttctcagctg ggcacttggg gctcaccacc tccctgtgga 2341 aatcaagaag gggtcggctg ccctcgtttc cttcgatggg gtgccggagc cggggagcgg 2401 ccggttggac tggttcatgc caccgcgggc cctacgccaa ctccgaaagg tgccgatcgg 2461 gtagggcttc tactgcgacg tcgatgccta tcccgagcgc attctcgagg cggcgcagcg 2521 gccgttccag ggcatagagt tcagcatcga gcggaacctg cgagcggagc accaggtcga 2581 ggtgcttgcc catctggagg tctgcctcga ccaccggtga gccaggcctc cgggccacgg 2641 ccctggcaac tcccaacatc gctgcaaggg tagccaccag gcggccgtca cgctcataga 2701 caaccggcgc gtaggaagag aggtcaacgg taccgcccga ggagtagcgg acgatagctg 2761 cgagcaagac gatctcgcgg tggctgaacc cgtgaagatc gccgctgtgc acgaggtatt 2821 cggcatgacg atcgcggttg tagaagtcca catgcatacc gatctccgag agacgagtag 2881 cggacaccag gagttcgagt tcggcatggc caagcccgtg taagggcacg gtcgcctcga 2941 agagttcgct ggcaagccgt acttctggtg ccgaagcgcc agggtccacg ccgttggctg 3001 cggcaagtcc agcgatggag gcggcacgga catccgggag gaggggccgc tgaccccgga 3061 gttgctgcca gaccaatcct tcgcggagcc cctggccgga gacgagaacg gtctcgacgt 3121 ccagtaccgc catcacctca tcaagcacaa ttgctgccgc aggaagtgtt gccgcgcgag 3181 ctgagttgag cccggcgatg tggcggcgct caccggcggg ctgccgcaac atctccttga 3241 ggacagcttt gagctctcgg cgatcgaaag tgaagccgtg caatctgcgg agtggatagc 3301 tacgcctgag ccgaaccatc cgggcgatgt tgcggcaggc gccaccggtc gcccagaggg 3361 tctccaccgg tacttcctgt tcctgttggc cgttgatctg tcgccgcaca gccttgcgca 3421 aggctcgcac ttcccgcggc agcgggggat cgctggccag gtagcgttcg gtggaataga 3481 tggcacccag gggtacgctt cgtgccgcgg taaaccggcg gccggcgacc cggatgactt 3541 cgagtgaacc accgccgata tcgataatcg ctccatccgc caagcgcgtc gcgttgatga 3601 tgccgagata gcttgcgtgg gcctcttcct gggctgaaat gacacgcacc gagatacccg 3661 tttccgcgtg gatgcgggag atgagggccc cggcgttcga agcggtgcgg aggacctcgg 3721 tgccgaccgc gattgttgcc cggggtgcgt aaccggcggc aacctgggcg atgattcgca 3781 tcgcaagcag gccgcgttca aatgcttccg gactgatggc gccggccacc agtccctcgc 3841 ccagacgggc atcgaagcgc tcttcgtcaa cgacttcgaa ggctgacgaa ctcaagcggc 3901 gggcgacgag aaggcggata gacttcgatc cgacgtcgac gacggcgtag attggtcgct 3961 gtgaagagcc cgacaccatt accgaggata gtaccgatgg cgaagcgtgg cgattgttgg 4021 atcttcacgt tgggggggct ggatccggtg gacactcgaa agcggctgaa catatgatac 4081 cgcggttccc ggaggcctcc cggcatgact cttgacttct ccagagggtg ttgatggaag 4141 attgcatcgc ccagtttctt aatttccttc gcgtcgaaaa gaacgcttcc gataacacca 4201 tccaggccta caagaatgac ctgggccagt ttgcaaagca gtttgcaggc aacccggagc 4261 cgccgctttc cgagtggaag gccgtctccc gcgacaaggt catccgtttt gtcgaatgga 4321 tgaaggagac gcgtgggtac aaggacgcca cggtggcccg caaggtggcg gccgtgaagt 4381 cgttcttcgc ctttctggcg gccgaagcga tcattgaaaa cgaccccact gagaacctga 4441 agtcccccca ggtcgggaag tcgttgccgg gtgccctgac actcgaagag gtcgatgccc 4501 tgcttgaaca gccggcccgc aagagcacgc cggaggggcg ccgcgacaag gccatgctcg 4561 aactgctcta cgccacgggc gtccgcgtca cggagctcgt ctcgctgaac cttgaggatg 4621 tggcactgga gagcgatccc gttaccgtcc gctgtgtggg caaaggagac catgatcgga 4681 tccggccgct cccgcagcgg gccgtcgacg agattcgcca gtacatcttc catgttcgcc 4741 cccggctggt tcgcaacaag aaagagcgcg cgctcttcgt gaaccggcgc ggtgagcgcc 4801 tgacgcgcca gggg //