LOCUS JAGRHI010000115 5907 bp DNA linear ENV 13-OCT-2021 DEFINITION MAG: Candidatus Competibacteraceae bacterium isolate HKST-UBA195 2015-01-06_1_(paired)_contig_56539, whole genome shotgun sequence. ACCESSION JAGRHI010000115 JAGRHI010000000 VERSION JAGRHI010000115.1 DBLINK BioProject: PRJNA432264 BioSample: SAMN14564267 KEYWORDS WGS; Metagenome Assembled Genome; MAG. SOURCE Candidatus Competibacteraceae bacterium (activated sludge metagenome) ORGANISM Candidatus Competibacteraceae bacterium Bacteria; Proteobacteria; Gammaproteobacteria; Candidatus Competibacteraceae. REFERENCE 1 (bases 1 to 5907) AUTHORS Wang,Y., Ye,J., Ju,F., Liu,L., Boyd,J.A., Deng,Y., Parks,D.H., Jiang,X., Yin,X., Woodcroft,B.J., Tyson,G.W., Hugenholtz,P., Polz,M.F. and Zhang,T. TITLE Successional dynamics and alternative stable states in a saline activated sludge microbial community over 9 years JOURNAL Microbiome 9 (1), 199 (2021) PUBMED 34615557 REMARK Publication Status: Online-Only REFERENCE 2 (bases 1 to 5907) AUTHORS Zhang,T. TITLE Direct Submission JOURNAL Submitted (09-APR-2020) Civil Engineering, The University Hong Kong, Pokfulam Road, Hong Kong 999077, Hong Kong COMMENT The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: CLC de novo assembler v. 6.04 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 184x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 04/20/2021 10:13:29 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 5.1 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 4,037 CDSs (total) :: 3,991 Genes (coding) :: 3,727 CDSs (with protein) :: 3,727 Genes (RNA) :: 46 rRNAs :: 1 (16S) partial rRNAs :: 1 (16S) tRNAs :: 41 ncRNAs :: 4 Pseudo Genes (total) :: 264 CDSs (without protein) :: 264 Pseudo Genes (ambiguous residues) :: 182 of 264 Pseudo Genes (frameshifted) :: 155 of 264 Pseudo Genes (incomplete) :: 34 of 264 Pseudo Genes (internal stop) :: 8 of 264 Pseudo Genes (multiple problems) :: 112 of 264 CRISPR Arrays :: 6 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..5907 /organism="Candidatus Competibacteraceae bacterium" /mol_type="genomic DNA" /submitter_seqid="2015-01-06_1_(paired)_contig_56539" /isolate="HKST-UBA195" /isolation_source="activated sludge from Shatin waste water treatment plant collected monthly from 2007 through 2015" /db_xref="taxon:2053538" /environmental_sample /geo_loc_name="China:Hong Kong SAR, Shatin waste water treatment plant" /lat_lon="22.406236 N 114.213394 E" /metagenome_source="activated sludge metagenome" /note="metagenomic" assembly_gap 125..361 /estimated_length=237 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(<362..2426) /locus_tag="KDJ54_07710" CDS complement(<362..2426) /locus_tag="KDJ54_07710" /inference="COORDINATES: protein motif:HMM:NF012722.1,HMM:TIGR00229.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="PAS domain S-box protein" /protein_id="MCB1824455.1" /translation="MWICLLVVLVAIGAISPVDGIAEPSPPTSVSTATPPPSDMAATL PDEASTHRRNDDAPSPGAAPRQSSLLAAVLRYAIWILGSLAAIAIATLLWSSLLQRQV TQRTRQLQRELNERRQAETALRESEARFRDVTEAASDWIWEMNDNLRFTYLSERFYAL TGIAPQHVLGRARWELAAGDLAGNPQKWRRHRRLLEQRQPFRDFVYQTRLSTHGEGRY LKVSGKPIHDAGGQFRGYRGTGTDITEQVKAEMALHESELRLRRIIDLVPHMIFAKDR YGHFLLANRATAAAYDMTIEALTNHSHRLVHGADEEVERMLADDLEVIESGVSKLIAE EAFTDHTGHTRTMQTIKIPFVEPGLNETAVLGISIDITEQKHAKEEVARMRLYLRNII DSMPSILIGVDPWGYITVLNQPAEEVSGLSWETAQGRFFGDVFPQLESQFEQVRQAIH LGQPIKTPRMTFDVGGELHYADVMVYPLMAESAGGAVIRMDDVTARVRIESMMVQTEK MLSVGGLAAGMAHEINNPLGVIMQGSQNILRRIDPDMPQNREAAAAIDADLHRINRYL AERGILHFLEGIREAGARAAKIVADMLSFSRRSESHFALVDLEDMLETVLRLAASDYD LKKKYDFRRIAIERDYDPALRLMYCDKTEIEQVILNLLKNAAQAMADDGTPSPTIVLR TRREPD" gene 2567..3613 /gene="nagZ" /locus_tag="KDJ54_07715" CDS 2567..3613 /gene="nagZ" /locus_tag="KDJ54_07715" /EC_number="3.2.1.52" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012638318.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="beta-N-acetylhexosaminidase" /protein_id="MCB1824456.1" /translation="MALGPVMLGLDGLELSAEEREILCHPRVGGVILFARNYRSPEQV AALTAAIHALRQPRLLVAVDHEGGRVQRFRDGFTRLPPVRRLGEIYDRDRMRAKQLAR VTGWLMAAELRAVGVDLSFAPVLDLDHGISGVIGDRAFHRDPEAVADLAHAYVSGMQK AGMEAVGKHFPGHGGIAADSHLELPVDPRAYADLEHADLLAFERMIHYGLAAIMPAHV LYPQVDDRPAGFSARWLRDILRRRLEFQGTIFSDDLDMAGAGEAGAAPERAEAALTAG CDMVLACNDRRAASAILERLRHVSEPVSQLRLIRLHGRGHPTLERLRRGPVWRRAVRL VHDYDAFPLLDMDI" gene 3656..4210 /locus_tag="KDJ54_07720" CDS 3656..4210 /locus_tag="KDJ54_07720" /EC_number="2.4.2.8" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_005959084.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypoxanthine-guanine phosphoribosyltransferase" /protein_id="MCB1824457.1" /translation="MSSITAEQALVVLREAELLCAPEQVEIALDRLAAAITDRLGDCD PLVLVVMNGAFIPAALILSRLRFPLQVGYLHATRYRGGIRGGAIDWIAPPRPAVAGRT VLVVDDIFDEGNTLKAILEEVRRQGAAAVHSAVLVNKRHPRKVPGLRVDFIGLEVPDR YVFGCGMDYQEYWRQLPAIYAARD" gene 4275..5026 /locus_tag="KDJ54_07725" /pseudo CDS 4275..5026 /locus_tag="KDJ54_07725" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_018231720.1" /note="frameshifted; too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="S-methyl-5'-thioinosine phosphorylase" assembly_gap 4819..4845 /estimated_length=27 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(5029..5586) /locus_tag="KDJ54_07730" CDS complement(5029..5586) /locus_tag="KDJ54_07730" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015280198.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Sua5/YciO/YrdC/YwlC family protein" /protein_id="MCB1824458.1" /translation="MNPPRLRAAARAVRSGGLIAYPTEAVYGLGCDPRNERAVRRLLA LKRRSAHKGLILIAADFAQLAPFLRPLTPTELGRLRASWPGPHTWLVPARRGTPRWLR GRHDTLAVRVTAHPLAAALCRVCGHPLVSTSANLSGRPPARGALAARRQLGRHLDALL PGPTGGAAQPTAIRSLRTGRIVRGG" gene complement(5602..>5907) /locus_tag="KDJ54_07735" CDS complement(5602..>5907) /locus_tag="KDJ54_07735" /inference="COORDINATES: protein motif:HMM:NF024761.1" /note="decatenates replicating daughter chromosomes; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA topoisomerase I" /protein_id="MCB1824459.1" /translation="RYGKSYVSLKQEDPYTVSRERALELIAAHRQAVANKVLREFPDS SIRILNGRFGPYITDGKKNVRLPKDRDPASLALAECEALVREAPDKKPTRRRATRSS" BASE COUNT 894 a 1721 c 1942 g 1086 t ORIGIN 1 aacgccgcca cccgccagcc ttcgtcctcc aggtaggcct tcaggttttc ccgaatcatt 61 tcctcgtcgt cgacgatcaa gacccaagcg ctcatggtag cgcctcccgg ttcgaaggca 121 ggcgnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 181 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 241 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 301 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 361 ntgtccggtt cgcggcgggt gcgcaagacg atggtcggcg acggcgtgcc gtcgtcggcc 421 atggcttggg cggcgttctt gagcaaattc aggatcactt gctcgatttc cgtcttgtcg 481 caatacatca agcgcagcgc cgggtcataa tcgcgttcga tggcgatgcg tctgaagtcg 541 tatttttttt tcaggtcgta gtcgctggcg gccaggcgca gcacggtttc cagcatgtct 601 tccaggtcca ccagggcgaa gtgcgactcg ctgcgtcggc tgaacgatag catgtcggcg 661 acgatcttgg cggcgcgcgc gccggcttcg cgaatgcctt cgaggaaatg caggatgcct 721 cgttccgcca gataacggtt gatccgatgc aaatcggcgt cgatggcggc ggcggcctcg 781 cggttttgcg gcatgtccgg gtcgatgcgc cgcaggatat tctggctacc ctgcatgatc 841 acgccgagtg ggttgttgat ttcatgcgcc atgccggccg ccagccctcc gaccgacaac 901 attttttcag tctgcaccat catggattcg attcggaccc gggcggtgac gtcatccatc 961 cggatcaccg ccccaccggc gctttccgcc atcaaaggat agaccatcac gtcggcgtag 1021 tgcagttcgc cgccgacatc gaaggtcatg cgcggcgttt tgatcggttg accgagatgg 1081 atcgcctgcc gcacctgttc gaattggctt tccagttgtg ggaacacgtc accgaaaaag 1141 cggccttgcg cggtttccca cgacaggccg ctcacttcct cggccggttg attcagcacg 1201 gtgatgtaac cccaggggtc cacgccgatc aggatcgagg gcatcgaatc gatgatgttt 1261 ctgagataga gccgcatgcg ggccacttct tccttggcgt gtttctgttc ggtgatgtca 1321 atcgatatgc ccagtaccgc ggtttcgttc aagcccggtt cgacgaaggg gattttgatg 1381 gtctgcatgg tgcgggtgtg gccggtgtgg tcggtgaagg cttcctcggc gatgagcttg 1441 gacaccccac tttcgatgac ctccaggtcg tcggccaaca ttcgttccac ctcctcgtcg 1501 gcgccgtgaa cgaggcgatg ggagtgattg gtcagtgctt cgatggtcat gtcgtaggcc 1561 gcggcggtgg cgcgattggc cagcaggaag tgcccatagc gatccttggc gaaaatcatg 1621 tgtggcacca gatcgatgat gcgtcgcagt cgcagctcgc tttcatgcag cgccatctcc 1681 gccttgacct gttcggtgat gtcggtgccg gtgccgcggt agcctcggaa ctggcctccc 1741 gcgtcatgaa tcggcttgcc gctgactttg aggtaacggc cctcgccgtg agtactcagc 1801 cgggtttggt agacgaagtc gcggaatggc tggcgttgtt ccagcaatcg ccggtgccgc 1861 cgccactttt gcgggttgcc ggccaggtcc ccggccgcca gttcccagcg ggcgcggccg 1921 agcacatgct gcggagcgat cccggtcagc gcgtagaaac gttccgaaag ataagtgaag 1981 cgcaggttgt cgttcatctc ccagatccag tcgctggcgg cctcggtcac gtcgcggaag 2041 cgcgcctcgc tttcgcgtag cgccgtttcg gcttgccggc gctcgttcaa ttcccgctgc 2101 agttgccggg ttcgctgcgt cacttgccgt tgcagcagcg aggaccacag tagggtcgcg 2161 atcgcgatcg ccgctagtga tcccagaatc cagatggcgt agcgcagcac cgccgccagc 2221 agactggatt gccggggcgc cgcgcccgga gatggtgcat cgtcgttccg acgatgggtg 2281 gatgcctcgt ccgggagcgt cgccgccatg tcggacggtg gtggggtggc ggtgctgacc 2341 gacgtcggtg gcgacggttc ggcgataccg tcgacagggg agatggcccc gattgcgacc 2401 agtacgacca gcaggcaaat ccacaacctc cggcatcggt gggggacagg tgaccacggg 2461 cttgagagca tggcgaaagg tgcggatata aaggatggcg cgggtgagat taatcgctat 2521 gataaaacca tttgggcggt aacaggcacg aataaaggtg cggacgatgg cgttgggacc 2581 ggtgatgctg gggttggacg gcctggaact gagcgcggaa gagcgcgaaa tcctgtgcca 2641 tccacgggtg ggcggcgtca ttctgtttgc ccgcaactat cggtcgccgg aacaggtggc 2701 ggcgctgacc gccgccattc acgccctgcg ccaaccgcgg ctgctggtgg cggtggatca 2761 cgagggcggc cgggtccaac gattccgcga cggttttacc cgcttgccgc cggtgcgtcg 2821 gctgggcgag atttacgacc gggaccggat gcgcgccaag caactggcgc gcgtcaccgg 2881 ctggctcatg gccgccgaac tgcgggcggt gggtgtggat ttgagtttcg cgccggtgct 2941 ggatctggat cacggcatca gcggcgtgat cggtgaccgg gcgttccacc gcgatcccga 3001 agcggtggcg gatttggcgc atgcctatgt cagcggcatg cagaaggcgg gcatggaggc 3061 ggtcggcaag cattttccgg gacatggcgg catcgccgcc gactcgcatc tggagttgcc 3121 ggtggacccg cgcgcctacg ccgatctgga acacgccgat ctgttggcct tcgagcgcat 3181 gattcattac gggttggccg ccatcatgcc ggcgcacgtg ctctatcctc aggtggacga 3241 ccggccggcc ggtttttcgg cgcgctggct gcgggacatc ctgcggcggc gtctggaatt 3301 ccaggggacg atcttcagcg acgatctgga catggctggc gcgggcgagg ctggcgcggc 3361 gcccgagcgg gccgaggcgg cgctgacggc cggttgcgac atggtgctgg cctgcaacga 3421 ccgtcgtgcc gccagcgcca ttctggagcg cctgcgccat gtttcggagc cggtcagcca 3481 gttgcggttg atccgcttgc acgggcgtgg ccatcccacc ctggaacggc tgcgccgtgg 3541 cccggtctgg cggcgggcgg ttcgcttggt gcacgactac gatgcgttcc cgctgctgga 3601 catggatatc tgaaatcgcc gggagcgttt ttttccacct caaccgaaat tcgccatgtc 3661 ctcgatcacc gccgaacaag ctctcgtcgt gttacgcgag gccgaattgc tctgcgcgcc 3721 ggagcaagtt gagatcgcgc tggaccgctt ggcggccgcc ataaccgacc gattgggaga 3781 ctgcgatccg ctggtgctgg tggtcatgaa tggcgcgttt attcccgctg cgctgatctt 3841 gtcccgcttg cgctttccgt tgcaagtcgg ttacttgcat gccacccgtt atcgcggcgg 3901 catccgtggc ggcgccatcg actggatcgc gccgccccgg ccggcggtcg ctggccgaac 3961 cgtgctggtg gtcgacgata tcttcgacga aggcaacacc ctcaaggcga tcctggagga 4021 agtgcgccgg cagggcgcgg cggcggtcca tagcgcggtg ttggtcaaca agcggcaccc 4081 tcgcaaggta ccgggactgc gggtggactt tatcgggttg gaggtgccgg atcgctatgt 4141 gttcggctgc ggcatggact accaggagta ttggcgccag ttgcccgcca tctacgccgc 4201 ccgcgattga gaaagttgtc ggcccggctc gttaccgtct ttccatgatc gttttcgacc 4261 gcgaggttac cgtcatgaca cccaccgtcg ccatcatcgg gggcaccggc ctcactaccc 4321 tggacgctct gcggaccgcg catcgggaaa cgctgtccac gccctacggc gagccgtcca 4381 gtcccgtgat tcacggtgaa ctgggaggac gggccgtggt ttttctggct cgccatggcc 4441 agcatcacac cttgccgccg cacaagatca actaccgggc caacctctgg gcgctacacc 4501 ggctcggcgt cgagcaggtc atcgcggtgg cggcggtggg cggcattcgc gccgatatgg 4561 aacctggggt actggcgttt cccgatcaga tcgtggatta cacctggggc cggcactgta 4621 ccttcttcga ggacaacctg agccacgtca cccatatcga ttttaccgag ccttactgcc 4681 tcgaactgcg ggaacggctg atccaggcgg ctcgcgagct gggccttgag gcgcgcgagt 4741 catgcactta cgcggcgatg tcggggccac ggctggagac ggcggcggaa gtccgccggc 4801 tggagcggga cggttgcgnn nnnnnnnnnn nnnnnnnnnn nnnnnagccg ccctggctcg 4861 cgagctgggc ttgcgctatg ccgcgtgcgc ggtggtggcc aactgggcgg cgggtaaggt 4921 tgccggtacg attagcatgg ccgagatcga gagcaatctg gccggcagca tggagaaggt 4981 caaggcgctg ctggcgcaag tcatcccgac cctggtaccg cgctgacctc acccgccgcg 5041 cacgatccga ccggttcgca agctgcggat ggcggtcggt tgggccgcgc cgccggtcgg 5101 gccgggtagc agggcatcga gatgtcgccc cagttgccgg cgcgccgcca gcgcgccacg 5161 agccggcgga cggccgctga ggttggcgct ggtggatacc agcggatggc cgcaaacgcg 5221 acacagcgcc gccgccagcg gatgagcggt cacccgcacc gccagggtgt cgtgccggcc 5281 ccgcagccag cgtggggttc cgcgccgggc gggaaccaac caagtgtgag gtcccggcca 5341 gctagcgcgc aatcgcccca gctcggttgg ggtcagggga cgcagaaaag gggcaagttg 5401 ggcaaaatcg gccgcgatca ggatcaggcc cttatgcgct gaccggcgct tgagcgccag 5461 caagcgccgt accgcccgtt cgttgcgagg atcgcagccg agtccgtaca ccgcctcggt 5521 ggggtaggcg atcaggccac cagagcgaac cgcgcgggcg gcggcgcgca agcgtggcgg 5581 gttgatggtg gacacgccgg gtcaggacga cctggtggcg cggcggcggg tgggcttctt 5641 gtcgggggcc tcgcgtacca gcgcttcgca ctcggccagc gccaggctgg ctggatcgcg 5701 gtccttgggc agacggacgt ttttcttgcc gtcggtgatg taggggccga agcggccgtt 5761 caggatgcgg atggaagaat ccgggaactc gcgcagaacc ttgttggcca ccgcctgccg 5821 gtgcgcggcg atcagttcca gcgcccgctc gcggctgacg gtgtagggat cttcctgttt 5881 cagcgaaacg tagctcttgc cgtagcg //