LOCUS JAGRDC010000198 5873 bp DNA linear ENV 12-OCT-2021 DEFINITION MAG: Anaerolineales bacterium isolate HKST-UBA42 NODE_198_length_5873_cov_1.679089, whole genome shotgun sequence. ACCESSION JAGRDC010000198 JAGRDC010000000 VERSION JAGRDC010000198.1 DBLINK BioProject: PRJNA432264 BioSample: SAMN14563033 KEYWORDS WGS; Metagenome Assembled Genome; MAG. SOURCE Anaerolineales bacterium (activated sludge metagenome) ORGANISM Anaerolineales bacterium Bacteria; Chloroflexi; Anaerolineae; Anaerolineales. REFERENCE 1 (bases 1 to 5873) AUTHORS Wang,Y., Ye,J., Ju,F., Liu,L., Boyd,J.A., Deng,Y., Parks,D.H., Jiang,X., Yin,X., Woodcroft,B.J., Tyson,G.W., Hugenholtz,P., Polz,M.F. and Zhang,T. TITLE Successional dynamics and alternative stable states in a saline activated sludge microbial community over 9 years JOURNAL Microbiome 9 (1), 199 (2021) PUBMED 34615557 REMARK Publication Status: Online-Only REFERENCE 2 (bases 1 to 5873) AUTHORS Zhang,T. TITLE Direct Submission JOURNAL Submitted (09-APR-2020) Civil Engineering, The University Hong Kong, Pokfulam Road, Hong Kong 999077, Hong Kong COMMENT The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: CLC de novo assembler v. 6.04 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 22x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 04/20/2021 04:37:49 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 5.1 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,483 CDSs (total) :: 5,457 Genes (coding) :: 5,445 CDSs (with protein) :: 5,445 Genes (RNA) :: 26 tRNAs :: 23 ncRNAs :: 3 Pseudo Genes (total) :: 12 CDSs (without protein) :: 12 Pseudo Genes (ambiguous residues) :: 0 of 12 Pseudo Genes (frameshifted) :: 1 of 12 Pseudo Genes (incomplete) :: 11 of 12 Pseudo Genes (internal stop) :: 1 of 12 Pseudo Genes (multiple problems) :: 1 of 12 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..5873 /organism="Anaerolineales bacterium" /mol_type="genomic DNA" /submitter_seqid="NODE_198_length_5873_cov_1.679089" /isolate="HKST-UBA42" /isolation_source="activated sludge from Shatin waste water treatment plant collected monthly from 2007 through 2015" /db_xref="taxon:2073117" /environmental_sample /geo_loc_name="China:Hong Kong SAR, Shatin waste water treatment plant" /lat_lon="22.406236 N 114.213394 E" /metagenome_source="activated sludge metagenome" /note="metagenomic" gene <1..703 /locus_tag="KDE04_07790" CDS <1..703 /locus_tag="KDE04_07790" /inference="COORDINATES: protein motif:HMM:NF013593.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=2 /transl_table=11 /product="cell division protein FtsH" /protein_id="MCB0006341.1" /translation="NKQALVHMHDFDRALDKITLGGERPLLLNEQDRRVVAYHEAGHA LVAWLLPAADTVHKVTIIPRGRSLGVTEQRPQADQYNLSNRYLLARLAVLLGGRTAEE IAIGDITTGAENDLVEATRLARRMVTRWGMSEVGLATFDLDEAQPFLGYELAQGRPYA DATAARIDAAIQDLLTEQHELVHDLLAAHQEQLDALAAMLLSQETVAEAGLAQLLGER SDEQIDFQLAELVPG" gene complement(711..1733) /locus_tag="KDE04_07795" CDS complement(711..1733) /locus_tag="KDE04_07795" /inference="COORDINATES: protein motif:HMM:NF015651.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="flippase-like domain-containing protein" /protein_id="MCB0006342.1" /translation="MLRRLLFLGLVVAFVWVVHSRFDEIRDLTATLLAGRWPWILSAI LLQIVYWVSYALLFRASFAVVGVHSPLRSLIPLVLASVFVNSATPSGGTGGIALFVDD ARRRGQSAARAAAGMLLVNIADFGSFLVVLTVGLVILLLLHDLVVYEIVTAFLMYTYV GVMVAVLLLGLWRPDRLRRLLQRVQQGVNRVGAWLRYPTILDENWSDNNSREFEAAAN QIAAHPDRLLEMSGIGLVAHVINICTLYSIFLAFNIKGSLGIIIAGYAMTRLFWIISP TPNGIGIVEALMPVVYTSLGLETAGATIITVAYRGISFWFPFVLGFFLLRRSLLRNSG DEALPI" gene 1833..3077 /locus_tag="KDE04_07800" CDS 1833..3077 /locus_tag="KDE04_07800" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013560669.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MFS transporter" /protein_id="MCB0006343.1" /translation="MTPGENQASSQSIWGQLRALPRNIWVLTVSSLLTDISSEMIVHL LPLFLANVLGVRTVTIGLIEGMAETTASLLKLWSGRFSDKLGQRKSLTVAGYALSTFA KPFLVLAHSWPAVLTIRFLERSGKGLRAAPRDALVADSIPAERRGFAFGVHRTGDTAG AALGLVVAILVIWRLQGQALLLQQATFTTVVWLSVIPAFLAVILLAFGIQEIRPAAGA AQNRPAATPLPANFRRFLLIVLVFTLGNSADAFLVLRAQERGLSLLGVLGMLLAFNVV YALVAGPAGSLSDRIGRRRLLLAGWLLYALIYLGFALARNSSEVIGLMLLYGLYNALT VGAAKSFVADLVPVQQRGTAYGWYNGAIGLAALPASIIAGLLWQGAGSWPGFGAPAPF LFGASMSLIALVLLLWWLPQTE" gene complement(3105..3596) /locus_tag="KDE04_07805" CDS complement(3105..3596) /locus_tag="KDE04_07805" /inference="COORDINATES: protein motif:HMM:NF012301.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="response regulator transcription factor" /protein_id="MCB0006344.1" /translation="MTPSFVAGRCRLYADEGGAAASYNVAMANERIRVFILERHPAIR AGLRALLESSPEIVVVGEAGDSAGILAHIMATQPDIVLLDPDLPDDGGLPLLRELSAA LPAASLLVHTDDGASAKMQAAFAAGVQGYLLKGQSSQKLVQTIRLLYASRVDLTANRD LPD" gene complement(3593..4261) /locus_tag="KDE04_07810" CDS complement(3593..4261) /locus_tag="KDE04_07810" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013559671.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="response regulator transcription factor" /protein_id="MCB0006345.1" /translation="MSEKIRIFLADDHAVVRRGLEALLATEMDMEVVGTAANGGEAVE RIIQLQPGVVLLDLQMPQKSGVDVLNEIRQGARDTRVLVLTSFSDRETIYEAIKAGAI GYLLKDSSPEELVQAIRNTHVGKSTLSPDVALKLIEEISKPPRSNVPLTEEPLTEREV EILKYVARGMNNQEIADLLFLSERTVRTHISNILSKLHLANRTQATLYALREGIARLD DDAI" gene complement(4265..5398) /locus_tag="KDE04_07815" CDS complement(4265..5398) /locus_tag="KDE04_07815" /inference="COORDINATES: protein motif:HMM:NF014567.1,HMM:NF019350.1,HMM:NF024583.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GAF domain-containing sensor histidine kinase" /protein_id="MCB0006346.1" /translation="MDGWTEQLAALYDVLSLTSDVVSDLTLVLHRALGRILAVTVTRN GAIHILNEAATQLELVTEIALPPTLVAQWRQIAADQPPFKAALASEHYITLDLSQEPA LAGLTGLAGVQTLLSFPIRKGARNLGTLTAVLGRETVDEGEMRLFVSLADQLAIIIEN AQLRRRAEQLAVVEERNRLARELHDSVTQSLYSTTLFAEAAQRQARAGHIEMALQHLA EVAETSQQALKEMRLLVHKLRPSVLEKEGLLPALRGRLKAVEGRAGIQHELLAEGELH LSRDLEDTSYYIVQEALNNALKHSRAANVTVEIRQTDHELWLIARDDGLGFDQTAALA GGGQGLNNIRERVEQLAGTLQICSEIGHGTSVTVMLPAKCPQD" gene 5644..>5873 /locus_tag="KDE04_07820" CDS 5644..>5873 /locus_tag="KDE04_07820" /inference="COORDINATES: ab initio prediction:GeneMarkS-2+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS-2+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="MCB0006347.1" /translation="MEIRDSQVYRQAIHDFVQARRRASLHELLSGLTGRSNQLLPYND IARDLQITNVHSAGLEEVPLEAIVGSVGRYGD" BASE COUNT 1214 a 1838 c 1700 g 1121 t ORIGIN 1 caacaagcag gcgctggtcc acatgcatga ttttgaccgc gctctggaca agatcaccct 61 cggcggcgaa cggccgctgc tgctaaacga gcaggatcgc cgggtagtcg cctaccacga 121 agctggccac gcgctcgtcg cctggctcct gccggcggcc gacaccgttc acaaggtcac 181 catcatcccc agaggccgct ccctgggcgt gacggaacaa cggccccagg cagaccagta 241 caacttgagc aatcgttacc tgctggcccg gctggcagtg ctcctcggcg ggcgtacggc 301 tgaggagatc gccatcggcg acatcaccac cggggccgaa aatgacctgg tcgaggcgac 361 gcggctggcc cgccgcatgg tcacccgctg gggcatgagc gaggtcgggc tggccacatt 421 cgacctggat gaggcccaac cgttcctggg ctatgagctg gcccagggac gaccctatgc 481 cgacgccacc gccgcccgca tcgatgccgc cattcaggat ctgctaactg agcaacatga 541 attggtccat gatttgttgg ctgcccacca ggagcagctc gatgccctgg ccgctatgct 601 cctgagccag gagaccgtgg cggaagccgg cctggcccaa ctcctgggcg agcgcagtga 661 tgagcagatc gattttcaac ttgcggagtt ggtgccgggc tgaatcaggt ctagatcggc 721 aacgcctcat ccccggagtt ccgcaacagg gagcggcgca gcaggaagaa gcccagcaca 781 aacgggaacc agaaggagat accccgatag gcgacggtga ttatcgtggc gccggccgtt 841 tccagcccca gcgaggtgta gacaaccggc atcagcgcct ccacaatgcc aatgccattc 901 ggcgtcggcg agatgatcca gaacagccgc gtcatcgcgt acccggcgat gatgataccc 961 aacgatccct tgatgttgaa cgccagaaag atcgagtaca aggtacagat gttgataacg 1021 tgggcaacca gaccgatgcc gctcatttcc agcaaccgat cagggtgggc cgcaatctga 1081 ttggcagcag cctcaaactc ccggctgttg ttgtccgacc aattctcatc caggatggtg 1141 gggtagcgca gccaggcgcc gacacgattg accccttgct gcacccgctg caacagccgt 1201 ctaagccgat caggtcgcca tagtcccagc aacaaaacgg ccaccatcac acctacatag 1261 gtgtacatga gaaacgccgt cacaatttcg tagacgacga gatcgtgcag gagcagcagg 1321 atgaccaggc caaccgtcaa gacaaccaga aacgagccaa aatcagcgat attgaccagc 1381 aacatgccag ccgcagcccg ggcagccgat tgaccacgcc ggcgggcgtc atccacgaag 1441 agggcaatgc cgccggtgcc gcccgatggg gtagcggagt tgacaaagac gctggccaga 1501 accagcggaa tcagactacg cagcggacta tgaacaccga caacggcaaa cgaggcccgg 1561 aacagcaggg cgtaactgac ccagtaaaca atctgcagca aaatcgccga caagatccag 1621 ggccagcgac cggccagcag agtagcggtc aaatcccgta tctcgtcaaa gcggctgtgt 1681 acaacccaga caaaggccac gaccaggccc aaaaagagaa gacgacgcaa catgaaaaat 1741 gaatcctgat aattgcaaag ttaaagcatt gcggcggcaa tgctttgtgc taactttacc 1801 cgaagattga cggcgacgcg aaaggaaagc aaatgacacc tggagaaaat caggcatctt 1861 ctcagtcaat ttggggccag ttacgggccc tgccacgcaa catctgggtc ctgacggtct 1921 cctcgctgct gaccgacatc tccagcgaga tgatcgtcca tttactaccg ctttttctgg 1981 caaatgttct gggtgtgcgt acggtgacca tcggcctgat tgaggggatg gcggagacga 2041 cggccagcct gctgaagctg tggtccggcc ggttctccga caaactcggc cagcgcaaat 2101 cgctcaccgt agccggctac gccctctcca cattcgccaa gccttttctg gtcctggcgc 2161 acagctggcc cgccgtcctg accatccgct ttctggaacg gagcggcaag gggctccggg 2221 ccgccccgcg ggacgcgctg gtggctgaca gtatcccggc cgaacggcgc ggtttcgcct 2281 tcggcgtcca tcgcaccggc gacactgccg gcgcggccct gggcctggtg gtcgccattc 2341 tggtgatctg gcggctgcaa ggtcaggcgt tgctcctgca gcaagccacc ttcacaaccg 2401 ttgtctggtt aagcgtcatc ccggctttcc tggcggtcat cctgctggcc ttcggcatcc 2461 aggagatccg accggcagcc ggggcagccc aaaaccggcc ggcggccacg cccctgccgg 2521 ccaacttccg ccgtttcctg ctgatcgtcc tggtttttac cctgggcaac tcggcggatg 2581 cctttctggt gctgcgggcc caggaacggg ggctatcctt gctgggtgtt ctggggatgt 2641 tgctggcatt taacgtggtt tatgcgctgg tggcgggacc ggccggttcc ctatcggaca 2701 gaatcggccg gcgccggctg ctgctggccg gctggctgct ttatgcgctc atctacctcg 2761 gctttgccct ggctcgcaac agcagtgagg tgattgggct gatgctgctt tacggcctgt 2821 acaatgcgtt gacggtcggc gcggccaaat cctttgtggc cgacctggta ccagtccagc 2881 agcgcggcac cgcctacggc tggtacaacg gcgccatcgg cctggctgcc ctaccggcca 2941 gcatcatcgc cggcctgctc tggcagggtg ccggctcatg gcctggattt ggggcgcccg 3001 ccccatttct attcggcgcc agcatgtcgc tgattgccct ggttcttttg ctgtggtggc 3061 tgccgcaaac agaatgagtc ttcaggggcg gcaatggcgc ctggtcaatc aggcaaatcc 3121 cgattggccg tcagatcaac ccggctcgca tataacaaac gaatggtctg caccagcttc 3181 tgagagctct gccccttgag cagatagccc tgtaccccgg ccgcaaaagc tgcctgcatt 3241 ttagcggaag caccgtcatc ggtatggacc aggagagaag cggctggcag cgctgcactt 3301 aattctcgga gtagcggcag ccccccatca tcgggcaaat ccgggtccag cagaacaata 3361 tcgggctgtg ttgccatgat atgggccagg atcccggcag aatccccggc ttcaccgaca 3421 acaacgatct ccggggacga ctcaagcagg gctcgcagcc cggcccgaat ggcgggatgc 3481 ctttccagga taaagactct tatgcgctca ttggccatgg ccacattgta gctagcggca 3541 gcgcctccct catcggcgta aaggcggcaa cgacctgcga caaatgacgg tgtcaaatag 3601 cgtcatcgtc cagacgggca atcccttcgc gcagcgcata aagcgtcgcc tgcgtgcggt 3661 tggccaggtg gagtttgctg agaatgttgc tgatatgcgt tcgcacggtc cgctcactga 3721 gaaagagcag atcggcaatt tcctgattat tcatgccccg ggccacatat ttcaggatct 3781 caacctcgcg ctccgtgagc ggctcttctg ttagcggcac attgctccgg ggcggcttgc 3841 tgatctcttc gatgagtttc agcgccacat ccggcgacag cgttgatttg ccaacatggg 3901 tattacgaat cgcctgaacc agctcttccg ggctgctatc cttcagcaga tagccaatgg 3961 cgccagcctt gatcgcctcg taaatcgtct cccgatcgct aaagctggtc aacaccaaaa 4021 cacgcgtatc ccgagcgccc tggcggatct cgttgagcac atccacgccg cttttttgcg 4081 gcatttgcag gtccagcagg accacgcctg gctgtagctg aatgatgcgt tccactgcct 4141 ctcccccatt ggcggcggtc cccacaactt ccatgtccat ctcggtggcc agtagagctt 4201 ccaggccccg gcgcaccaca gcgtggtcat cagccagaaa aatacgaatt ttttcactca 4261 tcatttagtc ctgggggcat ttggcgggca gcataacggt cactgacgtc ccatggccaa 4321 tttccgaaca gatctgcaag gtaccggcca actgctcgac ccgttcgcga atattgttca 4381 gcccctggcc gccaccggcc agggcggcag tctgatcaaa acccaggcca tcgtctcgcg 4441 ctatcaacca caactcgtga tccgtttgcc gtatttccac agtcacattg gcagcccgcg 4501 aatgttttag ggcattattc agcgcttcct ggacaatgta atacgatgtg tcttccaggt 4561 cccgggacag gtgcagctcg ccctcggcca gcaactcatg ctggatgccg gcgcgccctt 4621 caactgcctt aaggcggcct cgcaacgccg gcaacagccc ttctttttcc aggacagagg 4681 gtcggagctt atgaaccaac agccgcatct ccttgagcgc ctgctggctc gtctccgcga 4741 cctcagccag gtgctgcagc gccatctcaa tgtggccggc gcgggcctgg cgctgggccg 4801 cttcggcaaa tagggtggtg ctgtacaacg attgggtcac cgaatcatgc agctcacgcg 4861 ccagccggtt gcgttcctcc accaccgcca gctgctcagc ccgccggcgc aactgcgcat 4921 tttcaatgat gatagccagc tggtccgcca gggagacaaa caggcgcatc tcgccctcgt 4981 cgactgtttc acggcccagc acagcggtca gcgtacccag attgcgggcg cctttgcgaa 5041 tggggaagct cagcagcgtt tgcacaccag ccaggccggt caggccagcc agcgctggct 5101 cctgtgacag gtcgagggtg atgtaatgct cgctggccaa cgctgccttg aagggtggtt 5161 gatcggccgc tatctgacgc cattgcgcga caagggtcgg cggcagcgcg atctcggtca 5221 ccagctccag ctgggtggcc gcttcattca gaatgtggat ggcgccattg cgggtgacgg 5281 tcaccgccag gatgcggccc agcgcccggt gcaacacaag ggtcagatcg ctgaccacat 5341 cactggtcag cgaaagcacg tcatagagcg cggccagctg ttctgtccat ccatccatcg 5401 ctaatgtggg tctcgctttt gcctagagca tcttgaagcc gaagcagatt gccgtgacaa 5461 ggtttttaca atcgacaggc tacattctaa tgggaatagc aactgcaatc aatgcggccc 5521 tggcgggcaa tcaaatcaga ccttgagttc acagtccaac agtgataaaa aagcagatgt 5581 ctgagcacac atttttcacc cgaccaggct cgggccgcat gcatttataa cgattggtgg 5641 cctatggaga tacgggatag ccaggtttat cgacaggcaa ttcatgattt tgtgcaggca 5701 cggcggcggg caagcctgca tgagttgctg agtgggttga ccggccggtc caaccaactc 5761 ctgccctaca acgatatcgc ccgggacctg caaatcacca acgtccacag cgccggcctc 5821 gaggaagtgc ccctggaagc catcgtcggc agcgtgggcc gctacggcga ttt //