LOCUS DUVC01000185 2006 bp DNA linear ENV 25-APR-2020 DEFINITION TPA_asm: Thermoanaerobacterales bacterium isolate AS08sgBPME_326 192793_AS08, whole genome shotgun sequence. ACCESSION DUVC01000185 DUVC01000000 VERSION DUVC01000185.1 DBLINK BioProject: PRJNA602310 BioSample: SAMN13894581 Sequence Read Archive: SRR2917896, SRR2917897, SRR2917898 KEYWORDS WGS; Third Party Data; TPA; TPA:assembly. SOURCE Thermoanaerobacterales bacterium (anaerobic digester metagenome) ORGANISM Thermoanaerobacterales bacterium Bacteria; Firmicutes; Clostridia; Thermoanaerobacterales. REFERENCE 1 (bases 1 to 2006) AUTHORS Campanaro,S., Treu,L., Rodriguez-R,L.M., Kovalovszki,A., Ziels,R.M., Maus,I., Zhu,X., Kougias,P.G., Basile,A., Luo,G., Schluter,A., Konstantinidis,K.T. and Angelidaki,I. TITLE New insights from the biogas microbiome by comprehensive genome-resolved metagenomics of nearly 1600 species originating from multiple anaerobic digesters JOURNAL Biotechnol Biofuels 13, 25 (2020) PUBMED 32123542 REMARK Publication Status: Online-Only REFERENCE 2 (bases 1 to 2006) AUTHORS Campanaro,S. TITLE Direct Submission JOURNAL Submitted (28-JAN-2020) Department of Environmental Engineering, Technical University of Denmark, Bygningstorvet, Bygning 115, Kgs. Lyngby 2800, Denmark COMMENT The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Date :: JUN-2018 Assembly Method :: Megahit v. 1.1.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 156.595703221x Sequencing Technology :: Illumina HiSeq 2500 ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 04/10/2020 23:27:55 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 4.11 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 2,733 CDSs (total) :: 2,679 Genes (coding) :: 2,639 CDSs (with protein) :: 2,639 Genes (RNA) :: 53 tRNAs :: 48 ncRNAs :: 5 Pseudo Genes (total) :: 40 CDSs (without protein) :: 40 Pseudo Genes (ambiguous residues) :: 0 of 40 Pseudo Genes (frameshifted) :: 17 of 40 Pseudo Genes (incomplete) :: 21 of 40 Pseudo Genes (internal stop) :: 10 of 40 Pseudo Genes (multiple problems) :: 7 of 40 CRISPR Arrays :: 2 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..2006 /organism="Thermoanaerobacterales bacterium" /mol_type="genomic DNA" /submitter_seqid="192793_AS08" /isolate="AS08sgBPME_326" /isolation_source="anaerobic digestion of organic wastes under variable temperature conditions and feedstocks" /db_xref="taxon:2304039" /environmental_sample /metagenome_source="anaerobic digester metagenome" /note="metagenomic" gene <1..116 /locus_tag="GX723_07720" CDS <1..116 /locus_tag="GX723_07720" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011459575.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=3 /transl_table=11 /product="rubrerythrin family protein" /protein_id="HHX23876.1" /translation="NTEEVFYYLCPVCGNIEKFQPEKCSICGVPGDKFIKY" gene complement(207..827) /locus_tag="GX723_07725" CDS complement(207..827) /locus_tag="GX723_07725" /inference="COORDINATES: protein motif:HMM:NF014566.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="CPBP family intramembrane metalloprotease" /protein_id="HHX23877.1" /translation="MAAMAIVSFTNLFAANLSSAAIILGIVIFFVCQAVEKQPMQGSG LDFKAIGKGLKDKKIWIWLIMPIVMDAVCVLLSVLFLPEYIEYEPARAGSFVVIEISA KSLLMFLVFALGEEIAWRAFSQNRLSKILPIIPAVLVTSFLFTLGHYKQGDMTIVLFG LVFTFINSILYGIIFHKAKNAWISTTSHFAANIFEVLLFILIAKVN" gene complement(1146..1775) /locus_tag="GX723_07730" CDS complement(1146..1775) /locus_tag="GX723_07730" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_003454289.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GntR family transcriptional regulator" /protein_id="HHX23878.1" /translation="MSQRIDNPRYMNIALDIAGRICKGEYKEGDRIFGRSVLASEYNV SPETIRRAINLLEDMQVVIAKQGSGINVSSLSNAYSFVRKFQNKDTIRSLKNEIRALI SQKNDVENSIKKKIDMIVDYSDRLKNSNVFTPLEIEIDDNSHLIGRTISETKFWQNTG ATIIAIKRNDALVLSPGPYGGFEKGDVIFVIGGEDVIERIEKFMKEFKA" BASE COUNT 610 a 409 c 370 g 617 t ORIGIN 1 ataatacaga agaagtcttt tattatcttt gccctgtctg tggaaatatc gagaagtttc 61 agcctgaaaa atgcagtatt tgtggcgttc cgggggataa atttattaag tattaaatct 121 gagttaatga ataaaaaact tattgatatc caatgttaat aagagcaagg cactctatgg 181 aatcaaagga tcctctcgta gttcatttaa ttaacttttg ctattaatat gaataaaagg 241 acttcaaata tattagcagc aaaatgtgat gttgtgctaa tccatgcatt ctttgcctta 301 tggaatatga ttccatataa aatactgtta atgaaagtaa agacaagccc gaataaaacg 361 atagtcatat ccccttgctt atagtgacca agggtaaata agaaagaagt tacaagtaca 421 gctgggatta tcggcagtat cttgctgagc ctgttttggg agaatgccct ccacgcaatt 481 tcctcaccca aggcaaaaac taaaaacata agtaaagatt tagcgctgat ttctatcacc 541 acaaatgacc cggctcttgc aggttcatat tcgatatatt ccggcaaaaa caataccgac 601 agtaacacgc atacggcgtc catgacgatt ggcatgataa gccatatcca tatttttttg 661 tccttcaagc ccttgccgat tgccttaaaa tcaaggccgc ttccttgcat tggctgcttt 721 tcaacagctt ggcaaacaaa gaaaatgact ataccaagaa taattgccgc agacgataag 781 tttgccgcaa ataaatttgt aaaagagacg attgccatgg cagccaaaac gattataccc 841 accacagatt gttttttgtg tattcgttca tttttctcct taattcttga aaggataccg 901 ctgcacagta agcaaggggg tcaccccaga ccgccccctt gcttctccgg tcgttatggc 961 tgttatatga ctccaaccga agaaagatac cctaagatgc tcataatcgc aattaaagta 1021 attgcgatga taaagatgat attccaatat ttacgggggt ttttgtacca ttgaaatagg 1081 ccaaacgaga aaagaaaggt tctctcaatt cgcactgtgt ttcccatatc acgatattat 1141 cctattcaag ccttaaattc cttcatgaac ttttctatcc tttcgataac atcttcaccg 1201 ccgataacga aaatgacgtc tcctttttca aatccgccat atggtcccgg ggaaaggaca 1261 agggcgtcgt ttcttttaat tgcgataata gttgcgccgg tattctgcca gaacttagtt 1321 tcggatatgg ttcggccaat aagatgagaa ttgtcgtcta tttcgatttc aaggggggta 1381 aagacattcg aatttttcag gcggtcagaa taatccacaa tcatgtctat tttctttttt 1441 atagagtttt caacatcgtt cttttgagaa atcagtgctc ttatttcgtt cttaagagag 1501 cggatagtat ctttgttctg aaatttcctg acaaaagaat atgcgttgct aagggaggat 1561 acattgatac cgcttccctg cttggctatc accacttgca tgtcctccaa cagatttatc 1621 gcccgtctga tggtttccgg cgagacatta tattcgctgg ctaataccga ccggccgaaa 1681 atcctgtcgc cttccttata ttcaccttta catattctgc cggcaatatc taaagcaata 1741 ttcatatacc ttgggttatc aattctttga ctcatatgat atctcctcaa aataataaaa 1801 atatctgttt ttacagttat tataccgttt tttaaaatga tttggcaata aattcataaa 1861 aacacaggca cctaagattt gaaccgctcc ctgtcaagta gacagacaaa ataataaaat 1921 ttacagattg gattccgtct aatacgggat ccagttttct tatgcggcgt tttcaagcaa 1981 ctggcggtat gatgtggggg acaggc //