LOCUS DPWX01000108 7165 bp DNA linear ENV 10-SEP-2018 DEFINITION TPA_asm: Clostridiaceae bacterium isolate UBA10592 contig_2172, whole genome shotgun sequence. ACCESSION DPWX01000108 DPWX01000000 VERSION DPWX01000108.1 DBLINK BioProject: PRJNA417962 BioSample: SAMN08020198 Sequence Read Archive: SRR6482904 KEYWORDS WGS; Third Party Data; TPA; TPA:assembly. SOURCE Clostridiaceae bacterium (fermentation metagenome) ORGANISM Clostridiaceae bacterium Bacteria; Firmicutes; Clostridia; Clostridiales; Clostridiaceae. REFERENCE 1 (bases 1 to 7165) AUTHORS Parks,D.H., Chuvochina,M., Waite,D.W., Rinke,C., Skarshewski,A., Chaumeil,P.A. and Hugenholtz,P. TITLE A standardized bacterial taxonomy based on genome phylogeny substantially revises the tree of life JOURNAL Nat. Biotechnol. (2018) In press PUBMED 30148503 REMARK Publication Status: Available-Online prior to print REFERENCE 2 (bases 1 to 7165) AUTHORS Parks,D.H. TITLE Direct Submission JOURNAL Submitted (04-APR-2018) School of Chemistry and Molecular Biosciences, University of Queensland, Chemistry Bld, Cooper Road, St Lucia, Brisbane, Queensland 4072, Australia COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: CLC de novo assembler v. 4.4.1 Expected Final Version :: yes Genome Coverage :: 379.02x Sequencing Technology :: Illumina HiSeq 2000 ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 05/08/2018 23:22:34 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 2,237 CDS (total) :: 2,197 Genes (coding) :: 2,168 CDS (coding) :: 2,168 Genes (RNA) :: 40 tRNAs :: 37 ncRNAs :: 3 Pseudo Genes (total) :: 29 Pseudo Genes (ambiguous residues) :: 3 of 29 Pseudo Genes (frameshifted) :: 9 of 29 Pseudo Genes (incomplete) :: 16 of 29 Pseudo Genes (internal stop) :: 4 of 29 Pseudo Genes (multiple problems) :: 3 of 29 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..7165 /organism="Clostridiaceae bacterium" /mol_type="genomic DNA" /isolate="UBA10592" /isolation_source="fermentation" /db_xref="taxon:1898204" /environmental_sample /note="metagenomic; derived from metagenome: fermentation metagenome" gene complement(<1..1015) /locus_tag="DHW76_06830" CDS complement(<1..1015) /locus_tag="DHW76_06830" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016763417.1" /note="catalyzes the transamination of the aromatic amino acid forming a ketoacid; first step in aromatic amino acid degradation in lactococci; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="pyridoxal phosphate-dependent aminotransferase" /protein_id="HCL50700.1" /translation="MEELINKRVKEIQISGIRKFYNMTLKYPDVISLTLGQPDFNTPE HVKEAGIKAINANKTKYTENPGLLELRKEISKYIKLKYGMDYDPYNEIIVTNGASEGI DSTLRTILDEGSEVILPGPVYPGYEPIIKMCGAVPKYIDTRKNNFKITAEDIGKNITS KTRCIILPYPSNPTGAVLTKDEVQKIADVLRDREIFVLSDEIYSELVYDRNHFSIASI PGMKEKTIVINGLSKSHSMTGWRIGFILGPSYLTKHIVKVHQYNATCASSISQYAALE AVKNGSDDPLAMKNEYKKRRDYIFNRLINMGFEVAKPDGAFYIFPSISKINLISFEFA SKLL" gene complement(1131..1880) /locus_tag="DHW76_06835" CDS complement(1131..1880) /locus_tag="DHW76_06835" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008907844.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="4-hydroxy-tetrahydrodipicolinate reductase" /protein_id="HCL50701.1" /translation="MNILLHGCNGRMGQVLTRLIPEEDDMNIAAGVDSDVNKFKNSYP VYNSLKQVKEKFDLLIDFSNHTAIENILNFGLSKKVPLVICTTGFTDEEKNKMLEASK QIPIFNSSNMSLGVNLLISLSKQAAQVLGSDYDIEIIEKHHNQKLDSPSGTALMIADA INKTLGNNSNYVYGRHSKTSKRDKNEIGIHSVRGGGVVGEHEVLFLGSGDVIEIKHSA ISRDVFGYGAIKAARFIIDKNPKLYSMDDLI" gene complement(1891..2772) /locus_tag="DHW76_06840" CDS complement(1891..2772) /locus_tag="DHW76_06840" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008907845.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="4-hydroxy-tetrahydrodipicolinate synthase" /protein_id="HCL50702.1" /translation="MSVFTGSGVAIVTPFNKEGVDFEKLGELLEWHIKEKTDAIIICG TTGESSTMSLDEKKETIKYTVNKVNGRIPVIAGTGSNNTKAAVEMSVWAESIGVDALL LITPYYNKTTQRGCIEHFKAIANSVTKPIIIYNVPGRTGLNLLPQTLYELSKIKNIVA VKEASSNIDQISEIARLCGDNLDIYSGNDNETIPIMSLGGKGVISVLANILPRDTHDL CQKFLDGDLKGARDMQLKLLPLMNALFIETNPIPVKTAMNLMGMNVGHLRLPLVDMSE KNLEALKKEMINYGIKL" gene complement(2847..3830) /locus_tag="DHW76_06845" CDS complement(2847..3830) /locus_tag="DHW76_06845" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012103547.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aspartate-semialdehyde dehydrogenase" /protein_id="HCL50703.1" /translation="MGVKVAVVGATGLVGRKILEVLQEKNFPIDKLYLFASQKSAGKT MMFKDKEYTVEELKEDSFDRGIDIALFSAGASTSLKFAPIASQKGCIVVDNSSAWRMD KNVPLVVPEVNPEDISWNKGIIANPNCSTIQAVVALKPLNDKFNIRRIVYSTYQAVSG AGQVGYSDLVNGVKGEQPKKFPYPIAYNVLPHIDSFMENGYTKEEIKMVNETRKILHN DSLRITATTARVPVFYGHCESINIEFEKSFDMKDIFSTLNSAPGVIVYDDVKNQIYPM PITVEGKDEVYVGRIRRDESVDSGINIWVVADNIRKGAATNAVQIAQLLLK" gene 4075..4305 /locus_tag="DHW76_06850" CDS 4075..4305 /locus_tag="DHW76_06850" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008907851.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="spore protein" /protein_id="HCL50704.1" /translation="MTTPFKKVVKAKIKAKKELSPVEKERELIKYEIAEELGLKDRVE KYGWSGLTAEETGRIGGIMTKMNKDAGRRWND" gene complement(4306..5259) /locus_tag="DHW76_06855" CDS complement(4306..5259) /locus_tag="DHW76_06855" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006316463.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="mannose-6-phosphate isomerase" /protein_id="HCL50705.1" /translation="MEPLFFDPIYKSIIWGGRNLERIFNRSLPEGKVAESWEISNHGK DNSIITNGEFKGKSLNELLNKFGSKLVGKKCSSERFPLLIKLIDANDKLSVQVHPDNE YAKIHDNDLGKTEMWYIIDAKPNAKLICGVKSGTTKKQFEEAIKNNTLNEYLNYVDVK KGDTIFIPSGTVHAILDGIVIAEIQQNSDTTYRVYDWGRVGKDGKPRELHVEKALDVI NFEYKGSVEKVSTKKEEGFEHTSLINCKFFNVDKIHVFNSYKDKTDGSTFFAYTCVEG NGKLKYKDNYFDIKGGTSFLIPADNLEFEIAGNLQLLKSYI" gene complement(5407..5664) /locus_tag="DHW76_06860" CDS complement(5407..5664) /locus_tag="DHW76_06860" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009242210.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="HPr family phosphocarrier protein" /protein_id="HCL50706.1" /translation="MQKFIYTITDPQGLHARPAGLLVKCAQKCTSSVQMAANGHSADA KSILAVMGLGVTKNNEITFSVEGASEAKDAAALKDFCEKNL" gene complement(5722..7161) /locus_tag="DHW76_06865" CDS complement(5722..7161) /locus_tag="DHW76_06865" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013484517.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="PTS fructose transporter subunit IIBC" /protein_id="HCL50707.1" /translation="MKIVGVTKCPVGVAHTYLAAEKLEKAAKELQYEAKIEAQGSQGN ENELTPEDIASADYVIVAADVAIEGKERFNGKKVLVLPIKDVIKDANGILQSLPTRAH TYSSSKEVTKETEKEKKESSGDSAGKTAMKQLMNGVSYMIPFVVVGGLFIAISISIGG KPTAKGMVIPPGAWDKVNQIGGIGFNLMIPILAGYIAYAISGRAALAPAMISAVVANS KEILGTSAGTGFLGAIFVGYLTGYLVKWMNSWKIPRSLKPIMPIFVIPLLGTAAVSAV LILFLGAPISWLMTALNSALTFLSKDPVTAIPLGLLLGAMVAFDMGGPVNKVAFLFGT ASIVGGTPQIMGAVACAIPVPPLAMGLATLIDKKCFNEEERAAGIPALLMGLIGITEG AIPYAACDPKHVMPSIIVGSSVASALGMVLGITDIVPHGGPIVGFLGATNSLPLFLLT IAAGTVISTLMVIALKGIKQKKEFASKVV" BASE COUNT 2122 a 1415 c 1165 g 2463 t ORIGIN 1 ctaaaagttt acttgcaaac tcaaaagaga ttagattgat ttttgaaata gatgggaata 61 tataaaaagc tccatcaggc tttgcaactt caaagcccat gtttataagc ctattaaaaa 121 tataatccct tcttttttta tattcatttt tcatagctaa gggatcatca gagccatttt 181 tcactgcttc taaagcggca tactgactta tagatgaagc acaagtagca ttatactgat 241 gtactttaac aatgtgtttg gttaaataag aaggaccaag tataaagcca attctccagc 301 ctgtcattga atgagacttt gaaagcccat ttattacaat ggttttttct ttcattcctg 361 gaatcgatgc aatagaaaag tgatttctat catatacaag ttcactgtat atttcatctg 421 atagtacaaa aatttcccta tcccttaata catctgcaat tttttgaacc tcatcctttg 481 ttagtactgc tcctgttggg tttgaagggt aggggagtat tatacaccta gtttttgatg 541 ttatattttt tcctatatcc tcagctgtta ttttaaaatt gttttttctt gtatcaatat 601 atttaggaac agcaccacac attttaatta taggttcata tccaggataa actggacctg 661 gaagaattac ttccgagcct tcatccaata tagttcttag ggtactgtca atgccttcac 721 tggcaccatt agttactatt atttcattat aaggatcata atccatgcca tactttaatt 781 ttatatactt ggatatttcc ttccttaatt caagcaaacc agggttttca gtatattttg 841 ttttatttgc attaattgct ttaattcctg cctctttaac atgttcaggg gtattaaaat 901 caggttgacc taaggttaag gaaataacat caggatattt taatgtcata ttgtaaaatt 961 ttcttattcc cgaaatttga atttccttta ctcttttgtt tataagttcc tccatataaa 1021 caatcctttc ttttatccaa tggctacatc cgtaataccc tgcctaaggc gaggtattac 1081 ggatactaca cccatgggtt cttctaagat aaaagaccca caatggggtc ttaaatcaaa 1141 tcatccatac tatataattt agggttctta tcaataataa accttgcagc ttttatagca 1201 ccataaccaa aaacatctct tgaaatagca ctatgcttga tttcaattac atctcctgat 1261 cctaaaaata atacttcatg ttctccaaca actccgccgc ctcttacaga atgtattcca 1321 atttcatttt tatctctttt cgaagttttt gaatgtcttc cataaacata atttgaattg 1381 tttcctaaag ttttatttat agcgtcagca atcatgaggg cagttccgct tggagaatca 1441 agcttttgat tgtgatgctt ctcaattatt tctatatcat aatcagagcc taaaacttgt 1501 gcagcctgtt ttgatagaga aattaaaaga tttactccaa gagacatatt tgaggaatta 1561 aaaattggaa tttgttttga agcctctaac attttatttt tttcttcatc tgtaaaacca 1621 gtcgtacaaa tgactaaagg gactttttta cttaatccga aatttagtat attttcaatt 1681 gctgtatgat ttgaaaagtc aattaataaa tcaaattttt ccttaacctg ttttaaacta 1741 ttataaacag gataggaatt cttaaattta tttacatcgc tgtcaactcc tgcagcaata 1801 ttcatatcat cctcttcagg gatgagtctt gttaatacct gccccattct accattacat 1861 ccatgaagta aaatattcat actttcaccc ctaaagcttg attccataat taatcatttc 1921 ctttttaaga gcctcaaggt ttttctcaga catatccaca agcgggagtc taagatgtcc 1981 aacattcatt cccataaggt tcatggcagt tttaactggt attggattgg tttcaatgaa 2041 cagggcattc ataagcggta aaagtttaag ctgcatatca cgtgctcctt ttaagtcccc 2101 atctaaaaac ttttggcaca agtcgtgggt atcccttggc aatatatttg caagtactga 2161 aataacacct tttccgccta atgacattat tggaattgtt tcattgtcat tgccagagta 2221 tatatctaag ttatcaccgc aaagccttgc aatttcggaa atctgatcga tattagagct 2281 tgcttcctta acagcaacaa tatttttaat ttttgaaagc tcatacaagg tttgaggtaa 2341 aagattaagg cctgttcttc ctggtacatt gtaaattatg attggctttg tcacgctgtt 2401 tgcaattgcc ttaaagtgtt ctatgcagcc cctttgagta gtcttgttgt agtaaggcgt 2461 tattaatagg agggcgtcga ctccaatact ttcagcccac acactcattt caacagctgc 2521 ctttgtattg ttagagcctg ttcctgcaat aacaggaatt cttccattga ccttattaac 2581 tgtatattta atggtttctt ttttttcatc aagggacatt gtagaagatt ctcctgtagt 2641 accgcaaatt attatagcat cagttttttc ttttatatgc cattctaaaa gttccccaag 2701 cttttcaaag tcaactcctt ctttattaaa aggagtaaca attgcaacac cagaaccagt 2761 aaatacactc atatataacc atcctttctt aaattatatt tgttttatta tatgctttga 2821 gattaaatat ataaaactgc aagcaattat tttaatagta actgagctat ttgaactgca 2881 tttgtagcag cgccttttct tatgttatct gccactaccc aaatattaat tccactatca 2941 acgctttcat cacgtcttat tcttccaaca taaacttcat cttttccttc aacagtaatt 3001 ggcataggat aaatttgatt ttttacatca tcatatacaa taactccagg tgctgaattt 3061 aatgttgaaa aaatatcctt catatcaaag cttttttcaa attcaatatt gatggattca 3121 cagtgcccat aaaatacagg taccctggca gtggttgctg taattcttaa actatcatta 3181 tgaagtattt ttcttgtttc attaaccatt ttaatttctt ccttggtgta tccattttcc 3241 ataaatgaat ctatgtgtgg aagtacatta taggcaattg ggtaaggaaa ctttttgggc 3301 tgttctcctt ttacaccatt tacaaggtct gaatatccaa cttggcctgc tcctgaaaca 3361 gcttggtatg tggagtatac aatccttcta atattaaatt tatcatttag cggctttaag 3421 gctaccacag cttgaatagt agagcagtta ggatttgcaa ttataccctt attccagctt 3481 atatcctcag ggttaacttc aggtacaaca aggggaacat ttttatccat tctccatgca 3541 ctgctgttgt ctacaacgat acatcccttt tgtgatgcta ttggtgcaaa ttttaaactt 3601 gttgatgcgc ctgctgaaaa tagtgcaata tctattcctc tatcaaaaga atcctctttt 3661 agttcttcta ctgtatattc cttatcttta aacatcatag tttttcctgc ggatttttgt 3721 gaggcaaata agtaaagctt atcaatagga aagttctttt cttgtaaaac ttcaagaatc 3781 tttcttccaa ctaatccagt ggcgcctact actgcaactt taactcccat tttaactcct 3841 ccttattaat ataatttatt aataaatgtt taaaaaattt acattataat gagattttca 3901 agtatttatt tattgtaatg tattatagta ttatattgta ttatagcata aaaattttaa 3961 tttttataaa tgaatttgca ggactaaaac ttatcattaa gaaatattaa attactaaat 4021 aatcaatgaa taaaaaacat atatttacag ataataattt ttggtgatag taatatgaca 4081 actccattta aaaaagtggt aaaagcaaaa ataaaagcaa agaaggagct ctcacctgta 4141 gaaaaggaaa gagagctcat aaaatatgaa attgcagaag aattagggct taaggacagg 4201 gtggaaaaat acggctggag tggacttaca gcagaggaaa cgggcaggat tggcggaata 4261 atgacgaaaa tgaataagga tgcaggcaga aggtggaatg actagttaaa tataagattt 4321 taaaagctgt aagtttccag caatttcaaa ttctaaattg tccgctggaa ttaaaaagct 4381 tgttccacct tttatatcaa aataattatc cttatatttc agttttccat tgccttctac 4441 gcatgtatag gcaaaaaatg tgctgccatc tgttttatcc ttgtagctat taaatacatg 4501 tattttgtca acattaaaga atttacagtt tatgaggctt gtatgttcaa agccttcttc 4561 tttttttgtt gaaacttttt caactgagcc tttgtattca aaatttatta catccagtgc 4621 cttttctaca tgaagctccc tgggcttgcc atctttgcct acacgtcccc agtcataaac 4681 tctataggta gtatcactgt tttgctgtat ttctgctata actattccat ctaaaatagc 4741 atgaacagta ccagatggaa taaaaattgt atctcccttt ttaacatcca cataatttag 4801 atattcattt agtgtattgt tttttattgc ttcttcaaat tgtttttttg tagttccact 4861 ttttacacca caaataagtt tggcattagg tttagcatca atgatgtacc acatttctgt 4921 ttttcctaaa tcattgtcgt gtatttttgc atattcatta tcaggatgta cttgaacaga 4981 aagcttatcg tttgcatcaa ttaatttaat taaaagagga aatctttctg atgagcattt 5041 tttgccaaca agtttactgc caaatttatt aagaagttca ttgagacttt tacccttaaa 5101 ctcaccattt gtgattatgc tgttatcttt gccgtgattg cttatttccc agctttctgc 5161 taccttccct tcaggtaaac ttctattaaa tatcctttct aaatttctgc caccccatat 5221 tatactttta tatattggat caaaaaataa tggttccatg tttatctcct ttcaaaactt 5281 atgctttaat tataatctat cacattatca taagaataca aaaaagctac gcaaaaaatg 5341 cgcagctttt tcttcttttt attttagccg ttttcttttt tgttctgcaa aggggaaagc 5401 agttccttac aggttctttt cgcagaagtc cttcaatgct gcagcatcct tggcctcact 5461 ggcgccttcc acagaaaagg taatttcatt gtttttcgta acaccgaggc ccatgactgc 5521 taaaatactc tttgcgtctg cactgtgacc attggcagcc atctgtacag aagatgtaca 5581 cttctgtgcg catttaacta aaagtccagc tggacgagcg tgaaggccct gcggatcggt 5641 aatggtatag atgaatttct gcatatcaac cacttctatt ctataaatat aaaatttgtt 5701 tattaaaaat tgagctttca attagactac tttgcttgcg aattctttct tctgttttat 5761 gccttttagg gctataacca tcagcgtgga aataactgtg cccgcagcaa tcgttagtag 5821 gaacagtggt agactgttgg ttgcacccaa gaaaccgaca attggaccgc catgcggaac 5881 tatatctgta atgccaagaa ccatgcctag tgccgaagca acactgctgc caacgataat 5941 gctgggcatg acatgctttg ggtcacaggc agcatacgga atggcacctt cagtaatgcc 6001 aataagtccc atcagcagtg ctggaatgcc agcagcacgt tcttcttcat taaaacattt 6061 cttatcgatt agcgttgcca atcccattgc caatggaggc actggaatag cacacgcaac 6121 tgcacccata atctgcggtg tacctccaac aatggaggct gtaccaaata ggaatgcaac 6181 cttgttgact gggccaccca tatcgaaagc aaccatagca ccaagcagca gaccaagcgg 6241 aatggctgtt actggatctt tagacaggaa agtcagtgca ctgttcaggg cagtcatcag 6301 ccaggaaatt ggtgcaccaa ggaacagaat caatacggcc gaaactgctg ctgttcccaa 6361 aaggggaatt acaaaaattg gcataattgg tttcaggctc cttggtatct tccaactgtt 6421 catccatttg actagataac cggtcaggta gccaacaaaa atagcgccga ggaatccggt 6481 gccggcggat gtgccaagaa tttccttgct gtttgcaacc actgcagaaa tcatggccgg 6541 tgctagtgct gcacgaccag agattgcata agcaatatag cccgccagaa tcggaatcat 6601 cagattaaag ccaatgccac ctatctgatt tactttatcc caagcccccg gtggaatgac 6661 catgcccttt gctgtgggct taccgccgat ggaaatggat atagcgataa acaggccacc 6721 gacaacgacg aatggaatca tgtaggaaac accgttcatc agctgcttca tggctgtttt 6781 gcctgcactg tcgccagagg attctttctt ctccttttcg gtctccttgg tgacttcctt 6841 actgctgctg taagtatgtg cacgtgttgg cagcgattgc aaaattccat ttgcatcttt 6901 gataacatct ttaataggta aaaccagcac ttttttgccg ttaaaacgtt cttttccctc 6961 aatggcaaca tcggcggcaa caatgacgta atccgccgat gcaatatctt ccggtgtcag 7021 ttcgttttca tttccctgtg agccctgtgc ttcaattttc gcttcatact gcagttcctt 7081 tgctgctttc tccagctttt ctgctgccag ataggtatgg gcaacaccaa cagggcactt 7141 ggttacgcca acgattttca ctttg //