LOCUS DOON01000179 3963 bp DNA linear ENV 07-SEP-2018 DEFINITION TPA_asm: Clostridiales bacterium isolate UBA8797 contig_6088, whole genome shotgun sequence. ACCESSION DOON01000179 DOON01000000 VERSION DOON01000179.1 DBLINK BioProject: PRJNA417962 BioSample: SAMN08019306 Sequence Read Archive: SRR6486589 KEYWORDS WGS; Third Party Data; TPA; TPA:assembly. SOURCE Clostridiales bacterium (terrestrial metagenome) ORGANISM Clostridiales bacterium Bacteria; Firmicutes; Clostridia; Clostridiales. REFERENCE 1 (bases 1 to 3963) AUTHORS Parks,D.H., Chuvochina,M., Waite,D.W., Rinke,C., Skarshewski,A., Chaumeil,P.A. and Hugenholtz,P. TITLE A standardized bacterial taxonomy based on genome phylogeny substantially revises the tree of life JOURNAL Nat. Biotechnol. (2018) In press PUBMED 30148503 REMARK Publication Status: Available-Online prior to print REFERENCE 2 (bases 1 to 3963) AUTHORS Parks,D.H. TITLE Direct Submission JOURNAL Submitted (04-APR-2018) School of Chemistry and Molecular Biosciences, University of Queensland, Chemistry Bld, Cooper Road, St Lucia, Brisbane, Queensland 4072, Australia COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: CLC de novo assembler v. 4.4.1 Expected Final Version :: yes Genome Coverage :: 5.45x Sequencing Technology :: Illumina HiSeq 2500 ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 05/01/2018 07:07:15 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 2,848 CDS (total) :: 2,838 Genes (coding) :: 2,465 CDS (coding) :: 2,465 Genes (RNA) :: 10 tRNAs :: 8 ncRNAs :: 2 Pseudo Genes (total) :: 373 Pseudo Genes (ambiguous residues) :: 68 of 373 Pseudo Genes (frameshifted) :: 285 of 373 Pseudo Genes (incomplete) :: 30 of 373 Pseudo Genes (internal stop) :: 42 of 373 Pseudo Genes (multiple problems) :: 50 of 373 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..3963 /organism="Clostridiales bacterium" /mol_type="genomic DNA" /isolate="UBA8797" /isolation_source="terrestrial" /db_xref="taxon:1898207" /environmental_sample /note="metagenomic; derived from metagenome: terrestrial metagenome" gene complement(<1..703) /locus_tag="DEF04_07765" CDS complement(<1..703) /locus_tag="DEF04_07765" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019227055.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="D-alanyl-D-alanine carboxypeptidase" /protein_id="HBV68078.1" /translation="MILFCAALLCLCPFLDVAEAASIPDVPAQSYIVVDSDSGRIIGS RNPKEKLPIASTTKILTTILAIENIDDFERKIEVPDSCVGIEGSSIYLRAKQRVSIKD LLYGTMLRSGNDAAETLAAYAGRNSDEGFVFMMNEKAAELGAYDSNFMNPHGLSDENH YSTAYDLAIISRHAMKNKMFKEISSAEKYIAESMNTIFYNKNKVVYQYEYGNGIKIGY TKAAGRCLVASAEKDG" gene complement(714..1094) /locus_tag="DEF04_07770" CDS complement(714..1094) /locus_tag="DEF04_07770" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019227056.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="HBV68079.1" /translation="MSRDIESIIKTTLDNINNLTENEKLISSRISYDSTNYLAISKLN MNFVIGGSDIDKKFRTVDLKPFAGAAYVNLQLNPQVLIYEDRNRDVRTININNETVAS DITSSILNIMSFIKEVREKKDCDY" gene complement(1222..1443) /locus_tag="DEF04_07775" CDS complement(1222..1443) /locus_tag="DEF04_07775" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="HBV68080.1" /translation="MGVLIILFIVGGTIFIYFNARIICKINLSXNAGQIQQLRETEKQ KILSLLEASKKDCTLFYGQKHILISRMYR" gene complement(1447..1977) /gene="scpB" /locus_tag="DEF04_07780" CDS complement(1447..1977) /gene="scpB" /locus_tag="DEF04_07780" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019227058.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="SMC-Scp complex subunit ScpB" /protein_id="HBV68081.1" /translation="MNLKSVLESLLFAWGEPLNISEISRILNMPAHNLTAVLDEMAEK FDEDKERGLIIQKFGSSYQITTKKENYEFIQSLLQTTINKSLSTAAMETLSIIAYKQP VTRVEIELIRGVKCSNVVKGLLDKQLIKEVGKLDKPGRPTLYATTDEFLRHFGLNSIN DLPALNVKTEEEQKVI" gene complement(1987..2703) /locus_tag="DEF04_07785" CDS complement(1987..2703) /locus_tag="DEF04_07785" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019227059.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="HBV68082.1" /translation="MYSISLEIFQGPFDLLYHLIEKKEIDIYDIPIAEITDQYMEYLD QMIQFNMNVASEFILMASTLIEIKSQMLLPLKEKEEDPRQELVNKLLEYKLFKEVSEQ LKKYEDECCYYYSKPKEEVALDSDVKTEQLSLNEINIYELYNVFMSLIKNQNLKIVNE EKLKVYRENYSVKDCVDELVKKLKSRGRVSLFETFREKDKITKEYVITTFLAVLELSN KQGMKIYQTDIYSDIIIIAA" gene complement(2714..3817) /locus_tag="DEF04_07790" CDS complement(2714..3817) /locus_tag="DEF04_07790" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019227060.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="D-alanyl-D-alanine carboxypeptidase" /protein_id="HBV68083.1" /translation="MLIVNSMFVPVFGIEAESLQSRAAILINGEDGQVLFEKNANDKM QPASITKIMLLLLISEKMANGEIALDNELTVSEHASSMGGSQIYLEANETQTVENMLK AISMRSANDASVAMAXSCVKAMNDRAKELGMNDTHFVNVTGLPEPEHLTTAHDIGIMT QELLKYNYVNQYMLTWMDSVYVGKKKDSEQVLVNTNRLINNYDGLIGGKTGYTTEAKY CLSAAAKRNDTMLIAVVLGCDDTKIRFNEITKLLNEGFANYKNLVFHKKGDVITEAPV YCGKNDTFNVVSKENIAYFTESNCKTEDFNLEYIIDKNLKAPLNADVSIGKAIMSKNG EILGEFELFPESNVEKEGLFKFYFGNIVESIIK" BASE COUNT 1084 a 818 c 610 g 1445 t ORIGIN 1 ttccgtcttt ttctgcagag gctacaaggc atctaccggc cgcctttgta tagccgattt 61 ttattccatt cccgtattca tactggtaaa caaccttatt tttattgtaa aatattgtgt 121 tcatactttc ggcaatgtat ttttctgctg aggaaatttc tttaaacatt ttatttttca 181 ttgcatggcg gcttataata gctaaatcat aagctgttga gtagtgattt tcatcactca 241 ggccatgagg attcatgaaa ttagagtcat atgctccaag ctctgccgcc ttttcattca 301 tcataaatac aaatccttca tcgctgtttc tgcctgcgta tgcggcaagt gtttctgcag 361 catcgtttcc tgacctgagc attgtaccgt acagcaaatc cttaatcgac actctttgct 421 ttgctctcaa atagatgctt gagccttcta ttcccacgca tgaatcagga acctctattt 481 ttctttcgaa atcatctatg ttttctatgg ccaaaattgt agtcagtatt tttgttgtgc 541 ttgctattgg cagcttttct tttggatttc ttgaaccgat tattctgccg gaatcagaat 601 caacaacaat ataagactgt gccggcacat ccggaatgga agctgcctcc gcaacatcaa 661 ggaatggaca aaggcacagc aatgccgcac aaaataaaat aattttagag tatttaataa 721 tcacagtcct ttttctccct tacttccttt atgaatgaca ttatgttcag aatggatgat 781 gttatatcac ttgcaacggt ttcgttgtta atgttgattg tccttacgtc tctgtttctg 841 tcttcataaa tgagcacttg aggattcagc tgaagattta catatgccgc tccggcgaaa 901 ggcttcaagt caactgttct gaattttttg tctatgtcgc tgcctcctat tacaaagttc 961 atgttcaatt ttgatatggc gagatagttg gtgctgtcat aggaaattct gctgctgatg 1021 agcttttcgt tttcggttaa attattgata ttatccaagg ttgtcttaat gatgctttct 1081 atgtctctgc tcataatatc ccctttctta agttttcaca tagtatgccc tgaaaaagga 1141 taaatataac actatttatt tatgagaggc cgcttaataa tattgtttac aaccatgaat 1201 tcaaccgcaa aagaggaata gctaccgata cattcgggat ataaggatat gtttttgacc 1261 ataaaaaagc gtgcaatctt tcttagatgc ctcaagtaag gatagtatct tttgtttctc 1321 agtttcacgt agctgctgta tctgtccggc gttgnntgac aaattaattt tacatataat 1381 tcgtgcattg aaatatatga atatggtccc cccgacgata aataatataa tcaatactcc 1441 catttgctat attacttttt gttcctcctc agtctttaca ttcaaggcag gaaggtcatt 1501 tattgagttt aagccgaaat gccttaaaaa ttcatccgtt gtggcataaa gtgtaggtct 1561 tccgggttta tccagcttgc ctacctcctt gataagctgc ttgtctaaga ggcccttaac 1621 cacatttgag catttcactc ctcgtatcag ttctatttca acccttgtaa caggttgctt 1681 gtacgctatg attgacagtg tctccattgc agctgtggac agacttttat ttattgtggt 1741 ctgcaagaga ctttgaataa attcataatt ttctttttta gttgtaattt gatatgagct 1801 gccaaacttt tgaataatca gccctctttc cttgtcttca tcaaactttt cagccatctc 1861 atccagcact gcagtcagat tatgtgcagg catatttaaa attctggaaa tttcgcttat 1921 gttcaaaggc tcaccccagg caaacagcag gctttccaat acacttttca aattcattat 1981 ttactcctac gcagcaatta ttattatatc gctgtatata tctgtctgat agattttcat 2041 gccctgctta ttagagagct ccaatacagc taaaaatgtt gttatgacat attcctttgt 2101 aatcttgtct ttttctctga atgtttcaaa cagtgataca cggccccttg acttcagttt 2161 cttaacaagc tcatcaacac agtccttgac gctgtaattc tctctgtaca ccttaagttt 2221 ttcttcattg acaatcttta aattttgatt tttaataagc gacatgaaga cattgtacaa 2281 ttcatatata tttatttcat tcaatgaaag ctgctctgtc ttaacgtctg agtcaagggc 2341 cacttcttcc tttggcttgg aataatagta acagcattca tcctcgtatt tttttaactg 2401 ttcggatact tctttaaata atttgtattc caaaagctta ttgaccagtt cctgccttgg 2461 atcttcttcc ttctccttaa gtggcagcag catttgagat ttaatttcaa tcagcgttga 2521 tgccatcaaa ataaattcgc ttgccacatt catgttgaat tgaatcatct gatccagata 2581 ctccatatac tgatctgtta tttcagctat gggtatatca tatatgtcaa tttccttttt 2641 ctcaatcaga tgatacaata agtcaaaagg cccctggaaa atttctaaac ttatggaata 2701 catttttatc ctcttatttt attatacttt ctactatgtt accaaaataa aacttgaaaa 2761 gtccttcttt ttcaacgtta ctttcaggaa agagctcaaa ttctccaaga atttcaccgt 2821 tttttgacat gattgctttt cctattgaaa catctgcatt taaaggagct tttaaatttt 2881 tatctattat atattctaaa ttaaaatcct cagtcttgca attactttct gtgaaatatg 2941 caatattttc tttacttacc acattaaagg tatcattttt gccgcaatat acgggagcct 3001 cagtaataac atcacctttt ttatggaata caaggttttt ataatttgca aagccttcat 3061 tcaacagctt tgtaatctca ttaaatctta tctttgtatc atcgcatccc aacacaacag 3121 ctatcagcat tgtgtcattt ctttttgctg cagctgaaag gcagtatttc gcttctgttg 3181 tgtatcctgt ttttcctcct atcagaccgt cgtaattatt tatcagcctg ttggtgttta 3241 ccagcacctg ttcggagtct tttttcttac ctacatacac tgaatccatc catgtaagca 3301 tatattggtt tacataatta tatttcagta actcttgtgt cattatccca atatcatgtg 3361 ccgttgtcag atgttcaggc tcaggaagac ccgtaacatt tacaaagtgt gtatcattca 3421 tccccagttc ctttgctctg tcattcattg cttttacaca actnncagcc attgcaactg 3481 aagcgtcatt tgctgaacgc atagaaattg ctttaagcat attttccact gtctgggttt 3541 cattagcctc aagatatatc tggcttcctc ccatgcttga tgcatgttct gacacagtca 3601 gttcattgtc aagagctatt tcaccgtttg ccatcttttc tgaaatcaac aaaagcaaca 3661 taattttcgt tatgcttgca ggctgcattt tgtcgtttgc atttttttca aaaagaacct 3721 gaccatcctc accatttatc aatattgctg cccttgactg cagactttcg gcctctattc 3781 caaaaaccgg aacaaacatt gaatttacaa ttaacatcac taaaaaaaat atnncatgaa 3841 atgcaagaca actgttatac tatatagaca atagcttcaa attttatggt gatttttccc 3901 gtgaaatgtc gaaaaaatta ataattagtc tatttccaag aaaattatac aattatattt 3961 ccc //