LOCUS DMQT01000184 3799 bp DNA linear ENV 05-SEP-2018 DEFINITION TPA_asm: Betaproteobacteria bacterium isolate UBA11351 contig_35832, whole genome shotgun sequence. ACCESSION DMQT01000184 DMQT01000000 VERSION DMQT01000184.1 DBLINK BioProject: PRJNA417962 BioSample: SAMN08018000 Sequence Read Archive: SRR6487983 KEYWORDS WGS; Third Party Data; TPA; TPA:assembly. SOURCE Betaproteobacteria bacterium (soil metagenome) ORGANISM Betaproteobacteria bacterium Bacteria; Proteobacteria; Betaproteobacteria. REFERENCE 1 (bases 1 to 3799) AUTHORS Parks,D.H., Chuvochina,M., Waite,D.W., Rinke,C., Skarshewski,A., Chaumeil,P.A. and Hugenholtz,P. TITLE A standardized bacterial taxonomy based on genome phylogeny substantially revises the tree of life JOURNAL Nat. Biotechnol. (2018) In press PUBMED 30148503 REMARK Publication Status: Available-Online prior to print REFERENCE 2 (bases 1 to 3799) AUTHORS Parks,D.H. TITLE Direct Submission JOURNAL Submitted (04-APR-2018) School of Chemistry and Molecular Biosciences, University of Queensland, Chemistry Bld, Cooper Road, St Lucia, Brisbane, Queensland 4072, Australia COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: CLC de novo assembler v. 4.4.1 Expected Final Version :: yes Genome Coverage :: 11.04x Sequencing Technology :: Illumina HiSeq 2000 ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 04/18/2018 21:02:13 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 2,938 CDS (total) :: 2,903 Genes (coding) :: 2,680 CDS (coding) :: 2,680 Genes (RNA) :: 35 tRNAs :: 32 ncRNAs :: 3 Pseudo Genes (total) :: 223 Pseudo Genes (ambiguous residues) :: 120 of 223 Pseudo Genes (frameshifted) :: 120 of 223 Pseudo Genes (incomplete) :: 43 of 223 Pseudo Genes (internal stop) :: 21 of 223 Pseudo Genes (multiple problems) :: 78 of 223 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..3799 /organism="Betaproteobacteria bacterium" /mol_type="genomic DNA" /isolate="UBA11351" /isolation_source="soil" /db_xref="taxon:1891241" /environmental_sample /note="metagenomic; derived from metagenome: soil metagenome" gene 230..1546 /locus_tag="DCQ77_07975" CDS 230..1546 /locus_tag="DCQ77_07975" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011872388.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="preprotein translocase" /protein_id="HAN56113.1" /translation="MARVRLTAGRIAGFSSEKGQAFLWDSEVKGLAVRAVAGSDKKAF VWQAKLAGNAVRITIGDVFKWDIDQARARARDFHVMVDKGIDPREVKRETIKALGQRR IDEARVKGTVLDAFEAYIENRKPRWGERHLGHHLKFIDAGGKKITRGRRTGQGEFTRP GVLYPLMNIPLSELDADAIAAWLKKESARAPTLAAQAYRALKTFMTWCAESKDYAFVD GTVCRTKKIKEMVPKVGTKEGDXXRPWFETVRIIGNPVISAYLQALLLIGSRREELAL LKWEDVDFQWNGMTIRDKVDGERTIPLTPYVSSLLSALPRRNEWVFSSTSHTTAPDGS GRKNGWHITEPRIAHVKALTAAGLPHVTLHGLRRSFGTLAEWVEVPVGIVAQIQGHKP SALAEKHYRRRPLDLLRMWHTKIEEWILFQAGIEFKPKKSGLHAVA" gene 1599..1877 /locus_tag="DCQ77_07980" CDS 1599..1877 /locus_tag="DCQ77_07980" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006746770.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="peptidase" /protein_id="HAN56114.1" /translation="MIKHFKHKGLEVFFFEGSKAGIQPTHAKRLRLQLAMLHSAKAPA DMGLPSWKLHPLEGKLKDHWAVWVNGNWRMTFTFDNGDAVLVDYQDYH" assembly_gap 1973..2008 /estimated_length=36 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene <2009..2241 /gene="higA" /locus_tag="DCQ77_07985" CDS <2009..2241 /gene="higA" /locus_tag="DCQ77_07985" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015834748.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=3 /transl_table=11 /product="addiction module antidote protein, HigA family" /protein_id="HAN56115.1" /translation="ITVTDFAARIGVTRVALSRVLNGRCGISADMAVRLVAALGGSAE SWLHMQANYELAQAEKALKREVAKIEPLNMAA" gene 2347..2625 /locus_tag="DCQ77_07990" CDS 2347..2625 /locus_tag="DCQ77_07990" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="HAN56116.1" /translation="MAICNSIPADVADQICGRLDAVANMLQSMRTLAIHAETECERDA ACVAVAGLCERTYLIIDSCIIKMGMAGLGNFRDDEWESDDLAAEGSAL" gene 2622..3026 /locus_tag="DCQ77_07995" CDS 2622..3026 /locus_tag="DCQ77_07995" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="HAN56117.1" /translation="MTTTINNATAVEKNPHFAHGNGIEGLCHNLQAYPQEWAGAIFTN AYNPQRLDAIASQASNVKMQALMGVRTVGAMLAVAAQSGELENINAMRAGFLVQFLGD VAEFMDTLEAHAQHDRDHGSLVNAELNCNPMD" gene 3035..3373 /locus_tag="DCQ77_08000" CDS 3035..3373 /locus_tag="DCQ77_08000" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015067360.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="addiction module toxin RelE" /protein_id="HAN56118.1" /translation="MLITVAETPEYIRLADKLLSPDERRDLISYLAAHPRAGDLIEGT GGVRKLRWARGGRGKSGGVRVIYYVHSEAMPLYLLTLFAKSERANLSKAERNDLAGLV DILVSRWLED" assembly_gap 3501..3547 /estimated_length=47 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene <3548..3696 /locus_tag="DCQ77_08005" CDS <3548..3696 /locus_tag="DCQ77_08005" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017019613.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=3 /transl_table=11 /product="transcriptional regulator" /protein_id="HAN56119.1" /translation="QEAFAARFGFSVATLRHWERGDRAPGGPALVLLNVIAKDHKAVM RALS" BASE COUNT 897 a 952 c 1088 g 777 t ORIGIN 1 cggataaagt cgtgcattta gatcaattca gatcaactag acgaattcac ttaattaaaa 61 aaaatgttgt aaatcaatga aatagtagaa atatctaaag atatgcgtgt tgcgttaagt 121 tgatccatat tctataatct ggttacatta ggggttacat tttaacctga cgccaagttt 181 gaagctgtca tggaaggatg taacccctgt aaccaggagg ctgatccgta tggccagagt 241 gagacttacc gctgggagaa tcgctggatt ttcgtctgaa aaagggcagg ccttcctgtg 301 ggattccgaa gtcaagggac tcgccgtgcg tgccgttgcc gggtctgaca agaaagcttt 361 tgtctggcag gcgaaattgg ctggcaatgc ggtacgtatc accattggcg atgtgttcaa 421 gtgggacatt gaccaggcgc gcgcccgggc gcgggatttc catgtgatgg tcgacaaagg 481 catcgatccg cgtgaggtca aacgggaaac catcaaagca ctggggcaaa ggcgcatcga 541 tgaagcccga gtcaagggca ccgtgctgga tgccttcgag gcgtatatcg agaatcgaaa 601 gccccgctgg ggcgagaggc atctggggca ccacctgaag tttatcgatg ccggcggtaa 661 aaagatcacg cgcggacgcc gcacgggaca gggcgaattc acccggcccg gtgtgctcta 721 cccgctgatg aatataccgc tctccgaact ggatgccgac gctattgccg cctggctgaa 781 gaaggaatcc gcgcgggcgc cgacattggc agcgcaagcc tatcgggcac tgaaaacatt 841 catgacctgg tgcgctgaat caaaggacta tgcctttgtc gacggcacgg tctgccgcac 901 caagaaaatc aaggaaatgg tgccgaaggt tggcaccaag gaaggtgact gnntgcgacc 961 gtggtttgaa accgtgagaa taattggcaa tcctgtaatc agcgcctacc tgcaggcgct 1021 gctgttgatc ggttcgcgtc gcgaggagtt ggccctgctg aaatgggaag acgttgattt 1081 ccaatggaac ggcatgacca tccgtgacaa agttgacggc gagcgcacca tcccgctaac 1141 gccctacgtt tcgagtttgc tgtcggcgct gccgcgccgc aatgaatggg tgttctcctc 1201 caccagccat accactgcac cggatggaag cggccgcaag aatggatggc acatcaccga 1261 accgcgaata gcccatgtca aagcactgac tgccgctggc ctgccgcatg tcaccctgca 1321 cggcctgcgt cggtcattcg gcaccttggc cgaatgggtt gaggttccgg tcggcatcgt 1381 ggcacagata caaggccaca agcccagtgc tctcgctgag aagcactacc ggcggcgtcc 1441 gcttgatctg ctgcgcatgt ggcacacgaa gatcgaggaa tggattttgt tccaggctgg 1501 cattgaattc aagccgaaga aatccggcct gcatgctgtg gcttaagctc tcattaaaat 1561 aaattgacaa gcgtaaatat acgctataca ttaacaacgt gatcaagcat ttcaaacaca 1621 aaggactgga agtgttcttc ttcgaaggaa gcaaggctgg tattcaaccg acacatgcca 1681 agcgcctgcg gctgcaactg gccatgttgc attcggccaa ggcacctgcg gacatgggac 1741 ttccgagctg gaaattgcac ccgcttgaag gaaagctaaa ggatcactgg gcggtgtggg 1801 tcaatggcaa ctggcgtatg accttcactt tcgataatgg cgacgcagtt ctggttgatt 1861 atcaggacta ccactaaaag aggactcact ccatggccag aatgtataac catccccacc 1921 cgggcgaagt cctacgggac ggtgttttca ctgacaccgg tatcaccgtt acnnnnnnnn 1981 nnnnnnnnnn nnnnnnnnnn nnnnnnnngt atcaccgtta cagactttgc tgcgcggatc 2041 ggcgtcactc gtgtagcgct ctcgcgcgtg ctgaatggca ggtgcggcat cagcgcagac 2101 atggcggtgc ggctcgttgc cgctcttggc ggtagcgcag aatcctggct gcatatgcag 2161 gcgaattatg agttggcaca ggccgaaaaa gcgctaaaac gagaagttgc aaagatcgag 2221 ccgttgaata tggcagcata gtttttaccc gtaccggacg cgggctatcg tcgggaaagc 2281 ggcgggcaag gtgttacaag caccaaaccc gcctgaccaa gaagcaccta ccaaggagat 2341 gcaatcatgg ctatttgtaa ttctatcccc gccgatgtcg cagatcagat ttgcggaaga 2401 ctcgatgccg tggcgaacat gctgcagtca atgcggaccc tcgccatcca tgcagaaaca 2461 gagtgcgaga gagatgctgc ctgcgttgcg gttgcaggcc tgtgcgagcg cacgtacctg 2521 attattgatt cctgcatcat aaaaatgggg atggctgggc tcggtaactt ccgtgacgac 2581 gagtgggaga gtgatgatct agcggcagaa gggagcgcat tatgaccaca acgatcaaca 2641 acgcgactgc agtcgaaaag aatccgcact tcgcacatgg caatggtatc gaaggccttt 2701 gccacaatct gcaggcctat cctcaggaat gggccggggc catattcact aacgcctata 2761 acccgcaaag attggacgcg atcgcatcgc aggcatcgaa tgtaaagatg caggcattga 2821 tgggagtgcg tacggttggt gccatgcttg ccgtggcagc gcagtctggc gaacttgaga 2881 atatcaacgc gatgagggcc ggctttctcg tgcaatttct tggagatgta gccgagttta 2941 tggatacatt ggaggcccat gctcagcacg accgtgatca tggctccctc gtcaatgctg 3001 agctgaattg taatccaatg gattagacgc tatcatgctc attaccgtcg ccgaaacccc 3061 ggaatacatt cgcctggcgg acaagttgct gtcacccgac gagcgccgcg atctgatttc 3121 ctacctcgcg gcacatccaa gggccggcga tctgatcgaa ggcaccggag gcgtccgcaa 3181 gctgcgctgg gcgcgtggag gccggggcaa aagcggcggt gtgagagtga tctactacgt 3241 ccatagcgag gcaatgccgc tctatctgct gacactcttt gccaagagtg aacgggcgaa 3301 tctcagcaag gcagagcgca acgatctggc gggtctggtg gatatcctgg tatcgagatg 3361 gttggaggat tgaaacatga gcaaggcatt cgagagtatc aagcaagggc tgaccgaggc 3421 gatagcacat gcccgtggtg acaagcacac gggcgttttc gtctatcgtc ccgaatcggt 3481 cgatgtgaag gcgctgcggc nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3541 nnnnnnncac aggaggcgtt cgccgctcgc ttcgggtttt ccgtggccac gctgcggcat 3601 tgggagcgcg gtgatcgcgc ccccggcggc ccggcgctgg tgttgctgaa tgtgattgcc 3661 aaggatcaca aggcagtcat gcgggctttg tcatagagag cgcggtacac caaacacggg 3721 atgacttaaa caatgacgac taaaccacac ccgtatctgt catgcgctga atgggtgcca 3781 atatctgcca agctggttt //