LOCUS DOXH01000173 3950 bp DNA linear ENV 10-SEP-2018 DEFINITION TPA_asm: Acidimicrobiaceae bacterium isolate UBA11731 contig_90134, whole genome shotgun sequence. ACCESSION DOXH01000173 DOXH01000000 VERSION DOXH01000173.1 DBLINK BioProject: PRJNA417962 BioSample: SAMN08019719 Sequence Read Archive: SRR6488171 KEYWORDS WGS; Third Party Data; TPA; TPA:assembly. SOURCE Acidimicrobiaceae bacterium (Neamphius huxleyi metagenome) ORGANISM Acidimicrobiaceae bacterium Bacteria; Actinobacteria; Acidimicrobiia; Acidimicrobiales; Acidimicrobiaceae. REFERENCE 1 (bases 1 to 3950) AUTHORS Parks,D.H., Chuvochina,M., Waite,D.W., Rinke,C., Skarshewski,A., Chaumeil,P.A. and Hugenholtz,P. TITLE A standardized bacterial taxonomy based on genome phylogeny substantially revises the tree of life JOURNAL Nat. Biotechnol. (2018) In press PUBMED 30148503 REMARK Publication Status: Available-Online prior to print REFERENCE 2 (bases 1 to 3950) AUTHORS Parks,D.H. TITLE Direct Submission JOURNAL Submitted (04-APR-2018) School of Chemistry and Molecular Biosciences, University of Queensland, Chemistry Bld, Cooper Road, St Lucia, Brisbane, Queensland 4072, Australia COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: CLC de novo assembler v. 4.4.1 Expected Final Version :: yes Genome Coverage :: 47.07x Sequencing Technology :: Illumina HiSeq 2000 ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 05/03/2018 14:27:06 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 1,540 CDS (total) :: 1,517 Genes (coding) :: 1,398 CDS (coding) :: 1,398 Genes (RNA) :: 23 tRNAs :: 21 ncRNAs :: 2 Pseudo Genes (total) :: 119 Pseudo Genes (ambiguous residues) :: 71 of 119 Pseudo Genes (frameshifted) :: 45 of 119 Pseudo Genes (incomplete) :: 30 of 119 Pseudo Genes (internal stop) :: 2 of 119 Pseudo Genes (multiple problems) :: 29 of 119 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..3950 /organism="Acidimicrobiaceae bacterium" /mol_type="genomic DNA" /isolate="UBA11731" /isolation_source="metagenome" /db_xref="taxon:2024894" /environmental_sample /note="metagenomic; derived from metagenome: Neamphius huxleyi metagenome" gene 5..>375 /locus_tag="DEP69_06020" CDS 5..>375 /locus_tag="DEP69_06020" /inference="COORDINATES: protein motif:HMM:PF11716.6" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TIGR03084 family protein" /protein_id="HCB34697.1" /translation="MDAICDDLLAETDALAHVLADRTDDEWRAPTPAQGWDSRDTVVH LGMTDWVATLATADPDEFEATKAGMAAGEADLHTAAGFDFESMSGADLWAWFDSRRTT MVAAFRRVGPRDRIPWFGPDMG" assembly_gap 376..422 /estimated_length=47 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 426..>693 /locus_tag="DEP69_06025" CDS 426..>693 /locus_tag="DEP69_06025" /inference="COORDINATES: protein motif:HMM:TIGR03083" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="HCB34698.1" /translation="METWSHGHDVADTFGSPYPRTARLRGVAHIGVGTRGWSYVNHGM AVPDGEVAVALTAPDGDTWTWGDQSAADRVSGSAYDFCLAVTQRR" assembly_gap 694..792 /estimated_length=99 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 822..1445 /locus_tag="DEP69_06030" CDS 822..1445 /locus_tag="DEP69_06030" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007492289.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="dephospho-CoA kinase" /protein_id="HCB34699.1" /translation="MLVVGLTGGIGAGKSTVAALLARHGAAVVDVDALGREIVAAGGP ATEAVVARFGEGVRAADGGVDRPALAAIVFDDPAALADLNAVSHPVLDRLIDERLDEI AAAGRVAVLDMAVLAESTLGRGNRRPYEVVVVVEAPADVRLARLVERGMTAADATARM GAQATDEQRRALADFVLGNGGTPADLEPTVARMWEQLRALALGREGA" gene 1400..>2354 /locus_tag="DEP69_06035" CDS 1400..>2354 /locus_tag="DEP69_06035" /inference="COORDINATES: protein motif:HMM:TIGR00369" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="HCB34700.1" /translation="MGAAAGAGPGPGGSVSAGGREYPPQDHVLRHLRLSHRHDADGSV TAAMPVLPDLCDADGRLRLGAVATLVDSVAGRHSVARVAPDWVATMHLGIVLTARAEG DFVTAACTPVRVGRNHVVAEVAVSDAAGTPVARSTCTYVRLGRRADTPEAGTSTRSRA VDYREEKEVSPRPPLDEYLRISPVPGRPLIELPLHRRIVNSFGSLQGGVSVVVAEVMA EHMAAGNLDGAGPAGRPLSTAAGRPRCTAADVHYLAPARIGPIRATGETLTAGAHSTT VRVRCFDVGNADRLIQLATATAEFPAVGFPAAAALAPNLRPR" assembly_gap 2355..2386 /estimated_length=32 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 2392..3267 /locus_tag="DEP69_06040" CDS 2392..3267 /locus_tag="DEP69_06040" /inference="COORDINATES: protein motif:HMM:PF08282.10" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="HCB34701.1" /translation="MLAPRLLALDIDGTLLRDDGTISARTVRALGAARDRGLLLSAAT GRPWPATSRIVAEVGGMDYAVCLNGTVVMDARTDTIVDTNEMTVAQACDTARLAREHL PTVRLAADLADGRHLWDHDFDTQMPMAIDVERVDCAEEAVAACGVPVLTWLLELEVAG GRRRPFTAELRDAERIIDALDGRLDPALDVHHSGIGPAEVSLLGISKATGLASVAERH GISPAEVMAIGDGYNDVEMLGWAGISVAMGNAPAEVQGAAEFVALSNADDGAAVFVEG LLARSAMGDTPAAAG" gene 3264..3632 /locus_tag="DEP69_06045" CDS 3264..3632 /locus_tag="DEP69_06045" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="HCB34702.1" /translation="MTEDEIPPEGRSDLVLTRVKHYRDTIRSYTVLVDGEFVGRIREG QEETFTLRPGRCTVRLKLLWIYSPKVEVDLPPAGQTRMVCGPNGGILQAWRLFVAPFT AIFLRRETAGADGEQPHPRH" gene 3687..>3950 /locus_tag="DEP69_06050" CDS 3687..>3950 /locus_tag="DEP69_06050" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="HCB34703.1" /translation="MTPDDPAVVEPDPGALGFGTEQQALLERESSLQPRRWLTFALVV VALAVVALLIGLARGSDEEGETAGADAAAPDPAAGDLTGDSAAA" BASE COUNT 473 a 1501 c 1348 g 450 t ORIGIN 1 ccccgtggac gccatctgcg acgacctcct cgccgagacc gacgcactcg cccacgtcct 61 cgccgaccgc accgacgacg agtggcgcgc acccaccccg gcccagggct gggacagccg 121 cgacaccgtc gtccacctcg gcatgaccga ctgggtcgcc accctcgcca ccgccgaccc 181 cgacgagttc gaggcgacca aggccggcat ggccgccggc gaggccgacc tgcacaccgc 241 cgccgggttc gacttcgagt cgatgtccgg cgccgacctc tgggcctggt tcgacagccg 301 gcgcacgacg atggtcgccg cgttccgccg ggtcggcccc cgcgaccgga tcccctggtt 361 cggccccgac atgggnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 421 nnctgatgga gacctggtcg cacggccacg acgtcgccga caccttcggc tcgccctacc 481 cgcgcaccgc ccgcctgcgc ggcgtcgccc acatcggcgt cggcaccagg ggatggagct 541 acgtcaacca cggcatggcc gttcccgacg gcgaggtcgc cgtggccctc accgcgcccg 601 acggcgacac ctggacgtgg ggcgaccagt cggccgccga ccgcgtcagc ggcagcgcct 661 acgacttctg cctggccgtg acacagcgcc gccnnnnnnn nnnnnnnnnn nnnnnnnnnn 721 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 781 nnnnnnnnnn nncggctgac ggcgcccgcc gcagccctgc gatgctggtc gtcggtctga 841 ccggcggcat cggcgcgggc aagtcgacgg tcgcggcgct gctggcgcgc cacggcgcgg 901 ccgtcgtcga cgtggacgcg ctgggccggg agatcgtcgc cgccggcggg ccggcgaccg 961 aagcggtggt ggcgcgcttc ggcgagggcg tccgcgccgc cgacggcggc gtcgaccggc 1021 cggcgctggc ggcgatcgtg ttcgacgacc cggcggcgct ggccgacctc aacgccgtca 1081 gccaccccgt actcgaccgg ctgatcgacg agcgcctgga cgagatcgcg gccgccggcc 1141 gcgtcgccgt gctcgacatg gccgtcctgg ccgagagcac cctcggccgc ggcaaccggc 1201 gtccctacga ggtggtggtc gtcgtagagg cacccgcgga cgtccgtctc gcccgcctgg 1261 tcgagcgtgg catgaccgcc gccgacgcca cggcgcgcat gggcgcccag gccaccgacg 1321 agcagcgccg cgccctggcc gacttcgtgc tcggcaacgg cggcacgccc gccgacctcg 1381 agccgacggt cgcccggatg tgggagcagc tgcgggcgct ggccctgggc cgggagggag 1441 cgtgagcgcc ggcggccgcg agtacccgcc gcaggaccac gtcctgcgcc acctgcgcct 1501 gagccaccgc cacgacgccg acggctcggt caccgcggcc atgcccgtcc tgcccgacct 1561 ctgcgacgcc gacggacgcc tgcgcctcgg cgccgtcgcc accctcgtcg actccgtcgc 1621 cggccggcac agcgtcgccc gggtggcgcc cgactgggtc gcgacgatgc acctcggcat 1681 cgtgctcacc gcccgcgccg agggcgactt cgtgacggct gcgtgcacgc ccgtccgcgt 1741 cggccgcaac cacgtcgtcg ccgaggtcgc cgtcagcgac gccgccggca cgccggtcgc 1801 ccgctcgacg tgcacctacg tccgcctggg gcgccgcgcc gacacgccgg aagccgggac 1861 gtcgacgcgc agccgggcgg tcgactaccg cgaggagaaa gaggtcagcc cgcgcccgcc 1921 gctcgacgag tacctgcgga tatcgcccgt gcccggccgg ccgctgatcg agctgccgct 1981 gcaccgccgc atcgtgaact cgttcggctc gctgcagggc ggcgtctcgg tcgtcgtcgc 2041 cgaggtgatg gccgagcaca tggccgccgg gaacctcgac ggagccggcc ccgccggccg 2101 gccgctcagc acggccgccg gccggccgcg ctgcacggcg gccgacgtcc actacctcgc 2161 gccggcacgg atcggcccca tccgggccac cggcgagacg ctgaccgccg gcgcgcactc 2221 gacgacggtg cgcgtgcggt gcttcgacgt cggcaacgcc gaccggctca tccagctggc 2281 gaccgccacc gctgagttcc ccgccgtcgg gttccccgcc gccgcggcac tcgcgccgaa 2341 tctccggccg cgtgnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnncgcg cctgcttgcc 2401 ccgcgcctgc ttgcgctcga catcgacggc accctgctgc gcgacgacgg gacgatctcc 2461 gcgcgcaccg tccgggcgct cggcgcagca cgcgaccgcg gcctgctcct gtccgccgcc 2521 accggccggc cgtggccggc cacgtcccgg atcgtcgccg aagtcggcgg catggactac 2581 gccgtctgcc tcaacggcac cgtggtgatg gacgccagga ccgacacgat cgtcgacacg 2641 aacgagatga ccgtcgcgca ggcctgcgac accgcccgtc tggcccgcga gcatctgccg 2701 acggtgcgcc tggccgccga cctggccgac ggccgccacc tgtgggacca cgacttcgac 2761 acgcagatgc cgatggcgat cgacgtcgag cgcgtcgact gcgccgagga ggcggtcgcc 2821 gcctgcggcg tgcccgtgct gacgtggctg ctggagctcg aggtcgccgg cggccgccgc 2881 cggccgttca cggcggagct gcgcgacgcc gagcgcatca tcgacgccct cgacggccgg 2941 ctcgaccccg cgctcgacgt ccaccactcc ggcatcggcc cggccgaggt ctcgctgctc 3001 ggcatctcca aggcgacggg cctggcctcg gtcgccgagc gccacggcat ctccccggcg 3061 gaggtgatgg cgatcggcga cggctacaac gacgtggaga tgctgggctg ggcaggcatc 3121 tccgtcgcca tgggcaacgc gccggcggag gttcagggcg cggcggagtt cgtcgcgctc 3181 tccaacgcgg acgacggcgc ggccgtgttc gtggaagggt tgctggcccg ctccgccatg 3241 ggcgacacgc cggccgcggc ggggtgacgg aagacgaaat accccccgag ggacgcagcg 3301 acctggtgct gacccgcgtc aagcactacc gcgacaccat ccgcagctac accgtgctgg 3361 tcgacggcga gttcgtcggg cgaatccgcg aaggccagga ggagacgttc accctgcggc 3421 ccggccgctg caccgtccgg ctgaagctgc tgtggatata cagccccaag gtggaggtcg 3481 acctgccgcc ggccgggcag acccggatgg tctgcggccc caacggcggg atcctccagg 3541 cgtggcggct gttcgtcgcc ccgttcacgg cgatcttcct gcgccgggag accgcgggtg 3601 ccgacggcga gcagcctcac ccccgccact agcgttcccg tgcgatgacg accgcccggg 3661 gccgacctcg cggccgccgc cgggcagtga cgcccgacga ccctgccgtc gtcgagccgg 3721 accccggcgc gctgggcttc ggcaccgagc agcaggcgct gctggagcgc gaatcgtcgc 3781 tgcagccgcg ccgctggctg acgttcgcgc tcgtcgtggt cgcgctggcg gtcgtggcgc 3841 tgctgatcgg gctcgcccgc ggctccgacg aagaagggga gacggccggt gcggacgccg 3901 ccgcgcccga cccggcggcc ggcgacctga ccggcgactc ggccgccgcc //