LOCUS DNMS01000222 3628 bp DNA linear ENV 06-SEP-2018 DEFINITION TPA_asm: Rhodospirillaceae bacterium isolate UBA9186 contig_1990, whole genome shotgun sequence. ACCESSION DNMS01000222 DNMS01000000 VERSION DNMS01000222.1 DBLINK BioProject: PRJNA417962 BioSample: SAMN08018576 Sequence Read Archive: SRR6486097 KEYWORDS WGS; Third Party Data; TPA; TPA:assembly. SOURCE Rhodospirillaceae bacterium (marine metagenome) ORGANISM Rhodospirillaceae bacterium Bacteria; Proteobacteria; Alphaproteobacteria; Rhodospirillales; Rhodospirillaceae. REFERENCE 1 (bases 1 to 3628) AUTHORS Parks,D.H., Chuvochina,M., Waite,D.W., Rinke,C., Skarshewski,A., Chaumeil,P.A. and Hugenholtz,P. TITLE A standardized bacterial taxonomy based on genome phylogeny substantially revises the tree of life JOURNAL Nat. Biotechnol. (2018) In press PUBMED 30148503 REMARK Publication Status: Available-Online prior to print REFERENCE 2 (bases 1 to 3628) AUTHORS Parks,D.H. TITLE Direct Submission JOURNAL Submitted (04-APR-2018) School of Chemistry and Molecular Biosciences, University of Queensland, Chemistry Bld, Cooper Road, St Lucia, Brisbane, Queensland 4072, Australia COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: CLC de novo assembler v. 4.4.1 Expected Final Version :: yes Genome Coverage :: 13.40x Sequencing Technology :: Illumina HiSeq 2000 ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 04/19/2018 23:05:43 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 3,871 CDS (total) :: 3,839 Genes (coding) :: 3,770 CDS (coding) :: 3,770 Genes (RNA) :: 32 rRNAs :: 1 (16S) complete rRNAs :: 1 (16S) tRNAs :: 29 ncRNAs :: 2 Pseudo Genes (total) :: 69 Pseudo Genes (ambiguous residues) :: 21 of 69 Pseudo Genes (frameshifted) :: 19 of 69 Pseudo Genes (incomplete) :: 37 of 69 Pseudo Genes (internal stop) :: 1 of 69 Pseudo Genes (multiple problems) :: 9 of 69 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..3628 /organism="Rhodospirillaceae bacterium" /mol_type="genomic DNA" /isolate="UBA9186" /isolation_source="marine" /db_xref="taxon:1898112" /environmental_sample /note="metagenomic; derived from metagenome: marine metagenome" gene complement(38..736) /locus_tag="DC046_10025" CDS complement(38..736) /locus_tag="DC046_10025" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017451752.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glutathione S-transferase" /protein_id="HBC07902.1" /translation="MIDLYSWPTPNGHKVHIMLEETGLAYNVHPINIQKGDQFQPEFL KVSPNNRIPAIIDQDGPGGKPYSLFESGAILIYLAEKTGKFLPTDPTAKYDTLQWLMW QMGGIGPMFGQAHHFRGYAPVDIPYAVDRYTKEAGRLYGILDKRLGESAYLAGPNYTI ADIATFPWTRSIDRQGHSLDDFPNVKRWSDAINARPGVAKGVTVLEDVRGERKPLSDE ERAVMFGDKQFAKR" gene 963..1193 /locus_tag="DC046_10030" CDS 963..1193 /locus_tag="DC046_10030" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_010163956.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF4170 domain-containing protein" /protein_id="HBC07903.1" /translation="MSNALLHLVFGGKVTDPQGQDFVDPDNLEIVGIYPSYAKALTAW RGASQAHVDDANMKYVIVHLHRLLEPEETHNH" gene complement(1218..1418) /locus_tag="DC046_10035" CDS complement(1218..1418) /locus_tag="DC046_10035" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008190160.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF4169 domain-containing protein" /protein_id="HBC07904.1" /translation="MAAEIVNLRQYRKEKQRQERERTAEQNRARFGRDKTQRRKDETD RRRRDQDLDGKQIDPESDKPAG" gene complement(1528..2172) /locus_tag="DC046_10040" CDS complement(1528..2172) /locus_tag="DC046_10040" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019643884.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="uracil-DNA glycosylase" /protein_id="HBC07905.1" /translation="MPAAFQNPPRDCAKCPRLAGFRADNRAAFPDWHNAPVPAFGPLD ARLLIVGLAPGLRGANRTARPFTGDYAGDLLYPTLGEFGWTEGTYGAAPDDGLRLKGC RITNAVRCVPPENKPTGAECKACRPYLADEIAAMPNLRVILALGGVAHANVLTTLDER KKDFPFGHGNRHVLTSGRRLIDSYHCSRYNTNTGRLTPDMFRDVFAAIADLMAA" gene complement(2178..2804) /locus_tag="DC046_10045" CDS complement(2178..2804) /locus_tag="DC046_10045" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_008944763.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NYN domain-containing protein" /protein_id="HBC07906.1" /translation="MNFYPEERVALFIDGSNLYAAARALNFDIDYKRLLAVFSGKCRL VRAFYYTAMVEDQEYSPIRPLVDWLDYNGYTMVTKPTKEFTDSTGRRKIKGNMDIELA IDVLEMADKLDHVVLFSGDGDFRRLVDAVQRKGCRVTVVSTVRSQPPMVADELRRQAD IFIELQSLQSAIQRAGGPSNNAQPDDDDDYDDEDDYDDDEGEAEVEVV" gene 2927..3469 /gene="folK" /locus_tag="DC046_10050" CDS 2927..3469 /gene="folK" /locus_tag="DC046_10050" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012553091.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="2-amino-4-hydroxy-6- hydroxymethyldihydropteridine diphosphokinase" /protein_id="HBC07907.1" /translation="MSGPVVIGVGGNLPTAEFGPPRATCGAALQVLSQHPGVAITAHA GWYETAPVPVSDQPWFVNGAVAVATDLTPDALMAVLLDIETRFGRQRSERNAARILDL DLLTFGDRVLMGDLDVPHPRLHTRAFALLPIRDVAPGWRHPVLGRTIEALCAELPADQ EIRPLADAGGYLGTEWQTPE" BASE COUNT 662 a 1099 c 1148 g 719 t ORIGIN 1 ggaacgggcg gtgatgttcg gcctcgaatc cggataatca gcgtttcgcg aactgcttgt 61 cgccgaacat caccgcccgt tcctcgtcgg ataggggctt gcgctcgccg cgcacgtctt 121 ccagcacagt gacacccttg gcgaccccgg gccgcgcgtt gatggcatcg gaccagcgct 181 tcacgttggg gaaatcgtca agggagtggc cctggcggtc gatggaccgg gtccagggga 241 aggttgcgat atcggcgatc gtgtagttgg gaccggcaag gtaagccgat tcccccaaac 301 gcttgtcgag aatgccgtac agacgcccgg cttccttggt gtaacgatcg acggcgtagg 361 ggatatcgac cggcgcatag ccgcgaaaat ggtgggcttg cccgaacatc gggccgatgc 421 cgcccatctg ccacatcagc cattgcaggg tgtcgtactt ggccgtgggg tcggtcggca 481 ggaacttgcc ggttttctcg gccagataga tcaggatggc gccgctttcg aacagggagt 541 agggctttcc gccggggccg tcctggtcga tgatcgccgg gatgcggttg ttcggtgaaa 601 ccttcaggaa ttccggctgg aactggtcac ccttttggat gttgatggga tgcacgttgt 661 acgcaagccc ggtttcctcg agcatgatat gaactttgtg cccgttgggc gtgggccagg 721 aatagaggtc gatcattgtc gttgcagacc tttcagtggt ttcggtagcc gcggtgcccg 781 gcgcgcaaac gcggagcggc gggactggac gtagggtgta ttgcccgtcc gcgggattgg 841 ccctagttta gggcaaaatc caccgaccgc cagtgaatgc ggcggaataa cagttggtcc 901 gcggatgaac aatccgtgag gagacattca gaggcagccg aaacaacgaa aagggactaa 961 ccatgtcgaa tgcgcttctg catctcgtgt tcgggggcaa ggtaaccgac ccccagggcc 1021 aggatttcgt cgatccggat aatctggaaa tcgtcggcat ctatcccagc tatgccaagg 1081 cattgaccgc ttggcgcggc gcgtcgcagg cccatgtcga tgacgccaac atgaaatacg 1141 tgatcgttca cctgcaccgg ctgttggaac cggaagaaac ccacaaccac taggtcgttg 1201 cggccacctc gtgatcgtca gcccgcgggc ttgtcgcttt cgggatcgat ttgcttgccg 1261 tccaggtctt ggtcgcggcg gcggcggtcc gtctcgtctt ttcgccgctg cgtcttatcg 1321 cggccgaaac gggcccggtt ctgttccgcg gtccgttcac gctcctggcg ctgcttttcc 1381 ttacggtatt ggcgtaagtt gacgatctcc gcggccatcc cgaccctccg ttcgcaccgg 1441 cgttcccgct gacggggccg cctaaaaaac gataacccga acagatgctt gccgccaagc 1501 ccttgttccg gggggatgaa tgacgggtca ggccgccatc agatccgcga tggcggcaaa 1561 gacatcgcgg aacatgtccg gggtcaggcg gccggtgttg gtgttgtagc gcgagcaatg 1621 gtagctgtcg atcaaccgcc gacccgaggt caggacatgg cggttgccgt ggccgaaggg 1681 gaaatccttt ttccgctcgt ccagtgtcgt cagcacgttg gcatgggcaa cgccgcccag 1741 ggccaggatc acccgcaggt tgggcatggc cgcgatttca tccgcgagat aggggcggca 1801 ggctttgcat tcggcccccg ttggcttgtt ttccggcggt acgcagcgca cggcgttggt 1861 gatgcggcag cccttgaggc gcagcccgtc gtccggcgcg gcgccatagg tgccttcggt 1921 ccagccgaat tcgcccagcg tggggtacag cagatcgccg gcgtaatccc cggtgaacgg 1981 ccgcgccgtg cggttggcgc cgcgcagccc gggggccaat ccgacgatca gaagccgcgc 2041 atcaagcggc ccgaaagcag ggacgggggc attgtgccaa tcgggaaagg cggcgcggtt 2101 atcggcgcga aagccggcca atctggggca tttcgcacag tcgcggggcg gattctggaa 2161 cgcggcgggc atgcggctta gacgacttcg acctcggcct cgccttcatc gtcgtcgtaa 2221 tcgtcctcgt catcataatc gtcatcatcg tcgggctggg cgttgttgga cggaccgccg 2281 gcgcgctgga tggccgattg caggctttgc agttcgatga agatatcggc ctggcggcgc 2341 agttcatcgg caaccatggg cggctgagaa cgcacggtgg acacgacggt gacccgacag 2401 cccttgcgct gcacggcatc gaccagccga cgaaaatcac cgtcgccgga aaacaacacc 2461 acatggtcca gtttgtcggc catttccaac acgtcgatgg ccaattcgat gtccatattg 2521 cctttgatct tgcgccgacc ggtggaatcc gtgaattcct tcgtcggttt ggtaaccatg 2581 gtgtagccgt tgtaatccag ccagtcgacg agagggcgga tcggtgaata ctcctggtcc 2641 tcgaccatgg ccgtgtagta gaaggcccgc accaagcgac attttcccga gaacacggcc 2701 aacaggcgct tgtaatcgat gtcgaagttg agggcccttg cggcggcgta aaggttggag 2761 ccgtcgatga acagggcgac gcgttcttca ggataaaaat tcatttccat caatcccttg 2821 gtgaaatcaa agcgttaaat ttttacacag tgccaatcat gaaacactta agccccattg 2881 cccccgagca caaggattca acgcccgccg gcccagggtt tccggcgtga gcgggcccgt 2941 cgtcatcggc gtcggcggca acctgccgac tgcggaattc gggccgccgc gcgccacctg 3001 cggggccgca ttgcaggttc tttcacaaca ccccggggtc gcgatcacgg cccatgccgg 3061 atggtatgaa accgcccccg tgcccgtttc cgaccagccc tggttcgtca acggcgccgt 3121 cgccgtggct acggacctga cgccggatgc cctgatggcg gttttgctgg acatcgaaac 3181 ccggttcggg cggcaacgca gcgaacgcaa cgccgcgcgc attctggatc tcgatcttct 3241 gaccttcggt gaccgggtgt tgatgggcga tctggacgtg ccgcatccgc gtctacatac 3301 gcgcgccttc gccttgctgc cgatccgcga cgtggccccc ggctggcgcc atcccgtgct 3361 gggccgcacg atcgaagcgc tctgcgccga actgccggcg gaccaggaaa ttcgcccctt 3421 ggccgatgcc gggggctatc tgggcaccga atggcagacc cccgaataat tcgccgaaag 3481 gcttgaaaat cccggcacct cggcatataa cccacagcct tgccaatgat tggcgcccca 3541 tattcctgac attccgaccc cggaggttcc gcatggcccg cgttacggtc gaagattgcg 3601 ttctgaaaat ccccaaccgc ttcgagct //