LOCUS RFSP01000167 4593 bp DNA linear ENV 23-JAN-2020 DEFINITION Betaproteobacteria bacterium isolate WB4_1_0408 contig_100_166, whole genome shotgun sequence. ACCESSION RFSP01000167 RFSP01000000 VERSION RFSP01000167.1 DBLINK BioProject: PRJNA495371 BioSample: SAMN10222888 KEYWORDS WGS. SOURCE Betaproteobacteria bacterium (freshwater metagenome) ORGANISM Betaproteobacteria bacterium Bacteria; Proteobacteria; Betaproteobacteria. REFERENCE 1 (bases 1 to 4593) AUTHORS Rodriguez-R,L.M., Tsementzi,D., Luo,C. and Konstantinidis,K.T. TITLE Iterative Subtractive Binning of Freshwater Chronoseries Metagenomes Recovers Nearly Complete Genomes from over Four Hundred Novel Species JOURNAL Unpublished REFERENCE 2 (bases 1 to 4593) AUTHORS Rodriguez-R,L.M., Tsementzi,D. and Konstantinidis,K.T. TITLE Direct Submission JOURNAL Submitted (10-OCT-2018) Civil & Environmental Engineering, Georgia Institute of Technology, 311 Ferst drive, Atlanta, GA 30332, USA COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Date :: 2017 Assembly Method :: IDBA-UD v. JUN-2017 Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: 9.68x Sequencing Technology :: Illumina ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 10/25/2018 14:05:36 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.6 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 2,307 CDS (total) :: 2,279 Genes (coding) :: 2,226 CDS (coding) :: 2,226 Genes (RNA) :: 28 rRNAs :: 1, 1, 1 (5S, 16S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 1, 1 (16S, 23S) tRNAs :: 22 ncRNAs :: 3 Pseudo Genes (total) :: 53 Pseudo Genes (ambiguous residues) :: 0 of 53 Pseudo Genes (frameshifted) :: 22 of 53 Pseudo Genes (incomplete) :: 30 of 53 Pseudo Genes (internal stop) :: 2 of 53 Pseudo Genes (multiple problems) :: 1 of 53 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..4593 /organism="Betaproteobacteria bacterium" /mol_type="genomic DNA" /submitter_seqid="contig_100_166" /isolate="WB4_1_0408" /isolation_source="water samples from lakes and estuaries along the Chattahoochee River collected between June/2010 and December/2014" /db_xref="taxon:1891241" /environmental_sample /geo_loc_name="USA: Chattahoochee River" /metagenome_source="freshwater metagenome" /note="metagenomic" gene <1..292 /locus_tag="EBS16_07740" CDS <1..292 /locus_tag="EBS16_07740" /inference="COORDINATES: protein motif:HMM:PF03895.13" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=2 /transl_table=11 /product="hypothetical protein" /protein_id="NBU89079.1" /translation="ANGTAATDAANYGQLMDSEKIMSRGIASATAIANIPMLGDGKSV SVGIGIGNYNGQTAIALGGNFRVSAAAQIRASLATGSSGGKTAVGLGASASW" gene complement(317..2089) /locus_tag="EBS16_07745" CDS complement(317..2089) /locus_tag="EBS16_07745" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016534759.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="adenine deaminase" /protein_id="NBU89080.1" /translation="MDTRRRAIAAAQGQVPFDVLLTGATVVDVATLELREADVGIVGE MIASVHARGRFHEAAQVHALDGHYIAPSLIDGHVHFESSHMLPHHYASVVVPQGTTTI FCDPHELANVLGVDGVKFAVEASRGLPLRFIVQASSCVPATPGLELSGADFQANEIAD LLALPEVAGLAEVMDMRAVLDATPRMVGILTAALNSGKIIEGHARGLTGDRLQAYIAA GITSDHEMTSAADALEKLRAGMTVEIRGSHDYLLPELVDMIKTLPVVPTSLTVCTDDV FPDHLVRQGGVIDVLRRLIKYGMDPLQAIRCATVNNAYRLKRDDLGWVAAGRRADLMV VKDLQSLQVTSVFANGRHVAQDGKLLAPLRSRPLSLPTRTMKVASLSASDFQVRVPDV DQADGPRPPRAVVRVIKGARFTEWSEVEVDVVDGVAQLPEHLSLMTVIHRHGRSQAGP QTSFIDGWGRLKGAYATSYAHDSHNLVVYGADPDEMALAANTVVNMGGGSAVVRDGQV VGQIAFPVAGLLSALEPEAVAREHMALVEAAGTVIEWEGRYRTFKALSGQCLACNAGP HLTDLGLTDGGTREVHRPFLRWAS" gene complement(2166..3113) /locus_tag="EBS16_07750" CDS complement(2166..3113) /locus_tag="EBS16_07750" /inference="COORDINATES: protein motif:HMM:PF03401.12" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NBU89081.1" /translation="MAAPTEPLWPNKSPCSKAIDEGVGPMPLRLKFTRRCSALAAVAG LATFSAFAMFTAAWAQSSGATGGAAYPSRPITLIVPYAAGGNVDAVARWAPSVVKYDG LKDFAPVTLLSSSPLVLVGRPGLPANNLDELLKLMRKEPGKLNYATSGIGTSLHVAGE MFNQLGKVQMVHVPYKIGSQIVTDLMGNQIDLAMLPIPLTSEQVKAGRIKAFGVTEVA RSPGLPDVPSMAEHPQLKGLEVTVWFGLFAPAKTDPAVVARLAKESAEMLKDEALRQK LAAIGMRPLGLPPEAFAKFLDAEQRKFGDIVRVGNIRAE" gene complement(3055..3885) /locus_tag="EBS16_07755" CDS complement(3055..3885) /locus_tag="EBS16_07755" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017525705.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="enoyl-CoA hydratase family protein" /protein_id="NBU89082.1" /translation="MKKLLAANYTARHFHWSVQGKVGTVTLHRPERKNPLTFDSYGEL RDFFIDLKYATDVKVVVITGAGGNFCSGGDVHEIIGPLVQMQESGDMQGLLDFTRMTG DAVKAMRHCPQPIVAAVDGICAGAGAILAMASDLRVGTPRAKTAFLFTRVGLAGADMG ACNILPRIIGSGRAAELLYTGRVMTADEGERWGFFNRLVDPEQVLQEAQTWAAELADG PTFAHGMTKACLHQEWSMGIDEAIDNEAQAQAICMQTRDYGRAYRAFVAKQKPVFEGN " gene 4171..>4593 /locus_tag="EBS16_07760" CDS 4171..>4593 /locus_tag="EBS16_07760" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_018906210.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cation transporter" /protein_id="NBU89083.1" /translation="MHTHGFPDNLSEWVHDHVFHVSHEAAERGTRWVMWITAAMMVAE ILAGWWFNSMALLADGWHMSSHAVAIGLSAMAYATARRYASDPRFAFGTWKVEVLGGF ASAVFLLGVAALMVVGSLERLLSPQTIAYQEAVVVALIG" BASE COUNT 841 a 1396 c 1471 g 885 t ORIGIN 1 ggccaacggg accgctgcga cggatgcggc caactatggg cagctgatgg actccgagaa 61 gataatgtcc cgtggcatcg catcggccac tgccatcgcc aacattccga tgttgggtga 121 cggcaagagt gtgtccgtgg gcatcggcat tggcaattac aacgggcaga ccgccattgc 181 cttgggaggt aacttccgcg tgtccgctgc ggcgcaaatc cgcgccagct tggccacggg 241 ctcgagcggt ggcaagacgg cggtggggct gggggcttca gcctcttggt aacagcgggt 301 gacgaagaac ccggggctaa gacgcccacc gtaggaacgg acggtgtact tcgcgtgtgc 361 caccgtccgt cagccccagg tcggtcaggt gcgggcctgc gttgcaggcc aggcactgac 421 ccgacagggc tttgaaggtc cggtagcgac cctcccattc gatgaccgtg cctgcggcct 481 ccacgagggc catgtgttct cgggccacgg cttcgggctc tagcgcggac aacaagcccg 541 cgacagggaa ggcgatttgg cccaccacct ggccatcgcg gaccacggcg ctgccgcctc 601 ccatgttcac gaccgtgttg gctgcgaggg ccatctcatc agggtcggcg ccatacacga 661 ccaggttgtg ggagtcatgg gcgtaggacg tcgcgtaagc gcctttcaat cggccccagc 721 catcgatgaa cgaagtctga gggccggctt gcgagcggcc gtgtcgatga atgactgtca 781 tcaatgacag gtgttcgggc aactgagcca caccatccac cacgtcaact tctacctcgc 841 tccattcggt aaaacgcgct cccttgatga cccgaaccac cgcgcggggt ggacgcggtc 901 cgtcggcctg gtccacatcg ggcacgcgca cctgaaagtc gctggccgac agcgatgcaa 961 ctttcatcgt gcgggtgggt aaagacaggg gcctggagcg cagcggtgcc aacagcttcc 1021 cgtcttgcgc cacatgccgt ccgttggcaa acacgctcgt tacttgcagc gattgcaggt 1081 ctttcacgac catgaggtcc gcccgccgac cggcggccac ccacccaagg tcatctcgct 1141 tgagacggta ggcgttgttg acggtcgcgc agcgaatggc ctgcaagggg tccatgccgt 1201 acttgatgag ccgtcgcaac acgtcgatga ctccgccttg ccttaccaag tgatcgggga 1261 agacgtcatc ggtacagacg gtgaggctgg tgggcaccac aggcaaggtc ttgatcatgt 1321 ccaccagctc gggcagcagg tagtcgtgtg agccgcgaat ttctaccgtc atgccagcgc 1381 ggagcttttc caaagcgtcg gctgcggacg tcatctcgtg gtctgacgtg atgccggccg 1441 cgatgtaggc ttgcagccga tcaccggtca aaccacgggc atgtccttcg atgatcttgc 1501 cgctgttgag cgctgcggtc aggatgccga ccatgcgcgg tgtggcgtcc aacacggccc 1561 gcatgtccat gacttcggcc aggcccgcca cctcgggcag ggcgagcagg tcggcgattt 1621 cgttggcttg aaagtcagca cccgacagct ccagccccgg tgtggcaggc acgcaggacg 1681 aggcctgcac gatgaaccga agcggcaagc cccggctcgc ctccacggca aatttgacgc 1741 catcgacgcc caacacattg gccaactcgt gcgggtcgca aaagatggtg gtggtgcctt 1801 gtgggaccac caccgaggcg tagtggtgag gcagcatgtg cgaggattca aagtgcacat 1861 ggccgtcgat cagactgggc gctatgtagt gcccgtccaa ggcatgcact tgggccgcct 1921 cgtgaaaccg accccgagcg tgcacgctgg caatcatctc gccgacgatg cccacatcgg 1981 cttcgcgcag ctccaaagtg gccacatcca ccaccgtggc tcccgtgagc agcacgtcaa 2041 acggcacttg accctgcgcg gcagcgatgg cgcgacggcg agtgtccatg gaaatggctt 2101 cgttcatggg agggcctcgt gggggatgtc gggctgtctc attatttcgc tattcggcta 2161 ttcggctatt cagcacggat gttgcctacc cgaacaatgt cgccaaactt gcgctgttcg 2221 gcgtcaagaa acttggcgaa ggcctcgggt ggcagcccca acgggcgcat gccgatggcg 2281 gccaactttt gtctcagggc ttcgtctttg agcatctcgg ccgactcttt ggccaagcgg 2341 gcgaccaccg ccgggtcggt tttggcgggt gcaaacaagc cgaaccacac ggtgacctcc 2401 agccccttga gctgcgggtg ttcggccatg ctgggcacgt cgggcaggcc cggggaccgt 2461 gcgacttctg tcaccccaaa ggccttgatg cgacctgcct tgacctgttc gctcgtgagc 2521 ggaatgggca acatggccag gtcaatttga ttgcccatga ggtcggtgac gatttgtgag 2581 ccgatcttgt aaggcacgtg caccatctgc accttgccca actgattgaa catctcaccg 2641 gccacgtgca gcgaggtgcc aatgccggag gtggcgtaat tgagcttgcc cggctccttg 2701 cgcatgagct tgagcagctc gtccaggttg ttggcgggca gacccggtcg acccaccagc 2761 accaacggcg agctggaaag caaggtgaca ggggcaaagt ctttgagccc gtcgtacttc 2821 accaccgagg gtgcccaacg ggccacggca tcgacgttgc cgcccgcggc gtacggcacg 2881 atgagggtga tgggccggga ggggtaggcc gcgccacctg tggcgccact ggactgggcc 2941 catgcggcgg tgaacatggc aaaggctgaa aaggtggcga ggcctgccac tgcggccagg 3001 gcactgcagc ggcgagtgaa tttcaggcgc aagggcatgg ggccgactcc ttcgtcaatt 3061 gccttcgaac acgggctttt gtttggccac aaaggctcgg taggcgcggc catagtcgcg 3121 cgtttgcatg caaatggctt gggcttgagc ctcgttgtcg atggcctcgt caatgcccat 3181 gctccattct tggtgcaagc aggctttggt catgccgtgc gcaaaggtgg ggccgtcggc 3241 caattcggcc gcccaggttt gggcctcttg cagcacctgc tcggggtcga ccaggcggtt 3301 gaagaagccc caacgctcac cttcatcggc tgtcatcacc cgtccggtgt agagcagttc 3361 ggcggcgcgg cccgagccaa tgatgcgcgg caggatgttg caagcgccca tgtcggcacc 3421 ggccaggccc acccgtgtga acaaaaaggc agttttggcg cgcggagtgc ccacccgcaa 3481 atcgctggcc atggccaaga tggcacctgc cccggcgcaa atgccgtcga cagcggccac 3541 gatgggttgc ggacagtggc gcatcgcctt gacggcgtca ccggtcatgc gggtgaagtc 3601 cagcaggcct tgcatgtcgc cactttcttg catttgcacg agcggaccaa tgatttcgtg 3661 gacgtctccg cccgagcaaa agtttcctcc ggcgcccgtg atcaccacca ctttgacgtc 3721 ggtggcgtac ttcaggtcaa tgaaaaagtc tcgcagttcg ccgtaggagt caaaggtgag 3781 ggggtttttg cgctcaggtc ggtgcagggt gacggtaccc accttgccct gtaccgacca 3841 gtggaagtgg cgggcggtgt agttggcggc gaggagcttt ttcatggaga cctttctgaa 3901 gacctgagcc ccaaggccag gttcgaaccc attttaggga ggacctggca atcacccgtt 3961 ggggggcttg acaaacgtga tactgttgtt atgatcagct gtatacaaca acagtatagt 4021 tggtcgaaag catcccatgc ccagcaaaat cagacatcac taccagccca aaaccgtgcg 4081 actgcccgag tggttgcttc gcgtgtggtc atttttctga ctctgccggg tggggtgtga 4141 caatgggggt caccgcttgg actcgccccc atgcacacac acggatttcc agacaacctc 4201 tctgagtggg ttcatgacca cgtctttcat gtcagccacg aggcagccga acgaggcacc 4261 cgctgggtga tgtggatcac ggccgccatg atggtggccg agatcttggc gggatggtgg 4321 ttcaactcca tggccttgtt ggccgacggc tggcacatga gctcccacgc cgtggccatt 4381 ggcctgtcgg ccatggccta cgccacggca cgccggtatg cctcagaccc ccgcttcgcc 4441 ttcggcacct ggaaggtcga ggtgctgggc ggcttcgcca gtgcggtgtt tttgttgggc 4501 gtggccgcct tgatggtggt cggttcgctc gaacggttgc tctcccccca aaccattgcc 4561 taccaagaag cggtggtggt ggccctcatc ggc //