LOCUS PCRA01000284 6406 bp DNA linear ENV 14-NOV-2017 DEFINITION Syntrophobacteraceae bacterium CG23_combo_of_CG06-09_8_20_14_all_50_8 CG23_scaffold_10715_c, whole genome shotgun sequence. ACCESSION PCRA01000284 PCRA01000000 VERSION PCRA01000284.1 DBLINK BioProject: PRJNA362739 BioSample: SAMN06659701 KEYWORDS WGS. SOURCE Syntrophobacteraceae bacterium CG23_combo_of_CG06-09_8_20_14_all_50_8 (groundwater metagenome) ORGANISM Syntrophobacteraceae bacterium CG23_combo_of_CG06-09_8_20_14_all_50_8 Bacteria; Proteobacteria; Deltaproteobacteria; Syntrophobacterales; Syntrophobacteraceae. REFERENCE 1 (bases 1 to 6406) AUTHORS Probst,A.J., Ladd,B., Jarett,J.K., Geller-McGrath,D.E., Sieber,C.M., Emerson,J.B., Anantharaman,K., Thomas,B.C., Malmstrom,R.R., Stieglmeier,M., Klingl,A., Woyke,T., Ryan,M.C. and Banfield,J.F. TITLE Differential depth distribution of microbial function and putative symbionts through sediment-hosted aquifers in the deep terrestrial subsurface JOURNAL Nat Microbiol 3 (3), 328-336 (2018) PUBMED 29379208 REFERENCE 2 (bases 1 to 6406) AUTHORS Probst,A.J., Ladd,B., Jarett,J.K., Geller-Mcgrath,D.E., Sieber,C.M., Emerson,J.B., Anantharaman,K., Thomas,B.C., Malmstrom,R., Stieglmeier,M., Klingl,A., Woyke,T., Ryan,C.M. and Banfield,J.F. TITLE Direct Submission JOURNAL Submitted (15-SEP-2017) Department of Earth and Planetary Science, University of California, Berkeley, 307 McCone Hall, Berkeley, CA 94709, USA COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: IDBA-UD v. 02.2016 Genome Coverage :: 10x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 10/05/2017 15:37:08 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.2 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 1,945 CDS (total) :: 1,925 Genes (coding) :: 1,595 CDS (coding) :: 1,595 Genes (RNA) :: 20 rRNAs :: 1 (16S) partial rRNAs :: 1 (16S) tRNAs :: 16 ncRNAs :: 3 Pseudo Genes (total) :: 330 Pseudo Genes (ambiguous residues) :: 271 of 330 Pseudo Genes (frameshifted) :: 41 of 330 Pseudo Genes (incomplete) :: 50 of 330 Pseudo Genes (internal stop) :: 7 of 330 Pseudo Genes (multiple problems) :: 37 of 330 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..6406 /organism="Syntrophobacteraceae bacterium CG23_combo_of_CG06-09_8_20_14_all_50_8" /mol_type="genomic DNA" /isolate="CG23_combo_of_CG06-09_8_20_14_all_50_8" /isolation_source="groundwater" /db_xref="taxon:1974096" /environmental_sample /geo_loc_name="USA: Crystal Geyser near Green River, Utah" /lat_lon="38.56 N 110.8 W" /collection_date="20-Aug-2014" /note="metagenomic; derived from metagenome: groundwater metagenome" gene 3..896 /locus_tag="COX51_05870" CDS 3..896 /locus_tag="COX51_05870" /inference="COORDINATES: protein motif:HMM:PF14532.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PIP07763.1" /translation="MRGASGTGKELLARAIHKQSNRSSKPFVDVNCAAVPITLLESEL FGYEAGAFTDAKKRKIGLLEYAKGGTVLLDEIGEMSIHLQAKFLRVLEDGYVRRLGGM ENIPIDVRFIFSTNEDLNRMVAEGXXREDLYYRISVVPIFVPPLRERSEDIIILAQYF VEEFNKKFGKKVMGFSREAEMILQTYPWPGNVRELRNIIERIMIVQDIGTIITPERIP GEIKTTGHQEENKILPDIFLPPLPAEGLDFQVVIEKITREVKEKIIANTLDISKGNKT KAAKLLGISRYKLLREQKRNA" assembly_gap 380..386 /estimated_length=7 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(1814..2443) /locus_tag="COX51_05875" CDS complement(1814..2443) /locus_tag="COX51_05875" /inference="COORDINATES: protein motif:HMM:PF02525.15" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NADPH-dependent FMN reductase" /protein_id="PIP07764.1" /translation="MKITAILGSPRLNGNSTTLAESFLQEAERLGAEVERFRLNTMNY QGCVGCNLCKTKTDHCVLQDDLSFVLEAVREAETLLIATPVYALDMPSQLKAFIDRCY SFFKPHYYKRKDRSRLPSGKKIIFVLAQRAPETMFIDFVQRYDYMFNLLGLKIAYLIR GCELGDDLDAAAKRGDLIEQARETARRVMAGEPVDTAIPPYIFARVWNR" gene complement(2531..4171) /gene="groL" /locus_tag="COX51_05880" CDS complement(2531..4171) /gene="groL" /locus_tag="COX51_05880" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011415963.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="chaperonin GroEL" /protein_id="PIP07765.1" /translation="MSVKEIKYNVLAREQIMKGVDALANAVEVTLGPRGRNVLIEKSW GSPIITKDGVMVAKEIELENKFENMGAQMVKEVASKTSDKAGDGTTTATVLAQAIYHE GFKLVTAGMNPMSLKRGIDKGVQIIVDELKKISKPIKGKKEITQVGAIAANNDKTIGE IISEAMDKVGKEGVITVEEAKSMETSLEIVEGMQFDRGYISPYFATDLEKMEVHLEDP YILLFEKKISSMKDMVPILEQIAKMGRPLLIIAEDVEGEALATLVVNRIRGTLRCAAV KAPGFGDRRKAMLDDIAILTGGNVISEDIGVKLENITISDLGNCKRLSIDKDNTTIVD GAGKKAAIEGRVRQIRAQIEETKSDYDREKLQERLAKLVGGVAVIKVGAAPETAMKEK KARVEDALHATRAAVEEGIVPGGGVALLQTLNELNKANVPEDEQHGLNILKRAIEEPL RQIANNAGHEGSIVVEKVKSKKGAYGFNADSGQYVDMMDSGIIDPTKVVRFAIQNAAS VASLLLTTEAMVAEKPKKKSPSTGMPGGGMPSDMDYDY" gene complement(4247..5158) /locus_tag="COX51_05885" CDS complement(4247..5158) /locus_tag="COX51_05885" /inference="COORDINATES: protein motif:HMM:PF00570.21" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PIP07766.1" /translation="MQHNKWLLVDTPVKADEALKDIKSSSVISMDTEYDSFHYFRDKL CLIQIKTKRKTYLFDPLGDIDLSFLGEHFAASSLCKIMHAGDNDVRILKRDYGFFFNN IFDTHRAALLLGCSHLSLAALVSQYLGIEFEKKKKIQRSKWDLRPLTEEQLAYAVMDT AYLPDLHRRLEDEILKAGLEAEAAKIFTDMAKVVWREKELDQGGHKKIWGYWSLPDGC KERLKRLFRWRYLKAKAINRAFFMILSDKDLFSLSWAEIGNLEDLGDKGLLSRDKIRL LGPELVEILSGEDKSIKAHSSQNSRHS" gene complement(5309..5818) /locus_tag="COX51_05890" CDS complement(5309..5818) /locus_tag="COX51_05890" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_003539959.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nitroreductase" /protein_id="PIP07767.1" /translation="MDFQELISKRYSVRAYKSDPVEEGKLQKVLDAARLAPTAANRQP IQFVVIHTAGREEDLKLMYKKDWLSQAPLVVVACAVYDGAWVRMDNKNYCEVDATIAM DHLILAATDLGLGTCWIAAFDPQAVRKLLKLPEGVEPIALTPIGYPADQPKEKKRKPL SELVRYDHW" gene complement(5837..6125) /locus_tag="COX51_05895" /pseudo CDS complement(5837..6125) /locus_tag="COX51_05895" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_020722023.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" BASE COUNT 1552 a 1630 c 1434 g 1783 t ORIGIN 1 tgatccgcgg cgccagcggt acgggaaaag aattgctggc acgggcgatt cataaacaaa 61 gcaatagaag ttccaagccc ttcgttgatg tcaattgcgc cgccgttccc ataacactgc 121 tggaaagcga actcttcggc tatgaggcgg gcgcctttac agatgcgaaa aaacgtaaaa 181 tcggtctgct ggaatacgct aaagggggaa cggtattgct tgatgaaata ggggaaatgt 241 ccatacatct tcaggcaaaa ttccttcgtg tcttggaaga cggctacgtg cgacgtctgg 301 ggggcatgga gaacatcccc attgacgtcc ggttcatctt ttccacgaac gaagatttga 361 accgcatggt tgcggagggn nnnnnncgcg aggaccttta ttaccgcatc agcgttgtac 421 cgatctttgt tccccccctg cgggaaagaa gcgaagacat catcattctg gcgcaatatt 481 ttgttgagga attcaataaa aaattcggaa aaaaggtaat gggtttcagc agggaagcgg 541 agatgattct tcagacctat ccctggccgg gaaacgtccg ggaactgagg aacattatag 601 aacgtattat gatcgtccag gatataggca ctatcataac gccggaacgc attcccgggg 661 aaatcaagac aaccggacat caggaagaaa ataaaattct gcctgacatt ttcctgcccc 721 ctttgccggc tgaaggctta gactttcagg tggttatcga aaagataaca cgagaagtga 781 aggaaaaaat catcgccaat accctggata ttagtaaagg caataaaaca aaggctgcaa 841 aactgctggg catttcccgc tacaagcttc tgagggaaca gaaacgaaac gcttgatatc 901 tctccccttt gcaaagagcc acttcttatt ctcctccttt cgtaaagggg gcagggggga 961 tttttcctaa caggtcgtcc gttcatgatc tgccatctct tccataaaat atgtgcgctc 1021 ttcgcacagt aagtgcgttt attgcacatg ggccatgttt catctattgc cctgtttaga 1081 gtatctcttg ctgtttttgt gcatatcccg cacggttaat gtggcattgc catccagctc 1141 tgtaaagaac gaaacaccac cctgaaataa tttccctttc ctgtcctata acatgtggta 1201 tttgttatga ttataaaaaa ctttctacaa ggttagtcaa tggaaggttt ttcttgaaaa 1261 tggcattgat attgcttaat acaattcgga gtgaagacat acctgaaaag aattaaagaa 1321 cactcggaga taagtcagtt tttaggaaac atctctctaa gttttaagtc ggtgggtggt 1381 tttttttacc ctcctctcca cccaccgacc ccccttcagc taatctgaag ggtgggtttt 1441 ggcgccaccc ttcagtctac atagcggtgg atggtaagaa atggacgccc tcctttcccc 1501 atccaccttt ttttgttaaa gccccggcag tttaatctct atgggtataa ttcccagccg 1561 cttgcggcga taatgaaaaa ttgttccggc aagatacccc ggctgcttgc ggcggggtag 1621 ttcattgccg tatgccgtac tccactcgct atcggaaggg gcttttttta gcgtcctgaa 1681 caaatactgc cggtatgagc aagaataaaa cgaggatgca cagggtaacg agcatctcgc 1741 gtatgattac acaacttttt ctggcctttg gcaaaacctt gagtacacac agaaagagaa 1801 gagtattttc agtttaccgg ttccagaccc gggcaaagat atagggtgga atagccgtat 1861 ctaccggttc gccggccata actctccgtg ccgtctcccg agcttgctcg attaaatccc 1921 cgcgcttggc agcggcatcc aagtcatcac cgagttcaca gccgcggatc aaatacgcga 1981 ttttgaggcc taaaagatta aacatgtagt catagcgctg gacaaaatcg ataaacattg 2041 tttccggcgc tcgctgagca agcacgaaaa ttatcttctt ccctgacggc agcctgcttc 2101 tatctttcct tttgtagtaa tgcggcttga aaaaggagta acacctgtct atgaaggcct 2161 ttagttgtga aggcatatcc agagcataga cgggcgttgc gataagtagt gtctcggcct 2221 cacgcaccgc ctccagtaca aaggacaaat catcttgcaa cacacaatgg tcggtcttcg 2281 tcttgcataa gttgcagcca acgcaaccct ggtaattcat tgtattcaag cgaaatcgtt 2341 ctacctccgc cccgagtcgc tctgcttcct gcagaaacga ttcggctaag gtagtgctat 2401 ttccatttag ccttggactt cccaaaatag ctgtaatttt catctgcaga tcaacaattg 2461 ttgtttaacc ggtaatcggt ccgaaaataa aaacccccgc accctttaaa ggatgcatgg 2521 ggttaggcaa ttaataatca tagtccatat cagaaggcat gccgccgcct ggcattcctg 2581 tcgatgggga ttttttcttc ggtttttcgg ctaccatcgc ttctgttgtc aagaggagcg 2641 atgcgacgga agcggcattc tggatggcaa aacgaaccac cttagtggga tcaatgatgc 2701 cgctatccat catgtccacg tactgaccgg aatccgcatt gaagccatag gcgccttttt 2761 tgctcttgac tttttccacc acaatggatc cttcatgacc ggcattattg gcaatttggc 2821 ggagcggttc ttcaatagcc cgtttcagaa tgttcaaacc atgctgttca tcctcgggta 2881 cattagcctt atttagttca tttagggttt gcagcaaagc tacgccgcct ccgggaacga 2941 tgccttcttc gaccgccgca cgtgtggcat gcagggcatc ctccacccgg gccttttttt 3001 ccttcatggc ggtctctggg gcagcgccta ccttaattac ggctacaccg ccgaccaatt 3061 tggcaagacg ttcctgtaat ttttcccggt cataatcgga cttggtttct tcgatctgcg 3121 ccctgatctg cctgacgcgg ccctcgatag cggctttttt gccggcgcca tcaacgatgg 3181 tggtattgtc cttgtcaatg ctcaaacgtt tgcagtttcc aagatcgctg atagttatat 3241 tttcgagttt gacgccaata tcttcggaga tgacattacc gcccgtcagg atggcaatgt 3301 catccaacat ggcctttcgc ctgtcaccaa aaccgggggc cttcacggcc gcacacctga 3361 gtgtgccacg aatcctgttt accacgagcg ttgccagcgc ttcgccttca acatcttcgg 3421 caataatcaa cagcggccgt cccatcttgg cgatttgttc caggattggt accatgtcct 3481 tcatactgct gatcttcttc tcaaaaagga ggatgtaagg atcctcaaga tgtacttcca 3541 tcttctcaag atcggttgca aaatagggag atatatagcc gcgatcgaat tgcattccct 3601 caacaatttc gagagatgtt tccatgctct tggcttcttc aacggtaatg acgccctctt 3661 tgccgacttt atccatagcc tcggagatga tttcgccaat ggttttgtca ttattggccg 3721 ctatcgcgcc gacctgagta atttcttttt tacctttgat cggcttagat atctttttca 3781 gctcatcaac gatgatttgg actcccttat cgatgcctct tttgagcgac atgggattca 3841 tccctgctgt caccagctta aaaccttcgt gataaattgc ctgggccagt acggtcgccg 3901 tggtggttcc gtcccccgct ttgtcggatg tcttggaggc cacctcttta accatctggg 3961 cgcccatgtt ttcaaacttg ttttccaact cgatttcttt agccaccata acgccatcct 4021 tggtgatgat tggtgaaccc catgattttt caatcaggac attacggcct ctaggaccaa 4081 gcgtgacttc cacggcattg gctaaggcat cgaccccttt cataatttgt tcacgggcta 4141 aaacgttata ttttatttct tttaccgaca tttttctttc ctcctttttt actcaataat 4201 accgagcaca tcgttttctc tcatgatcaa gtgctctatg ccatcttcag gaatgccgtg 4261 agttttgcga ggaatgcgct ttgatgcttt tgtcctctcc acttaatatt tccacaagct 4321 ccgggcctaa aagacgaatt ttgtcccgtg aaagcagtcc tttatctcca aggtcttcga 4381 gattgccgat ctccgcccat gatagagaga acaaatcctt atcggataaa atcatgaaaa 4441 aagccctgtt gatcgccttg gcctttaaat agcgccaccg gaaaaggcgc ttcaagcgtt 4501 ccttgcagcc atcgggtaat gaccagtatc cccatatctt tttgtgccct ccctgatcga 4561 gttccttttc tcgccatacc acctttgcca tgtccgtaaa aatcttcgct gcttccgcct 4621 ccagacctgc tttaaggatt tcatcttcca gtctgcggtg caaatcaggt agataggccg 4681 tgtccataac agcatatgct aactgctctt ccgtcaaagg acggaggtcc catttcgagc 4741 gctgtatttt cttctttttt tcgaattcga tgccgagata ctggcttact aaagcggcaa 4801 gagaaaggtg tgagcatccc aacagcaagg cggcgcggtg ggtatcgaaa atgttgttga 4861 aaaagaaacc gtaatcccgc ttcaggatgc ggacatcatt gtctccggca tgcatgatct 4921 tacacaggga cgaagcggca aaatgctctc ccagaaagga taaatcaata tcccccagtg 4981 gatcaaagag ataagttttt ctttttgttt ttatctgaat taagcagagt ttgtcccgaa 5041 agtagtggaa agaatcatat tccgtgtcca tactgatgac ggaggagctt tttatatctt 5101 taagggcttc atcggccttt acaggggtgt ccaccaaaag ccacttatta tgctgcattt 5161 attatcctca tttctcgcgt attaaaatca caagcggcaa aaaccgtcaa gatattttcc 5221 ccagatattt tctccatcgg caatatgaaa atcgggccgg aagaggtaat ctacatcctg 5281 cataagacag cctgtctcaa gcagtagttt accagtgatc gtaccgcacc aattcagata 5341 gaggtttccg ttttttctcc ttcggctggt cagcgggata tccgatgggc gttagagcga 5401 taggctccac tccttcaggc agcttcagca atttccgcac cgcctgagga tcaaaggcgg 5461 cgatccagca agtgcccaga cccaggtcgg tggctgccag gatcagatga tccatggcga 5521 tcgttgcatc cacctcgcag tagttcttgt tgtccattcg tacccatgcc ccatcgtaaa 5581 ccgcgcaagc aacaacgacc aggggtgcct gactgagcca atctttcttg tacatcagct 5641 tcaggtcttc ttcccggccg gcagtgtgta tgacgacgaa ctgaatcggc tgccggttgg 5701 cagccgtggg ggcaagccgt gcggcatcaa gcaccttttg gagttttccc tcttccaccg 5761 gatccgattt atacgcccgc acactgtatc tcttgctgat caattcctga aaatccattg 5821 tctccctcct ttctaattac tccctgttga tcttcaccag cacccgtttt cggcggcggc 5881 cgtcggactc cccataaaag atctgttccc gaggaccgaa atcgagccag ccttccgtca 5941 ccgctatgac cgcttcccgg cccatgatct gccgcttgag gtgggcgtcg ccgttgactt 6001 cccccgtctg gttgtagcaa caatatgaaa ttggttcatg aagtgccatt tgctccagcc 6061 atcgttcgta atcctgacgt tgcccgctcg tcatcgttga tgaaagccga agccgtgata 6121 tgcatggcgt ttaccagaca tattcaggca atcttccgaa gttagatatt accatggttc 6181 cggcagttcg tttgataaat gttctgctat attataatgg atcccaaaat ttggacaata 6241 gatatattgt tcaagagtgg tcatttttaa ctggctttta cttaacctgc tgcgagggag 6301 cttaggtatt ccctgaggag aaatgctttg cttaccccat gatcgataag atcccagggc 6361 aagatttccg atatttcctt tcgtcgatag acgtaaaaat ctgcgt //