LOCUS VBLK01000341 4107 bp DNA linear ENV 28-MAY-2019 DEFINITION Deltaproteobacteria bacterium isolate DP_33 14_1009_16_20cm_scaffold_24739, whole genome shotgun sequence. ACCESSION VBLK01000341 VBLK01000000 VERSION VBLK01000341.1 DBLINK BioProject: PRJNA449266 BioSample: SAMN11380576 KEYWORDS WGS. SOURCE Deltaproteobacteria bacterium (soil metagenome) ORGANISM Deltaproteobacteria bacterium Bacteria; Proteobacteria; Deltaproteobacteria. REFERENCE 1 (bases 1 to 4107) AUTHORS Diamond,S., Andeer,P.F., Li,Z., Crits-Christoph,A., Burstein,D., Anantharaman,K., Lane,K.R., Thomas,B.C., Pan,C., Northen,T.R. and Banfield,J.F. TITLE Mediterranean grassland soil C-N compound turnover is dependent on rainfall and depth, and is mediated by genomically divergent microorganisms JOURNAL Nat Microbiol (2019) In press PUBMED 31110364 REMARK Publication Status: Available-Online prior to print REFERENCE 2 (bases 1 to 4107) AUTHORS Diamond,S. and Banfield,J.F. TITLE Direct Submission JOURNAL Submitted (01-MAY-2019) Earth and Planetary Science, Jill Banfield's Lab at Berkeley, University of California, Berkeley, CA 94720, USA COMMENT The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: IDBA_UD v. 1.1.1 Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: 7x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 05/16/2019 09:55:51 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 4.8 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 3,938 CDSs (total) :: 3,903 Genes (coding) :: 3,740 CDSs (with protein) :: 3,740 Genes (RNA) :: 35 tRNAs :: 31 ncRNAs :: 4 Pseudo Genes (total) :: 163 CDSs (without protein) :: 163 Pseudo Genes (ambiguous residues) :: 8 of 163 Pseudo Genes (frameshifted) :: 96 of 163 Pseudo Genes (incomplete) :: 61 of 163 Pseudo Genes (internal stop) :: 6 of 163 Pseudo Genes (multiple problems) :: 8 of 163 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..4107 /organism="Deltaproteobacteria bacterium" /mol_type="genomic DNA" /isolate="DP_33" /isolation_source="temperate grassland biome" /db_xref="taxon:2026735" /environmental_sample /geo_loc_name="USA: Angelo Coast Range Reserve, CA" /lat_lon="39.74 N 123.63 W" /collection_date="2014-09-03" /metagenome_source="soil metagenome" /note="metagenomic" gene complement(115..450) /locus_tag="E6J86_13830" CDS complement(115..450) /locus_tag="E6J86_13830" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012632365.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phage holin family protein" /protein_id="TMA11368.1" /translation="MAEEKELAKKEIELAKSELRADVQKEVAMVKGLGVAGLCALWAV SLMLVACALALGTVIAEWAAALIVAGVVLAVGTVAGLLGWGKRVKTPLEATRRTLKED VLWAKERLA" gene complement(1570..2100) /locus_tag="E6J86_13835" CDS complement(1570..2100) /locus_tag="E6J86_13835" /inference="COORDINATES: ab initio prediction:GeneMarkS-2+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS-2+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="TMA11364.1" /translation="MGRRIVRRLALAAVCLAALAACVVNLSFDMDQPGIPLVTSAAGA ASTSTLVDLGNSNDIRTHRNDIRSLDLESVDVTITEVKPGNVAATLSGTLTLRKDLGD PSTEVKVGDLPNFPVLVQSTKRIKGNPAVDAFLLERLHDGGKFYAIISGTTDGATDIV LDVLLHASMGYETGLF" gene complement(2121..3161) /locus_tag="E6J86_13840" CDS complement(2121..3161) /locus_tag="E6J86_13840" /inference="COORDINATES: protein motif:HMM:PF14249.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="TMA11365.1" /translation="MDAESLNSPRWPRVSKGGFHEAWFVSGSDAKAGYGLWLRYGVDL DRAGAPDSSIWGSWFERDASERTFVIRNMVDDATISRTGVAFGAAGLTESGCTGEVEA GGHSLRWRLSFGQGAPAEDVIPAWLSPIARLRLSGYVLPHAKTTLSGAVEVDGRMSEL QRMPALQGHLWGRSYWPAWAWARCSGFTEDPDASVELLDVEGPRNVRLPLFVLRFRGG THRFGELPWMLLSVSRPAAPAWHFSAQDARLSIDGVVRATVEQMVQVQYPDADGSLRH CVHTAVASMEVQVRSRAVPGAPWRPEGTLTSKSGASLEFCGREADARVTRGLVSAPAR RRNPSGAGSVAS" gene 3291..3743 /locus_tag="E6J86_13845" CDS 3291..3743 /locus_tag="E6J86_13845" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012241669.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NADH-quinone oxidoreductase subunit A" /protein_id="TMA11366.1" /translation="MPFHFANVLLFSATAILFVFGSLVAGHFLRPHAPTREKSLIYEC GEKPIGQAWFNFNPRFYLVALVFVIFEVEIAFMFPVAAVYKAFLDKGQGLLAFAEIFV FVAILAVGLAYVWAMGDLEWVKGLKQDVLQSLRERQQKPVETAGKKAA" gene 3744..>4107 /locus_tag="E6J86_13850" CDS 3744..>4107 /locus_tag="E6J86_13850" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012241670.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NADH-quinone oxidoreductase subunit B" /protein_id="TMA11367.1" /translation="MPKDYQQDFGSIITSKTSFIDEIINLARKSSIFYLLFGLACCGI ELMQTGGPRADIDRFGSVPRATPRQSDLMIVAGTLTYKMAERCRLLYEQMAEPKYVIS MGSCASCGGLFQYAYSVCK" BASE COUNT 691 a 1438 c 1315 g 663 t ORIGIN 1 cgtggcgcct gcggtccagc tccgccagcg atcggtccag acgcgttcgc agatgctcga 61 tgtcgctctc gatctcccga ggcgaccgcg gctcggccgg agcgttcccg tccttcacgc 121 gagacgttcc ttcgcccaca gtacatcctc cttcagggtg cgccgcgtcg cctccagcgg 181 cgtcttgacg cgcttgcccc atcccaagag cccggccacc gtcccgaccg ccagcaccac 241 gccggcgacg atcagcgccg cggcccattc cgcgatcacc gtgccgagcg ccagggcgca 301 ggccaccagc atcaagctga ccgcccagag agcgcagagc cccgccacgc ccaggccctt 361 taccatcgcc acttctttct ggacgtcggc gcgcagctcg gacttcgcca gctcgatctc 421 cttcttcgcg agctccttct cctccgccat gtcgctctcc tccaatcgct gtgctcgctc 481 gagcagcggc tcgtctccgc gcagcgcgcg gtcaagctgc aaaagcagcc ggtagatgat 541 ctgccactcg tcccagggca ggttcaggcg cgagagctgc gcggacagct cgatcaggcg 601 ccgctcgaga atgccggaac gccggtgcgc atgcggctgc tcgagataga tggcccagac 661 gcgcttggcg tgcgcctcca gctcccgagc tccggggtcg aaggtctgga cgatgggccg 721 ggggccgtac acctcggcgc agagggcgcg ggcgtggccg acgagcacgg gcacgccgcg 781 caggatggtc tcgttgggaa ggacggtcac ctcgagctcg tcgttgcgcc ggtactcgac 841 gtgctcgctc gccatcgccg caaacggccg gccgccgatg cgcaggagga tcttcgccgg 901 cgccgtctgg taccagggcg catccgccgg gcgcagcgcg atgccgatgc gctccagcac 961 ttccgccatc gccgccatcg cccctgcggt gatcgtcttg tccatcaccg agggaatccg 1021 caccacttcc cgcctgcgga tcacggtcac gatgcggagg atcggggcga ccaccaccat 1081 gaggaggaag gcggaggcga gcgcgagcgt gacggggaaa ccgcgaagga acttcttcca 1141 gcccggctcc ggcatcgtgt cccgggaacc gcgcttcgcc accaccgtcc cgacggcgat 1201 cggcaccgcg accgcgagcg cgatccagat caggcgcagc cagaaaccgg gcaccgattt 1261 cgagagcgga acgaaagcga tcaccagcaa ggcgatcttc gggacgatca cgccgagaag 1321 caggatcggc catgccgccg ccgcggcgac caccaccgag agcatcgtct gctctttggg 1381 agaggtgctg ccgaagagcg cgatcaccgc ccagccgaag atggcgttca gtgccttgcc 1441 cgccgacttg gccagatagc tgatcagcgc ttgcaggatg gccacgcgcc aaaactaggg 1501 cctgacgcag gacttgggca ggctgctcgg ccggtgggcg gcgctcagga gaggccgtcc 1561 tcgtaacgct tagaagaggc cggtctcgta acccatcgag gcgtgcagga ggacgtcgag 1621 aacgatgtcc gtcgccccgt ccgtggtccc gctgatgatc gcgtagaact tgccgccgtc 1681 gtgcaggcgc tcgagcaaga aggcatccac cgccgggttg cctttgatcc gcttggtgct 1741 ctggaccagc accgggaaat tcggcaggtc gccgaccttc acttcagtgg aggggtcccc 1801 caggtctttg cgcaacgtga gcgtaccgga cagggtcgcc gccacgttgc ccggcttgac 1861 ctccgtgatg gtcacatcca cggactcgag atcgaggctg cggatgtcgt tgcgatgcgt 1921 ccggatgtcg ttggagttgc ccaggtcgac cagcgtgctc gtggaagcgg cgccggctgc 1981 cgaggtcacg agagggatcc ccggctggtc catatcgaag ctgaggttca ccacgcacgc 2041 ggcgagcgcg gccaggcaaa ctgcggcgag cgcaagtcgt cgaacgattc ggcggcccat 2101 gcttctcccc tctcggaggg tcagcttgcc acagatcccg cgccggacgg atttctccgg 2161 cgcgccggag cggaaaccag tccccgagtg actcgcgcat ccgcctctcg gccacaaaac 2221 tccagcgatg ctcccgattt ggaggtcaac gtcccctcgg gccgccaggg cgccccgggc 2281 accgcacgcg agcgcacctg aacctccatc gacgcgaccg ccgtgtggac gcagtgccgc 2341 agcgacccgt cggcatcggg gtactgcacc tgcaccatct gctccaccgt ggcgcgcacc 2401 acgccgtcga tcgaaagccg cgcatcctgc gcggagaagt gccaggccgg cgcggcgggc 2461 cgcgagaccg agagcaacat ccagggcagc tccccgaatc ggtgcgtgcc gccgcggaag 2521 cgcagcacga agagcgggag ccggacgttt cgcggtcctt cgacgtcgag cagctcgaca 2581 gaggcgtccg ggtcctccgt gaatccggag cagcgggccc aggcccaggc cggccagtac 2641 gaccgccccc acagatgccc ttgcaacgca ggcatccgct gcagctcgct catccgcccg 2701 tccacttcca ccgctccact gagcgtggtt ttggcgtgcg ggagaacgta cccgctgagg 2761 cgcagccgcg cgatcggcga cagccaggcc ggaatcacat cctccgccgg agcgccctgg 2821 ccgaacgaga gccgccagcg gagcgaatga ccgcccgcct ccacttcgcc cgtgcagccc 2881 gattccgtca gccccgccgc cccgaaggcg acgcccgtgc ggctgattgt cgcatcatcc 2941 accatgttcc ggatgacgaa cgtgcgctcg gacgcatcgc gctcgaacca gcttccccag 3001 atggaggaat cgggagcacc cgctctgtcg aggtcgacgc cgtaccggag ccacaaaccg 3061 taaccggctt tcgcatcgct tccggaaaca aaccacgcct cgtggaagcc gcccttggaa 3121 acccgcggcc agcggggcga attgagcgat tcggcatcca cgcgacaccg ctacgccccc 3181 gcgaaaacac cgtcaagcga gcgggcgatg gagccgtgcc gcgcttgact tggaccccct 3241 cccccgtaga ctgtgggctc tcgatttttc cagagttttc gagggagcgg atgccctttc 3301 atttcgccaa tgtcttgctc ttttcggcga ccgccatcct gttcgttttc gggtcgctcg 3361 tcgccgggca tttcctccgt ccccacgccc ccacccgcga gaaatcgctg atctacgagt 3421 gcggggagaa gccgatcggg caggcctggt tcaatttcaa cccgcgcttc tacctggtgg 3481 cgctggtctt cgtgatcttc gaggtggaga tcgcgttcat gttcccggtc gccgcggtct 3541 acaaggcctt cctcgacaag ggccagggct tgctcgcttt cgccgagatc ttcgtgttcg 3601 tggcgatcct cgccgtcggc ctcgcctacg tctgggcgat gggcgatctc gagtgggtga 3661 agggactgaa gcaggacgtg ctgcagtcgc tccgggagcg gcagcagaag cccgtcgaga 3721 ccgctggaaa gaaggcagcg tagatgccga aggattacca gcaggatttc gggtcgatca 3781 tcaccagcaa gacctcgttc atcgacgaga tcatcaacct ggcgcgcaag agcagcatct 3841 tctatctgct cttcgggctc gcctgttgcg gcatcgagct gatgcagacc ggcggaccgc 3901 gcgcggacat cgatcggttc gggtccgtgc cgcgcgcgac gccgcgccag agcgacttga 3961 tgatcgtggc cgggacgttg acctacaaga tggccgagcg ctgccggctg ctctacgagc 4021 agatggcgga gccgaagtat gtgatctcga tgggctcctg cgcctcttgc ggcggcctct 4081 tccagtacgc ctactcggtg tgcaagg //