LOCUS VBIR01000024 6280 bp DNA linear ENV 28-MAY-2019 DEFINITION Chloroflexi bacterium isolate CF_135 14_0929_05_40cm_scaffold_16592_e:1378, whole genome shotgun sequence. ACCESSION VBIR01000024 VBIR01000000 VERSION VBIR01000024.1 DBLINK BioProject: PRJNA449266 BioSample: SAMN11380505 KEYWORDS WGS. SOURCE Chloroflexi bacterium (soil metagenome) ORGANISM Chloroflexi bacterium Bacteria; Chloroflexi. REFERENCE 1 (bases 1 to 6280) AUTHORS Diamond,S., Andeer,P.F., Li,Z., Crits-Christoph,A., Burstein,D., Anantharaman,K., Lane,K.R., Thomas,B.C., Pan,C., Northen,T.R. and Banfield,J.F. TITLE Mediterranean grassland soil C-N compound turnover is dependent on rainfall and depth, and is mediated by genomically divergent microorganisms JOURNAL Nat Microbiol (2019) In press PUBMED 31110364 REMARK Publication Status: Available-Online prior to print REFERENCE 2 (bases 1 to 6280) AUTHORS Diamond,S. and Banfield,J.F. TITLE Direct Submission JOURNAL Submitted (01-MAY-2019) Earth and Planetary Science, Jill Banfield's Lab at Berkeley, University of California, Berkeley, CA 94720, USA COMMENT The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: IDBA_UD v. 1.1.1 Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: 7x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 05/16/2019 04:00:12 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 4.8 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 2,967 CDSs (total) :: 2,941 Genes (coding) :: 2,876 CDSs (with protein) :: 2,876 Genes (RNA) :: 26 tRNAs :: 24 ncRNAs :: 2 Pseudo Genes (total) :: 65 CDSs (without protein) :: 65 Pseudo Genes (ambiguous residues) :: 0 of 65 Pseudo Genes (frameshifted) :: 31 of 65 Pseudo Genes (incomplete) :: 32 of 65 Pseudo Genes (internal stop) :: 2 of 65 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..6280 /organism="Chloroflexi bacterium" /mol_type="genomic DNA" /isolate="CF_135" /isolation_source="temperate grassland biome" /db_xref="taxon:2026724" /environmental_sample /geo_loc_name="USA: Angelo Coast Range Reserve, CA" /lat_lon="39.74 N 123.63 W" /collection_date="2014-09-03" /metagenome_source="soil metagenome" /note="metagenomic" gene complement(<1..535) /locus_tag="E6J15_01270" CDS complement(<1..535) /locus_tag="E6J15_01270" /inference="COORDINATES: protein motif:HMM:PF00994.22" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="molybdopterin molybdotransferase MoeA" /protein_id="TMC79180.1" /translation="MRIMTGAPMPDGADTVVRVEDTDNASDLVTITAATPKGISTRQA GEDLRKGETVLTKGTVLRAAEIGLLASVGRGRVQVRKRPRVAVFSTGDEIVDLDRPLG RGQIRDSNRYTLASAIRAAGAEPWVRGIVRDSPDALRAAFREVVAADAIVTSGGVSVG DHDHLKPVLSELGTIDFW" gene complement(706..1755) /locus_tag="E6J15_01275" CDS complement(706..1755) /locus_tag="E6J15_01275" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006134305.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter ATP-binding protein" /protein_id="TMC79181.1" /translation="MLTAHVAVRIGTLDLDVAIDAAPGEIVAVLGPNGAGKSTFLRAV AGLVRLDAGRVELDEIVLEDAARGIQLPPEHRPIGVVFQDYLLFPHLSALENVAFGLR ARGVAAREARTRARSWLDRLGLGDHADAKPRALSGGQAQRVALARALAIDPRLLLLDE PLAALDASARGAVRRDLRRHLASFAGIRIVITHDPLEAVALADRLVILERGRVVQTGS PADVTQRPRSRYVADLVGVNLLRGTATGGQVVVSGGASLQSADGADGEVFAVIHPRAV ALHRARPEGSPRNVWRGRASALDFQGDRVRVGIEGEMPIVAEVTPAAVRELDLAEGGE VWVSVKATEITVYPA" gene complement(1755..2540) /gene="modB" /locus_tag="E6J15_01280" CDS complement(1755..2540) /gene="modB" /locus_tag="E6J15_01280" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012853150.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="molybdate ABC transporter permease subunit" /protein_id="TMC79182.1" /translation="MRSERPPILLAGLAALGIAFFVLPLVGLLLRAPWQTALSDLTAP EAVTALRLSLVVSLAATAVALVLGVPLAWVYARVPFPGRDVVRALTTLPMILPPVVGG VALLFAFGRRGLFGQTLDAMFGIRLPFTTAGAVLAATFVAMPFLVLTVEAGLRSMDRR YEDAARTLGAGRWHVFRRVTLPLIAPSVFAGAVLCWARALGEFGATITFAGNLPGTTQ TLPLAVYIALETRPEVAIMLSLVLLAVSLAILIVMRDRYLRAF" gene complement(2537..3295) /gene="modA" /locus_tag="E6J15_01285" CDS complement(2537..3295) /gene="modA" /locus_tag="E6J15_01285" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017934502.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="molybdate ABC transporter substrate-binding protein" /protein_id="TMC79183.1" /translation="MRAPIPILIAVALLLGACASPTATESPQIKELTVFAAASLTDGF TKAGIEFAKSAVRVRVTFNFGSSSTLATQITNGAPADVFASADDANVQKIVDAKLADG VPTAFATNRLEIAVAAGNPRRIGSLADLAQSGVVLVLAAPTVPAGKYALDALTKAGVN AEPVSQEVDVRAVLNKVSLGEADAGIVYVTDVKSAGSRVTGVEIPEQQQVVARYPIAV VKGSKNAALAHRFVDYLVSPAGQSVLAEFGFSKP" gene complement(3301..3735) /locus_tag="E6J15_01290" CDS complement(3301..3735) /locus_tag="E6J15_01290" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019735567.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="helix-turn-helix domain-containing protein" /protein_id="TMC79184.1" /translation="MPQKAVKPIDQAVRIGEAAELLGVSVDTLRRWTASGRLRVRRSA GGQRLVALADIRRLQEDRRKRARPIVAQSARNRFPGVVTRVDKDRVAAVVEVQAGPHR LVSLMTAEAATELELQPGAVVVCVVKATNVVVEIARRRRGST" gene complement(3764..4501) /locus_tag="E6J15_01295" CDS complement(3764..4501) /locus_tag="E6J15_01295" /inference="COORDINATES: ab initio prediction:GeneMarkS-2+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS-2+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="TMC79185.1" /translation="MLAAIAAERLPFAAGAFGRALQSLVEGGYVAPAAAATVCLTALG AAERMAERERWSEMLPTLVRLLGDTEPRPFPVTPEIPPPRARPVAEAYLDRVLVASVR ERIAAARDGGPAFAVVLAQLDVDGVGEATRRAMVHRAIRATLGATATLLGGDVTAYRY GDSGVAVLAPAGRDSTRGERLCTLVRARLDELMRTMTSTVRAFGAARWSVRAGQATWS DDIATTTVLLRRAEEALSTDGDRRDAA" gene 4453..5538 /locus_tag="E6J15_01300" CDS 4453..5538 /locus_tag="E6J15_01300" /inference="COORDINATES: protein motif:HMM:PF00781.22" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="TMC79186.1" /translation="MRPPRTAAAPPRSRRATLPGEPDLRRGSPEDRGGRMRSRDRSRS GGPSPLRWQKKERPDLARGARRALERVTMGERALLIVNPAAGQRHAEDGDLAECVRLL IAAGFEIDRRETGADGPTSADLAKAGVAEGFPVVIVAGGDGTVAPAAAALLETNATLG ILPFGSYMNIANGLGIPLKPIDAARVIAERRVKRADAGEVAGKVFFETCGIGLDADAF GAARLAERRRWRPAFRRVVRWATASPQRVDITVDGKTERHRVLQILIVNSPYYGWAFP VVPKADMTDGLLDVAIFPRKGRLDLIRWMIEIWRHGRPGKPPRLLQGKVIDIASPESV PVHADGQIAGRLPVTVRCCEGALRVFA" gene complement(5495..6184) /locus_tag="E6J15_01305" CDS complement(5495..6184) /locus_tag="E6J15_01305" /inference="COORDINATES: protein motif:HMM:PF13263.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="PHP domain-containing protein" /protein_id="TMC79187.1" /translation="MDRIRLDMHMHTEYSRDSRVALADFAELARKAQLGAVCVTDHDT IEGGLRLREMSTGLQVIVGEEITTADGELVGLYLETKVTPGQTAEHTIDLIHDQGGLA YVPHPFSRNRRRHLHRSTLERLAAKLDIVEVFNAREVASSSNVRALEFARAHDLPGGV GSDSHRAIEIGRAYVDVAPFATPQELLIALREGQVTGTLSGLAIHMRTWVDIGRKFMR TRVARLRSTAR" BASE COUNT 1022 a 2158 c 2225 g 875 t ORIGIN 1 cccagaagtc gatcgtcccg agctcggaga gcaccggctt caggtgatcg tggtcgccga 61 ccgacacacc gccgctcgtc acgatcgcgt ccgccgcgac gacctcgcga aaagccgcgc 121 gcagcgcatc cggcgagtca cgcacgatcc cgcggaccca cggctcggcg cccgccgcgc 181 ggatcgccga cgcgagggtg tagcggttcg agtcgcggat ctgcccgcgc cctaggggtc 241 tgtcgagatc gacaatctcg tcaccggtcg agaacaccgc gacgcgcggg cgcttccgga 301 cttgcacgcg gccgcgaccc accgatgcga gaagaccgat ctcggcggcg cgaaggacgg 361 tgcccttcgt cagcacggtc tcgcccttgc gcaggtcctc gccggcctgc ctcgtcgaga 421 tacccttcgg cgttgccgcg gtgatcgtca cgaggtcgct cgcgttgtcg gtgtcctcga 481 cccggacgac ggtgtcggcc ccatcgggca tcggcgcgcc ggtcatgatc cgcatcgcct 541 cgcctagacc gaccgctcga tcggggaatt gagcgcgtcg aggatcggcg cctccaccgc 601 ggggagggcc gggacgcccg cgagaattcg cgcgagcgcg tcgtccgcgc tcatgagctt 661 ctgcgcgggc ttggccgagg tcgcgctgac cggcatggcg cgagtttagg ccgggtagac 721 ggtgatctcg gtcgccttca cgctcaccca gacctcgccg ccctcggcga gatcgagctc 781 cctaacggcg gccggcgtga cctcggcgac gatcggcatc tctccttcga tcccgacacg 841 gacgcgatcg ccctggaagt cgagcgccga cgcgcgaccg cgccacacgt tccgcggcga 901 gccctccggc cgcgcgcggt gcaatgcgac ggcgcgcgga tggatcacgg cgaacacctc 961 gccgtcggcg ccgtcggccg attgaagcga cgcgccgccc gacaccacga cctgcccccc 1021 ggtcgcggtg cctcgcagaa gattcacgcc gacgagatcc gcgacgtatc gcgaccgcgg 1081 ccgctgcgtg acgtcggcgg gcgagccggt ctgcacgacc cggccgcgtt cgaggatcac 1141 cagacgatcc gcgagcgcga ccgcctcgag cggatcgtgc gtgatgacga tgcggatgcc 1201 ggcgaacgag gcgagatgcc gccggaggtc gcgccggacc gcgccgcgcg cggacgcatc 1261 gagcgcggcg agcggctcgt cgagcaggag caggcgtggg tcgatcgcga gcgcgcgtgc 1321 gagggccacg cgctgcgcct gcccacccga gagcgcgcgc ggtttcgcgt ccgcgtggtc 1381 gccgaggccg agccgatcga gccaggagcg agcccgggtc cgcgcttcgc gtgccgcgac 1441 gccgcgcgcg cgaaggccga aagccacgtt ctcgagcgcc gagagatgcg ggaagagcag 1501 gtagtcctga aaaacgacgc cgatcgggcg gtgctccggc gggagctgga tgccgcgcgc 1561 ggcgtcctcc aggacgattt cgtcgagctc gacgcgtccc gcgtccagcc ggacgagccc 1621 ggcgacggcg cgcaggaacg tcgacttgcc ggccccgttc ggcccaagca ccgcgacgat 1681 ctcgccaggg gcggcgtcga tcgcgacgtc gagatcgagg gtgccgatgc ggaccgcgac 1741 gtgcgccgtg agcatcagaa ggcgcgtagg tagcgatcgc gcatcacgat caggatggcg 1801 agcgatacgg caagaaggac gaggctcaac atgatcgcga cctctggacg cgtctccaga 1861 gcgatgtaga cggcaagggg cagcgtctgc gtcgtccccg ggaggttccc cgcgaacgtg 1921 atcgtcgccc cgaactctcc gagagcacgc gcccagcaga gcaccgcgcc ggcgaaaacc 1981 gacggcgcga tgagcggaag cgtcacgcgc cggaacacgt gccatcgtcc ggcgccgagc 2041 gttcgcgcgg catcctcgta gcgacgatcc atcgatcgca gcccggcctc gaccgtgagc 2101 acgaggaagg gcatcgccac gaacgtcgcc gcgaggaccg cgcccgcggt cgtgaacggg 2161 aggcggatcc cgaacatcgc gtcgagtgtt tgaccgaaca ggccccgccg tccgaaggcg 2221 aagaggagcg cgacgccacc gacgaccggc ggcaggatca tcggcagagt cgtcagcgcg 2281 cggacgacgt cccgcccggg gaacggaaca cgcgcgtaaa cccacgcgag cgggacgccg 2341 agcacaagcg ctaccgcggt ggcggcgagc gagaccacga gcgaaagccg gagcgcggtg 2401 accgcctcgg gtgcggtcag gtcgctgagc gcagtctgcc acggcgcgcg gagcagcagc 2461 ccgacgagag gcagaacgaa gaaggcgata ccgagggccg caagcccggc gaggaggatc 2521 ggcggacgtt cggatctcac ggcttcgaga aaccgaactc cgcgagcacg gactgtcccg 2581 cgggcgagac gaggtagtcg acgaaccgat gcgcgagcgc cgcgttcttc gagcccttga 2641 ccacggcgat ggggtagcgc gctacgacct gttgctgctc cggtatctcg acgccggtca 2701 cgcggctgcc cgcggatttc acatcagtga cgtagacgat gccggcgtcg gcttcgccga 2761 ggctcacctt gttgagcacg gcgcgaacgt cgacctcctg gcttactggc tccgcgttga 2821 cgcccgcttt cgtcagcgca tcgagcgcgt acttgcctgc ggggaccgtt ggcgccgcga 2881 gcacgaggac cacgccggac tgtgcgagat cggcgagaga gccgatccgt ctggggttgc 2941 ctgcggcgac cgcgatctca agccgattgg tcgcgaacgc cgtgggcact ccgtcggcga 3001 gcttcgcgtc cacaatcttc tgcacgttcg catcatcggc cgacgcgaac acatcggcgg 3061 gtgcgccgtt ggtgatctgc gtggccagcg tcgacgacga tccaaagttg aaagtcacac 3121 gaacgcgaac ggcggacttc gcgaactcga tgcctgcttt ggtgaagccg tcggtaagcg 3181 acgcggccgc gaataccgtg agctctttga tttgcggcga ctcggtcgcg gtgggtgacg 3241 cacatgcgcc gagaagcaac gcaacggcga tgaggatcgg gatgggggcc cgcatggcct 3301 ctaggtcgag ccgcggcggc gacgcgcgat ctcgaccacg acgttcgtcg ccttgaccac 3361 gcacaccacc acggcgccgg gctgaagctc gagctcggtc gcggcctcgg ccgtcatcag 3421 gctcaccaag cggtgcggcc ccgcctgcac ctccacgacc gcggcaacgc ggtctttgtc 3481 aacacgggtc acgacaccgg ggaacctgtt tcgcgccgac tgcgcgacga tcggccgtgc 3541 acgcttgcgc cgatcctcct ggaggcggcg gatgtcggcg agggccacga gacgctggcc 3601 acccgcggac cgccgcacgc gcagacgtcc ggatgcagtc cagcgacgaa gcgtatccac 3661 cgagacaccg aggagctcag cggcctcacc gattcgcacc gcttgatcta taggctttac 3721 ggccttctgt ggcatgaaag cctatgtatc atacgacgcg ctgtcaggcg gcatcgcgcc 3781 ggtcgccgtc agtcgagagg gcctcctcgg cgcggcggag cagcaccgtc gtcgtcgcga 3841 tgtcgtcgct ccacgtcgcc tgccccgcgc ggacgctcca tcgcgcggcg ccgaacgcgc 3901 ggacggtaga ggtcatcgtg cgcatcagct cgtcgagccg agcgcgcacg agcgtgcaca 3961 ggcgctctcc acgcgtggag tctcggcctg ctggagcgag cacggccacg cccgaatccc 4021 cgtagcggta ggccgtgacg tcgcctccga gcaacgtcgc ggtcgctcca agagtcgcgc 4081 gtatcgctcg gtggaccatg gcgcgccgcg tcgcctcacc gacgccgtcg acatcgagct 4141 gagcgaggac caccgcgaag gcgggcccgc cgtcgcgcgc cgcggcgatc cgctcgcgca 4201 cggacgcgac gagcacgcga tcgaggtagg cctcggcgac cgggcgcgct cgcggcggag 4261 gtatctccgg agtgacggga aacggacgcg gttcggtgtc gccgaggagg cggacgagcg 4321 tcggcagcat ctcggaccat cgctcccgct cagccatccg ctcggccgcg ccgagcgccg 4381 tgagacacac ggtcgcggcg gccgcgggcg cgacatagcc gccttcgacc agcgactgca 4441 gcgctcggcc gaatgcgccc gccgcgaacg gcagccgctc cgccgcgatc gcggcgagca 4501 acgctgccgg gcgagccgga cctgcggcga ggctctccag aagatagagg cggaagaatg 4561 cggtcgcgag accggtcgcg ctcgggcgga ccatctcccc tccgatggca gaagaaagag 4621 cgacccgacc ttgcccgtgg cgcgcgacgt gccctagagc gcgtcacgat gggagaaaga 4681 gcgctgctga tcgtgaatcc cgcggccggc cagcggcacg ccgaggatgg cgacctggcc 4741 gagtgcgtgc ggcttctcat tgctgccggg ttcgagatcg atcgccgtga gacgggcgcc 4801 gatggtccga cctcggcgga tctggcgaag gcgggagtgg ccgagggatt cccggtcgtg 4861 atcgtggcgg gtggagacgg caccgtcgct ccggcggcgg ccgcccttct cgagacgaac 4921 gccacgcttg ggatcctccc gttcggcagt tacatgaaca tcgcgaacgg acttgggatc 4981 ccgctcaagc cgatcgacgc ggcgcgcgtc atcgccgaac gaagggtcaa gcgcgcggat 5041 gcgggtgagg ttgccggcaa ggtcttcttc gaaacgtgcg gcatcggcct cgacgccgac 5101 gcgttcgggg cggcgcgact ggcggagcgc cggcgctgga gacccgcgtt ccgccgtgtc 5161 gtgcggtggg cgaccgcgag tccgcaacgg gtggacatca ctgtcgacgg aaagaccgag 5221 cgccaccgcg ttctccagat cctcatcgtg aacagtccgt actacggttg ggccttccca 5281 gtcgtcccga aggccgacat gaccgatggc ttactcgatg tggcgatctt cccgcgcaag 5341 ggaaggctcg acctcatccg gtggatgatc gagatctggc gccacggccg gccggggaag 5401 cccccgcgac tcttgcaggg gaaggtcatc gacatcgcct caccagagag cgttccggtc 5461 cacgccgacg gtcagatcgc gggccggctc ccagtcaccg tgcggtgctg cgaaggcgcg 5521 ctacgcgtgt tcgcatgaac ttgcggccga tgtcgaccca cgtgcgcatg tggatggcga 5581 ggccggagag cgttccggtc acctggccct cgcggagcgc gatcaggagc tcctgcggag 5641 tcgcaaacgg cgcgacgtcc acgtaggcgc ggccgatctc gatcgcgcgg tgcgaatcgc 5701 tgccgacccc gcccggaaga tcgtgcgcgc gggcgaattc gagtgcgcgg acgttgctcg 5761 acgatgccac ctcgcgggcg ttgaacacct ccacgatgtc cagctttgcc gcgaggcgct 5821 cgagcgtcga ccggtggagg tgccggcggc ggttccggct gaagggatgc gggacgtacg 5881 cgagcccgcc ctggtcgtgg atgagatcga tggtgtgctc tgcggtctgc cccggcgtca 5941 ctttcgtctc gagatagaga ccgacgagct cgccgtcggc ggttgtgatc tcctcgccga 6001 cgatcacctg aagacccgtg ctcatctcgc ggagccgcag cccaccctcg atggtgtcgt 6061 gatcggtgac gcagaccgca ccgagctgtg ccttgcgggc gagctcggcg aagtcggcca 6121 gggcgacgcg gctgtcacgc gagtactccg tgtgcatgtg catgtcgagc ctgatgcggt 6181 ccacgccgcg agtgtgcggc agttcgcgtg ccgcccgcca aaatcagggt cggacccgct 6241 ccgcgttgtc atcctccgcg aaggcgagaa cgccgatgac //