LOCUS PFFY01000006 4547 bp DNA linear ENV 16-NOV-2017 DEFINITION Bacterium (Candidatus Ratteibacteria) CG15_BIG_FIL_POST_REV_8_21_14_020_41_12 CG15_8_21_14_0.20_scaffold_8255_c, whole genome shotgun sequence. ACCESSION PFFY01000006 PFFY01000000 VERSION PFFY01000006.1 DBLINK BioProject: PRJNA362739 BioSample: SAMN06659577 KEYWORDS WGS. SOURCE bacterium (Candidatus Ratteibacteria) CG15_BIG_FIL_POST_REV_8_21_14_020_41_12 (groundwater metagenome) ORGANISM bacterium (Candidatus Ratteibacteria) CG15_BIG_FIL_POST_REV_8_21_14_020_41_12 Bacteria. REFERENCE 1 (bases 1 to 4547) AUTHORS Probst,A.J., Ladd,B., Jarett,J.K., Geller-McGrath,D.E., Sieber,C.M., Emerson,J.B., Anantharaman,K., Thomas,B.C., Malmstrom,R.R., Stieglmeier,M., Klingl,A., Woyke,T., Ryan,M.C. and Banfield,J.F. TITLE Differential depth distribution of microbial function and putative symbionts through sediment-hosted aquifers in the deep terrestrial subsurface JOURNAL Nat Microbiol 3 (3), 328-336 (2018) PUBMED 29379208 REFERENCE 2 (bases 1 to 4547) AUTHORS Probst,A.J., Ladd,B., Jarett,J.K., Geller-McGrath,D.E., Sieber,C.M., Emerson,J.B., Anantharaman,K., Thomas,B.C., Malmstrom,R., Stieglmeier,M., Klingl,A., Woyke,T., Ryan,C.M. and Banfield,J.F. TITLE Direct Submission JOURNAL Submitted (15-SEP-2017) Department of Earth and Planetary Science, University of California, Berkeley, 307 McCone Hall, Berkeley, CA 94709, USA COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: IDBA-UD v. 02.2016 Genome Coverage :: 10x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 11/07/2017 18:58:56 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.2 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 1,601 CDS (total) :: 1,565 Genes (coding) :: 1,486 CDS (coding) :: 1,486 Genes (RNA) :: 36 rRNAs :: 1, 1 (16S, 23S) complete rRNAs :: 1 (16S) partial rRNAs :: 1 (23S) tRNAs :: 32 ncRNAs :: 2 Pseudo Genes (total) :: 79 Pseudo Genes (ambiguous residues) :: 60 of 79 Pseudo Genes (frameshifted) :: 5 of 79 Pseudo Genes (incomplete) :: 14 of 79 Pseudo Genes (internal stop) :: 1 of 79 Pseudo Genes (multiple problems) :: 1 of 79 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..4547 /organism="bacterium (Candidatus Ratteibacteria) CG15_BIG_FIL_POST_REV_8_21_14_020_41_12" /mol_type="genomic DNA" /isolate="CG15_BIG_FIL_POST_REV_8_21_14_020_41_12" /isolation_source="groundwater" /db_xref="taxon:2014291" /environmental_sample /geo_loc_name="USA: Crystal Geyser near Green River, Utah" /lat_lon="38.56 N 110.8 W" /collection_date="21-Aug-2014" /note="metagenomic; derived from metagenome: groundwater metagenome" gene complement(83..883) /gene="lgt" /locus_tag="COW28_00105" CDS complement(83..883) /gene="lgt" /locus_tag="COW28_00105" /inference="COORDINATES: protein motif:HMM:TIGR00544" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="prolipoprotein diacylglyceryl transferase" /protein_id="PIW34275.1" /translation="MHPILFKIGNFSIYSYGVVIACSIFVVSSLILRQGKKEGFSEEE FFNLIFLVVIIGLLGARLLHIMVHLAYYLRHPIEIIAIRHGGLAIQGGVIFGLLAAIL FLRRRKLPILKTLDIFALYFPLAQAIGRIGCFLNGCCYGKEINFFLSVRFPFDNASRH PTQLYYSVSNFSIFLILFFLYRQRKNLSSASENSADLQSWIKDGDIMLFYFMFYAVSR YSLDFLRDNLEPVFFSLYPTQVISFFTFLIAGGLLILRIIRRNNKSIK" gene complement(967..1401) /gene="lspA" /locus_tag="COW28_00110" CDS complement(967..1401) /gene="lspA" /locus_tag="COW28_00110" /inference="COORDINATES: protein motif:HMM:TIGR00077" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="signal peptidase II" /protein_id="PIW34276.1" /translation="MMVIWVGLIVLFFDQLSKYIISFFLGQGQSVLLIPHFLYLTYTK NSGISFGLFKDKIPFPFYLGLSFLAMVMLIILLRRAKRNWPIRLATGLIAGGIFGNLL DRARLGAVIDFLDFRVWPIFNLADSAITIGIVILLICEIKKQ" gene complement(1459..1824) /locus_tag="COW28_00115" CDS complement(1459..1824) /locus_tag="COW28_00115" /inference="COORDINATES: protein motif:HMM:PF01258.15" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PIW34277.1" /translation="MNKQEKEKYKKLLIKEKIRILEAIGALQKGNLESSQTGGRSPGV PNHLAELASDNFEKNLDLDLASSEGKLLAKINNALAKLDKNLFGVCEKCHKKIDQKRL RALAYAELCITCQKKKEET" gene complement(2017..4476) /locus_tag="COW28_00120" CDS complement(2017..4476) /locus_tag="COW28_00120" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_014406804.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA gyrase subunit A" /protein_id="PIW34278.1" /translation="MKKEKEIAVKERITPIEIEKEMKSSYIDYAMSVIVGRALPDVRD GLKPVHRRILYAMSELGVGPAKAYKKSARIVGEVMGKYHPHGDVAIYDTIARMCQDFS LRYPLIDGQGNFGSVDGDRPAAMRYTEARLSAVATYLLSDIEKQTINFVPNFDGTLPE PTILPSALPNLLLNGSSGIAVGMATNIPPHNLNEVIDALLYLIDQPQTSLEELMKFLP GPDFPTGGFICGKKGILEAYRTGRGIITLQGRVSTEELERGRSALIVTELPYEVNKSI LIENIANLAQEKKIEGISNIRDESDKKGMRICIELKAGENSEVVLNQLYKHTALRTSF GIINLALVKNRPRVLSLNQLLSYYLEHRQEIVRRRTQFDLKKAENRAHILAGLKIALL HIEDVIKLIRKSPAVEAAKSSLMKNFKLSSLQADSILAMPLSRLTKLERDKIDNEHKE LTKEIERLKVILSDEKKILGVIKNELKETRKRFGDKRRTKIIAPIGELFDLDFIKKED AVVTVSAAGYVKRVPVETYHRQHRGGRGIIGAGIKEEDYIKHFFVASTHDTILFFTNK GRVYWFPTYQIPEASRQSKGKAVINLLKISTDETTTAIIPLKEYSDNLFLLMATKKGI VKKTPVSAYSHQRRSGIIALTLKENDELIKVKLTDGKKNIILSTKEGKSIHFAESDIR SMGRTASGVRGIRLAKEDVVRGVEIAEENHFLLTITANGYGKRSKIKLYRKQKRGGKG IIDIKTIGRNGEVVSVKSVQDDDEIMLITAKGILIRVPIKDIRAIGRNTQGVKLINLE SGDHIADSALVLKGKEGIPNL" BASE COUNT 1257 a 972 c 955 g 1363 t ORIGIN 1 ctaattcgta aaattcgata aattcgtggt tttataactt attgttttta aatcctgtca 61 atctgcgtat atctgcgtcc tattatttta tgcttttgtt gtttctgcgg ataatcctta 121 aaattagcaa tcctccggca attagaaagg taaaaaagga aatgacctgg gtgggataaa 181 gggagaaaaa gaccggttct aagttatccc ggagaaagtc taggctatac cgggagacag 241 cgtagaacat aaagtaaaaa agcataatat ccccatcttt tatccaagac tgcaagtccg 301 ccgaattctc actggcggaa gaaagatttt tcctttgccg gtaaagaaag aacagaatca 361 agaaaatgga aaaattgctc acactatagt aaagttgagt gggatgtcgg ctggcattat 421 caaaaggaaa tctgacgctt aagaaaaaat ttatttcttt accataacag caaccgttaa 481 gaaaacagcc aattcttccg attgcctgag ccaaaggaaa gtaaagggcg aaaatatcca 541 atgtctttaa aatcggtaat tttcttcggc gcaggaacaa tattgccgcc aaaagaccga 601 agataacacc cccctggatg gccagtcccc catggcggat agcgataatt tcgatcggat 661 ggcgaagata ataagcaaga tgaaccatga tatgcagtaa ccttgctcct aaaagtccaa 721 taataaccac caaaaagatg aggttgaaga attcttcttc gctaaatccc tcctttttcc 781 cctggcggag aatcagggaa ctaaccacaa aaattgaaca ggcaataacc acgccatagg 841 aatagatgga aaaattacca atcttaaata aaataggatg catttttctt tttaaaccac 901 gaattatacg aattaatcga atttatatta ttctttaatt cgcgtaatta gagaaattcg 961 tggttgctat tgtttcttta tttcgcagat taacagaatt acaattccaa tagtgatggc 1021 gctatcggct aaattaaaaa tcggccagac tctaaaatca agaaaatcaa tcactgcccc 1081 taatctggcc cggtcaagca ggttaccaaa gattcctccg gcgataagcc cggtcgccag 1141 ccttattggc caattcctct tggcccggcg caagagaatt ataagcataa ccatagctaa 1201 gaaagataat cccagataga acggaaaagg aattttgtct ttaaagagac cgaaagatat 1261 cccggaattt ttagtatagg taagataaag aaaatgagga attaacaaaa ccgattgacc 1321 ctggcccaga aaaaagctga ttatgtattt gcttaattgg tcaaaaaaga gaacaataag 1381 cccgacccag ataaccatca ttaaccacag gaataagaat taagtaataa gaaataagta 1441 acttaatcct taattctatt atgtttcttc tttttttttc tggcaggtga tgcacaattc 1501 cgcataagcg agggcgcgca gtctcttttg gtcaattttt ttatggcatt tttcgcagac 1561 cccgaatagg tttttatcaa gttttgccag agcattatta atcttagcca gtagtttgcc 1621 ttcggaggag gcaaggtcca ggtccaaatt tttctcaaag ttatcgctgg ctaattcagc 1681 taaatggttg ggaactcccg ggcttcgtcc accggtttga cttgattcca aattgccctt 1741 ctgcaaagct ccaatcgctt ccaggattct gattttttcc ttgattaata attttttgta 1801 tttctctttt tcctgcttat tcatttttct cctctgtagc ggtcggagtt gctgtcagta 1861 tcatcaagaa gttaatctac tcgcatattg ctactgaaga tagcagaaag tatagtagcg 1921 caatttatcc gactattttg tgcgataaat cgcaccgcta caacaagtgc gataaatcgc 1981 accgctacaa caagtgcgat aaatcgcacc gctaccttaa agattgggaa taccttcttt 2041 tccttttaag acgagggcgc tatcagcaat atgatccccg gattccaggt taatcaattt 2101 tacgccttga gtatttcttc caatggcgcg aatatccttg atgggaacgc gaatcaaaat 2161 tcccttcgcg gtaatcagca taatttcatc gtcgtcctgc acgcttttga cgctgaccac 2221 ttctccgttc cgcccgatag ttttaatatc aataatgccc ttgccgcctc ttttttgctt 2281 ccggtaaagt ttgattttgc tgcgcttccc gtaaccgtta gcggtgatgg tgagaaggaa 2341 atgattttct tcggctattt ctacccctct gaccacgtct tcctttgcca gcctgatgcc 2401 gcgcacgcca gaggcggttc ggcccataga acggatgtcg ctttcagcaa aatgaatgga 2461 tttaccctct ttagtggaga ggatgatatt cttcttcccg tcggtaagtt ttaccttaat 2521 caattcatca ttctctttta gggtgagggc aataattccg cttcgtcttt ggtggctata 2581 agcggaaacc ggagtttttt tcactattcc tttcttggtg gccatcagga gaaagaggtt 2641 gtctgagtat tccttaaggg ggatgatggc ggtagtggtt tcatccgtac tgattttaag 2701 gaggttgatc acggcctttc cctttgactg ccggctggct tccggaatct ggtaggtggg 2761 aaaccaatag accctgccct tattagtaaa gaataaaatg gtatcgtgag tagaagcgac 2821 aaagaagtgt ttaatataat cctcttcctt tattccggcg ccaatgatgc ctcttcctcc 2881 ccgatgttgc cggtggtagg tttctacggg aacgcgtttc acgtaacctg ccgccgatac 2941 cgtaaccaca gcatcttctt tcttgatgaa gtcaagatca aaaagttccc ctatcggagc 3001 aataatcttg gtcctgcgtt tatcgccaaa tcgttttctg gtttctttca actcgttttt 3061 aattacccct aaaattttct tttcatcaga taggatgact ttaaggcgtt caatttcttt 3121 ggttagttct ttgtgttcgt tatctatttt atcccgttcc aatttagtaa ggcgggaaag 3181 gggcattgct aagatgctgt cagcctggag ggaagagagt ttaaagtttt tcatcagaga 3241 ggatttggcg gcttctacgg ccggagactt cctgatgagc ttgattacat cttcaatatg 3301 cagaagagca atctttaaac cggcaagaat gtgagcgcga ttttccgctt tcttcaagtc 3361 aaattgggtg cgccgcctta ctatctcttg gcgatgttct aagtagtaac tgagtagttg 3421 gttgagagaa agaacgcgcg gccggttttt gactaaagct aaattgataa tgccaaaact 3481 ggtccgcaga gcggtatgtt tgtaaagttg gtttagaacc acctcggaat tttcgccggc 3541 ttttaattca atgcaaatcc tcatcccttt tttatctgat tcatcccgga tgttgctgat 3601 tccttcaatc ttcttctcct gagcaaggtt ggcgatattt tcaataagga tgcttttatt 3661 tacttcgtaa ggaagttcgg taacgattaa tgcgctcctg cccctttcta attcttctgt 3721 gcttaccctg ccttgaaggg tgataatacc tctcccggtg cggtaagctt ccagaatacc 3781 ttttttgccg cagataaaac ctccggtggg aaagtccggt ccgggcagaa acttcattaa 3841 ttcttccaga gaggtttggg gttggtcaat cagatagagt agagcatcaa ttacttcgtt 3901 aaggttgtgg gggggaatgt tagtggccat acctacggca atacccgagg agccgttaag 3961 caaaagattg ggaagagccg aggggagaat tgtcggttca ggtaaggttc cgtcaaagtt 4021 agggacaaaa ttgatcgttt gtttctcaat atcggaaagg aggtaagttg caacggcaga 4081 gagccttgct tcggtatatc tcattgctgc gggacggtct ccgtcaaccg aaccaaaatt 4141 tccttgtccg tcgataaggg ggtatcttaa agagaaatcc tggcacatcc gggcaatggt 4201 atcatagatt gcgacatccc catggggatg atatttcccc atcacctctc cgacgattct 4261 ggcgctcttc ttgtaagctt tggctgggcc tactcctaat tcagacattg cgtaaaggat 4321 tctgcgatgg accggcttta atccatcccg aacatcggga agagctcttc ccacgatgac 4381 gctcatcgca tagtcaatat aggaggactt catctccttt tcaatctcta tcggcgttat 4441 tctttcctta acggctattt ctttttcctt tttcatagtc tattttcttc tatttagcca 4501 cgaattaaca cgaataaaaa aacaatattc accacagagg cacaaag //