LOCUS PGXL01000028 4410 bp DNA linear ENV 13-DEC-2017 DEFINITION Tenericutes bacterium HGW-Tenericutes-2 bjp_ig2102_scaffold_7791, whole genome shotgun sequence. ACCESSION PGXL01000028 PGXL01000000 VERSION PGXL01000028.1 DBLINK BioProject: PRJNA321556 BioSample: SAMN06767762 KEYWORDS WGS. SOURCE Tenericutes bacterium HGW-Tenericutes-2 (groundwater metagenome) ORGANISM Tenericutes bacterium HGW-Tenericutes-2 Bacteria; Tenericutes; unclassified Tenericutes. REFERENCE 1 (bases 1 to 4410) AUTHORS Hernsdorf,A.W., Amano,Y., Miyakawa,K., Ise,K., Suzuki,Y., Anantharaman,K., Probst,A., Burstein,D., Thomas,B.C. and Banfield,J.F. TITLE Potential for microbial H2 and metal transformations associated with novel bacteria and archaea in deep terrestrial subsurface sediments JOURNAL ISME J 11 (8), 1915-1929 (2017) PUBMED 28350393 REFERENCE 2 (bases 1 to 4410) AUTHORS Probst,A.J., Ladd,B., Jarett,J.K., Geller-McGrath,D.E., Sieber,C.M., Emerson,J.B., Anantharaman,K., Thomas,B.C., Malmstrom,R., Stieglmeier,M., Klingl,A., Woyke,T., Ryan,C.M. and Banfield,J.F. TITLE Direct Submission JOURNAL Submitted (01-NOV-2017) Department of Earth and Planetary Science, University of California, Berkeley, 307 McCone Hall, Berkeley, CA 94709, USA COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: IDBA-UD v. 03.2016 Genome Coverage :: 10x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 11/21/2017 08:25:02 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.3 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 2,214 CDS (total) :: 2,166 Genes (coding) :: 2,154 CDS (coding) :: 2,154 Genes (RNA) :: 48 rRNAs :: 2, 1 (5S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 1, 1 (5S, 23S) tRNAs :: 42 ncRNAs :: 3 Pseudo Genes (total) :: 12 Pseudo Genes (ambiguous residues) :: 1 of 12 Pseudo Genes (frameshifted) :: 1 of 12 Pseudo Genes (incomplete) :: 7 of 12 Pseudo Genes (internal stop) :: 4 of 12 Pseudo Genes (multiple problems) :: 1 of 12 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..4410 /organism="Tenericutes bacterium HGW-Tenericutes-2" /mol_type="genomic DNA" /isolate="HGW-Tenericutes-2" /isolation_source="groundwater" /db_xref="taxon:2013845" /environmental_sample /geo_loc_name="Japan: Horonobe URL" /lat_lon="45.045278 N 141.859444 E" /collection_date="2014" /note="metagenomic; derived from metagenome: groundwater metagenome" gene 356..1501 /locus_tag="CVV57_10720" CDS 356..1501 /locus_tag="CVV57_10720" /inference="COORDINATES: protein motif:HMM:PF13534.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PKK97745.1" /translation="MKVTHKKFNGGYRFKKFEGQPLTKVVDFLPSSKVETATEDITNN DDIMEFLKSFNLTIKKGINEALTDEDVHIQKESIGKIIISMTQVEPYDFPIDIFFEKD YANDFIQGIKILKDKLPNVQIHLVLCGDKLKKHETVISEIMNIDGVQVSTVVNKYPIN KKELLIPTLLDMKYPIGYPSANIGVIMMEPYRIIQLYRYYKEKKPTTHYMVALAGLAW KDNLVLNLPLGTPIKEITNEYLNENIQVRLISNSLMTGLTLSMEDKIDKDTSLLIALP EGVKGGHGIDTGLHGTVRACISCGQCQNVCPVGLIPHLLHKHVEQGLVNESLANLRIF DCIECNLCNYGCPSKIDLVGSIKKGKEQLEEMEISHKDYQLKDIKEV" gene 1506..2510 /locus_tag="CVV57_10725" CDS 1506..2510 /locus_tag="CVV57_10725" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013277356.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="RnfABCDGE type electron transport complex subunit D" /protein_id="PKK97746.1" /translation="MGKNDKSLIFRPIKQTIESHFKGSNKVSSGNPHIRSNINLKHFM MIAVVALLPSTFAAIYYYGIRMLYLIGFCYLVAFLVEWAFAIFRKEDINEGILVTGLI FPLTLPPTIPFWIAGVGMAFGIFFGKEVFGGTGRNIFNPALVGRLFITIAFPTYMSSM WINPSSPDAVTSATPLIFLRANETLPFSLWDLMIGTAPGSIGETFRIGIIIAGLFLIL TRVIHWRVPIIYLGTVVVASYLGHLLLPSEVVQPVYQLLTGGLLFGAFFMATDPVSSP YTKAGQAVYGIGLGILTIVIRSFSGFAEGVMFAIILMNAFTPMIDAYVIDKKFKPLSE " gene 2523..3098 /locus_tag="CVV57_10730" CDS 2523..3098 /locus_tag="CVV57_10730" /inference="COORDINATES: protein motif:HMM:PF04205.12" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PKK97747.1" /translation="MKEQLKMILFTIILGLASSGILMGMDAYTSDKIIENEELAFKKS VLKAFEIDYEASEIVDVFSNQIERIEKDGYNFYESSSGAIGFEFEGSGLWGPIAGFLT LEEDLVTIQDIQIMEQSETPGLGGIIAEADYLAKYKGKVFDPDIVVVKANDQENAINE VDAITGATGTSRAFETLLNAAYKTKKEVLVN" gene 3100..3729 /locus_tag="CVV57_10735" CDS 3100..3729 /locus_tag="CVV57_10735" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_020257847.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NADH:ubiquinone reductase (Na(+)-transporting) subunit D" /protein_id="PKK97748.1" /translation="MRLIRLKYTKWYQLLSKGIFKDNPIYAMALGICSALAVTNRVEN AIAMGLGVTFVLMASSMSTSLIRKFIPAKVRMVTYMVLISTFVIAFQGFLQAYFFDLS KSLGAYTGLIITNCIVMGRAEAFAIKNPIHFSGLDALANGLGYMFTLIAISVVREILA FGTLLGIQVVGDGFVTWTVMAMAPGAFFVMAIYMWIMRTIAKLDTTSAS" gene 3745..4326 /locus_tag="CVV57_10740" CDS 3745..4326 /locus_tag="CVV57_10740" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013277359.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NADH:ubiquinone reductase (Na(+)-transporting) subunit E" /protein_id="PKK97749.1" /translation="MEGLINIFMASVLTHNIALIYILGMCPLIAISKNLKTAKGMGAS VILVITLTAAINWPIYQLLKNNNAESISLLVFIITIAATVQFLEIFLDKYLPALYNAF GIFLPLITVNCAVLAVSLFMVNRTFGFFETVFFGFGSGTGWALAITIIAAIREKMALV SKVPDGLQGAGIVMIIAGIISLGFMGFIGVVGF" BASE COUNT 1367 a 718 c 909 g 1416 t ORIGIN 1 attctctagg aaagtcatac tctagctttc tgccttgaaa cgtagcagaa agcgtcaagg 61 acctttcctg ccggaacaat ttgctttgca aaacgaggca acgcctcttt tcttgattcc 121 aagaacttca tagtcgactt gggcaaaagc taaatgaata ataatcatac tcaacaacta 181 cttattaact ttatcttgat aatagtgtat aattatggtt aaagtgtcat aattaagtga 241 actaatttaa cgatatattg tagataaaac ttcaactata tagttttgac acttaaccat 301 taaaaccata aaaatttaaa gatagtattt agaatttata ttagggaggc caaacatgaa 361 ggtcacacat aaaaaattta acggcgggta ccggtttaag aagtttgaag gtcaacctct 421 aacaaaagtc gtagattttc tgccaagttc taaggtagag actgctacag aagatattac 481 caataatgat gatattatgg agtttcttaa gtcttttaat ttgacaatca aaaagggtat 541 aaatgaggca ttaacagatg aggatgttca tattcaaaaa gaaagtattg gtaagattat 601 tatcagtatg acccaggtcg aaccgtatga ctttccaata gatatttttt ttgagaaaga 661 ttatgcaaat gactttatcc aaggtattaa aatcttaaaa gataaattac cgaatgttca 721 aatccatctt gtgctatgtg gggataagct taaaaagcat gagacagtaa tatctgaaat 781 catgaacata gatggggtac aagtctctac agttgtaaat aagtatccta ttaataaaaa 841 ggaattactt atacctacct tattggatat gaaataccca atcggttatc cttcagctaa 901 tattggcgtt attatgatgg aaccatatag aatcattcag ctatatcgtt attataaaga 961 aaaaaagcca acaactcatt atatggttgc attggctggt ttggcatgga aggataatct 1021 ggtacttaat ctgcccctag gaaccccgat aaaagaaata accaatgagt atctgaatga 1081 aaatatacaa gtccgtctaa ttagtaatag ccttatgaca gggcttaccc tctctatgga 1141 ggataagatt gataaggata catctctttt aattgctttg cccgaagggg ttaaaggtgg 1201 ccatggaatc gatacaggcc tacatggtac tgttagagcg tgcatatctt gtggtcaatg 1261 tcaaaatgta tgtcctgtag gtttgattcc acatttgctc cataaacatg ttgaacaagg 1321 gttggtgaat gaaagcttgg caaacttgag aatttttgat tgtattgaat gcaatctttg 1381 taattatgga tgtccttcaa aaatagactt ggttggtagt attaaaaaag gtaaagaaca 1441 acttgaagaa atggaaatat ctcataaaga ctatcaatta aaagacataa aggaggtgta 1501 atcagatggg aaaaaatgat aaaagcctca tatttagacc gataaaacaa acaattgaat 1561 ctcatttcaa aggatcgaac aaggtttcat cagggaaccc tcatatacga agtaatatca 1621 acttaaagca tttcatgatg atagctgtgg ttgcgctact gccttcgacc tttgcagcaa 1681 tttactatta tggcataagg atgctttacc tcattggatt ctgttatctc gtcgctttcc 1741 ttgttgaatg ggcctttgca atctttagaa aagaggatat caatgaaggg attttggtga 1801 caggtttgat attccctttg actttaccac caacgattcc attttggatt gccggtgttg 1861 gtatggcatt tggtatattt tttggcaagg aagttttcgg tggcacaggt cgaaacattt 1921 tcaatccggc attagtgggc agactgttta ttacaatagc ttttcccacg tacatgtcat 1981 ctatgtggat caatccaagc agtccagatg cggttacgtc tgccacacca cttatttttt 2041 taagagctaa tgaaaccctt cctttctcac tttgggattt aatgataggc accgctccgg 2101 gatctattgg agaaaccttt agaatcggta ttatcattgc ggggctattc cttatactaa 2161 ccagagtcat tcattggaga gtaccgatta tttatttagg cacagtagtc gttgcttctt 2221 atttaggaca tcttctatta ccgagtgaag tggttcaacc cgtctatcag cttctaacag 2281 gtggtctgct atttggtgcc tttttcatgg ctacggatcc ggtttcatct ccctatacaa 2341 aggcgggaca agctgtttat gggataggtc ttggaatttt gaccatagtc ataagaagct 2401 tttcgggctt tgcagaaggt gtcatgtttg ccattatact catgaatgct tttacaccta 2461 tgatagatgc atatgttatt gataagaaat tcaaaccctt atcggaatag gcaggtgatg 2521 atatgaaaga acagttaaag atgatactat ttacgattat tcttggtctt gcatcttcag 2581 ggatactcat gggaatggat gcatatacaa gtgataaaat tattgaaaat gaggagcttg 2641 cttttaaaaa atccgtttta aaagcatttg agatagatta tgaagcctca gaaatcgtgg 2701 atgtctttag taatcaaata gagaggattg aaaaagacgg ttataatttc tatgagtcaa 2761 gttcaggggc tatcggtttt gagtttgaag gatcaggcct ttggggccca atagccggat 2821 ttttgacact tgaagaagac cttgttacca tacaagacat tcagattatg gaacaatcag 2881 aaactccggg gcttggaggt atcattgccg aggctgatta tcttgcgaag tataaaggca 2941 aagtttttga ccctgatata gtggtggtta aggcaaacga tcaagaaaat gccattaacg 3001 aagtggatgc catcactggt gctactggaa ccagtagagc atttgaaact cttctaaacg 3061 ctgcttataa gacaaaaaag gaggttctgg tgaactagta tgagactaat aagattaaaa 3121 tatacgaaat ggtatcagtt actaagtaaa ggcatattta aagataatcc tatttatgct 3181 atggccttag gtatatgctc cgcacttgcg gtcaccaata gggttgaaaa cgccattgct 3241 atgggactag gtgtaacctt tgttcttatg gcaagttcta tgtcaacgtc tcttatacgt 3301 aagtttatac ctgctaaagt acggatggtt acctatatgg ttctgatttc aacgtttgtt 3361 atagcgtttc agggatttct acaagcctat ttttttgatt taagtaagtc cctaggtgcc 3421 tatacaggat taattatcac caattgtatt gttatgggaa gagccgaggc ctttgcgatt 3481 aaaaacccca tacacttttc tggtcttgat gcacttgcca atggtcttgg atatatgttt 3541 acattgattg caatatccgt tgttagagag atactggcat ttggtacgtt actgggcata 3601 caagtggttg gcgatggttt tgtcacatgg acggttatgg ctatggcacc tggagcattt 3661 ttcgttatgg ccatttatat gtggatcatg aggacgattg caaagcttga tacaacatca 3721 gcatcataga aaggggatag gcttatggaa ggattaatca atatttttat ggcatcagtt 3781 ttaacccata atattgctct tatatatata ctgggtatgt gtcccttaat tgcgatttca 3841 aagaatctaa aaacagcaaa aggcatgggc gcttcggtta ttctagtcat aacactgacg 3901 gcagccatca attggcctat atatcaattg cttaagaaca acaatgccga gagcatcagt 3961 ttattggtat ttattattac aattgcggct acagtacagt ttcttgagat tttcttagac 4021 aaatacttgc cggcgttgta caacgctttt ggcatatttc ttcctttaat tacagtaaac 4081 tgtgccgtac ttgccgtttc tttgttcatg gttaacagaa cttttggatt ttttgaaaca 4141 gtattttttg gttttggatc aggaacaggt tgggcccttg ctattaccat catagcagca 4201 ataagagaga agatggcact ggttagcaaa gtaccagacg gattacaagg tgctggtata 4261 gtcatgatta ttgcaggcat tatatctctt ggatttatgg gttttattgg cgttgttggt 4321 ttttagagat aaataagcat tggtataatg gttaaggtgt caaaactata tagttgaact 4381 ttcgcataca atatatagtt atgaatgtat //