LOCUS QMPR01000913 2188 bp DNA linear ENV 15-OCT-2018 DEFINITION Bacteroidetes bacterium isolate B1_G1 B1_Guay1_scaffold_69215, whole genome shotgun sequence. ACCESSION QMPR01000913 QMPR01000000 VERSION QMPR01000913.1 DBLINK BioProject: PRJNA362212 BioSample: SAMN09214769 KEYWORDS WGS. SOURCE Bacteroidetes bacterium (marine sediment metagenome) ORGANISM Bacteroidetes bacterium Bacteria; Bacteroidetes. REFERENCE 1 (bases 1 to 2188) AUTHORS Dombrowski,N., Teske,A. and Baker,B.J. TITLE Extensive metabolic versatility and redundancy in microbially diverse, dynamic hydrothermal sediments JOURNAL Unpublished REFERENCE 2 (bases 1 to 2188) AUTHORS Dombrowski,N., Teske,A., Baker,B.J. and Seitz,K.W. TITLE Direct Submission JOURNAL Submitted (08-JUN-2018) Marine Science Institute, The University of Texas at Austin, 750 Channel View Dr, Port Aransas, TX 78373, USA COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Date :: APR-2017 Assembly Method :: megahit v. 1.0.6 Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: 17.2871198568873x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/30/2018 04:51:14 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 3,671 CDS (total) :: 3,653 Genes (coding) :: 3,595 CDS (coding) :: 3,595 Genes (RNA) :: 18 rRNAs :: 1 (5S) partial rRNAs :: 1 (5S) tRNAs :: 17 ncRNAs :: 0 Pseudo Genes (total) :: 58 Pseudo Genes (ambiguous residues) :: 0 of 58 Pseudo Genes (frameshifted) :: 21 of 58 Pseudo Genes (incomplete) :: 28 of 58 Pseudo Genes (internal stop) :: 13 of 58 Pseudo Genes (multiple problems) :: 4 of 58 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..2188 /organism="Bacteroidetes bacterium" /mol_type="genomic DNA" /isolate="B1_G1" /isolation_source="deep-sea hydrothermal vent sediments from dive 4569_9 depth 0-3 cm" /db_xref="taxon:1898104" /environmental_sample /geo_loc_name="Mexico: Guaymas Basin, Gulf of California" /lat_lon="27.015 N 111.379 W" /collection_date="30-Nov-2009" /note="metagenomic; derived from metagenome: marine sediment metagenome" gene <1..271 /locus_tag="DRJ13_15840" CDS <1..271 /locus_tag="DRJ13_15840" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016360860.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=2 /transl_table=11 /product="amidohydrolase" /protein_id="RLD93203.1" /translation="IWSAANRITSSGKVLGENQCVPVLEAVKSVTTYAAFQAFAENHK GSLEVGKLADLVVLDANPLKVDQIKIKDIGVLATIVGGNVVYGEI" gene 453..653 /locus_tag="DRJ13_15845" CDS 453..653 /locus_tag="DRJ13_15845" /inference="COORDINATES: protein motif:HMM:PF07969.9" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="RLD93204.1" /translation="MKAYSTDAAYCSYEEDRKGSICSGKFADFIVLSDNPTTIDPSGI KDIQVLKTYLGGNVIHDVEGSA" gene 849..1112 /locus_tag="DRJ13_15850" CDS 849..1112 /locus_tag="DRJ13_15850" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012788562.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA-binding protein" /protein_id="RLD93205.1" /translation="MTKAEMIEKIAKDAGISKAAAAKAYDSFLDGIKSGLKKRGSKVT VFGFGTFKKVYRKTRQGRNPQTGEQIKIKGRNAVTFKASKNLA" gene 1199..1597 /locus_tag="DRJ13_15855" CDS 1199..1597 /locus_tag="DRJ13_15855" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007224405.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="RLD93206.1" /translation="MHNNSSIAALRNNQGFYLYSAAKAAVTHLTKVAGNELGCFGIRV NSISPGAVATPIFWGGSEVANMLDDEVNAKKLEKLKGSLSKANALGISGLPEDIAKAA LYLASEDGRYVTCVDLVVDGGRIWQYHEAS" gene 2096..>2188 /locus_tag="DRJ13_15860" CDS 2096..>2188 /locus_tag="DRJ13_15860" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019412485.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Hg(II)-responsive transcriptional regulator" /protein_id="RLD93207.1" /translation="MEKFTIGQLAKKVNVNLETIRYYERRGLLPE" BASE COUNT 631 a 452 c 515 g 590 t ORIGIN 1 gatctggtcc gcggcgaatc ggatcacctc ttcaggaaaa gttctcggag aaaaccagtg 61 cgtgcccgtg ctggaagcgg taaaaagtgt aactacctat gcggcgtttc aggcatttgc 121 agaaaatcac aaaggatctc tggaggtcgg caaactagct gatttggtgg tacttgacgc 181 caacccgtta aaagtagacc aaatcaaaat caaggacatt ggcgttttgg caactattgt 241 aggtggcaat gttgtctatg gggaaatata aaggaagagg taaacgatag acaaagacga 301 aaatggcgaa cctacaggtt tgttaatgga accggcggcc cagaatatga tcgcccgttt 361 tctccccaaa cccgatgttt ctgtatttat agatatgatc ccccaggcag tgggacactt 421 caaccaggaa ggggtgacca gtatccacga cggtgaaagc gtatagcaca gatgccgcat 481 actgtagcta tgaggaagac cgcaaaggct cgatctgctc aggtaaattt gctgatttta 541 tcgtgctttc tgataatcct acaacgatcg atccgtcggg aatcaaggat attcaagttt 601 tgaaaaccta ccttggcggc aacgttattc acgatgtgga gggatctgcc taatccataa 661 aaaggacacg agcaccttta catcgatggt aatttagccg tccaaaactt ccagaagaaa 721 cctatggaaa tttcaaagga atttccaatc cttgacacaa gccgatctca cgtatatcgt 781 gttttaaaaa tgttcatgaa ttagggcggt aattcaatgt tcacctttaa cctgaaggag 841 gaggtttcat gacaaaagcg gaaatgattg aaaagattgc aaaggatgcg ggtatttcaa 901 aggcggcagc agccaaggcc tacgattcat tccttgatgg aatcaaaagt ggcctgaaaa 961 agcgcggcag caaggtaact gtcttcggtt ttggcacttt caagaaggtt tatcggaaaa 1021 cccgccaggg ccggaatcct cagacaggtg aacagatcaa aattaagggc agaaatgcgg 1081 taacatttaa ggccagcaaa aatttggctt aggattatat ttaaagacaa tttaagggca 1141 gggcgactct attttcgaat tcgccctgcc tttattatat tagccataca ggttttcaat 1201 ccataacaat tccagcatcg ccgctcttag gaacaatcag ggtttttatc tgtacagtgc 1261 ggccaaggcc gctgttaccc acctgaccaa ggtcgccggc aatgaactcg gctgcttcgg 1321 cattagggtc aacagcattt caccaggtgc cgtggccacg cccatcttct ggggcggctc 1381 tgaggttgcc aatatgttgg acgatgaagt aaatgctaaa aaactagaaa aacttaaagg 1441 tagtctgagc aaggccaacg ctctgggaat ttcaggcttg cccgaagaca tagctaaggc 1501 cgctctgtac ttggccagcg aagacgggcg gtacgtaaca tgcgtggatt tggtcgttga 1561 cggcggacgc atctggcagt atcacgaggc atcctgatac aggcctatgg ccatgcaggt 1621 ttccttccgg tctgtctgct ggcaggtctc tccctgcaag gcctattggc tgacggttca 1681 ggtgtttctg aagctttctg taaggcatgt cgtagcaggg atcatggatg aatttttcat 1741 tttcatgtgg taggtcattt tccacatatc gtcaatgtaa aatgccccct gataattatt 1801 gtaacctttt atggttacac cgtttaatca tattaatcaa aatcttaata aggtctccaa 1861 aattgatcaa atttttattc tgttcgtgaa tagtcccatt gatgtttgta tttgcaagct 1921 ttaataaagc acacggttct catatgatga gttttgactt aggataaaat ttttcaccta 1981 cccgtaagta agctgccttt attcatctca atttaattat tatagaaaaa aatttaattt 2041 ttatccttga cctgtacctt ggtacgggtc ttatgttgtt tcaggaggat gaaaaatgga 2101 aaaatttact attgggcaat tggcaaaaaa ggtaaatgta aacctggaga ctatccgata 2161 ttatgagcgc agaggtttac tcccggaa //