LOCUS RFKP01000194 1573 bp DNA linear ENV 29-OCT-2018 DEFINITION Aquificae bacterium isolate J131 k99_598320, whole genome shotgun sequence. ACCESSION RFKP01000194 RFKP01000000 VERSION RFKP01000194.1 DBLINK BioProject: PRJNA392119 BioSample: SAMN10120031 KEYWORDS WGS. SOURCE Aquificae bacterium (hot springs metagenome) ORGANISM Aquificae bacterium Bacteria; Aquificae. REFERENCE 1 (bases 1 to 1573) AUTHORS Ward,L.M., Idei,A., Nakagawa,M., Ueno,Y., Fischer,W. and Mcglynn,S.E. TITLE Thermophilic Lithotrophy and Phototrophy in an Intertidal, Iron-rich, Geothermal Spring JOURNAL Unpublished REFERENCE 2 (bases 1 to 1573) AUTHORS Ward,L.M., Mcgglynn,S.E. and Fischer,W. TITLE Direct Submission JOURNAL Submitted (02-OCT-2018) Earth and Planetary Sciences, Harvard University, 24 Oxford Street, Cambridge, MA 02138, USA COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: MegaHit v. 1.1.2 Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: 101.8960239x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 10/24/2018 19:38:35 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.6 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 1,720 CDS (total) :: 1,692 Genes (coding) :: 1,671 CDS (coding) :: 1,671 Genes (RNA) :: 28 tRNAs :: 27 ncRNAs :: 1 Pseudo Genes (total) :: 21 Pseudo Genes (ambiguous residues) :: 0 of 21 Pseudo Genes (frameshifted) :: 10 of 21 Pseudo Genes (incomplete) :: 5 of 21 Pseudo Genes (internal stop) :: 6 of 21 CRISPR Arrays :: 1 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..1573 /organism="Aquificae bacterium" /mol_type="genomic DNA" /isolate="J131" /isolation_source="iron-rich hot spring" /db_xref="taxon:2202151" /environmental_sample /geo_loc_name="Japan: Tokyo Prefecture, Shikinejima Island, Jinata Onsen" /lat_lon="34.318 N 139.216 E" /collection_date="2016" /note="metagenomic; derived from metagenome: hot springs metagenome" gene complement(<1..615) /locus_tag="D6804_02955" CDS complement(<1..615) /locus_tag="D6804_02955" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012992168.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="flagellar biosynthetic protein FliP" /protein_id="RME11533.1" /translation="MKLIILSLLLSGSVLAQQIIPNIDLRVGTGQLDTSIRLLILLTV LSLAPSILIMTTSFVRIVIVLSLLRQALGVPTVPPNQVIVSLALFLTFFVMKPVFDRI NSDALQPLLKNQITDSTFFERTSSIMKEFMAKNTRKESLKVFLDMANLPKEEKDNIKN PQDVPLSVLIPAFMVSEITTAFQIVFLLYLPFLIIDLVMASILIS" gene complement(596..871) /locus_tag="D6804_02960" CDS complement(596..871) /locus_tag="D6804_02960" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="RME11534.1" /translation="MTDFFLNLLRVLVALGIVIVLILITLPYLLPLLQRLRWTREERG SEVRLRRVIPLGKSMLLVELEIRGKLFVLALTDGAVEVVYRDEADNT" gene complement(868..1344) /locus_tag="D6804_02965" CDS complement(868..1344) /locus_tag="D6804_02965" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012992166.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="flagellar basal body protein FliL" /protein_id="RME11535.1" /translation="MAEEAKEAKEEKRGGSKKILFLVPVLIVLLAGGGGAYLFLFSKK GKKEETAPLPSQVGVMMDLGTFTVNLADRDVDAYARVSITLELSNEKVRQEVDRRLPI IKDAVIDVISSKASSFVRTPEGRENLRLELIKRINTILFEGGVRNIYFTEFVVQTT" gene complement(1360..>1573) /gene="flgG" /locus_tag="D6804_02970" CDS complement(1360..>1573) /gene="flgG" /locus_tag="D6804_02970" /inference="COORDINATES: similar to AA sequence:RefSeq:NP_213567.1" /note="makes up the distal portion of the flagellar basal body rod; Bradyrhizobium has one thick flagellum and several thin flagella; the Bradyrhizobium protein in this cluster is associated with the thick flagella; Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=2 /transl_table=11 /product="flagellar basal body rod protein FlgG" /protein_id="RME11536.1" /translation="IQTDASGEPVVDNPGNQGIGTLLQGYLESSNVNIVEEMVSLIIA QRAFEFNTKGITAADEMLGQTANLRR" BASE COUNT 412 a 363 c 375 g 423 t ORIGIN 1 ggatatgaga attgaagcca tcacgaggtc tattataagg aagggtagat atagcagaaa 61 aactatctga aaggcggtgg ttatctcgct taccataaag gcagggatta gaacgctgag 121 aggaacatcc tgtggattct ttatgttatc tttctcctcc ttagggaggt ttgccatgtc 181 aaggaagact ttgaggcttt cctttcttgt gttctttgcc atgaactcct tcattatgga 241 agaggttctt tcaaagaagg tgctgtctgt aatctggttt ttaagaagtg gctggagtgc 301 atcagagttt atcctgtcaa agacgggctt catcacaaaa aaggtaagaa agagggcaag 361 ggagactatg acctgattgg gaggaactgt gggaacacca agcgcctgcc tgagaaggga 421 aaggactatg acaattctca caaaggatgt ggtcattatc aggatagatg gggcaagaga 481 taggacagtg agaagtatga ggagccttat ggaagtatcc agctgaccag ttcccaccct 541 aaggtctatg ttggggatta tctgctgggc gaggacagag ccagagagaa gaagactaag 601 tattatcagc ttcatccctg taaaccacct ccacagcacc atccgtcaat gccagaacaa 661 agagcttgcc ccttatctcc agctccacaa gcagcatgct tttcccgaga ggtatgaccc 721 ttcggagcct gacctcagaa cctctttcct cccttgtcca tctcagtctc tgtaaaagag 781 gaaggaggta tggaagggtg ataagtatga ggactataac tatgcccagg gcaacaagaa 841 cacggagaag gttaaggaaa aaatccgtca tgtggtctgc accacaaact ctgtaaagta 901 tatgtttctg accccaccct caaagagtat agtgtttatc ctttttatta gttcaagcct 961 taggttttct cttccctctg gtgttctcac gaaggaggag gccttgctgc ttataacgtc 1021 aattacggca tcctttatta tgggaagtct tctgtccact tcctgtctca ccttttcgtt 1081 ggaaagctca agggttatgg aaactctcgc gtaggcatcc acatctctgt ctgcaaggtt 1141 tacggtgaag gtgccaaggt ccatcataac gcctacctga gatggaagtg gtgccgtctc 1201 ctccttcttg cctttttttg aaaagaggaa taggtacgct ccaccacccc ctgcaaggag 1261 aactataaga accggaacca gaaaaagtat ctttttggac ccaccccttt tctcttcctt 1321 agcttccttc gcctcctcag ccatgagtta aattatagct taccttctga ggttggcggt 1381 ctgaccgagc atctcatcag cggcggttat gccctttgtg ttgaactcaa aagccctctg 1441 ggctattata aggctcacca tctcctccac tatgtttaca ttggaagact caaggtatcc 1501 ctgaagaagg gtccctatgc cctggtttcc agggttgtcc accacaggct cgccggaggc 1561 atctgtctgt ata //