LOCUS DQUG01000093 2899 bp DNA linear ENV 23-NOV-2020 DEFINITION MAG TPA_asm: Thermococcus paralvinellae isolate SZUA-1451 k95_525199, whole genome shotgun sequence. ACCESSION DQUG01000093 DQUG01000000 VERSION DQUG01000093.1 DBLINK BioProject: PRJNA488180 BioSample: SAMN09926499 Sequence Read Archive: ERS1370016 KEYWORDS WGS; ENV; Metagenome Assembled Genome; MAG; Third Party Data; TPA; TPA:assembly. SOURCE Thermococcus paralvinellae (hydrothermal vent metagenome) ORGANISM Thermococcus paralvinellae Archaea; Methanobacteriati; Methanobacteriota; Thermococci; Thermococcales; Thermococcaceae; Thermococcus. REFERENCE 1 (bases 1 to 2899) AUTHORS Zhou,Z., Liu,Y., Pan,J., Cron,B.R., Toner,B.M., Anantharaman,K., Breier,J.A., Dick,G.J. and Li,M. TITLE Gammaproteobacteria mediating utilization of methyl-, sulfur- and petroleum organic compounds in deep ocean hydrothermal plumes JOURNAL ISME J 14 (12), 3136-3148 (2020) PUBMED 32820229 REFERENCE 2 (bases 1 to 2899) AUTHORS Zhou,Z. TITLE Direct Submission JOURNAL Submitted (18-OCT-2018) School of Biological Sciences, The University of Hong Kong, School of Biological Sciences, Kadoorie Biological Sciences Building, The University of Hong Kong, P, Hong Kong 000000, Hong Kong COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: MEGAHIT v. v1.1.2 Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: 14.55x Sequencing Technology :: Illumina HiSeq 2000 ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 02/20/2019 14:25:04 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 4.7 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 1,087 CDSs (total) :: 1,066 Genes (coding) :: 992 CDSs (with protein) :: 992 Genes (RNA) :: 21 tRNAs :: 20 ncRNAs :: 1 Pseudo Genes (total) :: 74 CDSs (without protein) :: 74 Pseudo Genes (ambiguous residues) :: 0 of 74 Pseudo Genes (frameshifted) :: 51 of 74 Pseudo Genes (incomplete) :: 29 of 74 Pseudo Genes (internal stop) :: 14 of 74 Pseudo Genes (multiple problems) :: 19 of 74 CRISPR Arrays :: 1 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..2899 /organism="Thermococcus paralvinellae" /mol_type="genomic DNA" /submitter_seqid="k95_525199" /isolate="SZUA-1451" /isolation_source="Mid-Cayman Rise Vent Fluids" /db_xref="taxon:582419" /environmental_sample /geo_loc_name="Atlantic Ocean: Mid Cayman Rise" /lat_lon="18.3769 N 81.7979 W" /collection_date="2017-04-15" /metagenome_source="hydrothermal vent metagenome" /note="metagenomic" gene complement(116..781) /locus_tag="EYH13_02265" CDS complement(116..781) /locus_tag="EYH13_02265" /EC_number="3.1.26.4" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015848699.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ribonuclease HII" /protein_id="HIP74978.1" /translation="MKLGGIDEAGRGPVIGPLVIAAVVIDEENLGRLEALGVRDSKAL TPERRKKLFNEITALLDDYAIIELSPEQIDGRKGTMNELEIENFIKALNSLKVKPDVV YVDAADVDAGRFGEIIKKRLNFSPKIIAEHKADAKYLPVSAASILAKVTRDRAIEKLK EQYGEIGSGYPSDPRTRKFLEDYYKEHGKFPPMVRKSWKTLKKIEEKVRRKGQLNLLE FLR" gene complement(778..1614) /locus_tag="EYH13_02270" CDS complement(778..1614) /locus_tag="EYH13_02270" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_014789737.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="metal ABC transporter permease" /protein_id="HIP74979.1" /translation="MISEFFLRALLASIMVSILLGMLSPLINMKGLAFLTHATFHSLL FGAVLGMILGLLVGNLGIIVWTALIVTILVVIVIAEIENRGFTSDTAIGIISSFVAGA TVLGFGILYKVMASRPYFALSESIVAYLTGEIFLITLSDLEMLIFGGFLLFLIMLFLY RDFLYVSFDPEGVESYGGNVRAYLMILYIIVGTIGALIVRTVGLITLQVIAVLPGAIA MMLSDDLRKIVGISLFLTLGIEILSILLAYATNIPPSGIATILLGMIYGTLVFRKWGA GP" gene complement(1611..2399) /locus_tag="EYH13_02275" CDS complement(1611..2399) /locus_tag="EYH13_02275" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013466948.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="metal ABC transporter ATP-binding protein" /protein_id="HIP74980.1" /translation="MEAVIAENVAIYYGNYKAVVGLTFKLNEAETLLLMGPNGAGKTT LLRTLAGFHREYTGKLLIFGKPPHRSKDLISYVPQSHLLNERVPLTVLEVVAMGGIYK KGFIHFKIPKEILKAAEEALRFVGLEGMKNKPFKELSGGQKQRVLLARALLSNPRLLL LDEPLSALDPSARVEVTNVLDKIKRERDITMIITTHDVNPLIEIGDKVMLLNRRLVAF GTPDEVLRDEIIGKVYGAQSKAVRIGDKLYCIIGDVHIHRGERR" gene 2711..>2899 /gene="cas2" /locus_tag="EYH13_02280" CDS 2711..>2899 /gene="cas2" /locus_tag="EYH13_02280" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007043774.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="CRISPR-associated endonuclease Cas2" /protein_id="HIP74981.1" /translation="MYVVIVYDVNVSRVNKVKKFLRRYLHWVQNSVFEGEVTLAEFER IKEGLLNLIDENEDSVIIY" BASE COUNT 777 a 684 c 589 g 849 t ORIGIN 1 ggcatatgaa ctgataaaat gggagtactt tgagggttca ctcagatttt actggggctg 61 gatctggagg gttatagact caattaaaag gcgatggttg aaaaatccta aagatttacc 121 tcagaaactc gaggaggttt agctggcctt tcctcctaac cttttcctcg attttcttta 181 aagttttcca gctctttcta accatgggcg gaaatttacc gtgttctttg taatagtcct 241 ctaaaaactt tctcgtcctt gggtcgctcg gatagccaga accaatttca ccatactgct 301 ctttaagctt ctcaattgcc ctgtcccttg taaccttcgc caaaattgat gcggcagaga 361 ctggaaggta tttggcatcg gccttatgct cggctattat ttttggagag aaattcaacc 421 tctttttaat tatctcgcca aaacgcccag catctacatc agcagcatca acgtaaacca 481 catcgggctt tactttcagg gaattaaggg cttttataaa gttctcaatc tcgagctcat 541 tcattgttcc ctttcttcca tctatctgtt cgggacttag ctcaatgatt gcataatcat 601 ccaaaagggc agttatctca ttgaacagct ttttccttct ctctggcgtt agggcctttg 661 agtccctaac tccaagggcc tcaagcctgc caaggttttc ttcatcgatt acaaccgccg 721 ctatgacgag ggggcctata accgggcctc ttcctgcctc gtcaatacct ccaagcttca 781 tggccctgca ccccatttcc tgaataccag tgttccgtag atcatgccca ataaaatggt 841 tgctattccg cttggaggta tgttggtagc ataagccaga agtattgaaa ggatttcgat 901 gccaagcgtc aggaagaggc ttatgccaac gatttttctt aaatcatcgc taagcatcat 961 cgcaattgct cctgggagaa cggcaatgac ctggagcgtg atcaaaccca ccgtcctgac 1021 tatcaacgct ccgatggtcc ccacgattat gtagagaatc atcaggtatg ccctaacgtt 1081 gcccccataa ctttcgaccc cttcaggatc aaagctaacg tagagaaaat ctctgtaaag 1141 gaagagcatg attaagaata gcaaaaatcc gccaaaaatt agcatctcca gatcgctgag 1201 tgttatcaaa aagatctccc cggtgagata ggcaactatg ctctccgaga gtgcaaaata 1261 tggccttgat gccataacct tatagagaat cccaaagccc agaaccgtcg cccctgcaac 1321 gaagcttgag ataattccaa tcgctgtgtc actcgtaaat cctctgttct ctatctcggc 1381 tattactatg accacaagga tagtcacaat caatgcagtc cacacaatta tgccaagatt 1441 tccaactaaa agccccaaaa tcatgcccag aactgctcca aagaggaggg agtgaaaggt 1501 tgcatgggtt aaaaaagcca ggcctttcat attgattagg ggactaagca tgccaaggag 1561 aatgcttacc attatgctcg caaggagggc tctcagaaag aactcggaaa tcatctcctc 1621 tcacccctat ggatgtgaac gtctccgatg atgcagtata atttgtcccc aatcctcact 1681 gccttcgact gtgcaccata gactttacct attatctcat ctctgagaac ctcgtcaggg 1741 gtcccaaatg ccacaagtct tctgtttagg agcataactt tgtcaccaat ctctatgaga 1801 gggttaacat cgtgggttgt gattatcatc gttatgtccc tttctctctt tattttgtcc 1861 aggacatttg taacctctac ccttgcgctt ggatctaaag ccgataatgg ctcatccaga 1921 aggaggagcc tcggatttga gagcagtgct ctcgccagta aaaccctttg cttttgtcct 1981 ccgctgagtt ctttaaaggg tttgtttttc atgccttcca atccaacaaa tctcagtgct 2041 tcctctgcag cttttagaat ttccttgggg attttaaagt ggataaaacc ctttttgtaa 2101 attcctccca ttgcaacgac ctcaagcacc gtaaggggaa ccctctcatt taagagatga 2161 ctctgaggca cgtaggaaat caaatctttg gatctatggg gtggtttgcc aaaaattaac 2221 agtttcccag tatattccct atgaaatccc gccagcgttc tgagcagggt ggtttttccg 2281 gcaccatttg gacccattag gagcagggtt tctgcttcat tgagtttgaa agttagtccc 2341 acaactgctt tgtagtttcc atagtaaatg gcgacatttt ctgctattac cgcctccatg 2401 cactcacccc gaatttaggt ttgcctaaat gtttgtaaga taggttttaa aaggctttat 2461 catgtgggac tttgtcccac ggcccaaagt ttgttgcttt caaaattctt tcgtcatttt 2521 tcatgggaaa ccaagtttcc aactaaaatg ctcaggtttt tagggaagct ccaggtttta 2581 agtgttacaa aaacttatag tgcactagcc ccacgttctc ttgaaatcaa taaatttcaa 2641 aaccagtgtc aacagggata aaatccaaaa tgattaaatg gaattacggt caaacccacc 2701 atgatgcgcc atgtatgtgg tcattgttta tgatgtcaat gtctcgagag ttaataaggt 2761 caaaaagttt ctgcgtcggt atttacactg ggttcaaaat agcgtttttg agggggaggt 2821 taccttagct gagtttgagc gcataaaaga gggtcttctc aacctgatag atgaaaatga 2881 ggactctgtg atcatttac //