LOCUS QMSD01000213 3480 bp DNA linear ENV 15-OCT-2018 DEFINITION Thermoprotei archaeon isolate B132_G9 B132_Guay9_scaffold_62931, whole genome shotgun sequence. ACCESSION QMSD01000213 QMSD01000000 VERSION QMSD01000213.1 DBLINK BioProject: PRJNA362212 BioSample: SAMN09215213 KEYWORDS WGS. SOURCE Thermoprotei archaeon (marine sediment metagenome) ORGANISM Thermoprotei archaeon Archaea; Crenarchaeota; Thermoprotei; unclassified Thermoprotei. REFERENCE 1 (bases 1 to 3480) AUTHORS Dombrowski,N., Teske,A. and Baker,B.J. TITLE Extensive metabolic versatility and redundancy in microbially diverse, dynamic hydrothermal sediments JOURNAL Unpublished REFERENCE 2 (bases 1 to 3480) AUTHORS Dombrowski,N., Teske,A., Baker,B.J. and Seitz,K.W. TITLE Direct Submission JOURNAL Submitted (08-JUN-2018) Marine Science Institute, The University of Texas at Austin, 750 Channel View Dr, Port Aransas, TX 78373, USA COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Date :: APR-2017 Assembly Method :: megahit v. 1.0.6 Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: 14.3874458874459x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/30/2018 06:28:52 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 2,191 CDS (total) :: 2,160 Genes (coding) :: 2,140 CDS (coding) :: 2,140 Genes (RNA) :: 31 rRNAs :: 1 (23S) partial rRNAs :: 1 (23S) tRNAs :: 28 ncRNAs :: 2 Pseudo Genes (total) :: 20 Pseudo Genes (ambiguous residues) :: 0 of 20 Pseudo Genes (frameshifted) :: 7 of 20 Pseudo Genes (incomplete) :: 11 of 20 Pseudo Genes (internal stop) :: 6 of 20 Pseudo Genes (multiple problems) :: 4 of 20 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..3480 /organism="Thermoprotei archaeon" /mol_type="genomic DNA" /isolate="B132_G9" /isolation_source="deep-sea hydrothermal vent sediments from dive 4571_4 depth 0-3 cm" /db_xref="taxon:2250277" /environmental_sample /geo_loc_name="Mexico: Guaymas Basin, Gulf of California" /lat_lon="29.52416667 N 113.57000000 W" /collection_date="02-Dec-2009" /note="metagenomic; derived from metagenome: marine sediment metagenome" gene complement(<1..337) /locus_tag="DRJ64_06740" CDS complement(<1..337) /locus_tag="DRJ64_06740" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="RLF04570.1" /translation="MSDSERMKETLKRFLEEKGFSIEFNRALRGSSGLEHVFDIVATR EGTILCFDVINPSEISVLSSLGKAIDVSYVQFFLLCKNVQSAKLPDILRDIDALEAII YSDVEDLLRK" gene complement(595..2472) /locus_tag="DRJ64_06745" CDS complement(595..2472) /locus_tag="DRJ64_06745" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_004068215.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphoenolpyruvate carboxykinase (GTP)" /protein_id="RLF04571.1" /translation="MSFLEYLKRRLEYEQYKKIEAIENEELHKFLVEYIKLLNPKKIF VCSEKREDEEYIRLKAIEYGEERRLRQRNHTVHFDNYYDQARDKKNTKILVSKAVRLP FTNTLDREKGLEEIHKLMKNIMRDKELFICFFSLGPKNSPFMIPAVQLTDSAYVAHSE FLLYRNAYDVFRDSGDESRFLKFVHSAGELDERKTSKNIDKRRIYLDVENGIGFAVNT QYGGNTIGLKKPAFRMTIYKAVKEGWLSEHMFLMGINGPNGRVSYFAGAFPSMCGKTS TCMLPHERLVGDDLVFIKIIDGEARAVNVEIGVFGILQGVNAKDDPIIWEVLHSPNEI IFSNVLVYNGRPYWNDMGIEIPDEGENHSGKWWRGKRDAEGNLIPPAHKNARFTARLI YFRNLDREALENPKGVKLSGLIFGGRDADTWPPVCESFNWSHGVVMKAASLESETTAA TLDREGERKFNLMAILDFLSVHISDYIKNYLEFGSRLNNPPKIFAVNYFLRDANGNFL NEKIDKSIWLKWMELRANDDVDAIKTPIGFIPIYSDLKSLFREVLGKEYSREDYEKQF MIRVPEFLAKIRRIRRIYSNIENIPNAVFEELDREEKRLKEAKESLGDYISPFSFEES Y" BASE COUNT 982 a 833 c 512 g 1153 t ORIGIN 1 gtttcctgag taagtcttca acatcactgt atattatggc ttctaaagca tcaatgtccc 61 ttaggatatc aggtaatttt gcagattgga catttttaca gagtaagaaa aattggacat 121 agcttacgtc aattgccttt cctagagagc ttagaacaga tatttcgctg ggatttatga 181 cgtcgaagca tagtatcgtg ccttctcgag tagcaacaat atcaaagaca tgttcaagtc 241 cgctagatcc cctcaaagcc cggttaaatt ctatcgaaaa acccttttct tccaaaaatc 301 tctttaatgt ttccttcatc ctctcggaat cactcatgtc atccaccgat acatccatca 361 ttatagtcaa gttatgtcta actcggaata ttaacaatct aaaattggat ttaaacttta 421 cctttcgcat taaagattat agaaaaataa atctagctag caacctaagc ttaggtagaa 481 gatataattg ataatccgtg tatagaatga tattaaacca tatgcgttcg gtataaagaa 541 tccaccttgg actaaataat ggttttagag taataattaa agtattcgtc tagattaata 601 actttcttcg aaagaaaagg gggagatgta atctccaagg ctttccttag cctctttcag 661 cctcttttct tctctatcta attcttcgaa aacagcatta ggaatatttt ctatattact 721 atatatcctt ctgattcgtc taatcttggc cagaaactcc ggaacacgta tcataaactg 781 cttttcatag tcttctctag aatactcctt acctaatact tctctaaata gactcttcag 841 gtcactatat atgggtataa atcctattgg ggttttaata gcgtcgacgt cgtcattcgc 901 acgtaactcc atccacttta accatatgct cttatctatt ttctcgttaa gaaaattacc 961 attcgcgtcc cgtaaaaagt agtttacagc gaagattttt ggaggattgt tcagcctact 1021 accgaattca agataatttt tgatatagtc acttatatga acagagagaa agtctagtat 1081 cgccatcaag ttgaactttc tttctccttc tctatcaagt gttgctgccg tcgtctccga 1141 ttcaagagat gcggctttca ttacaacacc atgtgaccaa ttaaatgact cgcatactgg 1201 gggccatgta tctgcatctc ttccaccaaa tattaatccg ctcaacttca cgccctttgg 1261 attctctaat gcttctctgt caagatttct gaagtatatc aagcgagccg tgaacctagc 1321 attcttatga gcaggaggta tcaggttacc ttcagcatcc ctcttccccc tccaccattt 1381 accgctatga ttctctcctt cgtccggtat ctcgataccc atatcattcc aatacggcct 1441 accattatac actaacacgt tagagaatat tatttcatta ggggaatgca acacttccca 1501 gataattgga tcatccttag cattcacgcc ttgtagtatc ccgaaaactc cgatctcaac 1561 attcacagct ctagcctctc cgtctatgat ctttataaaa actaggtcgt ctccaactaa 1621 tcgttcatgt ggtaacatgc acgtggatgt ttttccacac atcgatggaa aagcacctgc 1681 aaaatagcta accctgccat taggcccatt gatgcccatg agaaacatgt gctcagaaag 1741 ccatccctcc ttaacagcct tgtatatagt cattctgaat gcaggcttct tcaatccaat 1801 cgtgttaccg ccatactgag tgttcacagc gaagcctatc ccattttcga catctagata 1861 gatcctcctc ttgtcaatgt tcttgctagt ctttctctca tccagttccc cagcagaatg 1921 cacaaacttt aaaaatctcg actcatcacc gctatccctg aacacatcat atgcatttct 1981 ataaagtaaa aactcgctgt gagcaacata tgcagaatcc gttaactgta cagctggtat 2041 cataaatggg gaatttttag gtccaagaga gaaaaagcat ataaataact ctttatcacg 2101 cataatgttt ttcataagct tatgaatttc ttctaagccc ttctccctat ccaaagtatt 2161 cgtgaaagga agtctgacgg ccttagatac tagtatctta gtattctttt tgtctcttgc 2221 ctggtcataa taattgtcga agtgaacagt atgattcctc tgtcttagtc ttctctcctc 2281 tccatactct atggctttca gcctaatata ttcctcatct tctctcttct cactgcatac 2341 gaagatcttt ttaggattta gtagcttaat atactcgact aagaacttat gtaactcctc 2401 gttctctatc gcttctatct tcttatattg ttcgtattct agtctccgtt ttaaatactc 2461 caaaaagctc atacagatct tccatagata tttatttgtc ttttttaaat atagcttcta 2521 aattgtaaga ccgcgaaagg atcagaaaga aaattatttc ttatttttat attatctttc 2581 ttcctaagca tttattttcc atactttttc aaacatgatc tctccataag cggacataga 2641 ttatcccaaa acccgcctac caccgtttta taacctcatc cttctcttta agctcgtctt 2701 cgccctcgcc tacagttata tcactggact gaataatacc cagaacattg gaagatgcat 2761 ttttatctcg cttgttggcc ccatacccgg ttatcacact acctactcct gtactttctt 2821 gtactctact cggctaaagt cgatgtactg ttagtctcct aggctagaag catgtcaata 2881 cccagaactt tcaatgcttc catggaatcc ccatctcgac atccatcctc tatccgtatc 2941 atacgggcga ttttgtgttc catcaatata caccattctt cattccctca acgtcccatc 3001 caatatcgcc ttttcttccc atgaatggtg gcctcttcca gtacacgtca tacagtatta 3061 aaaagcgtta atctccacct cctagtagca agatagggaa ttgatctaac gccgttaccc 3121 ttaatatcta caaaagaagc tagcccaggg agcccttttc taagaaaaat ctttaagttt 3181 accgtctata actatttctc ccttccccta taaaagatat atgtctctgg ttttaggagc 3241 ctaccgtctc ttgtttcaac cagcatattg gcaccaaaat cttatctgaa tatatgtata 3301 aaggattact ggctctacca gaataatggt tttttatgtc ctaaatccta tagtataagt 3361 gtcgcgccgc ctaattcgaa aaatacttct accatccttg atttttacag cttcgcttgc 3421 tttcctaact gttcctttac taagaaatcc tcttttaaca ggtatcgacg gtccatacta //