LOCUS DUCR01000078 6684 bp DNA linear ENV 04-MAY-2020 DEFINITION TPA_asm: Gemmatimonadetes bacterium isolate UWMA-0334 NODE_14663_length_6684_cov_0.064541, whole genome shotgun sequence. ACCESSION DUCR01000078 DUCR01000000 VERSION DUCR01000078.1 DBLINK BioProject: PRJNA522654 BioSample: SAMN10967668 Sequence Read Archive: SRR8626183 KEYWORDS WGS; Third Party Data; TPA; TPA:assembly. SOURCE Gemmatimonadetes bacterium (marine metagenome) ORGANISM Gemmatimonadetes bacterium Bacteria; Gemmatimonadetes. REFERENCE 1 (bases 1 to 6684) AUTHORS Zhou,Z., Tran,P.Q., Kieft,K. and Anantharaman,K. TITLE Genome diversification in globally distributed novel marine Proteobacteria is linked to environmental adaptation JOURNAL bioRxivorg, doi 10.1101/814418 (2019) REMARK Publication Status: Online-Only REFERENCE 2 (bases 1 to 6684) AUTHORS Zhou,Z. TITLE Direct Submission JOURNAL Submitted (12-MAR-2019) Department of Bacteriology, University of Wisconsin-Madison, 4545 Microbial Sciences Building, 1550 Linden Drive, Madison, WI 53706, USA COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.12.0 Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: 0.69x Sequencing Technology :: Illumina HiSeq 2000 ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 03/21/2019 02:59:24 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 4.8 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 3,207 CDSs (total) :: 3,166 Genes (coding) :: 3,010 CDSs (with protein) :: 3,010 Genes (RNA) :: 41 tRNAs :: 38 ncRNAs :: 3 Pseudo Genes (total) :: 156 CDSs (without protein) :: 156 Pseudo Genes (ambiguous residues) :: 86 of 156 Pseudo Genes (frameshifted) :: 40 of 156 Pseudo Genes (incomplete) :: 42 of 156 Pseudo Genes (internal stop) :: 18 of 156 Pseudo Genes (multiple problems) :: 27 of 156 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..6684 /organism="Gemmatimonadetes bacterium" /mol_type="genomic DNA" /submitter_seqid="NODE_14663_length_6684_cov_0.064541" /isolate="UWMA-0334" /isolation_source="Guaymas Basin Hydrothermal plume metagenome" /db_xref="taxon:2026742" /environmental_sample /geo_loc_name="USA: Guaymas Basin" /lat_lon="27.5158333333333 N 111.425 W" /collection_date="2004-07-11" /metagenome_source="marine metagenome" /note="metagenomic" gene 354..701 /locus_tag="EYQ64_02625" CDS 354..701 /locus_tag="EYQ64_02625" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011554856.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="divalent-cation tolerance protein CutA" /protein_id="HIF05861.1" /translation="MSNTDVVTVFVTVPEKESALALGRQIVDESLAACVNVVPNVTSI FRWKGKVTEEEEVLLILKSRAERVPALIARVAELHTYEVPEVLSFRVEDGFGPYLDWV GECTSMESFGDRV" gene 688..1719 /locus_tag="EYQ64_02630" CDS 688..1719 /locus_tag="EYQ64_02630" /inference="COORDINATES: protein motif:HMM:PF13181.4,HMM:PF13414.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="tetratricopeptide repeat protein" /protein_id="HIF05862.1" /translation="MTEYRPEQDQLSEPDDAPAADDSQSWLFEAEEDTQGDIFEPAPP PEAADLEEEPASRTSTAEDADPDRVVDDAGEAGAEDITAQASPYEDDDAKVAPEDEAA QSSSDDDDDAKVAAEDEAALASNDPNTPAEALADADEEPPNPLVQTFLKLREGRRRET ADLHQDIVKPAEPPAVAQMDRGKKHDETGRHEMAVEEFLEAVELDPENVDALTRLAAA YGALGRFVEADEAIGKAMKIAPEDVEVQAGEGILSFRKGLYSEAEVRLKRVCTAHSSH GPAHFYRGEALNRLGRVEEAVETMERTIQLQPRNWRAYHTLGMLFDRLEDRERASEMY RRARELNPL" gene 1716..2198 /locus_tag="EYQ64_02635" CDS 1716..2198 /locus_tag="EYQ64_02635" /inference="COORDINATES: protein motif:HMM:PF07045.9" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF1330 domain-containing protein" /protein_id="HIF05863.1" /translation="MIDVRRANLATSDAECILRSVGSNLEPLTAISREVGAAGLSLLC VMLAASAVSAQDDQESEPFYMFNALWFREDGGAQKYSEYLQAAGPFVSKHGGQVNDTY APEQALIGEFDADLVFFVEWPNQEAFTSLFQDPGYQAIAHLREEAIVNSLLIRFRKLP " gene 2240..3313 /locus_tag="EYQ64_02640" CDS 2240..3313 /locus_tag="EYQ64_02640" /inference="COORDINATES: protein motif:HMM:PF04389.15" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="M20/M25/M40 family metallo-hydrolase" /protein_id="HIF05864.1" /translation="MFASAIGDGMTRTAHAALVGALLVGVSGCAILDSFGGESRPRRV QRLLSSLAADSMEGRAMGSLGSLMAAGLIAEEFDDAGVAPAGTTGYLQEIRAVRVTIP GRRSRVLTVEAADTFPDSVKEYMSDRNLVGIVRGSDPSVADEIIVVGAHFDHVGVGRP VDGDSIYNGADDDASGVVAVLEAARDLALGTPPRRTVVFALFTGEEAGGVGSGWYLDH PAVPLEQTVAQLQVEMIGRPDDRAGGPGNLWATGYDRSTVGATLSALGIPVVADPYPD ENFFFRSDNVRFAYAGIPAHTLSSFNLHTDYHSPSDEAERVDMDHLVAAVETIIRAIR ALADAPSAPVWNEGGRPEARAGA" gene 3329..5083 /locus_tag="EYQ64_02645" CDS 3329..5083 /locus_tag="EYQ64_02645" /inference="COORDINATES: protein motif:HMM:PF07969.9" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="D-aminoacylase" /protein_id="HIF05865.1" /translation="MKRRTFIRTGGLIGAAGAAGAIGVGLTRDRGGRDVAPEIPPATG SSAGRSAGPVARSGAEPDLVLRRATVFDGSGAPGVEVDVAVTGDRITEVGNVTAIGAE EIDLAGMALAPGFVDIHSHADLSLFVNPNAESRIRQGVTLEIVGQDGSSVGPLSEAGS RATRERYRNTYGVDVDFRDLGGFLDALDRAPATVNLATMVGHGTVRGLVVGGADRPAT ADELQRMRGLIREALDQGAVGLSSGLEYTPGSFADGNELVELAKELQGTGYPYASHMR SEDDGLFAAVEETLYVGQMAGVPVQISHLKAQGERNWWKAKAILRSIEEARAAGLDVH FDRYPYAAYATGLSNLFPAWARAGGSGAFIRRLQDATDLPAIESFTRAKVALLGSWNA VQISSTRTSGNAYARGRRLGDLARERGEEPFALAVRLIVEEGNSVGMIGFGMSEENTA EILAHPLGMVCSDGGSYAPYGPLSGGSPHPRGYGTFPRLMGHYVRSGALSLALAVHKV TGLPARKLGLDDRGVNKAASNGATPLFCAAASGHAEVVLFVNGIPLVVVNGQVTLRDG EQTGARAGRAVRGRGAAA" gene 5136..6347 /locus_tag="EYQ64_02650" CDS 5136..6347 /locus_tag="EYQ64_02650" /inference="COORDINATES: protein motif:HMM:PF04443.10" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="long-chain fatty acid--CoA ligase" /protein_id="HIF05866.1" /translation="MSDAFVSLAPDLGVAFRESSGEPWPDDVFDAWAQRVFAYQFATN SVYAAFADKRGVTPFTVEHWTEIPWVPASAFKEVLLVSGDVSQVQRVFRTSGTTVDTR PGAGRGEHHVLDLDLYKASMIPNMRRHFYSGPAGSDAAGASAVGRPILALTPSPAEVP DSSLSFMLGAALDVLSGGEGGFFVTADGVIDTAGLSDALVEATARGAPILLTGTAFAF VHWLDWLEERSVRFHLPAGSRIMETGGFKGRSRVLDRPEFYLSLSEAHGVPFDEIVNE YGMTELLSQFYDGPVAVMSESGRSEAGASDAAMADAGAAGLGARRHVPPPWVRTRVLD PQTLSALPDGNPGLLCHFDLANAGSVIAILTEDLGIAVDGGFQVLGRVQGAEPRGCSI AMDDLLSGVRS" gene 6344..>6684 /locus_tag="EYQ64_02655" CDS 6344..>6684 /locus_tag="EYQ64_02655" /inference="COORDINATES: ab initio prediction:GeneMarkS-2+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS-2+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="HIF05867.1" /translation="MKSPFDACCLPPSWGESFDTSEPFVWETIQVAEDQVAEDQVVED QVADVAVRYPVWSGDAAEQLLVALSEAGARLAGVPSSDVVIGIGRVADRFLDASDSLR REAIDLLRPTAG" BASE COUNT 1200 a 1906 c 2282 g 1296 t ORIGIN 1 atgctcgggg accgtgagag ctactgcgtg gttttggtat tgccgagttt tccgaacctc 61 gaggcgtggg cgcgaaagag tggagtcagc gctccggata cgagcgcgtt gttggcgtcg 121 aagcccgttc aagaacacat ggagcgccgt gtcatggcgc ggctcgagga cctggctcgc 181 catgagaagc ccaaaaagat cggcctgatc accgagccct tcaccgtcga agatggcacg 241 ctgacgccca cacaaaaggt caagagaagg gccgtggaag cacgctatcg gaagctcgta 301 gaagctttct acgcagagga gaacctcgac cagacggtct tcgtggagac ctgatgtcga 361 acacggacgt cgtcaccgtc ttcgttaccg tgcccgagaa ggagtcggcg cttgcgctcg 421 ggcgccagat cgtggacgag tcgctcgccg cgtgcgtcaa cgtggttccg aacgtcacgt 481 cgatattccg ctggaaagga aaagtgaccg aagaggagga ggttctgctt atcttgaaga 541 gccgcgccga aagagtgccg gcgctcatcg cgagggttgc ggaactacac acgtacgagg 601 tcccggaagt attgtcattt cgtgtcgaag atggtttcgg gccgtacctc gattgggtag 661 gggaatgcac gtcgatggaa tcgtttggtg accgagtata gacccgagca ggatcaactc 721 tcggagccgg atgacgcacc tgccgctgac gattcccagt cgtggctgtt cgaagccgag 781 gaagacacgc agggggatat tttcgagccg gccccaccgc cagaagcggc ggatctcgag 841 gaggaacccg cctcccgaac gagcacggcg gaagacgccg atccagaccg tgtcgtggac 901 gatgccggtg aggcgggtgc cgaagacatt accgcccagg cgagtcccta cgaggacgac 961 gacgccaagg tggcgcctga agacgaagca gcccagtcga gttccgacga tgatgacgac 1021 gccaaggtgg ccgctgagga cgaggcggcc ctggcgagca atgatccgaa cactccggcc 1081 gaggccctcg ccgacgctga cgaggagccc cccaatccgc ttgtccagac attcttgaaa 1141 ttacgtgagg gccggcgtcg tgagaccgct gacctccacc aagacatcgt taaacccgcc 1201 gagccaccgg cggttgccca gatggatcgt ggcaaaaaac acgacgagac cgggcgccat 1261 gagatggccg tcgaggaatt cctcgaggcg gtggaactcg accccgagaa tgtcgacgct 1321 ctgacccgtc tggctgcggc gtacggagca ttgggtcgtt tcgtggaggc ggacgaagcg 1381 atcggcaagg ccatgaaaat cgcccccgag gacgtggagg ttcaggccgg cgaggggatc 1441 ctgtctttcc ggaaggggct ttactccgaa gcggaggtcc gactcaaacg cgtttgcacg 1501 gctcattcgt cacacggtcc ggcgcatttt taccgtggcg aagcgttgaa ccggcttggg 1561 cgtgtggagg aagcggtcga gactatggaa cggactatcc agcttcagcc gcgcaactgg 1621 cgcgcatatc acacgctggg gatgctgttc gatcggctcg aggatcggga gcgggcttcg 1681 gagatgtacc ggcgggcgag agagctcaac cctctgtgat cgacgtgcgt cgggcgaatc 1741 tggccacttc tgacgcggaa tgtattctcc gctcggtcgg ttcgaacctg gagcctttga 1801 cggcaatcag tagagaagtc ggagcggccg gcctttcgtt gctgtgtgtg atgctggcag 1861 cgagcgcagt ctccgctcag gacgatcaag agtctgaacc cttctatatg ttcaacgcgc 1921 tgtggtttcg ggaagatgga ggggctcaga agtactcaga atacttgcag gcagctggac 1981 cgtttgtctc caagcatggt ggccaggtaa acgacaccta cgcgccagag caggcgttga 2041 tcggagaatt tgatgcggac ctggtcttct tcgtggaatg gcctaaccaa gaagcattca 2101 ccagcctgtt tcaagatccg ggctatcagg ccatcgcaca tctgcgagaa gaagcgatcg 2161 taaattccct gctaattcgt tttcgaaagc taccttgaac tctgtcccca aggccgcgag 2221 gccggcacct cggagagtaa tgttcgcttc cgcgatcgga gatggaatga caagaacggc 2281 tcatgccgcg ttggtcggcg ctctcctggt gggtgtctct ggctgcgcga tcctcgattc 2341 gttcgggggc gaatcgaggc ctcgtagggt gcagagacta ctcagttctc ttgccgcgga 2401 ctcgatggag gggcgggcca tggggtccct cggttcattg atggcagccg gattgatcgc 2461 cgaggagttc gacgacgcag gagtggctcc ggccggaacc actggatacc tccaggagat 2521 tcgcgctgtc cgggttacga tcccaggccg gagatctcgt gttctcaccg tagaggcggc 2581 tgacaccttc cccgactcgg tcaaggaata catgtcggat cggaacctcg tcgggatcgt 2641 gcggggctcg gatcccagcg tggcggacga gatcatcgtt gtcggcgccc acttcgatca 2701 tgttggagtg ggcaggccgg tcgacgggga ttccatttac aacggcgccg atgacgacgc 2761 gtccggtgtg gtcgccgtgc tggaggccgc gcgagacctc gctttgggaa cgccccctag 2821 gagaaccgtg gtttttgcgc tgttcactgg agaggaggcc gggggggtgg gaagcgggtg 2881 gtatctggat catcctgccg tgcccctgga gcaaactgtg gcccaactgc aggtcgagat 2941 gatcggacgt ccggacgatc gagcgggcgg gcccggcaat ctctgggcaa ctggctatga 3001 ccgttcaacg gtgggagcaa cgctctctgc gctcgggatc cccgtggtgg ccgacccgta 3061 tccggatgag aatttctttt tccggagcga caacgtgagg ttcgcatacg cgggaatccc 3121 ggcccacacc ttgtcttcgt tcaacctgca caccgactac catagcccct ccgacgaggc 3181 cgagagagtg gacatggacc atttggtggc ggctgtcgag accatcatcc gggcgatccg 3241 cgcccttgct gatgctccga gcgcgcctgt ttggaatgag ggggggcgcc cggaggcaag 3301 agccggtgcg tgagggggaa acccggtttt gaagcgtcgc accttcatta gaaccggtgg 3361 tctgatcggg gcggctggtg cggctggggc cattggggtg ggcctcacgc gggatcgcgg 3421 gggtcgagac gtcgcgcccg agatcccgcc cgccacagga tcgtcggcgg gtcggtcagc 3481 gggtcctgta gcgaggtccg gtgctgagcc cgacttggtg ttgcggcgcg ccacggtttt 3541 cgacggttcc ggtgcaccag gcgtagaggt ggacgtcgcg gtaacgggcg atcgtatcac 3601 tgaagtgggg aacgtcaccg cgatcggcgc agaggagatc gacttggcgg gaatggcgct 3661 ggcgccgggg ttcgtcgaca tccactccca cgccgatctc tcgctcttcg tcaacccgaa 3721 cgcggagagc cgcattcggc agggtgtgac cctggagatc gtgggccagg acggtagttc 3781 ggtcggccct ctgtccgaag cgggctcccg agcaactcgg gagcgatacc gaaacacata 3841 cggcgtcgac gttgacttcc gggatctcgg aggtttcttg gatgcgctgg atcgcgcgcc 3901 cgccaccgtg aatctcgcca cgatggtcgg ccacggaacc gtccgtggtc tggtggtcgg 3961 cggcgcggac cgacccgcca cggccgacga gcttcagcgt atgcgagggc tgattcgaga 4021 agccctcgat cagggcgcgg tgggattgtc ttcgggactg gagtacacgc ccggatcctt 4081 cgcggacggg aatgagttgg tcgagctcgc caaggagttg caggggaccg gttatccata 4141 cgcgtcccac atgcggagcg aagacgacgg tctctttgcc gccgtcgagg agacccttta 4201 tgtgggtcaa atggctggag tacccgtcca gatatcgcac ctcaaggccc aaggagaacg 4261 caactggtgg aaggcaaagg cgattcttcg ctcgatcgaa gaggcgcgcg ccgccgggct 4321 cgacgtccac ttcgatcgtt atccttacgc tgcctatgcg acggggctct ccaatctctt 4381 ccccgcctgg gcgcgggcgg gcggaagtgg tgccttcatt cggcgtttgc aagatgcgac 4441 cgatcttccg gccatcgaaa gctttacacg ggcgaaagtg gcattgttgg gttcgtggaa 4501 tgccgtacag atcagctcga ctcggacttc gggcaatgcg tacgcaagag ggcgccgtct 4561 gggcgacttg gcccgagaga ggggcgaaga gccgttcgcg ctggcggtcc gcttgatcgt 4621 cgaggagggg aacagcgtgg gcatgatcgg attcggaatg tcggaggaga acaccgccga 4681 gattctggca catccgcttg ggatggtgtg ttcagacggg ggatcgtacg ccccgtacgg 4741 acccctttca ggtggttccc cgcatccccg gggttacggc acgttcccgc gcttgatggg 4801 ccactacgtt cgttcgggcg ccctttcgtt ggcgctggcc gtgcacaaag tgacgggcct 4861 gccggcgcgg aaactcggcc tggacgaccg gggcgtgaac aaggcggcca gtaatggcgc 4921 aactccgctg ttctgtgcgg ccgcgagtgg ccacgcggag gtggtgctct ttgtgaacgg 4981 catccccttg gtggtggtga acggccaagt gactttgaga gatggtgaac aaacaggcgc 5041 ccgtgctggg cgggcggtgc gggggcgggg agccgctgcg tagagcttcg atgtcgaccc 5101 gcgttcccct tccaaccgtc gtctagtccg aaaccgtgtc agacgccttt gtcagcctag 5161 cacccgatct gggagtggcc ttccgggaga gtagcggcga gccctggccc gacgatgtat 5221 tcgacgcgtg ggcgcagcgg gtcttcgctt atcaattcgc gaccaactct gtctacgcgg 5281 cttttgcgga caagcggggt gtcacgccgt tcaccgtgga gcactggacc gaaattccgt 5341 gggtccccgc ttcggcgttc aaggaggtct tgctagtgtc cggggacgtc tcccaagtgc 5401 agcgtgtctt ccgaacgagc ggaactacgg tagacacgcg cccgggcgcg ggccggggag 5461 agcatcacgt gctcgacctt gatctgtaca aagcttcgat gattccgaac atgagacggc 5521 atttctattc gggtcccgct ggatcagacg ccgctggagc cagcgcggta ggacgaccga 5581 tcctggcgct cacgccctct ccggctgagg tgcccgactc ttccttgagt ttcatgttgg 5641 gagccgctct cgatgtgctg tcgggggggg agggtggttt tttcgtcacg gcggacggag 5701 tcatcgacac cgccggactg agtgatgcgt tggtcgaggc cacagcccgc ggggctccaa 5761 tccttcttac cgggacggcc ttcgcgtttg tccactggct ggattggctt gaggagcgga 5821 gtgtccgatt ccacttgcct gccggatcac ggatcatgga gactgggggc ttcaagggcc 5881 gctcccgggt tctggatcgc cccgagttct atttatctct ctccgaggca cacggcgtcc 5941 ccttcgacga gatcgtgaac gagtatggca tgaccgagtt gttgtcgcag ttctacgacg 6001 gaccggttgc ggtgatgtca gaatcgggga ggtcggaagc cggggcttcc gatgcggcga 6061 tggctgatgc aggagcggcg ggcctggggg ctcgccggca cgtccctcct ccttgggtcc 6121 gaacgcgggt gctcgatccc cagacgctgt cggctcttcc ggatgggaac ccgggtctgc 6181 tctgccactt cgatctggcg aacgcggggt cggtgatagc catcctcacc gaggatctgg 6241 gaatcgcggt cgacggagga tttcaggtgc tcgggcgggt ccagggtgcc gagccgaggg 6301 gctgttccat cgcgatggac gacctcttgt cgggtgtgcg gtcgtgaaga gcccattcga 6361 cgcttgctgt ctcccacctt cttggggcga atcgttcgat acctccgaac ccttcgtctg 6421 ggaaaccatt caggtcgccg aagatcaggt cgccgaagat caggtcgttg aagaccaggt 6481 cgctgatgtc gccgttcgtt acccagtgtg gtctggtgac gccgccgagc agctcctggt 6541 ggctctgagc gaggcaggtg cgcgactggc cggtgtgccg tcaagtgacg tggtgatcgg 6601 aatcggtcga gtggcggatc gtttcctcga cgcctcggac tcccttcgcc gagaggcgat 6661 cgacttgctt cgacccacgg cggg //