LOCUS RPQC01000310 2114 bp DNA linear ENV 25-NOV-2018 DEFINITION Methanoregulaceae archaeon isolate metabat2.147 Contig_528731, whole genome shotgun sequence. ACCESSION RPQC01000310 RPQC01000000 VERSION RPQC01000310.1 DBLINK BioProject: PRJNA330672 BioSample: SAMN10417371 KEYWORDS WGS. SOURCE Methanoregulaceae archaeon (sediment metagenome) ORGANISM Methanoregulaceae archaeon Archaea; Euryarchaeota; Stenosarchaea group; Methanomicrobia; Methanomicrobiales; Methanoregulaceae. REFERENCE 1 (bases 1 to 2114) AUTHORS Dalcin Martins,P., Danczak,R.E., Roux,S., Frank,J., Borton,M.A., Wolfe,R.A., Burris,M.N. and Wilkins,M.J. TITLE Viral and metabolic controls on high rates of microbial sulfur and carbon cycling in wetland ecosystems JOURNAL Microbiome 6 (1), 138 (2018) PUBMED 30086797 REMARK Publication Status: Online-Only REFERENCE 2 (bases 1 to 2114) AUTHORS Dalcin Martins,P. TITLE Direct Submission JOURNAL Submitted (19-NOV-2018) Microbiology, Radboud University Nijmegen, Heyendaalseweg 135, Nijmegen, Gelderland 6525AJ, Netherlands COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Date :: 2017 Assembly Method :: MEGAHIT+Newbler v. OCT-2017 Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: 13.16x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 11/20/2018 19:29:37 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 4.7 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 2,816 CDSs (total) :: 2,766 Genes (coding) :: 2,721 CDSs (with protein) :: 2,721 Genes (RNA) :: 50 rRNAs :: 1, 1 (5S, 16S) complete rRNAs :: 1 (5S) partial rRNAs :: 1 (16S) tRNAs :: 45 ncRNAs :: 3 Pseudo Genes (total) :: 45 CDSs (without protein) :: 45 Pseudo Genes (ambiguous residues) :: 0 of 45 Pseudo Genes (frameshifted) :: 17 of 45 Pseudo Genes (incomplete) :: 23 of 45 Pseudo Genes (internal stop) :: 8 of 45 Pseudo Genes (multiple problems) :: 3 of 45 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..2114 /organism="Methanoregulaceae archaeon" /mol_type="genomic DNA" /isolate="metabat2.147" /isolation_source="Prairie Pothole Region wetland sediments" /db_xref="taxon:2485498" /environmental_sample /geo_loc_name="USA: North Dakota, Cottonwood Lake Study Area" /collection_date="2015" /note="metagenomic; derived from metagenome: sediment metagenome" gene complement(<1..224) /locus_tag="EHM53_13130" CDS complement(<1..224) /locus_tag="EHM53_13130" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011991169.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="transcriptional regulator" /protein_id="RPI35964.1" /translation="MKSGLRNQLAEKMAGEITLSDSPGHALKKWRMNFEIAPGVLSER LGVSPSVISDYEGGRRKSPGTAVVGKIVDTL" gene 349..1014 /locus_tag="EHM53_13135" CDS 349..1014 /locus_tag="EHM53_13135" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015284073.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF120 domain-containing protein" /protein_id="RPI35965.1" /translation="MIPAEDLQCLKAIALRGGCKGPVFVSTQSIGTMLAISQQTASRR LKGLEPQGFITRAMTADGQHVTVTKLGEEDLRREYQEYTRIFSEGGKTYALHGAVVSG IGEGKYYMSLPEYKDQFRTHLGFEPYPGTLNIRLTHSSIPIRKKIDALEWTRIKGFST DGRTFGDAKCIPCRIGTISCGIVVPGRTHYPEDIIEVIAPMALRRKLGVEDSDSVSVE VGP" gene 1011..1691 /gene="ribB" /locus_tag="EHM53_13140" CDS 1011..1691 /gene="ribB" /locus_tag="EHM53_13140" /EC_number="4.1.99.12" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015284072.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="3,4-dihydroxy-2-butanone-4-phosphate synthase" /protein_id="RPI35966.1" /translation="MIDKALAALRDGKFILLYDFDDREGETDFAIRSDAVTPKHILRM RKDGGGLICTAVHPVAARRLGLPFASDALRAIAVAEREGDIPYDRKNHSSFSLWVNHR NTFTGITDRDRALTVNAIAEQVKHSLNGGSVSFHETFRTPGHMALLRAADGLLDVRKG QTELSIAMAQMAKITPAVTICEMLDDESGYALTKADAKLYAKKHGLVFVEGPEVLERW EAEKKSRV" BASE COUNT 480 a 607 c 593 g 434 t ORIGIN 1 agagtatcga caatcttgcc gacaacagcg gtgccgggac ttttcctcct gccaccctcg 61 taatcgctga tgacggaagg cgagacacca aggcgttccg aaaggacgcc cggagcgatt 121 tcaaagttca tgcgccactt cttgagggcg tgtcccggtg agtccgacag cgtaatttcg 181 cccgccatct tctcggcgag ctggttccgc aacccggatt tcatgctcat gatcaatgat 241 ccggactgtt atatatctaa cgaacacgga ttcgacataa aacgaaacac gacgaagcag 301 ccgccgaaag gcaactcata aataataagc aactcaattt tgataagcat gattcccgca 361 gaagatctcc agtgtctcaa ggccatagcg ctccggggcg ggtgcaaggg cccggttttt 421 gtatcgaccc agagtatcgg aacaatgctt gcgatcagcc agcagacggc atcgcgccgg 481 ctcaagggac tcgagccgca ggggtttatc acccgggcca tgacggcgga cggccagcat 541 gtcactgtca cgaagctcgg cgaggaggat ctcaggcgcg aataccagga atacacacgg 601 atcttttctg aaggcggcaa gacctatgcg ctgcacggtg ccgtggtctc cgggattggt 661 gagggaaagt actacatgag tctgcccgaa tacaaggatc agttcaggac ccacctcggg 721 ttcgagccct accccggaac gctcaacatc cgtctcaccc attccagcat tccgatccgt 781 aaaaaaatcg atgcactcga atggacccgt atcaagggat tttctacgga cggtcgcacc 841 ttcggtgatg cgaaatgtat cccgtgccgt atcggcacca tatcctgcgg tattgtcgta 901 cccggccgga cgcattatcc ggaggacatc atcgaagtga tcgcgcccat ggcgctgcgc 961 cggaagctcg gcgttgagga ctctgatagc gtcagcgtag aggtggggcc gtgattgaca 1021 aggcgctcgc agcgctgcgg gacggtaagt tcatcctcct ctacgatttc gatgaccgcg 1081 agggcgagac tgattttgca atccgttccg atgctgttac ccctaagcat attctccgga 1141 tgcgcaaaga cggggggggg ctgatctgca cggctgtcca tccggttgca gcccggcgcc 1201 tcggtcttcc ctttgcaagc gacgccctgc gggcgattgc ggttgccgag agggaaggcg 1261 atatcccgta cgaccggaag aaccattcct cgttctcgct ctgggtgaac caccggaaca 1321 ccttcaccgg catcacggac cgtgaccggg cgctcaccgt gaacgcgatc gccgagcagg 1381 taaaacattc cctcaacggc ggcagcgtca gcttccacga gaccttccgt accccgggtc 1441 atatggccct cctccgggca gcagacgggc ttctcgatgt gaggaaaggc cagaccgagc 1501 tttccattgc gatggcccag atggcaaaga tcacgccggc cgtcaccatc tgcgagatgc 1561 tggacgacga gagcggctac gcgctcacca aggcggatgc aaagctctat gccaagaagc 1621 acgggcttgt ctttgtcgaa gggcctgaag tactggagcg gtgggaagcg gagaagaagt 1681 cccgcgtata agtacaccgg cactgcgttt tttttgctct ggagaaaacc tcttttcata 1741 agtgaggata acccctctgc accagccctg caaccatcgg gtgcgggggg cattgtgttc 1801 aggtaatccg gcactatcgc aacgcatctc cctgccgctg tggcatcgcc ttatggtggc 1861 agtgattagt accagcaccg aaatatgtgg cggtctgtcc cggtacctgc gtatcctttg 1921 atttttttac cctgttccca atcttgatac gtccgagtgc agaatgattt ccatcaatga 1981 cggaacaaca catcctgccg gaactccgga aatccggagg cacacgtgaa ctctacagcg 2041 agattgagat ccacgcaagt cctgagcgga tatgggagat cctttcagat ttcaagagct 2101 atccggaatg gaac //