LOCUS JAFZSS010000235 4397 bp DNA linear ENV 29-MAR-2021 DEFINITION Muribaculaceae bacterium isolate RGIG8063 Water_buffalos_Rumen-2__c2683842, whole genome shotgun sequence. ACCESSION JAFZSS010000235 JAFZSS010000000 VERSION JAFZSS010000235.1 DBLINK BioProject: PRJNA657473 BioSample: SAMN16348565 KEYWORDS WGS. SOURCE Muribaculaceae bacterium (gut metagenome) ORGANISM Muribaculaceae bacterium Bacteria; Bacteroidetes; Bacteroidia; Bacteroidales; Muribaculaceae. REFERENCE 1 (bases 1 to 4397) AUTHORS Xie,F. TITLE A highly-resolved spatial and functional map of the ruminant gastrointestinal microbiome JOURNAL Unpublished REFERENCE 2 (bases 1 to 4397) AUTHORS Xie,F. TITLE Direct Submission JOURNAL Submitted (24-NOV-2020) Laboratory of Gastrointestinal Microbiology, Nanjing Agricultural University, Weigang NO. 1, Nanjing, Jiangsu 210000, China COMMENT The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Date :: 01-AUG-2020 Assembly Method :: MEGAHIT v. v.1.1.1 Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: 30x Sequencing Technology :: Illumina NovaSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 03/23/2021 16:57:40 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 5.1 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 1,524 CDSs (total) :: 1,503 Genes (coding) :: 1,500 CDSs (with protein) :: 1,500 Genes (RNA) :: 21 tRNAs :: 19 ncRNAs :: 2 Pseudo Genes (total) :: 3 CDSs (without protein) :: 3 Pseudo Genes (ambiguous residues) :: 0 of 3 Pseudo Genes (frameshifted) :: 0 of 3 Pseudo Genes (incomplete) :: 3 of 3 Pseudo Genes (internal stop) :: 0 of 3 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..4397 /organism="Muribaculaceae bacterium" /mol_type="genomic DNA" /submitter_seqid="Water_buffalos_Rumen-2__c2683842" /isolate="RGIG8063" /isolation_source="ruminant gastrointestinal tract" /host="water buffalo" /db_xref="taxon:2498093" /environmental_sample /collection_date="2018-11-20" /metagenome_source="gut metagenome" /note="metagenomic" gene <1..740 /locus_tag="J5503_06545" CDS <1..740 /locus_tag="J5503_06545" /inference="COORDINATES: ab initio prediction:GeneMarkS-2+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS-2+." /codon_start=3 /transl_table=11 /product="hypothetical protein" /protein_id="MBO4804189.1" /translation="QDRFLANYIVKNHNAIAPQLMEAGIHTIFTGHLHVTDAATQYNE SRTDSIVEVATGSAICYPFALRVATLNRDKRSLDIDTRWLNATATCPTLRESGRQRII NSTPGMAATLSNKAWSKLGGRIGQIKAMLEMNGSKANVPENPQQATQLVLRHLSEVFS RAMLAVVEGNEQEKDVEDIIEQGKQGVRAMIAEVIPDEADNMWEFFLGSVYPNLEPMV RSILEDRNAVGADGESHTDDLRLTVTL" gene 834..2408 /locus_tag="J5503_06550" CDS 834..2408 /locus_tag="J5503_06550" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013064921.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="alpha-galactosidase" /protein_id="MBO4804190.1" /translation="MMKKLSAILLLLALAIPAGAQRTPTMGWSSWNTFALDISEQLIR QQADAMHNSGLQKAGYRYINIDDGYWQGRGDDGQLRLNTERFPSGMRALVDYIHSLGL KAGIYSDAGDNTCGSGNRRAWGVGVGLAGHEEQDIRLYFRDWDFDFIKVDWCGGRHLD LDEREQYTRISNAIKNCGKEDIVFNICRWAYPGTWGADVADSWRTTGDIYDNWKSVKE ILAENLYMSAYCHDGHYNDMDMLEVGRSMSAVEDETHFGMWCIMSSPLLIGCDMSNIK PRALRLLTNSDLIALNQDPLHLQAYIAAKQGECYVMVKDIKNLYGKERAFAVYNPSDA DQTVSLKFSTIDLGGKVELYDCFAQANAGTAIDEMTIQVPAHGTRIYRAKAQQRLERK VYEAETAFLSQYQEIRSNYDRFTAIYYPKAEASGGYIACYLGYRSHNDLQWQHVNIGH DGRRTMTIYYFSGEQREFNIEVNGEHVGTYTVNGKDWTKRQSMKIDINLKKGENTIRL WNDGLIMPDIDMMELD" gene 2422..3837 /locus_tag="J5503_06555" CDS 2422..3837 /locus_tag="J5503_06555" /inference="COORDINATES: protein motif:HMM:NF024258.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="metallophosphoesterase" /protein_id="MBO4804191.1" /translation="MSQTTRNIIKIAVISDLHVMAPQLVINDGTAFQEYLNQDRKMLH ESPEILDTLIGNILQLMPDLVLVTGDLTKDGELASHLLVAEKLQGLVDAGIQVLVVPG NHDINNPDARIYDGDNTQPAQTITRQQFAEIYRNMGYDNDSQRDPDTLSYRRDINNQL TILAIDACMDRLNTFIAKGDAQDHCKTSGMLEASSQQWIVDQATKAKAAGQRVIAMMH HHLVPHFHMEDTLAAPYMVKDAKQLCRQLIEAGVHVVFTGHLHISDISQTSSREGNMV EVATSAAVGYPCQWRVINCNPYSGKLQLHTRWVESLPSQPDFGERARQMFVNCIPTIV HGVLNRYWNEVSQAIDNYRHQHPFIMRYVNLPDTPEGTANLLLKHLQESVTKAYIAFA EGNEGGNDHQQLVEQLIKGMGHVVNETVRGILKPIAKIVLRLRIYGIVRLVLRSILED RNDIGTAHEAVINDHTAIVSF" BASE COUNT 1160 a 1265 c 1087 g 885 t ORIGIN 1 aacaggatag attcttggcc aattatatcg tcaagaacca caatgccatt gctccccaac 61 tgatggaggc aggcatccac accatcttca cgggacatct gcatgtgacc gatgccgcca 121 cccaatataa tgaatcccgt accgacagca tcgtcgaggt agccacaggc tcggccatct 181 gctatccctt tgccttgcgc gtggcgacac tgaatcgcga caaacgcagc cttgacatcg 241 acacacgatg gctcaacgcg acagcgacct gccccaccct gcgtgaatcg ggccgccagc 301 gcatcatcaa ttccacgccc ggcatggcgg caacgctctc caataaggct tggagcaagc 361 tgggtggacg catcggccaa atcaaggcaa tgcttgaaat gaatggaagc aaggccaatg 421 tacccgagaa tccacagcaa gcgacacaac tcgtcctgcg ccacctgagc gaggtgttct 481 cgcgtgccat gcttgccgtg gtcgagggca acgagcagga aaaggatgtc gaggatatca 541 tcgaacaggg caaacagggt gtacgcgcca tgattgccga agtgattccc gatgaagccg 601 acaacatgtg ggaattcttc ttgggcagcg tttatcctaa tttggagcct atggtgcgca 661 gcatccttga ggaccgcaat gccgtgggcg ccgatggcga gtctcatacc gacgacctgc 721 gcctgaccgt caccttataa aaattacagt ttttgtgccc atgggggcac tattcaagaa 781 ataatcgcta tatttgtggt cactgaatcc tgcaacctca caatgtgacc acaatgatga 841 aaaaactatc cgcaatcctc ttattgctgg cgcttgccat accggcaggg gcgcagcgca 901 cacccaccat gggctggagc tcgtggaaca cctttgcgct cgacatcagc gagcagctca 961 tccgtcaaca agccgacgcc atgcacaaca gcgggctgca aaaggcgggc taccgctaca 1021 tcaatatcga cgacggctac tggcagggac gcggcgacga cgggcagttg cgcctgaaca 1081 ccgagcgctt tcccagcggc atgcgagccc ttgtggacta catccactcg ctgggcctga 1141 aggcgggcat ctacagcgat gcgggcgaca acacctgcgg ttcgggcaat cgtcgggcct 1201 ggggcgtcgg cgtgggactg gcgggccacg aggagcaaga catcaggctc tacttccgcg 1261 actgggactt cgatttcatc aaggtcgact ggtgcggtgg caggcatctg gacctcgacg 1321 agcgcgaaca atacacccgt atctccaacg ccatcaagaa ctgtggcaag gaagacatcg 1381 tcttcaacat ctgccgctgg gcctaccctg ggacatgggg cgccgatgtc gcagactcgt 1441 ggcgcaccac aggagatatc tatgataact ggaaatcggt gaaagaaatc ctggccgaga 1501 acctctacat gagcgcctac tgccatgacg ggcactataa cgacatggat atgctggagg 1561 tgggacggtc gatgtcggcc gtggaggatg agacacactt cggcatgtgg tgcatcatgt 1621 cctcgccatt gctcatcggc tgtgatatgt ccaacatcaa gccacgggcc ctgcgcctgc 1681 tcaccaacag tgacctcatc gcccttaatc aggatcccct gcacctgcag gcctatattg 1741 ctgccaaaca gggcgagtgc tatgtgatgg tcaaggacat caagaacctc tatggcaagg 1801 agcgcgcctt tgcggtttac aaccccagcg atgcggacca aaccgtcagt ctcaagttca 1861 gcaccatcga cctggggggc aaggtggaac tctatgactg cttcgcccaa gccaacgcag 1921 gcaccgccat cgacgagatg accatccaag tgcccgcaca cggcacgcgc atctaccgcg 1981 ccaaggccca gcaacggctg gaacgcaagg tctatgaggc cgagacagct tttctgagcc 2041 aatatcagga gatacgcagc aattacgacc gcttcacggc catctattac cccaaagcag 2101 aggcctcagg cggctacatc gcctgctatt tgggttaccg ctcccacaat gacctgcaat 2161 ggcagcatgt caacatcggt catgacggcc gtcgcaccat gaccatctac tatttctccg 2221 gtgagcaacg cgaattcaac attgaggtca atggtgagca tgtgggcact tatactgtca 2281 acggcaagga ctggacaaaa cgccagtcga tgaagattga cattaacctc aagaagggcg 2341 agaataccat ccgcctgtgg aacgatggtc tgatcatgcc tgacatcgac atgatggaac 2401 ttgattaaca attggatacc catgagccag actaccagaa atatcatcaa aatagccgtc 2461 atcagcgacc tgcatgtgat ggcgccgcaa ctggtcatca atgacgggac tgcatttcag 2521 gaatacctga atcaagaccg caaaatgctg cacgagagcc cagaaatcct tgataccctt 2581 atcggcaaca ttcttcaact catgcccgac ctggtgcttg tgacaggcga ccttaccaag 2641 gatggcgagc tggcaagtca cctgctcgtg gccgaaaagc ttcaaggcct tgtagatgcc 2701 ggcattcagg tgctcgtcgt cccggggaac cacgacatta acaaccccga tgccagaatc 2761 tacgacgggg acaacaccca gccggcccaa acaatcaccc gccaacagtt tgccgaaatc 2821 taccggaaca tgggctacga caacgactcg cagcgggatc cggacacctt gagctaccga 2881 cgggacatca ataatcaact cacgatcctt gccattgacg cctgcatgga ccggctcaac 2941 acatttatcg ccaaaggaga cgcccaggac cactgcaaga ccagcggcat gctcgaagct 3001 tccagccaac aatggattgt ggaccaagcg accaaggcca aagcggcagg acaacgcgtc 3061 atcgcgatga tgcaccatca cctggtgccc catttccaca tggaggacac cctcgccgcc 3121 ccctatatgg tcaaggatgc caagcaactg tgccggcaac tcatcgaggc aggcgtgcat 3181 gtcgtcttca cggggcactt gcacatcagc gacatcagcc aaacaagctc acgagaggga 3241 aacatggtgg aagtggccac ctcggcggcg gtgggctacc cctgccagtg gcgcgtgatc 3301 aactgcaacc cctatagcgg aaaactgcag ctgcacactc gatgggtcga gagcctgcca 3361 tcgcaacccg acttcgggga acgggcacga cagatgtttg tcaattgtat ccctaccatc 3421 gtgcacggcg tgctcaaccg ctactggaac gaagtcagcc aagccatcga caactaccga 3481 catcagcatc ccttcatcat gcgatatgtt aacctgcccg acactcccga aggcaccgcc 3541 aacctgctgc tcaaacacct gcaggaatct gtcaccaagg cctatattgc cttcgccgag 3601 ggtaacgagg gcggcaacga ccaccagcaa ctggtcgagc agctcatcaa gggcatgggc 3661 catgtcgtca acgagaccgt gcgaggtatc ctcaagccga tcgccaagat cgtcctgagg 3721 ctgcgcatct acggcatcgt cagactggtg ctgcgcagca tcctcgagga ccgcaacgac 3781 atcggcaccg cccacgaggc cgtcatcaat gaccacacgg ccattgtttc attctaatta 3841 tctaaaaacc agtcaagtgc atcaagaact gcacttgccg tgagaacata aaaaaatgac 3901 gactatgatt tttcacaaaa gcacagtcgc cgaatttttc gagtataaga aatttacttt 3961 attaaagaaa agttttagat ttccgcctgc aaatatacag gaattttcat aaatcacaaa 4021 tttttcttgg gtaaaatcac aaaaattcta ttaaaatcac taacgctttt attaccaatc 4081 tcttatggaa taatggagtt ggtgaatcat cggtcaatcc aacaataccc cctgaaatca 4141 cgggcagaca ccccaagaga aacgcccatg aaatacctgt ttttatgggt ttcatcgaaa 4201 tccacagcga tatgaaaaca gcatttattc gtcgcaaaaa aggccataat cgcttgcatt 4261 tggttgcatt taaacagttt tctctactga aatacatagt tttgatatca ttttgcctat 4321 tttatcgcta ttttgtttgg tttgataggt tttactacct atttttgcaa ctatattaaa 4381 ataagcaact tatcaat //