LOCUS DXAW01000129 6351 bp DNA linear ENV 05-JUL-2021 DEFINITION MAG TPA_asm: Candidatus Coprenecus stercoravium isolate Gambia16-554 Gambia16__C65207_L6351_SAMN16086526, whole genome shotgun sequence. ACCESSION DXAW01000129 DXAW01000000 VERSION DXAW01000129.1 DBLINK BioProject: PRJNA543206 BioSample: SAMN15817085 Sequence Read Archive: SRR12763689 KEYWORDS WGS; Metagenome Assembled Genome; MAG; Third Party Data; TPA; TPA:assembly. SOURCE Candidatus Coprenecus stercoravium (gut metagenome) ORGANISM Candidatus Coprenecus stercoravium Bacteria; Bacteroidetes; Bacteroidia; Bacteroidales; Rikenellaceae; Rikenellaceae incertae sedis; Candidatus Coprenecus. REFERENCE 1 (bases 1 to 6351) AUTHORS Gilroy,R., Ravi,A., Getino,M., Pursley,I., Horton,D.L., Alikhan,N.F., Baker,D., Gharbi,K., Hall,N., Watson,M., Adriaenssens,E.M., Foster-Nyarko,E., Jarju,S., Secka,A., Antonio,M., Oren,A., Chaudhuri,R.R., La Ragione,R., Hildebrand,F. and Pallen,M.J. TITLE Extensive microbial diversity within the chicken gut microbiome revealed by metagenomics and culture JOURNAL PeerJ 9, e10941 (2021) PUBMED 33868800 REMARK Publication Status: Online-Only REFERENCE 2 (bases 1 to 6351) AUTHORS Gilroy,R. TITLE Direct Submission JOURNAL Submitted (09-APR-2021) Microbes in the Food Chain, Quadram Institute BioScience, Norwich Research Park, Norwich NR4 7UQ, United Kingdom COMMENT The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: MegaHIT v. 1.2.9 Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: 327576x Sequencing Technology :: Illumina NextSeq 500 ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/09/2021 19:58:28 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 5.2 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 1,716 CDSs (total) :: 1,677 Genes (coding) :: 1,668 CDSs (with protein) :: 1,668 Genes (RNA) :: 39 rRNAs :: 1 (23S) partial rRNAs :: 1 (23S) tRNAs :: 35 ncRNAs :: 3 Pseudo Genes (total) :: 9 CDSs (without protein) :: 9 Pseudo Genes (ambiguous residues) :: 0 of 9 Pseudo Genes (frameshifted) :: 3 of 9 Pseudo Genes (incomplete) :: 4 of 9 Pseudo Genes (internal stop) :: 2 of 9 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..6351 /organism="Candidatus Coprenecus stercoravium" /mol_type="genomic DNA" /submitter_seqid="Gambia16__C65207_L6351_SAMN16086526" /isolate="Gambia16-554" /host="Gallus gallus" /db_xref="taxon:2840735" /environmental_sample /metagenome_source="gut metagenome" /note="metagenomic" gene <1..584 /locus_tag="IAC04_07715" CDS <1..584 /locus_tag="IAC04_07715" /inference="COORDINATES: ab initio prediction:GeneMarkS-2+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS-2+." /codon_start=3 /transl_table=11 /product="hypothetical protein" /protein_id="HIZ86361.1" /translation="SVSMPVTATAVNHGSIYKPSIVTTVKPWELQFIKDGAVENVSVD PSAVPVQNGLKVYNYVFTNAAYMYDPTIMAGEADNMVSRDNPGQTALERTIIRWYFDS DPRGARVFWRVVSSIPQVVKNTNEQYLGTTPFEETRAFNILGLTYENSRDVQIEVKVS RNGYMDQVKRFNVRQAIDQQEISSFFDLVKDEE" gene complement(663..1574) /locus_tag="IAC04_07720" CDS complement(663..1574) /locus_tag="IAC04_07720" /inference="COORDINATES: ab initio prediction:GeneMarkS-2+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS-2+." /codon_start=1 /transl_table=11 /product="fibronectin type III domain-containing protein" /protein_id="HIZ86362.1" /translation="MNFRHFGTISSLFMLLAAVSCSKDTDKTPEEPAQEPFAAISITE TGIDFAKATVETENTAELTWLCIPASEKADADRIRTEGTEAAEVSETVLLDITGLTQN TDYTLYVLAANGDMQFLASAPFKTEEDVTADALVLADGFMAYDGLDEASGMYRTNIMM CSQEMGSDNYPYYDVLIYVYTAEPLEKVDDYYRAVPFGDVTPFYTNGQGLTDMMYYIG QHVTDSEGEPNMSGSGWVYYDENGGVEYYVADDTDNTRISIADNGDGTYTVSGTLVDK TMGEELKFVYTDDKLVFSIDQSYASNQ" gene complement(1589..3442) /locus_tag="IAC04_07725" CDS complement(1589..3442) /locus_tag="IAC04_07725" /inference="COORDINATES: ab initio prediction:GeneMarkS-2+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS-2+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="HIZ86363.1" /translation="MNQHLKHLIGLSAGIFACAAIISSCSKTETIEPPITGGEDTTTV TTPEEPSVILTEGPVSQTSISFTIIPENAEECAYLITEKEDGAEVPDAETIFKDGELL EEIKSQLIEKDGMEAGTEYVVTAAVRNQDLTAVSEQLILKTDSIAQGTTMEVTISDVS ATTSTITFTVTCANAGKAAYDVVEKGVEEHTPADVMYKGKEIPTQEPQTITVEGKKDN TTYVIYAVVEATDSYKRVMDTYEITTEKLPEPEETDVEKFTDGTIKMYGAGGRNYTIT TLNEDYEIYLDFYCDEANAYMPYIASHEYTYEKAGSTSDWAIGTMSYVKEISSGNRLS FEKGSFTSSISDGEYTIEGMFVTTDNIEFKFNITGQLDFIISPYYPSAESSKTDNGMN ITINMDSFILSLMMQSDKIGGTHTVGQDIASESEFSLYYKSGSFGLQSGTVTFEDKGD DYYVFSADLILEGGYPVKINSDSYMHITAPEVIEDDVIIFTSAEAYGLPDNTGYGVLY TLDLDSDEWDVSIEFCEYGEYDELPTGKILFASWLMGGEGGGAGEITGYRITNKSTNE TIEDLDEGEMVISLNDSTYEVAIDIVRTNGESFMGKYTGPIVCEDASEYGY" gene complement(3626..5329) /gene="tssO" /locus_tag="IAC04_07730" CDS complement(3626..5329) /gene="tssO" /locus_tag="IAC04_07730" /inference="COORDINATES: protein motif:HMM:NF036668.1,HMM:NF037660.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="type VI secretion system TssO" /protein_id="HIZ86364.1" /translation="MTMKLHRYILITALLVLLEAVAAPSAVSKEDGEDGIREQLSAIK GADMTIYSLYCNEDDYSKKMRCAGIFLSGLDSAAANTALAVMNTELADWYENEKFLFS KAIQYREQSLRIYTALDCLDKIADTKYCLARLYYKKGLYHNALKYIYDALDDYTALGD KVGRAECYNIMGILYHICKDYEKSKACFSKYAENASELNDSLRMVLALSNSAAFEHAM SDSVKSETLIAESLELCRNLKDTARLCTVLQNLIGISVAQGKYEEAEQYFSRIRPLLN NIELKANYNLSRGNLYRLKGQYDSAAVLIEKAITYYEQGEFDRKRMQCYLLLENVYSM SGDTDKAYKALRNYYGIENSADRNDVFLELFRAQNDIILQHDRENLLRQQNKWRTLTL TAIFIVLVTALVIYIYYYRKSEQIKKNERELRSKTEMLELKKMQNYQMEQMAKNIVDK LKKLCSGIKEQAVRNRIQDICNELSSTKDEKQWKELSQYIPEFNSDFYNALIKDFPNL TINERRLCSLLNMNLTTKEISEITRQSPKSINMARTRLRGKLGLTDSGISIHEFLAKY N" gene complement(5477..5953) /gene="nuoE" /locus_tag="IAC04_07735" CDS complement(5477..5953) /gene="nuoE" /locus_tag="IAC04_07735" /EC_number="1.6.5.11" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013610539.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NADH-quinone oxidoreductase subunit NuoE" /protein_id="HIZ86365.1" /translation="MDKIELNKCQKDKIAEICAEFGNAPGELINVLHKCQGHFGYLPE EVQREIARNLHIPVAKVYGVVTFYSFFTMQPKGRHSISVCMGTACYVRGAESVLEELK KELKIDVGGVTPDGKFSLECLRCVGACGLAPVMLVDEKVYGRLEPKQIKGILAQYE" gene complement(5972..>6351) /locus_tag="IAC04_07740" CDS complement(5972..>6351) /locus_tag="IAC04_07740" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009135591.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=3 /transl_table=11 /product="iron hydrogenase small subunit" /protein_id="HIZ86366.1" /translation="RGLDGIRHATIDFNGTPINIGIAHTLGNARKLLDGMRAGMYNFH AIEIMACPGGCIGGAGQPFHHGDSSVIKARMDAIYREDAGKKIRKSHENPYIIKLYEE FLGHPMSEKAHHLLHTHYFDKHD" BASE COUNT 1389 a 1579 c 1608 g 1775 t ORIGIN 1 gttcagtaag tatgcctgtt acggctacgg ctgtaaatca cggaagcata tataaaccgt 61 ctattgtaac gaccgtaaaa ccttgggaat tgcaatttat aaaagatggt gctgtggaga 121 atgtatccgt cgatccgagt gctgttcctg tgcagaacgg gctgaaggta tacaattatg 181 tctttacaaa tgccgcctat atgtacgatc ctacgatcat ggccggagaa gccgacaata 241 tggtcagcag ggacaatcct ggacagacag ctctggagcg cactatcatc cgctggtatt 301 ttgattccga tccccgtgga gccagagtct tctggagagt ggtatccagc attcctcagg 361 ttgtgaagaa caccaatgag cagtatcttg gaacgactcc ctttgaagag acacgtgctt 421 ttaacatact cggtcttaca tatgagaatt cccgggatgt tcagatagaa gtaaaggtga 481 gccgcaacgg gtacatggac caggtcaaga gattcaatgt ccgtcaggct attgaccagc 541 aggagatcag cagcttcttt gatttggtga aggacgagga ataatacagt atccggtttt 601 tgataaaaaa gaaacggagt ccgtggattc gggctccgtt ttctgcaata tatatagtac 661 cattactgat tggaagcgta ggactgatcg atgctgaaca ccagcttgtc gtcggtgtag 721 acgaacttaa gttcttcgcc catggtcttg tccacgagtg tcccgcttac ggtgtatgtg 781 ccgtctccgt tgtcggctat ggatatccgg gtgttgtcgg tatcgtcagc gacatagtat 841 tcaacacctc cgttctcatc gtaataaacc catcccgaac ccgacatatt gggttctcct 901 tccgagtcgg tcacatgctg gccgatatag tacatcatgt cggtgagtcc ctgtccgttt 961 gtatagaaag gcgtcacatc accgaaagga actgcccgat aatagtcatc gaccttttcc 1021 aacggctcgg ccgtgtagac gtagataagc acgtcgtaat aaggataatt gtccgagccc 1081 atctcctggc tgcacatcat tatattggtc ctgtacatgc ctgaagcttc atccagaccg 1141 tcgtaggcca taaagccgtc cgccaggacc agtgcgtccg cagttacgtc ttcctctgtc 1201 ttgaaagggg cagaggccag aaactgcatg tcgccgtttg cggcgaggac gtaaagagtg 1261 tagtctgtgt tctgtgtgag tccggttatg tccagcagta ctgtctcaga tacctctgcc 1321 gcttcagtgc cttcagtccg tatcctgtct gcatcggctt tctcggaggc cgggatgcac 1381 agccatgtca gctccgcagt attctcggtc tcaactgtgg cctttgcgaa gtcaatgcct 1441 gtctctgtga tgcttatggc cgcaaaaggt tcctgtgccg gttcctccgg agtcttgtcg 1501 gtgtccttgc tgcagctgac agcagccaac agcataaata acgacgatat cgttccaaaa 1561 tgtctaaagt tcatggtctg tgattatttt aatatccata ttcacttgcg tcctcgcata 1621 ctatgggacc ggtatatttg cccataaagc tttctccgtt tgttctgacg atgtcgattg 1681 cgacctcata ggtgctgtcg ttcaggctga tgaccatctc tccctcgtcc agatcctcaa 1741 tagtctcgtt cgtggacttg ttggtaattc tgtatccggt aatctcacct gcgccaccgc 1801 cttcaccgcc cataagccat gatgcgaaca gtatttttcc agtgggaagc tcatcgtatt 1861 caccgtattc acagaactct atggagacat cccattcgtc gctgtcaaga tcgagagtgt 1921 agagcactcc atatccggta ttgtcaggca gtccgtatgc ctctgctgat gtaaagatga 1981 ttacatcatc ctcaatgact tccggagcag tgatgtgcat atatgagtca gaatttatct 2041 tgaccggata tcctccctcc agtataagat cggcagagaa gacataatag tcatctcctt 2101 tgtcctcgaa tgtcactgtt ccgctctgga gtccaaagga gccggatttg tagtagaggg 2161 agaattcgga ttcagaagcg atgtcctgtc ctacagtatg ggttccgcct attttatcgg 2221 actgcatcat aagggacagt atgaatgagt ccatattgat ggttatattc attccgttgt 2281 ctgtcttgct ggattcggca gaaggatagt atggactgat gatgaagtca agctgtccgg 2341 ttatattgaa cttgaactct atgttgtcgg ttgtgacaaa cataccttca atggtgtatt 2401 ccccgtctga tatagaagat gtgaaggaac ccttctcaaa agaaagtctg tttccggaag 2461 agatctcttt cacataactc atagttccta tagcccagtc agaagtagat ccggctttct 2521 cgtatgtata ttcgtgcgat gctatgtacg gcatatatgc gttggcctcg tcgcagtaaa 2581 aatccagata gatctcgtag tcctcattca atgtggttat cgtatagttg cgtccgccgg 2641 caccgtacat tttgatggtt ccgtcagtga atttttccac atcggtctcc tccggttccg 2701 gcagtttttc ggtagtaatc tcgtaagtat ccatcactct cttatatgag tcggtcgctt 2761 cgactacagc atagatgacg tatgtggtat tgtccttctt gccctctact gttatggtct 2821 gaggttcttg agtagggatc tctttgcctt tgtacattac atcggcaggc gtatgctcct 2881 ctacaccctt ctcaaccaca tcgtaagccg ctttaccggc gttggcgcag gtgacggtaa 2941 atgtgatggt gctggtagtt gcggatacat cggagatagt cacttccatg gtcgtaccct 3001 gggcgatact gtcggtcttc agtataagct gctccgatac ggccgtcagg tcctgattcc 3061 ttacggcagc ggtcaccacg tattcagttc cggcttccat tccgtccttc tcgatgagct 3121 gggactttat ctcttccaaa agttctccgt ccttgaaaat ggtctctgca tcgggaacct 3181 cggcaccgtc ttctttctca gtaataaggt atgcgcattc ctcggcattc tcagggatga 3241 tggtgaagct gatggatgtc tgggagacag ggccttccgt aagtatgaca gagggctcct 3301 caggcgttgt caccgtagta gtgtcctcac cgcctgtaat gggcggctcg atggtctcgg 3361 ttttgctgca gctggagata atagccgcgc aggcaaatat gccggcggac agtccgatca 3421 gatgtttgag atgttgattc atagtgttgt gaatttattg agttaatgtt ttgtttacgg 3481 ctcaaaagta attcacagga aatcttccgt caaaatatcg cggaaacaac ggaactacat 3541 acaggaatac ctgtgctaca aaattactac agcagcaccc agcagactta taaacagagt 3601 attaagaata taggctatgt tcggactaat tgtatttggc cagaaattcg tggatggaga 3661 taccgctgtc agtgagtccg agtttgcccc ggaggcgggt tctggccata ttgatgctct 3721 tgggcgactg acgtgtgatc tcggatatct cctttgtcgt aaggttcatg ttcagcagcg 3781 agcacaaccg cctctcgttt atggtcaggt tggggaagtc cttgatgagc gcgttataga 3841 agtcggagtt gaattcgggt atatactgac tgagctcttt ccactgcttt tcgtccttgg 3901 tgctggaaag ttcgttgcag atatcctgaa tcctgttgcg gacagcctgc tcttttatcc 3961 cggagcacag ttttttgagt ttgtccacta tgttcttcgc catctgctcc atctggtagt 4021 tctgcatctt cttcaactcc agcatttcgg tctttgagcg cagctcccgc tcgttcttct 4081 tgatctgctc ggatttcctg tagtagtata tgtagatgac gagggccgtg acaaggacta 4141 tgaatatcgc tgtcagggta agtgttctcc acttgttctg ctgcctgagc agattctccc 4201 tgtcgtgctg caggatgatg tcgttctgcg cccggaaaag ttccaggaac acatcattcc 4261 tgtcggcgct gttctctatg ccgtaatagt tcctcagtgc cttgtaggct ttgtctgtat 4321 cgccgctcat gctgtagacg ttctccagca gcaggtagca ctgcatccgt ttcctgtcga 4381 actctccctg ttcatagtac gttatggctt tctcaatgag gactgccgcc gagtcgtatt 4441 gtcccttgag gcggtacagg ttgccgcgcg ataggttgta gttggctttc agttctatgt 4501 tgttgagcag cggccgtatt ctgctgaaat actgttcggc ctcctcgtat ttgccctgag 4561 cgacagatat gccgataaga ttctgaagta ctgtacacaa ccgggcagta tccttcaggt 4621 tccggcagag ctccagggac tcggcgataa gggtctcgga tttgacggaa tcgctcatgg 4681 cgtgttcaaa cgcagctgag ttgctcagtg ccagcaccat cctcaggctg tcgttcagtt 4741 cgctggcgtt ttccgcatac ttgctgaagc aggccttgga cttttcgtag tccttgcaga 4801 tgtggtacag gatacccatg atgttgtaac attcggctcg tccgaccttg tctccgagcg 4861 cggtgtaatc gtccagggca tcgtagatgt atttcagcgc attgtggtac aggccctttt 4921 tgtagtaaag tctggccagg cagtacttcg tgtcggcaat cttgtccaga cagtccagtg 4981 cggtgtatat tctgagggac tgttcgcggt actgtatagc cttggaaaac agaaacttct 5041 cgttctcata ccagtcagcc agctccgtgt tcatcacggc cagggcagtg tttgcggcag 5101 cagaatcaag ccccgagagg aatatcccgg cgcatctcat cttcttactg tagtcatcct 5161 cattgcagta cagggaatag atagtcatgt cggccccctt gattgcggac aactgctccc 5221 gtatgccgtc ttccccatct tccttagaaa ctgcggacgg tgctgccacg gcctccagca 5281 ggaccagcag ggctgtgatc aggatatatc ggtgcaactt catggtcata tcataaatgt 5341 ccggcatttg catttctcca gagccgcaaa gttataataa aatacgtttg ccgaaacaaa 5401 aagggcgggc ccgccgaaac ggttccgccc ttcattccga aaacaacgca gcagtttgcc 5461 agttttgcac agaaggctat tcgtactggg cgaggatacc cttgatctgc ttgggctcaa 5521 gacggccgta gaccttctcg tccaccagca tcacgggtgc cagaccgcag gcgcccacgc 5581 agcgcaggca ctccagggag aacttcccgt cgggggtgac tccgccaacg tcgatcttga 5641 gctctttctt gagctcttcc agtacgctct cggctccacg cacatagcag gccgtaccca 5701 tacataccga gatggagtgg cggcccttgg gctgcatggt gaagaaggag tagaatgtaa 5761 cgactccata gaccttggcc acgggaatgt gcagattgcg ggcaatctcg cgctgcacct 5821 cctccggcag atagccgaag tgaccctggc acttatgcag gacgttgatc agctcgccgg 5881 gagcgttgcc gaactccgcg cagatctcgg cgatcttgtc cttctgacat ttgttcaatt 5941 ctatcttatc catatattgc tctccttctt gttaatcgtg tttgtcgaaa taatgtgtat 6001 gcagcaggtg atgcgctttc tcgctcatag gatgaccgag gaactcctcg tacagcttga 6061 tgatgtaggg gttctcgtgc gacttgcgga tcttcttgcc ggcatcctcg cggtagatgg 6121 cgtccatacg ggccttgatg acactgctgt ccccgtgatg gaaaggctgg ccggcaccgc 6181 ctatgcagcc gccgggacag gccatgatct caatagcgtg gaagttgtac ataccggcac 6241 gcataccgtc aagcagcttg cgggcattgc ctagagtgtg cgcgataccg atgttgatgg 6301 gagtaccgtt gaagtctatc gtggcatgac ggatgccgtc aagtcctctg a //