LOCUS DVGY01000178 4020 bp DNA linear ENV 04-JUN-2021 DEFINITION MAG TPA_asm: Candidatus Egerieicola pullicola isolate CHK184-25365 CHK184__C238304_L4020_ERR3414579, whole genome shotgun sequence. ACCESSION DVGY01000178 DVGY01000000 VERSION DVGY01000178.1 DBLINK BioProject: PRJNA543206 BioSample: SAMN15817017 Sequence Read Archive: ERR3414579 KEYWORDS WGS; Metagenome Assembled Genome; MAG; Third Party Data; TPA; TPA:assembly. SOURCE Candidatus Egerieicola pullicola (gut metagenome) ORGANISM Candidatus Egerieicola pullicola Bacteria; Firmicutes; Clostridia; Eubacteriales; Oscillospiraceae; Oscillospiraceae incertae sedis; Candidatus Egerieicola. REFERENCE 1 (bases 1 to 4020) AUTHORS Gilroy,R., Ravi,A., Getino,M., Pursley,I., Horton,D.L., Alikhan,N.F., Baker,D., Gharbi,K., Hall,N., Watson,M., Adriaenssens,E.M., Foster-Nyarko,E., Jarju,S., Secka,A., Antonio,M., Oren,A., Chaudhuri,R.R., La Ragione,R., Hildebrand,F. and Pallen,M.J. TITLE Extensive microbial diversity within the chicken gut microbiome revealed by metagenomics and culture JOURNAL PeerJ 9, e10941 (2021) PUBMED 33868800 REMARK Publication Status: Online-Only REFERENCE 2 (bases 1 to 4020) AUTHORS Gilroy,R. TITLE Direct Submission JOURNAL Submitted (20-OCT-2020) Microbes in the Food Chain, Quadram Institute BioScience, Norwich Research Park, Norwich NR4 7UQ, United Kingdom COMMENT The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: MegaHIT v. 1 Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: 69883.6x Sequencing Technology :: Illumina NovaSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 11/09/2020 17:38:36 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 4.13 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 1,620 CDSs (total) :: 1,580 Genes (coding) :: 1,576 CDSs (with protein) :: 1,576 Genes (RNA) :: 40 rRNAs :: 2, 2 (5S, 23S) complete rRNAs :: 2 (5S) partial rRNAs :: 2 (23S) tRNAs :: 33 ncRNAs :: 3 Pseudo Genes (total) :: 4 CDSs (without protein) :: 4 Pseudo Genes (ambiguous residues) :: 0 of 4 Pseudo Genes (frameshifted) :: 1 of 4 Pseudo Genes (incomplete) :: 2 of 4 Pseudo Genes (internal stop) :: 1 of 4 CRISPR Arrays :: 1 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..4020 /organism="Candidatus Egerieicola pullicola" /mol_type="genomic DNA" /submitter_seqid="CHK184__C238304_L4020_ERR3414579" /isolate="CHK184-25365" /host="Gallus gallus" /db_xref="taxon:2840775" /environmental_sample /metagenome_source="gut metagenome" /note="metagenomic" gene <1..159 /locus_tag="IAB36_07715" CDS <1..159 /locus_tag="IAB36_07715" /inference="COORDINATES: ab initio prediction:GeneMarkS-2+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS-2+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="HIR41697.1" /translation="KNGKFYNANTNQEYTAEDELWGDTADDILEFNVSGDRLRDVITQ VTVLDRTI" gene 332..1756 /locus_tag="IAB36_07720" CDS 332..1756 /locus_tag="IAB36_07720" /inference="COORDINATES: protein motif:HMM:NF016689.1,HMM:TIGR00486.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Nif3-like dinuclear metal center hexameric protein" /protein_id="HIR41698.1" /translation="MDLLDARLRCCAGYVRPGTALCDVGCDHGYLPCQLALEEKIRSG LACDVNPAPLDSARRTIARTGTEKIVSTRLSDGLSAVQPEEARDIVIAGMGGELILSI LLGVSWVKERGRRLILQPMTKADLLRQGLCREGFRILEESAVMDGNRCYAILLAEYDG QMRDCSPLYAVTGEMKDDAYLAFQQQRCQRKAQGLAQGKDPEAEAVLQLAEQIQQRRN ALKGEKQMNAKQIYDAIGQVAPFDSAESWDNPGLLVGSPQAEADPLGLALDITPQVLA AAKEKGIQTILTHHPVIFHPLKQVRQDSPVFQLIQSGITVISAHTNLDRSPQLGTNRV LAERLGLWNGSLSPELEEMGVLGELPPCPAPELAKKVKTALGCASVNFYDAAIPCHRV AVIAGSGGSLLEQAAAAGADTLITGDVKQDVFVSAQWMGMNLLEVSHFDLENPVFSKL GPWLEETLGIHTCLLSPQNPVQWI" gene 1784..3550 /locus_tag="IAB36_07725" CDS 1784..3550 /locus_tag="IAB36_07725" /inference="COORDINATES: protein motif:HMM:NF017482.1,HMM:NF017633.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NFACT family protein" /protein_id="HIR41699.1" /translation="MALDGIYLYALARQIGREALNSRVEKIHQPSKEEIIIWLRSPAG RYRLLCCANPSAPRVQLTGLNPENPKIPPQFCTLLRKHLSTGKLVGVRQEGLDRVLHL DFEAINELGDVVTLTLTMEIMGRHSNLILVNQEGKVLDAIRRVGEEQSSVRLLLPGVT YRSAPAQHKLNPFTASPEEGKAAFSSLPDMVFSKAILQVYQGISPLLSRELAEYACNG AELNKSQVSDYRYQRLFDRLAELKAQIDSDSAAYTMVLENGRPREFSVCPLEQYRNAL ESRQFSDPSSLLDGFYSQRDLQLRMAQKGQDLLRVLTSASQRAQRKLDHRQQELQRSI DRQRYQQLGDLIQANLYQIKKGDTQTTVTDYFDPEQKPVTISLDPALSPAQNAQKYYK EYRKAKTARELLGGLIQQNQDEIIYLDTIFDELARASRESDLNAIREELEQQGYLHKA RRKLKPDKPLPPMEFVTDDGFTVLCGRNNKQNDQLTLKTARNYDVWFHTKDIPGSHVI LQSQGEDRPIPERSILQAATIAATHSRGQDSNQVPVDYTLIKYVKKPNGARPGMVIFT HNKTLYVAPDLALCEKLAKEAK" gene complement(3673..3873) /locus_tag="IAB36_07730" CDS complement(3673..3873) /locus_tag="IAB36_07730" /inference="COORDINATES: protein motif:HMM:NF024081.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="FeoB-associated Cys-rich membrane protein" /protein_id="HIR41700.1" /translation="MIATIIISVILAALLGLAVWYMVKTTRRGGCVGCSACSGKQKGG VSSGCNGNCAGCSGCHSVTKAK" gene complement(3905..>4020) /locus_tag="IAB36_07735" CDS complement(3905..>4020) /locus_tag="IAB36_07735" /inference="COORDINATES: protein motif:HMM:NF017366.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=3 /transl_table=11 /product="CcoQ/FixQ family Cbb3-type cytochrome c oxidase assembly chaperone" /protein_id="HIR41701.1" /translation="VGFSIWTVIAFLLLIGVLYLLFRPAKRHSLSHAAERA" BASE COUNT 994 a 1157 c 1062 g 807 t ORIGIN 1 aaaaacggaa aattttacaa tgccaatacc aatcaggaat ataccgcaga agacgaattg 61 tggggcgata cagcggacga cattctggaa ttcaacgtct ccggcgaccg gctccgagac 121 gtcatcaccc aagtcaccgt cctggaccgc accatttgaa ctgcttgttt tctgcaaaaa 181 agccgtcccc cattcaaacg tttttcttac agactgctgc attcatggaa aagcccgtgg 241 tacaaccatg ggtttttctt tttctttcgt tgcacccccc aaaaatctat gctacaatat 301 taccatcacc aacaaagaaa ggagctgctt catggatttg ctggatgccc ggcttcgatg 361 ctgcgccggc tatgtgcgcc ccggcacagc attgtgcgac gtgggctgcg atcacggtta 421 cctgccctgc caactggcac tggaggaaaa aatccgttcc ggtttagcct gcgacgtcaa 481 tcccgcaccc ttggactccg cccggcggac catcgcccgc accggcaccg aaaagattgt 541 ctccacccgg ctcagcgatg gactgagcgc cgttcaaccg gaagaagccc gggacatcgt 601 cattgccggt atgggaggag aattgatcct ctccattttg ctgggagtct cctgggtgaa 661 agagcgggga cggcggttga ttttgcagcc catgaccaaa gccgacctgc tgcgccaggg 721 actttgccgg gaagggttcc ggatcttgga ggaatctgcc gtgatggacg gcaaccggtg 781 ttacgccatt ttgctggcgg agtacgatgg acagatgcgg gattgttctc ccctgtacgc 841 cgtcaccggc gaaatgaagg acgacgctta cctggccttc cagcagcagc gctgccagcg 901 aaaagcccaa ggtctagccc aaggcaaaga tcctgaagca gaagccgttt tgcagctggc 961 agagcagatc cagcagcggc gaaacgcctt gaaaggggaa aaacagatga acgcaaaaca 1021 aatttatgat gccattgggc aggtcgcgcc ctttgacagc gcggaaagct gggataaccc 1081 cggcctgctg gtgggcagcc cccaggcaga agcagatccc ctgggactgg cgttggacat 1141 cacccctcag gtattggcgg cggcaaaaga aaaaggaatt caaaccatcc tcacccacca 1201 tccggtgatc tttcaccctt taaagcaggt gcggcaagac agcccggtgt tccagttaat 1261 ccaaagcggc atcactgtca tcagcgccca caccaacctg gaccgttccc ctcagctggg 1321 caccaaccgg gttttggcgg agcgattggg cctttggaac ggttccctgt ccccggaatt 1381 ggaggagatg ggggttttgg gggaacttcc tccctgccct gccccggagc tggccaaaaa 1441 agtaaaaact gcgctgggct gcgcttcggt gaacttctac gacgcagcca tcccctgtca 1501 ccgggtggcc gtgattgccg gaagcggcgg aagtctgctg gaacaggcag cggcagcggg 1561 agccgacacc ctgattaccg gagacgtaaa gcaggacgtt ttcgtttccg cccagtggat 1621 gggaatgaat ctgctggaag tcagccactt tgatctagaa aacccggtgt tttccaaact 1681 ggggccctgg ctggaagaaa cgttgggcat tcacacctgc ttgctcagcc ctcaaaaccc 1741 ggtgcaatgg atttaagatt acccaattag gagttacata tctatggcat tagacggcat 1801 ttatctctac gcattggccc ggcagattgg tcgagaggct ttaaacagcc gggtagaaaa 1861 aattcatcaa ccctccaaag aagaaatcat catctggctg cgcagccctg ccgggcggta 1921 ccggctgctt tgctgcgcca atcccagcgc ccctcgggtg cagctcaccg gcttgaatcc 1981 ggaaaacccc aaaatcccac cccaattctg caccctgctg cgcaagcatc tctccaccgg 2041 caaattggtg ggggtgcgcc aggaaggatt ggaccgggtg cttcacctgg actttgaagc 2101 catcaacgag ttgggggatg tggtcaccct gaccctcacc atggagatca tgggacggca 2161 ctccaaccta attttagtca atcaggaagg taaagtattg gacgccatcc gccgggtagg 2221 ggaagagcag agcagtgtgc ggctgctgct tcccggggtg acctaccgct ctgccccggc 2281 ccagcacaaa ttaaatccct ttaccgcttc cccggaagaa gggaaagccg ccttctcctc 2341 tttgccggat atggtttttt ccaaagcgat tttacaggtt taccagggca tctcccctct 2401 tctcagccga gagctggcag aatatgcctg taacggggcg gaactgaata aaagccaggt 2461 ctccgattac cgttaccaac gtctatttga ccggctggca gaattgaagg cccagatcga 2521 ttccgactcc gccgcctaca ctatggtgct ggaaaacggg cggcctcgag aattttcggt 2581 gtgcccctta gagcagtacc ggaacgcgtt ggaaagccgg cagttttccg atcccagcag 2641 tttgttggac ggcttttact cccagcggga cctgcagctt cgcatggcgc aaaaggggca 2701 ggatctgctc cgggtgctca ccagcgcatc tcagcgagct cagcggaagt tagatcaccg 2761 gcagcaggaa cttcagcgtt ccatagaccg gcagcggtac caacagttag gggatttgat 2821 ccaagccaac ctgtaccaaa ttaaaaaagg agacacccaa accaccgtca ccgactactt 2881 cgacccggaa cagaaaccgg taaccatctc tctggatcct gccttgtctc cggcgcagaa 2941 cgcccaaaag tattacaagg aataccgcaa agccaaaact gcccgagaac tgttgggcgg 3001 actgatccag caaaaccagg atgaaattat ttatctggac accatctttg atgaactggc 3061 tcgggccagc cgggaaagcg acctgaatgc catccgggaa gagttagagc agcaagggta 3121 tctgcacaaa gcccgccgga aactaaaacc ggacaaaccc ctgccgccta tggaatttgt 3181 caccgacgac ggctttacgg tgctctgcgg acgaaacaac aagcagaacg accagctcac 3241 cctgaaaacc gcccggaatt atgatgtgtg gttccacacc aaggacatcc ccggcagcca 3301 cgtaatttta caatcccaag gggaggaccg gcccattccg gaacgttcca tccttcaggc 3361 cgccaccatt gccgccaccc attcccgggg acaggattcc aatcaggtgc cggtggacta 3421 taccttaatc aaatacgtga aaaagccaaa tggcgcccgg ccagggatgg tgatcttcac 3481 ccacaacaaa accctctatg tggctccgga cttggccctc tgcgaaaaat tagccaaaga 3541 agcaaagtaa aacaccatgc ctcgatctga acccaggatc ggggcatttt gcttgcaggg 3601 acaaaaagaa agcggcagct tccgaagaaa ccaccgcttc gctttatgaa ttgaggaaaa 3661 gcaggacaaa ctttattttg ctttagtcac actgtgacag ccgctgcagc cggcacagtt 3721 tccgttgcag ccggaggaga cgcctccctt ttgttttccg gagcaggcgc tgcatccaac 3781 gcagcctccc cggcgggtgg tctttaccat ataccataca gcaagaccca gcagggctgc 3841 taaaatcacc gaaatgataa tggttgcaat catggttgtc gcctcctttg gcaaaattta 3901 tatcttatgc ccgctcggca gcgtgggaca agctgtgtct cttggccggg cggaacagca 3961 ggtacagtac cccgatcaac agcaggaagg caatgaccgt ccaaatgctg aatccaactg //