LOCUS DXAM01000076 5339 bp DNA linear ENV 05-JUL-2021 DEFINITION MAG TPA_asm: Candidatus Microbacterium stercoravium isolate ChiHjej8B7-3636 ChiHjej8B7M5__C217808_L5339_SAMN08009862, whole genome shotgun sequence. ACCESSION DXAM01000076 DXAM01000000 VERSION DXAM01000076.1 DBLINK BioProject: PRJNA543206 BioSample: SAMN15816680 Sequence Read Archive: SRR6323450 KEYWORDS WGS; Metagenome Assembled Genome; MAG; Third Party Data; TPA; TPA:assembly. SOURCE Candidatus Microbacterium stercoravium (gut metagenome) ORGANISM Candidatus Microbacterium stercoravium Bacteria; Actinobacteria; Micrococcales; Microbacteriaceae; Microbacterium. REFERENCE 1 (bases 1 to 5339) AUTHORS Gilroy,R., Ravi,A., Getino,M., Pursley,I., Horton,D.L., Alikhan,N.F., Baker,D., Gharbi,K., Hall,N., Watson,M., Adriaenssens,E.M., Foster-Nyarko,E., Jarju,S., Secka,A., Antonio,M., Oren,A., Chaudhuri,R.R., La Ragione,R., Hildebrand,F. and Pallen,M.J. TITLE Extensive microbial diversity within the chicken gut microbiome revealed by metagenomics and culture JOURNAL PeerJ 9, e10941 (2021) PUBMED 33868800 REMARK Publication Status: Online-Only REFERENCE 2 (bases 1 to 5339) AUTHORS Gilroy,R. TITLE Direct Submission JOURNAL Submitted (09-APR-2021) Microbes in the Food Chain, Quadram Institute BioScience, Norwich Research Park, Norwich NR4 7UQ, United Kingdom COMMENT The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: MegaHIT v. 1.2.9 Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: 123690x Sequencing Technology :: Illumina HiSeq X Ten ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/09/2021 20:12:42 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 5.2 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 2,356 CDSs (total) :: 2,306 Genes (coding) :: 2,291 CDSs (with protein) :: 2,291 Genes (RNA) :: 50 rRNAs :: 1 (5S) partial rRNAs :: 1 (5S) tRNAs :: 46 ncRNAs :: 3 Pseudo Genes (total) :: 15 CDSs (without protein) :: 15 Pseudo Genes (ambiguous residues) :: 0 of 15 Pseudo Genes (frameshifted) :: 2 of 15 Pseudo Genes (incomplete) :: 11 of 15 Pseudo Genes (internal stop) :: 2 of 15 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..5339 /organism="Candidatus Microbacterium stercoravium" /mol_type="genomic DNA" /submitter_seqid="ChiHjej8B7M5__C217808_L5339_SAMN08009862 " /isolate="ChiHjej8B7-3636" /host="Gallus gallus" /db_xref="taxon:2838697" /environmental_sample /metagenome_source="gut metagenome" /note="metagenomic" gene <1..286 /locus_tag="H9800_05405" CDS <1..286 /locus_tag="H9800_05405" /inference="COORDINATES: ab initio prediction:GeneMarkS-2+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS-2+." /codon_start=2 /transl_table=11 /product="hypothetical protein" /protein_id="HJA04279.1" /translation="DVLRLTPAPAGVSISRGLVMIARATIVHHVYADRGDLLPLCGVI PGPGDPGLGRLTSWNTGHDPMVLRAHSDIGGDCCMVCLEYAVSLPSDESV" gene complement(309..854) /locus_tag="H9800_05410" CDS complement(309..854) /locus_tag="H9800_05410" /inference="COORDINATES: ab initio prediction:GeneMarkS-2+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS-2+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="HJA04280.1" /translation="MDVKAAQHDVRRVYRGGFSGPLVSALIWAVASAVYQWGSITTAI TVLFIGGMLIFPLSTLVLKLMGSPAFLPKGHPSIALAMQSAFTVPLGLLVAIALGTAA PSLFMPASLIIVGAHYLTFISLYGMRLYGALAVVLVAVGAAAIFVAPELRDITGWIGA AVLLVASLPLFISYRREESRA" gene complement(859..1074) /locus_tag="H9800_05415" CDS complement(859..1074) /locus_tag="H9800_05415" /inference="COORDINATES: ab initio prediction:GeneMarkS-2+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS-2+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="HJA04281.1" /translation="MVRRPRMVRRLVTIAALVGVAAVVVWFSSAVVDASGAQMAAAGG LFAVIASGWSAAFVTTTAMLKTEGLRG" gene 1374..2183 /locus_tag="H9800_05420" CDS 1374..2183 /locus_tag="H9800_05420" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013600097.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="excalibur calcium-binding domain-containing protein" /protein_id="HJA04282.1" /translation="MPMRWVWWIPVVLVLVVTIAVVTIPSEADPREPADPGTAAALLA ELPVKGSAPANDYDRVGHFGESWIDVDGNGCDTRNDILQRDLTDLVLDGPCIVLAGEF ADPYTGDRIAFERGVDTSQRVQIDHVVALKDAWRTGAQDLPQERRIALANDPINLRAA DGSANGQKGDRNAASWLPSNTSFRCEYVARQVSTKAAYDLWVVPAEKDAMERVLAACP DQPAFVSEGPTASVSYRNCAAVREAGAAPIRRGDPGYASHLDRDGDGIGCA" gene 2266..3525 /locus_tag="H9800_05425" CDS 2266..3525 /locus_tag="H9800_05425" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_018186811.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sensor domain-containing protein" /protein_id="HJA04283.1" /translation="MTSIPAPPPRVHTPLRMLGAIAHLVFLGLYGVAVFTVLSALLAT GLGLLVVLGIGALFLLGFVYLLFGVAYLETARIEGLYQFGLPMLRARRSPKRGFGGFL HTVWLQFIDGGMWRAVAHLAIATVLGWVALFLVSGVVRSIAAAFAPLYAPDGVASAFG FRYDVAVAPVVGTIVAVLALAALVGLALLHGVLARVLIAPIREAQLAAAARDATTQRA DAIRASDVERTRIERDLHDGVQPRLVSVGMTLGLAQQKIDTDPQSAKDLVSEAHTSTK AAITELRQLARGIHASVLDDRGLDAALSAVASRSHIPVSVDVRMPARAGRDAEAAVYF AIAESLTNAAKHSRATEARVTVRARPDGGVLWARVEDNGIGGARVLPGGGLDGIQNRI AAIGGTARLDSPAGGPTSLEVSVPCAS" gene 3513..4211 /locus_tag="H9800_05430" CDS 3513..4211 /locus_tag="H9800_05430" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017884112.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="response regulator transcription factor" /protein_id="HJA04284.1" /translation="MRILIVEDSVLLREGLVRLLADAGHEVVAALPDASRALTEAADL DPDLAIVDVRLPPTFTDEGLRAAIALRGQDPSLAVLVFSQYVEERYAADLIAQPGGAI GYLLKDRVTDVSEFLESIERIREGATVLDPEVVAQLLTRRSRDEQISRLTERERSVLA LIAEGKSNGAISRILFVSEGAVEKHITSIFSKLGLEQDDTGNRRVLAVLAHIDATAPQ APVGPTAHQNGMNR" gene 4208..5062 /locus_tag="H9800_05435" CDS 4208..5062 /locus_tag="H9800_05435" /inference="COORDINATES: ab initio prediction:GeneMarkS-2+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS-2+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="HJA04285.1" /translation="MTNDTLAPPEPPAPQQPSQQSSTVRSGTRALAITIGVLGGGILL LGGAAAAIGAVGSTMFSASGGTGSSSLPVNGVDSLRIDVGAGDVDVAFGSGSEARLDY ESNVGEWAFERDGDTLVVSSPRRWFVFFDWIGEQRATLVLPESLEGIDADLEVAAGSL TMDGAFGDIAYDLSAGEIELEGSATTLEAGMSAGSSTIELADLETATFDVSAGSVYGD LTGDAPDAIGIDVAAGSVELQVPDVPYRVSIDRDIGNVESNVEQNNDARRTIDVQMSA GYVSLNAG" gene 5173..>5339 /locus_tag="H9800_05440" CDS 5173..>5339 /locus_tag="H9800_05440" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017829181.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TSUP family transporter" /protein_id="HJA04286.1" /translation="MNAARARGPRALATFAVIGLISGFMSGLFGVGGGTVIVPLLVMI ALFSQKVAAGTS" BASE COUNT 818 a 1917 c 1795 g 809 t ORIGIN 1 ggacgtcctt cgtctgaccc cggctccggc aggggtttcc atctcgcgag gactcgtcat 61 gatcgcacgc gccaccatcg tccaccacgt gtacgccgac cggggcgatc tgctgccgct 121 gtgcggcgtc atccctggtc cgggggatcc tggcctgggg aggctcacct cttggaacac 181 cggtcacgat ccgatggtgc tgcgtgctca ctccgacatt ggcggcgatt gctgcatggt 241 gtgcctggag tacgcggtgt cgttaccgag cgacgagagc gtgtgactcg aaagtcgtcg 301 cccaccactc aggcgcgcga ctcctcacgt cgatagctga tgaagagagg aagcgacgcg 361 acgagtagaa cggcggcccc gatccaaccg gtgatgtcgc gcaactctgg agcaacgaag 421 atcgcggcag cgccgacagc gacgaggact accgccaagg cgccgtacaa gcgcatcccg 481 tatagcgaga tgaacgtgag gtaatgagcg cccacgatga tgagcgaggc cggcataaac 541 agcgaaggtg cggcggtacc gagcgcgatc gctacgagga gcccgagggg caccgtgaag 601 gcactttgca tcgccagggc aatggatggg tgacccttcg gcagaaaagc cggacttccc 661 atgagcttca acacgagggt tgacaacgga aaaatcaaca tgccgccgat gaacagaacg 721 gttatggctg tcgtgatcga cccccactgg taaacggctg aggccactgc ccagatgagt 781 gcggagacca gagggcctga gaagccgcca cggtagacgc ggcggacgtc gtgctgagcg 841 gctttcacgt ccataacgtc atcctcgcag gccttcggtc ttcagcatcg ccgtggtagt 901 cacgaaggca gcagaccagc cgctggcgat gaccgcaaac aacccgccgg ctgcggccat 961 ctgcgcgccc gatgcatcga cgacagccga gctaaaccac acgacaacag cagcgacgcc 1021 gacgagcgca gcaatcgtga cgagccgccg aaccatccgt ggtcgtcgaa ccatggcact 1081 atccacgatt cctcctcgag catcgaccga acggttgtca ccgatccgtt cctcacgccc 1141 tcccgctcct gcggcgtcgc atgcctcgac cctacgcctg cgctcccgtt ccggctcgtg 1201 cgcggtgcga tgccgccgcg tacacgtgcg gccgtatctg gaagaatgga tccgtgcagc 1261 cgcgccgacg cgccgtcgag aacgcgtccc ggctccatcc gctctcttgc tgaccgcgcg 1321 ccgcgcgccg aacacatccc ggaggacccg tgcgccgacg cggccgtcag accatgccga 1381 tgcgatgggt gtggtggatc cccgtcgtgc tcgtgctcgt cgtcaccatc gccgtcgtga 1441 cgattccgtc cgaggccgat ccgcgcgagc ccgccgatcc cggaaccgcc gccgcgctgc 1501 tcgccgagct ccccgtcaaa ggatccgccc cggcgaacga ctacgaccgt gtcgggcact 1561 tcggcgagtc ctggatcgac gtcgacggaa acgggtgcga cacccgcaac gacatcctgc 1621 agcgcgacct gaccgacctc gtgctcgatg gcccctgcat cgtgctcgcc ggtgagttcg 1681 ccgatccgta tacgggcgat cgcatcgcgt tcgagcgcgg cgtcgacacg tcgcagcggg 1741 tgcagatcga ccacgtcgtc gctctcaaag acgcgtggcg caccggtgcg caggatcttc 1801 cgcaggagcg gcggatcgcg ctcgcgaacg atccgatcaa cttgcgggcg gccgacggat 1861 cggcgaacgg ccagaagggc gaccgcaacg cggcgtcgtg gctgccgagc aacacgtcgt 1921 tccggtgcga atacgtcgca cgccaggtgt cgacgaaggc ggcgtacgac ctgtgggtgg 1981 tgcccgccga gaaggacgcg atggagcgcg tgctcgccgc ctgccccgac cagcccgcgt 2041 tcgtgtcgga ggggcccacc gcatcggtct cgtaccgcaa ctgcgccgcc gtccgcgagg 2101 cgggcgcggc acccatccgc cgcggcgacc ccggatacgc gagccacctc gaccgagacg 2161 gcgacggcat cggctgcgcc tgacgcgcga accgggggat atccccctcc cgttccggcg 2221 gtcggccccg ctgtgttcgc acgcgcgatt ccgtagcttt ggagcatgac ctcgattccc 2281 gctcctccac ctcgcgtgca tacgccgctg cgaatgctcg gcgccatcgc acacctcgtg 2341 ttcctcggcc tctacggagt cgccgtgttc acggtgctgt cggcgctgct cgccaccggc 2401 ctcgggctcc tcgtggtgct cggcatcggg gcgctgttcc tgctcgggtt cgtctacctg 2461 ctcttcggcg tcgcctacct cgagacggcg cgcatcgagg gcctgtacca gttcggcctg 2521 cccatgctgc gggcccggcg ctcgccgaag cgcggcttcg gcggcttcct gcacacggtg 2581 tggctgcagt tcatcgacgg cggcatgtgg cgcgcggtgg cgcacctcgc gatcgcgacg 2641 gtgctcggct gggtcgcgct cttcctcgtc agcggcgtcg tccggtcgat cgcggccgcg 2701 ttcgctccgc tgtacgcccc ggatggcgtc gcgtcggcgt tcgggttccg gtatgacgtc 2761 gccgtcgccc cggtcgtcgg caccatcgtc gccgttctcg ccctcgccgc tctcgtgggg 2821 ctcgcgctgc tgcacggcgt gctcgcccgc gtgctgatcg ccccgatccg cgaagcgcag 2881 ctcgccgccg ccgcccgcga cgcgacgacc cagcgcgccg acgcgatccg cgcgagcgat 2941 gtcgagcgca cccgcatcga gcgcgacctg cacgacgggg tccagccccg gctcgtctcg 3001 gtgggcatga cgctcgggct cgcgcagcag aagatcgaca cggatccgca gagcgccaag 3061 gatctcgtct ccgaagcgca cacctcgacg aaggccgcga tcaccgagct gcgccagctg 3121 gcgcgcggga tccacgcctc ggtgctcgac gaccgcgggc tcgacgccgc gctgagcgcc 3181 gtcgccagca gatcccacat ccccgtctcg gtcgatgtgc gcatgccggc gcgcgcgggc 3241 cgcgacgccg aggccgccgt gtacttcgcg atcgcggagt cgctcacgaa cgccgccaag 3301 cactcccgcg ccacggaagc gcgcgtcacg gtcagggccc gccccgacgg cggcgtgctg 3361 tgggcgcgcg tcgaggacaa cggcatcggc ggagcgcgcg tgctccccgg cggcggcctc 3421 gacggcatcc agaaccgaat cgccgccatc ggcggcaccg cccggctcga cagccccgcg 3481 ggcggaccga cctccctgga agtgagcgtg ccatgcgcat cctgatcgtc gaagactccg 3541 tgctgctgcg cgagggcctc gtgaggctgc tcgccgacgc cggccacgag gtcgtcgccg 3601 ccctgcccga cgcgtcccgc gcgctcaccg aggccgcgga tctcgatccc gacctcgcga 3661 tcgtcgacgt gcgcctgccg ccgacgttca ccgacgaggg cctgcgcgcg gcgatcgcgc 3721 tgcgcgggca ggatccgtcg ctcgcggtgc tcgtgttctc gcagtacgtc gaagagcgct 3781 acgcggccga cctcatcgcc cagccgggcg gcgcgatcgg atacctgctg aaggaccggg 3841 tcaccgacgt gtccgagttc ctcgaatcga tcgaacggat ccgcgagggc gccacggtgc 3901 tcgacccgga ggtcgtcgcg cagctgctca cccgccgctc gcgcgacgag cagatctccc 3961 gcctcaccga gcgcgagcgc agcgtgctcg ccctcatcgc cgaaggaaaa tcgaacggcg 4021 cgatctcgcg gatcctcttc gtgagcgagg gcgccgtcga gaagcacatc acatcgatct 4081 tctcgaagct cggcctcgag caagacgaca ccggcaaccg ccgcgtgctc gccgtgctcg 4141 cccatatcga tgcgaccgcc ccgcaggccc ccgtcgggcc gaccgcgcac cagaacggaa 4201 tgaaccgatg acgaacgaca cccttgcacc gcccgagcct ccggccccgc agcagccctc 4261 gcagcagtcg agcaccgtgc gctccggcac gcgcgccttg gcgatcacga tcggcgtgct 4321 cggcggcggc atcctgctgc tcggcggcgc cgcggccgcg atcggcgccg tcggatccac 4381 gatgttctcg gcgtccggcg gaaccggatc ctcgtctctg ccggtcaacg gcgtggactc 4441 gctgcgcatc gacgtcggcg ccggtgacgt cgacgtcgcc ttcggcagcg gatccgaggc 4501 ccgcctcgac tacgagtcga acgtgggcga gtgggccttc gagcgcgacg gcgacaccct 4561 cgtcgtctcg tcgccgcggc ggtggttcgt gttcttcgac tggatcggcg agcagcgggc 4621 gacgctcgtg ctgcccgaat cgctcgaggg catcgatgcc gacctcgagg tggccgccgg 4681 ttcgctgacg atggacggcg cgttcggcga tatcgcctac gacctgagcg cgggggagat 4741 cgagctcgag ggctccgcga ctacgctcga ggccgggatg tcggccggga gcagcaccat 4801 cgagctcgcc gatctggaga cggccacgtt cgacgtctcg gcggggagcg tgtacggcga 4861 cctgaccggc gacgcgcccg acgcgatcgg catcgacgtc gcggcgggat ccgtcgaact 4921 gcaggtgccc gacgtgccgt accgcgtgag catcgatcgc gacatcggca acgtcgagtc 4981 gaacgtcgag cagaacaacg acgcgcgtcg cacgatcgac gtgcagatgt cggcgggcta 5041 cgtcagtctc aacgcgggct gacgccccga cgcggcaccg agaagtgcgg ctcccgagcg 5101 ggctggagcc gcgcttctcg gcgaccccgt gatttaaggg gcagtgcgcg cgggcgctag 5161 ggtgggcgag tgatgaatgc cgcgcgtgcc cgaggtccga gggcgcttgc gaccttcgcc 5221 gtcatcggcc tgatctcggg cttcatgtcg gggctcttcg gcgtcggggg cggcaccgtc 5281 atcgtgccgc tgctcgtcat gatcgcgctg ttctcgcaga aggtagcggc gggtacgtc //