LOCUS DPCL01000292 3997 bp DNA linear ENV 10-SEP-2018 DEFINITION TPA_asm: Prevotellaceae bacterium isolate UBA8030 contig_19418, whole genome shotgun sequence. ACCESSION DPCL01000292 DPCL01000000 VERSION DPCL01000292.1 DBLINK BioProject: PRJNA417962 BioSample: SAMN08019563 Sequence Read Archive: SRR6486177 KEYWORDS WGS; Third Party Data; TPA; TPA:assembly. SOURCE Prevotellaceae bacterium (gut metagenome) ORGANISM Prevotellaceae bacterium Bacteria; Bacteroidetes; Bacteroidia; Bacteroidales; Prevotellaceae. REFERENCE 1 (bases 1 to 3997) AUTHORS Parks,D.H., Chuvochina,M., Waite,D.W., Rinke,C., Skarshewski,A., Chaumeil,P.A. and Hugenholtz,P. TITLE A standardized bacterial taxonomy based on genome phylogeny substantially revises the tree of life JOURNAL Nat. Biotechnol. (2018) In press PUBMED 30148503 REMARK Publication Status: Available-Online prior to print REFERENCE 2 (bases 1 to 3997) AUTHORS Parks,D.H. TITLE Direct Submission JOURNAL Submitted (04-APR-2018) School of Chemistry and Molecular Biosciences, University of Queensland, Chemistry Bld, Cooper Road, St Lucia, Brisbane, Queensland 4072, Australia COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: CLC de novo assembler v. 4.4.1 Expected Final Version :: yes Genome Coverage :: 6.81x Sequencing Technology :: Illumina HiSeq 2500 ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 05/03/2018 18:20:47 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 1,597 CDS (total) :: 1,589 Genes (coding) :: 1,498 CDS (coding) :: 1,498 Genes (RNA) :: 8 tRNAs :: 7 ncRNAs :: 1 Pseudo Genes (total) :: 91 Pseudo Genes (ambiguous residues) :: 74 of 91 Pseudo Genes (frameshifted) :: 50 of 91 Pseudo Genes (incomplete) :: 15 of 91 Pseudo Genes (internal stop) :: 1 of 91 Pseudo Genes (multiple problems) :: 49 of 91 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..3997 /organism="Prevotellaceae bacterium" /mol_type="genomic DNA" /isolate="UBA8030" /isolation_source="gut" /db_xref="taxon:2049047" /environmental_sample /note="metagenomic; derived from metagenome: gut metagenome" gene <1..989 /gene="dnaN" /locus_tag="DEQ84_07785" CDS <1..989 /gene="dnaN" /locus_tag="DEQ84_07785" /inference="COORDINATES: protein motif:HMM:TIGR00663" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=3 /transl_table=11 /product="DNA polymerase III subunit beta" /protein_id="HCE48528.1" /translation="DQDTMLKTNIELQEANGDCKFALESKQIMNILKEIPEQPLTLEI NPGTLQIDLTYQNGHFSFQGVQGDEYPIPATPEGESNDITLEAASLVKGISTALIAAA NDETRKVMNGVFMDISPEDLSIVASDGHKLIRYRIECDTHGTTAGFTLPQKPANILKN ILDKAKGIITIRTYGTSNARIETEDYMISCRLIEEKYPNYKSVIPTQNNNVAVIDRAS FVSAMRRVLVVADKTTALVKFVFSSNKVVLTSENINYSLSAEEQLVCQYEGMPLKIGF KGTDMLELISALQGTDFIIKLADASRAGLIVPGEQSGNTDLLMLLMPLLINN" gene 997..1779 /locus_tag="DEQ84_07790" CDS 997..1779 /locus_tag="DEQ84_07790" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_005930941.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA polymerase III subunit epsilon" /protein_id="HCE48529.1" /translation="MKLNLKKPLVVFDLETTGVNVTQDRIVEISYIKILPNGNEESKM MRINPERHIPEASSKIHGITDEDVKDSPTFKEVAQGLANDFAGCDFAGYNSNHFDIPM LVEEMLRAGIDFDIHKAKLVDVQNIFHKMEQRTLVAAYKFYCHKDLIDAHSSLADTTA TYEVLKAQLDRYPETLKNDINFLSDFSRMNDNVDLAGRMIYNDQKEIVFNFGKYKGQS VIHTLHNDPGYYSWILQGDFPRETKAVLTRIKLSESHIGQKQ" gene 1787..2986 /gene="coaBC" /locus_tag="DEQ84_07795" CDS 1787..2986 /gene="coaBC" /locus_tag="DEQ84_07795" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_005847882.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="bifunctional phosphopantothenoylcysteine decarboxylase/phosphopantothenate--cysteine ligase CoaBC" /protein_id="HCE48530.1" /translation="MELQGKHIVLGITGSIAAYKACLLIRLLIKEGAEVQTVITPAGK EFITPVTLSALTRKPVICDFFAQRDGTWNSHVELGLWADVMIIAPATASTIGKMANGI ADNMLVTTYLSMKAPVVVAPAMDLDMYAHPATQRNLETLRSYGNIIIEPASGELASQL VGKGRMEEPENIVGHIRKLFSEKESLTGKTVLITAGPTYEKIDPVRFIGNYSSGKMGY ALAEECAQRGANVILVSGPTHLEVHHRNIERICVESAQEMYEMATNSFKDADIGILCA AVADFTPNDMEGCKIKREKGTQTLVLKPTRDIAASLGQMKHKGQLLIGFALETNDEEA NALYKLKKKNLDFIVLNSLKDNGAGFMYDTNKITILSEKEKIPFSLKNKKEVAKDIID KICDMCD" gene 2968..3873 /locus_tag="DEQ84_07800" CDS 2968..3873 /locus_tag="DEQ84_07800" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009346749.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF4835 domain-containing protein" /protein_id="HCE48531.1" /translation="MRYVRLIVITLILFFMQSASAQELNATVDINTQKIQGTNKNVFD DLKSTLTQFINERQWTSLKFRNNERIKCAFSIIVNKYDEGSGSMTCEAYIQSSRPVYN STYTTPTLSIHDANFNFDFREHDQLEFRDDQINNNLTALIAYYAYLIIGVDMDTMAPQ GGTDILQKAMDVVNNAQNMNTKGWKAMEDESNRYGIVNDYLNEGLQSFRQLQYDYHRK GLDQMTANSDQARETITKSMSLLEKTYSTKSRSALPRLFSEYKRDELVGIYQGKETSA KKQKVYDILTKVNASQSNYWKKLLN" gene 3880..>3997 /locus_tag="DEQ84_07805" CDS 3880..>3997 /locus_tag="DEQ84_07805" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_002661887.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="HCE48532.1" /translation="MLQRLKIQNYALIEHLDMELHAGFSTITGETGAGKSIIL" BASE COUNT 1295 a 823 c 913 g 966 t ORIGIN 1 ccgaccagga taccatgttg aaaaccaata tagagctgca agaagccaat ggcgactgca 61 agtttgcgtt ggaatcgaaa caaatcatga acatcttgaa agagataccc gagcaaccgt 121 tgacattgga aatcaatccg ggcactttac aaatagatct gacttaccaa aacgggcatt 181 tctcgtttca aggtgtgcaa ggtgatgaat atcctattcc ggctaccccc gaaggcgaat 241 cgaacgacat cacattagaa gccgcttcgc ttgtcaaggg tatcagtaca gcactaatag 301 cagccgcaaa cgatgagacg cgcaaggtaa tgaatggtgt attcatggat atatctcccg 361 aagacctttc aatcgttgcc agtgacgggc ataaattgat acgctaccgc atagaatgcg 421 atacgcatgg aacgactgca ggtttcaccc tgccgcagaa accggcaaac atattaaaga 481 acatcttgga taaggccaag ggaataatta cgatacgcac ctatggaacc agcaatgcca 541 ggattgaaac tgaagactac atgataagct gccgcttgat agaagaaaaa tatcccaatt 601 ataaaagtgt cataccgact caaaacaaca atgtggccgt catagatcgt gcctcgtttg 661 tgagtgccat gcgccgtgtg cttgtcgtgg ccgacaagac tacggctctt gtgaaatttg 721 tctttagcag taataaggta gtgcttacat ccgaaaacat caattattcg ctaagcgcag 781 aagaacagct ggtctgccag tatgaaggca tgcctctgaa aataggattc aagggcacgg 841 atatgttaga actgataagc gctctgcaag gaaccgactt tatcatcaaa ctggctgatg 901 catcacgcgc cggacttata gttcccggag aacaatcggg aaatacagat ttgctgatgc 961 tgcttatgcc tcttctaatc aataactaac aagattatga aactcaactt gaaaaagcct 1021 ttggtggttt ttgatttgga aacaaccggt gtcaatgtca cccaggatcg tatcgttgaa 1081 atcagctata taaagattct tcccaacgga aacgaagaat cgaaaatgat gcggatcaat 1141 cctgaaaggc atatacccga agcatcttcc aagatacacg gtattacgga cgaagatgtg 1201 aaagacagtc ctacatttaa ggaggtggca caagggcttg ccaatgattt tgccggttgc 1261 gattttgccg gctacaattc caatcatttt gacataccta tgttggtgga agaaatgctt 1321 cgagccggta tcgactttga tatccataaa gctaaattgg tagatgtgca aaacatcttt 1381 cataagatgg agcagcgaac attggtagca gcatataagt tttattgcca caaggatttg 1441 atcgatgcac actcctcgtt ggctgatacg acggcaactt acgaagtact gaaagcccag 1501 ttggaccgtt atcccgaaac attgaaaaac gacataaatt ttctgtcgga tttttcaagg 1561 atgaacgaca atgtggatct ggcgggtcgc atgatatata acgaccaaaa agaaatcgta 1621 tttaatttcg ggaaatacaa ggggcaatcg gtgattcaca cgcttcacaa tgatccggga 1681 tattacagct ggatactgca aggggacttt ccgcgtgaga caaaagccgt attgacgcgc 1741 atcaagctaa gcgagtctca catcggtcaa aaacaatgat aaaaagatgg agctgcaagg 1801 gaagcatatt gtattaggca ttaccggcag tatagcagct tataaggcat gcttgcttat 1861 ccgattgcta atcaaggaag gggctgaggt ccagacggtc ataactccgg ccggaaaaga 1921 atttattacc cccgtgacac tttcggccct gacaagaaag ccggtcatat gtgatttctt 1981 tgcccagcgc gacggaacat ggaacagcca tgtcgagttg ggactttggg cagatgtgat 2041 gattatagca ccggcaacag catcgaccat agggaaaatg gctaatggaa tagccgacaa 2101 tatgcttgtt acgacatatc tttcaatgaa agctccggta gtagtggctc cggccatgga 2161 ccttgacatg tatgcacatc cggctacaca aaggaatctg gagacccttc gcagttatgg 2221 caatatcatc attgaaccgg ccagcggaga attagccagc caactggtcg gaaaaggacg 2281 tatggaggag cccgaaaata ttgtaggaca tatccgtaaa ctattttcgg aaaaagaatc 2341 tcttaccggg aagacggtac tgattacggc cgggccgact tatgaaaaaa tagatccggt 2401 gcgttttatc ggcaattatt caagcgggaa gatgggctat gctttggccg aagaatgtgc 2461 gcaacgtgga gccaatgtca tattggtcag cggtcccaca catcttgaag tgcatcatcg 2521 caacattgag cgcatatgcg tagaaagtgc acaagaaatg tacgaaatgg caacaaacag 2581 ttttaaggac gcagacatcg gcattctatg tgccgccgta gccgatttta caccgaatga 2641 tatggaaggc tgtaaaataa aacgtgaaaa aggaactcag actttggtct tgaagccgac 2701 tcgcgatatt gccgcttcgt tgggacaaat gaaacataaa gggcaactgc tgataggatt 2761 tgcccttgaa acaaacgatg aagaagcgaa cgccctgtac aagcttaaaa agaagaactt 2821 ggattttatc gtgcttaact cactgaaaga caatggtgcc ggattcatgt atgatacaaa 2881 taaaattacc atcctcagtg aaaaagaaaa aatcccattc tcactgaaaa acaaaaaaga 2941 agtggctaaa gacatcatag acaaaatatg cgatatgtgc gactgattgt tatcaccttg 3001 attctttttt ttatgcaaag tgcatcggcg caagagttga acgcaacagt cgatatcaat 3061 acgcaaaaga ttcaagggac caataagaat gtattcgatg atttaaaatc gacattgacg 3121 caatttatca atgagcgtca atggacttcg ctgaagttta gaaacaatga gcgtatcaaa 3181 tgtgcgtttt ccattatagt caataaatat gatgagggaa gcggcagcat gacgtgcgaa 3241 gcatatatac aaagtagccg cccggtttac aattcaactt atacaacacc gactctaagc 3301 attcacgatg caaactttaa ctttgatttt cgtgaacacg accaattgga gttccgtgat 3361 gaccaaatca ataataatct gacggcactt attgcttatt atgcctattt gattataggt 3421 gtcgatatgg acacgatggc accgcaagga ggcactgaca tccttcagaa agcgatggat 3481 gtggtcaaca atgctcaaaa tatgaatacg aagggttgga aagccatgga agacgaaagc 3541 aatcgctacg gcatagtgaa cgactatctg aatgaaggat tgcaatcatt caggcaattg 3601 caatatgact atcaccgtaa aggacttgac caaatgacag ccaatagcga tcaagcccgt 3661 gaaactatca ccaaatcaat gtctttgctt gagaaaacgt atagtacaaa gtcacgcagt 3721 gctttgccgc ggcttttttc tgaatacaaa cgtgacgaat tagtcggcat ctatcaggga 3781 aaggaaacct ctgcaaaaaa gcaaaaggtt tatgacattc tgacgaaagt gaatgcttca 3841 caatccaact attggaagaa actgctaaat taagtgctta tgctacaacg tttgaagatt 3901 cagaattatg ctcttataga gcacctggat atggaacttc atgccgggtt ttctaccata 3961 actggtgaaa caggtgccgg gaaaagcatc attttag //