LOCUS QHXA01000356 4354 bp DNA linear ENV 12-JUN-2018 DEFINITION Acidobacteria bacterium isolate gp5 AA66 14_1009_16_30cm_scaffold_72378_curated, whole genome shotgun sequence. ACCESSION QHXA01000356 QHXA01000000 VERSION QHXA01000356.1 DBLINK BioProject: PRJNA449266 BioSample: SAMN08912151 KEYWORDS WGS. SOURCE Acidobacteria bacterium (soil metagenome) ORGANISM Acidobacteria bacterium Bacteria; Acidobacteria. REFERENCE 1 (bases 1 to 4354) AUTHORS Crits-Christoph,A., Diamond,S., Butterfield,C.N., Thomas,B.C. and Banfield,J.F. TITLE Novel soil bacteria possess diverse genes for secondary metabolite biosynthesis JOURNAL Nature (2018) In press PUBMED 29899444 REMARK Publication Status: Available-Online prior to print REFERENCE 2 (bases 1 to 4354) AUTHORS Diamond,S. and Banfield,J. TITLE Direct Submission JOURNAL Submitted (29-MAY-2018) Earth and Planetary Science, University of California, Berkeley, University of California, Berkeley, CA 94720, USA COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: IDBA_UD v. 1.1.1 Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: 6x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 05/31/2018 16:08:54 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 7,187 CDS (total) :: 7,141 Genes (coding) :: 6,698 CDS (coding) :: 6,698 Genes (RNA) :: 46 rRNAs :: 1 (5S) complete rRNAs :: 1 (5S) tRNAs :: 41 ncRNAs :: 4 Pseudo Genes (total) :: 443 Pseudo Genes (ambiguous residues) :: 256 of 443 Pseudo Genes (frameshifted) :: 184 of 443 Pseudo Genes (incomplete) :: 61 of 443 Pseudo Genes (internal stop) :: 13 of 443 Pseudo Genes (multiple problems) :: 71 of 443 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..4354 /organism="Acidobacteria bacterium" /mol_type="genomic DNA" /isolate="gp5 AA66" /isolation_source="meadow soil" /db_xref="taxon:1978231" /environmental_sample /geo_loc_name="USA: Angelo Coast Range Reserve, CA" /lat_lon="39.74 N 123.63 W" /collection_date="2014-10-09" /note="metagenomic; derived from metagenome: soil metagenome" gene complement(<1..619) /locus_tag="DMG14_20895" CDS complement(<1..619) /locus_tag="DMG14_20895" /inference="COORDINATES: protein motif:HMM:PF00069.23" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="protein kinase" /protein_id="PYS37532.1" /translation="MIGKIFGHYEIQSLLGTGGMGEVFQARDTKLGRTVAIKVLPEAF AENGERIARFEREAKLLASLNHPNIAALYGMEISDGRHFLVMELVEGETLADRLRRGA IPVDESLHIARQIAEALEAAHERGIVHRDLKPANIKITPDDKVKVLDFGLAKAMQDAP ETATLSNSPTLSLAATQAGVILGTAAYMSPEQAKGMQADPRSDIFS" gene complement(920..2686) /locus_tag="DMG14_20900" CDS complement(920..2686) /locus_tag="DMG14_20900" /inference="COORDINATES: protein motif:HMM:PF13360.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="pyrrolo-quinoline quinone" /protein_id="PYS37533.1" /translation="MRTRIALSISMMAFCGGLCLFAQVREFRQVTEAMLRNPSPGDWL NWRRTDSAWGYSPLDQINRQNVSQLQLAWSWAMDDTGANEAAPLVYDGIMYLPNPRGV IQALDAATGDLIWQYRPQINRPAAAASAGGGEQTSIPRLAQGNAAAADGRGIQRNIAI YGDKIFATTGDAHIVALDARTGKVVWNTKVADPQLGYEYTSGPIIVRGKVIAGITGCT RYKDDVCFITGHDAATGKELWRTSTIARPGEPGGETWGDLPLTFRAGGDAWIAGSYDP ETNLIYWGTAQAKPWARVARGTDGDALYTNSTLALDPDTGKIKWYYQHLPGETQDMDE VFENILIDVGGRKSLFKMGKLGILWQLDRTNGQFIHATDLGYQTIVQVNPQNGKVTYL PGKIPQIGVEVDMCPSTAGFKSWRAMAFSPQTNAFYIPLSLHCEKATFSPVEKVVGRG GTGPVARTDYKHPESGGNLGEFLALDVRTGKVLWRQRTPSPANTAALATAGGVVFGGD WDRHMYAYDAGTGKILWQTRLPTSAQGFPITYVAKGKQYVAMPAGIGGGSWSTLITPE LAPEIKRPNSGNTLLVFALPGK" gene 2797..2952 /locus_tag="DMG14_20905" CDS 2797..2952 /locus_tag="DMG14_20905" /inference="COORDINATES: protein motif:HMM:TIGR01552" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PYS37534.1" /translation="MRKSKEIHFSQARQNLSGIIDRLPHSGPVTILRHGKPAAVAISN EPVKEIK" gene 2949..3458 /locus_tag="DMG14_20910" CDS 2949..3458 /locus_tag="DMG14_20910" /inference="COORDINATES: protein motif:HMM:PF05163.10" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PYS37535.1" /translation="MTVQDLEDLYDYGYWATKKLVGVMSQLMPEQFTQSVGGSYGSIR NTLVHILSAEWGWLDRCGGPKRGPRLNPAGYPTLESLLEIWSKVEGYVREFLSTLKDE DLRRNAEYMNDAGEKRSMPIGELMQHAANHGVHHRGQVALMLRLLGYAPGNFDILFYF AERRGVPAF" gene complement(3455..4348) /locus_tag="DMG14_20915" CDS complement(3455..4348) /locus_tag="DMG14_20915" /inference="COORDINATES: protein motif:HMM:PF07969.9" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PYS37536.1" /translation="MNAHANSLGLTAVINASNLSDQEYPLRLWREDKLTIRMRPLFPA DSPQEVEARILNNFSQSGRAVGDDLFRVAGFGERIGGNDTVSPMFEPTARVIAKHGWL LQQHSITLAENDFHLSAFRSIARDYPIDGLRWSLLHLQSIDSPRLKTLMELGAGASAQ TWTYLSTGGGPPFRRIVESGIRAGVGTDSTNVSALDPWLSLFYMTTGRNLAGTLTNDG QQISRVEALRLYTEGAAWFSFDEHHIGSFVEGKYADLAVLSDDYLTVSDARIRKIESV LTLLAGKVVHATGPFSQLHKD" BASE COUNT 898 a 1253 c 1229 g 974 t ORIGIN 1 acgaaaaaat gtcgctgcgc ggatcggctt gcattccttt tgcctgctcc ggcgacatgt 61 atgccgcggt gccaagaatc acgccggcct gcgtcgctgc aaggctcagc gtaggcgagt 121 tggacaaggt tgcagtctcc ggcgcatctt gcatggcctt cgcgagacca aagtccaaaa 181 ccttcacctt gtcatccggc gtgatcttga tgttggctgg tttcagatcg cgatggacga 241 tgcctcgctc atgcgcagct tcgagcgctt cggcaatctg tcgtgcgatg tggagagatt 301 cgtccaccgg aatcgctccg cgccggaggc ggtcggcgag ggtttctccc tcaaccagtt 361 ccatcacgag gaagtgtcta ccgtcagaaa tctccatccc gtacagcgct gcaatgttcg 421 ggtggttcag cgacgcgagc agtttcgctt cgcgttcgaa ccgcgcgatg cgttcgccat 481 tttcagcaaa cgcctccggc aacactttga tcgcaaccgt acggccgagc ttcgtgtcgc 541 gtgcttgaaa cacttctccc atcccgccgg ttccaagcag gctctggatt tcgtaatgcc 601 cgaagatttt tccgatcatg aaggccctca agtttgtcca ttttgcgcac ggcggagccg 661 tgtcaggaaa ttagccgggg gtttagcgag cgggagcgag cgcaaccccc ggttgtctga 721 aaccaaaaac aaccgcaccc tgtaaagggt gcgaggagtc ctcgacaccc cttcagggtg 781 cgtttcccga gctatgaagt tattccaggg gtacgcaaaa aacgcatacc tctggctaat 841 ttcctagcac cgctccgcgg tgcgaatgtc caaactccag agcaccgctc cgcggtgcct 901 acagtgggtg cgaataggat cacttccctg gcaatgcaaa tacgagaagg gtattcccag 961 aattcggccg cttaatttca ggggccaatt ccggtgtgat aagcgtggac caactaccgc 1021 caccgatgcc tgcgggcatc gcgacgtatt gcttcccttt cgcgacgtac gtgatcggaa 1081 atccctgagc agaagtcggc agccgcgttt gccaaaggat ttttcctgtg ccggcatcgt 1141 aagcgtacat gtggcgatcc caatcaccgc cgaagaccac gccgccggcc gttgcgagcg 1201 cggcggtgtt tgccggcgat ggagtacgtt ggcgccacaa cacctttccg gtacggacat 1261 ccaatgccaa gaattcacca agattgccac cggattccgg atgcttgtaa tccgtgcgcg 1321 cgaccggacc ggtgccgcct ctgccgacga ccttttcgac agggctaaat gtcgctttct 1381 cacaatgcag gctaagcgga atgtagaacg cgttcgtttg tggactaaag gccatcgcgc 1441 gccagctctt gaacccggca gtgcttgggc acatgtcgac ctcgacgccg atttgcggaa 1501 tcttgccggg aaggtatgtg actttcccgt tctgcggatt gacctgcacg atcgtctgat 1561 aaccgagatc ggtcgcgtgg atgaactggc cattggtccg atcgagctgc cagagaatgc 1621 ccagcttgcc catcttgaaa agcgacttgc gtcctcccac atcgatcaga atgttctcga 1681 atacttcatc catgtcctgc gtttcgccgg gcaggtgttg gtagtaccac ttgatctttc 1741 ccgtgtccgg gtcaagcgca agtgtcgagt tggtataaag agcatcaccg tcggtgcctc 1801 gcgcaactct ggcccatggc ttcgcttgcg cagtgcccca atagatgaga ttggtttcgg 1861 gatcgtagct gccggcaatc caggcatcac cgccggcgcg aaacgtcagc ggcagatcac 1921 cccacgtctc gccaccgggt tcgcccgggc gagcgatcgt tgaagtgcgc cacagttctt 1981 ttccagtggc tgcatcatgg ccggtgatga agcagacatc atctttatag cgcgtgcaac 2041 cagtgatgcc cgcgatcacc ttgcctcgaa cgatgatcgg tcccgatgtg tattcgtacc 2101 cgagttgtgg atcggcaact ttagtgttcc acacgacctt ccctgtccga gcatccaggg 2161 caacgatgtg tgcatctccc gttgtcgcaa aaatcttgtc gccatagatc gcgatgtttc 2221 tctgaattcc tctgccgtcg gccgccgcag cgttgccttg ggcgaggcgt ggaatcgatg 2281 tctgttcccc accgccggca gacgcagccg cagcaggccg gtttatttgt ggacggtact 2341 gccagatcag gtcgccagtc gcggcatcca gcgcttgtat gacgccgcgc ggattcggca 2401 ggtacatgat gccgtcatag accagcggcg cggcttcatt ggcgccggta tcatccatcg 2461 cccacgacca cgcgagctga agctggctta cgttctgccg attgatttga tccagcggac 2521 tgtaacccca cgcgctgtcg gtgcgcctcc agttcaacca atcgcctgga gacggattcc 2581 gcagcatcgc ctcggtgacc tggcgaaatt ctctgacctg cgcgaagagg caaagccctc 2641 cgcaaaacgc catcatggag atggaaagtg cgattcgcgt tctcatacga ttgccctcat 2701 ctgaatgtct gcgcgattct attccggcca gccaccggtt caaacgtttt tctgggattt 2761 tgagatttgt gcaatcagaa tgtacaattc agacacatga gaaaatcgaa agaaattcat 2821 ttcagtcaag cacgccaaaa cttaagcggc attattgacc gccttccgca ctccggacct 2881 gtgacgattt tgcggcatgg aaaaccggcc gcggtggcga tcagtaacga gccggttaag 2941 gagatcaaat gaccgttcaa gatcttgaag atctttacga ctacggctac tgggcgacca 3001 aaaagctagt tggtgtcatg tcgcagctta tgccagaaca attcacgcag tctgtgggtg 3061 gaagctatgg gtccatacgg aacacgttgg tgcatatcct cagcgctgaa tggggctggc 3121 tcgatcgttg cggcggcccg aagcggggtc cacgtctcaa tcctgccggc tatccaactc 3181 tggaatcgct gctcgaaatc tggagcaaag tcgaagggta cgtgcgcgag tttttgtcca 3241 cgctgaagga tgaagacctt cgtcgcaatg ctgaatatat gaatgacgct ggcgagaagc 3301 gctccatgcc aataggagaa ctgatgcagc acgcggccaa tcatggcgtg caccaccgtg 3361 ggcaggtggc gctgatgctg agattgctcg gctacgcgcc cggcaatttc gacatcctct 3421 tttactttgc cgaacggcgc ggcgtacctg ccttttaatc cttgtgtagc tgcgaaaacg 3481 gcccggtcgc gtgcacgact ttccctgcga ggagtgttag tacggattcg atttttcgaa 3541 tccgagcatc ggagactgtg aggtagtcgt cactcaggac agccagatcg gcgtacttgc 3601 cttccacaaa agatccgatg tggtgctcgt cgaacgagaa ccacgcggcg ccctccgtat 3661 acaaacgaag cgcctccacg cgtgagatct gctggccgtc gttggtcagt gttcccgcga 3721 ggtttcgtcc cgtcgtcata tagaacagcg acaaccaggg atcgagagct gaaacgttgg 3781 ttgaatcggt gcccactccc gcccggattc cggactcgac gatccggcgg aatggtggtc 3841 cgccgcctgt tgatagatag gtccacgtct gggcactggc tccggcgcca agttccatca 3901 acgttttcag ccgcggcgaa tcgatgcttt ggagatgcag gagtgaccag cgaaggccgt 3961 caatcggata atcgcgtgca atcgatcgaa acgcgctgag atgaaaatcg ttttctgcaa 4021 gcgtgattga atgttgctgc agtagccaac catgcttcgc aatgacgcgg gcggtaggct 4081 cgaacatcgg cgatacagtg tcgttgccgc cgattctctc gccgaagcct gccactctga 4141 ataaatcgtc gccgactgcg cgaccagatt ggctgaagtt gttcaggatt cgtgcttcca 4201 cctcttgcgg cgagtccgct ggaaacagcg gccgcatgcg gatggtgagc ttatcttccc 4261 gccacagccg gagcgggtac tcctgatccg agagattact cgcattgatg actgcagtca 4321 ggccgaggct gttcgcgtga gcgttcaatt gggc //