LOCUS SEHD01000796 3406 bp DNA linear ENV 12-FEB-2019 DEFINITION Xanthomonadaceae bacterium isolate PMG_165 scaffold_64764, whole genome shotgun sequence. ACCESSION SEHD01000796 SEHD01000000 VERSION SEHD01000796.1 DBLINK BioProject: PRJNA272922 BioSample: SAMN10678729 KEYWORDS WGS. SOURCE Xanthomonadaceae bacterium (phyllosphere metagenome) ORGANISM Xanthomonadaceae bacterium Bacteria; Proteobacteria; Gammaproteobacteria; Xanthomonadales; Xanthomonadaceae. REFERENCE 1 (bases 1 to 3406) AUTHORS Crombie,A.T., Larke-Mejia,N.L., Emery,H., Dawson,R., Pratscher,J., Murphy,G.P., McGenity,T.J. and Murrell,J.C. TITLE Poplar phyllosphere harbors disparate isoprene-degrading bacteria JOURNAL Proc. Natl. Acad. Sci. U.S.A. 115 (51), 13081-13086 (2018) PUBMED 30498029 REFERENCE 2 (bases 1 to 3406) AUTHORS Crombie,A. TITLE Direct Submission JOURNAL Submitted (18-JAN-2019) School of Biological Science, University of East Anglia, Norwich Research Park, Norwich NR4 7TJ, United Kingdom (Great Britain) COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Date :: 2018 Assembly Method :: IDBA-UD v. 1.1.1 Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: 9x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 02/01/2019 14:25:40 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 4.7 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 9,204 CDSs (total) :: 9,106 Genes (coding) :: 8,911 CDSs (with protein) :: 8,911 Genes (RNA) :: 98 rRNAs :: 1 (16S) partial rRNAs :: 1 (16S) tRNAs :: 88 ncRNAs :: 9 Pseudo Genes (total) :: 195 CDSs (without protein) :: 195 Pseudo Genes (ambiguous residues) :: 0 of 195 Pseudo Genes (frameshifted) :: 47 of 195 Pseudo Genes (incomplete) :: 144 of 195 Pseudo Genes (internal stop) :: 6 of 195 Pseudo Genes (multiple problems) :: 2 of 195 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..3406 /organism="Xanthomonadaceae bacterium" /mol_type="genomic DNA" /isolate="PMG_165" /isolation_source="Deciduous woodland, University campus" /host="Populus alba" /db_xref="taxon:1926873" /environmental_sample /geo_loc_name="United Kingdom: Norwich" /lat_lon="52.6188 N 1.2451 E" /collection_date="24-Aug-2015" /note="metagenomic; derived from metagenome: phyllosphere metagenome" gene complement(1..113) /locus_tag="EOP92_31710" /pseudo CDS complement(1..113) /locus_tag="EOP92_31710" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_005663905.1" /note="incomplete; partial in the middle of a contig; missing start and stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hydrolase 1, exosortase A system-associated" gene complement(163..1014) /locus_tag="EOP92_31715" CDS complement(163..1014) /locus_tag="EOP92_31715" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015768735.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hydrolase 2, exosortase A system-associated" /protein_id="RZA31334.1" /translation="MKAPGTPPADAFFLESEGRQRFCLYHPPAGPARGAVLYVHPFAE ELNRTRRMAAMQARALAVHGFGVLQIDLTGCGDSSGDFGDARWELWKDDLAAGAAWLR QRVDGRFTLWGLRLGALLALDYARGARHPVDAMLLWQPVLKGTQHLNQFLRLHMAGAL LTEGNGAAHGGTEALRATLHAGGALEIAGYELAPQLARALEALPSLEALAPTCPVDWI ETVSAAGQEAAPGALRAVAAWRAAGVGVRLHPVQCAPFWATSEITENTAWIDATSDAL LERMHGT" gene complement(1011..1259) /locus_tag="EOP92_31720" CDS complement(1011..1259) /locus_tag="EOP92_31720" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_005663900.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="acyl carrier protein" /protein_id="RZA31335.1" /translation="MYFDETKTILIDVLSLGENGRRLDADSPLLGALPELDSMAVISL IAALEEHFDIAIDDDDISASTFATLGTLAAFVASKRDA" gene complement(1299..2759) /locus_tag="EOP92_31725" CDS complement(1299..2759) /locus_tag="EOP92_31725" /inference="COORDINATES: protein motif:HMM:PF13440.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="RZA31336.1" /translation="MAILRRSLIINFFSSSGATILQLVVSILLARILSPAEIGVYSMT LVFVNFAQVFRDFGVTQYIQREAELTPEKLRAANGVAFTSSWLIASCLYLSSGVVGRW FAEPGITPVMQVLAMGFVLIPFSSITNAMLTREFAAEKQAYVNAAGTIAFCISCLVLA KLGFSSMSLAWANFINIVTSTVAFSLLRPRNLPWLPSFSHWRSIAHFGAGSLLSNSVS AINNALPDILLGKLGSASQVGLLSRANSTVQIFIYVAGSTVSYGAVSYLAQTYHRGES LVPVLRRATVLLTGVGWAALALTAVFGRDIMLALYGPTWLEAVPAILPLALAAATAMT FHYIPLAVTAIGRPYLSAAPVLVTLLARIAFGVLLYDGSLDGFAWALCLATLVTTPVI AFQQSRCLGLGTPSLLRALVPSAVVAVGSAAAGALLMALMPAALSPLTRLLLLGPVLA AVWYLLLRLTRHELVGEVHRLAAPIKTRLALLRADT" gene complement(3004..3357) /locus_tag="EOP92_31730" CDS complement(3004..3357) /locus_tag="EOP92_31730" /inference="COORDINATES: ab initio prediction:GeneMarkS-2+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS-2+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="RZA31337.1" /translation="MANLGLAADARYVRGCAGFTGFPRSTEMREKMIDFSRRMSEKLG EDWKRWGTEQVTSNYLAANSPGIKALPFPKYGTPDCATSDTAFFHFIGSMRFINAKYD TTSRHAIALIKGMAV" BASE COUNT 643 a 1076 c 1157 g 530 t ORIGIN 1 cggtactgcg gtccgccgac gacgaccagc acgccgcgtc cgcctccgac ggtgggctgc 61 gtcaggatcc cgtacagccg cgcgccgccg cagtcgaagg cgagggcgcg ctcccgcacc 121 gcgggcgtgg ttccggactg ccgggggcca gcctgaagga tctcaagttc catgcatgcg 181 ctccagtagt gcgtccgagg tcgcatcgat ccaggcggtg ttttcggtga tctcggaggt 241 cgcccagaat ggcgcgcact ggaccggatg caggcgtacg ccgactcccg cagcccgcca 301 tgccgcgacg gcacgaaggg cgcccggcgc cgcttcctgc cctgcggcgg ataccgtctc 361 gatccagtcc accggacatg tcggcgccag tgcttccagc gagggcagtg cctccagcgc 421 gcgcgctagc tgcggcgcca gctcgtagcc ggcgatctcc agcgccccgc ctgcgtgcag 481 ggtcgcacgc aaggcttcgg tcccgccgtg cgcagcgccg ttgccttcgg tgagcagggc 541 gccagccatg tgcaggcgca ggaactgatt caggtgctgc gtgcccttca gcacgggctg 601 ccacagcagc atggcgtcga ccggatggcg cgcgccgcgc gcgtaatcga gcgccaacag 661 ggcgcccagg cgcaggcccc agagtgtgaa gcgtccatca acccgctgcc gaagccaggc 721 ggcgccggcc gccaggtcgt ccttccacag ttcccagcgc gcatcgccga agtcgccgct 781 gctgtcccca cagccggtca ggtcgatttg caggacgccg aagccgtgga ccgccagggc 841 gcgcgcctgc atggccgcca tgcgccgcgt gcggttcaat tcctcggcga agggatgcac 901 gtacagcacc gcgccacgcg cgggcccggc gggaggatgg tagagacaaa atcgctggcg 961 gccttcgctt tccaggaaaa aggcgtcggc cggcggcgtg ccgggcgcct tcatgcgtcg 1021 cgcttggacg cgacgaaagc tgccagggta cccagcgtgg cgaaggtgct ggcgctgata 1081 tcgtcgtcgt cgatcgcgat gtcgaaatgc tcctcgagtg cggcgatcag gctgatcacc 1141 gccatcgagt cgagttccgg cagggcgccc agcaggggcg agtcggcgtc cagacggcgg 1201 ccgttctcgc ccaggctcaa gacgtcaatc aggatggttt tggtctcatc gaagtacata 1261 gcgattccag ataaagcgac gcggcccggg ggcgcgcctt aggtatcggc ccgcagcagg 1321 gcgagccggg tcttgatggg cgcggcgagg cggtgcactt cccctaccag ttcgtgacgc 1381 gtcaggcgca ggagcaggta ccagaccgca gccagcacgg ggccgagcag cagcaggcgg 1441 gtgagcggcg acagcgcagc cggcatcagc gccatgagca gggcgcccgc cgccgcgctt 1501 cccaccgcga ccacggcgct cggtaccagc gcgcgcagca gcgacggggt gcccaggccg 1561 aggcagcggc tttgctggaa cgcgatgacg ggcgtcgtca cgagcgtcgc caggcacagc 1621 gcccaggcga agccgtccag ggagccgtcg tacagcagca cgccgaaggc gatacgggcc 1681 agcagggtga ccagcaccgg cgcggcgctc agatagggcc gtccgatggc cgtgaccgcc 1741 agcggaatgt agtggaaggt catggccgtg gcagccgcca gcgccagcgg caggatggcg 1801 ggcaccgctt ccagccaggt aggaccatag agggccagca tgatgtcgcg tccgaacacg 1861 gccgtcaggg ccagcgcggc ccaaccgacg ccggtcaaca gcacggtggc gcgcctcagc 1921 accggcacca gtgattcgcc gcgatgataa gtctgagcca ggtaggacac cgcgccatag 1981 ctcaccgtcg agccggccac gtagatgaat atctggactg tcgagttggc gcggctgagc 2041 aggccgacct ggctggccga gcccagcttc ccgagcagga tatccggaag cgcgttgttg 2101 atcgcgctga cggaattcga tagcagcgag ccggcgccga agtgggcaat gctgcgccag 2161 tgcgagaacg agggcagcca gggcaggttg cgcggacgca ataagctgaa ggcgaccgtg 2221 ctggtcacga tattgatgaa attggcccat gccagactca tgctgctgaa gccgagcttg 2281 gccagcacca ggcagctgat gcagaaggcg atggtgccgg ccgcgttgac gtacgcctgc 2341 ttttcggccg cgaactcgcg cgtcagcatg gcgttggtga tggagctgaa gggaatcagg 2401 acaaatccca tcgccagcac ctgcatcact ggcgtgatcc cgggctcggc aaaccacctg 2461 ccgacgacgc cgctggacag gtagaggcag gacgcgatca gccatgacga agtgaaagca 2521 acaccgttgg ctgcgcgcaa cttttcgggg gtgagctcgg cttcacgctg gatgtattgg 2581 gtaacgccaa agtcgcggaa cacctgggcg aagttgacaa ataccagcgt catcgaataa 2641 acgccgatct cggccggact caggatacgg gcgagcagga tgcttaccac caactgcagt 2701 attgtcgctc cggacgagga aaaaaagttg atgatcagcg agcgacgcag aatcgccata 2761 aggacagcaa tcgagaagga acagggcgcg attcagaaca cgctgcacgg ccggaagggc 2821 aggacggcgc gcagtgggcg cggcaaacca gtcaaagaac gcacatgatg tgctgggtaa 2881 acttctttgc tgtaacgcat aacattatat gccgtatcga catgcgcaga cgtttccatg 2941 acgctgtttt tgtcatatct gcatcatctg ttggtgcttg gtcatatccc tgctggccgg 3001 cactcagacc gccatgccct tgatcagcgc gatcgcatgg cgtgacgtgg tgtcatactt 3061 ggcgttaatg aaacgcatcg agccgatgaa gtggaagaag gccgtatcgc tggtcgcgca 3121 atccggcgtg ccatatttcg ggaatggcag cgccttgatg ccgggcgaat tcgcggccag 3181 gtagttcgac gtaacctgtt cggtgcccca gcgtttccag tcttcgccca gcttctcgct 3241 cattcggcgc gagaaatcga tcattttttc gcgcatctcg gtagagcggg ggaatccggt 3301 gaagccggca cagccgcgaa cataccgcgc gtccgccgcc aggccgaggt tggccatttc 3361 actctccgag atggtctgga tgtgcgcgcc gggaccaagc ttgggc //