LOCUS SEEA01000137 6357 bp DNA linear ENV 12-FEB-2019 DEFINITION Proteobacteria bacterium isolate PMG_207 scaffold_25563, whole genome shotgun sequence. ACCESSION SEEA01000137 SEEA01000000 VERSION SEEA01000137.1 DBLINK BioProject: PRJNA272922 BioSample: SAMN10678648 KEYWORDS WGS. SOURCE Proteobacteria bacterium (phyllosphere metagenome) ORGANISM Proteobacteria bacterium Bacteria; Proteobacteria. REFERENCE 1 (bases 1 to 6357) AUTHORS Crombie,A.T., Larke-Mejia,N.L., Emery,H., Dawson,R., Pratscher,J., Murphy,G.P., McGenity,T.J. and Murrell,J.C. TITLE Poplar phyllosphere harbors disparate isoprene-degrading bacteria JOURNAL Proc. Natl. Acad. Sci. U.S.A. 115 (51), 13081-13086 (2018) PUBMED 30498029 REFERENCE 2 (bases 1 to 6357) AUTHORS Crombie,A. TITLE Direct Submission JOURNAL Submitted (18-JAN-2019) School of Biological Science, University of East Anglia, Norwich Research Park, Norwich NR4 7TJ, United Kingdom (Great Britain) COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Date :: 2018 Assembly Method :: IDBA-UD v. 1.1.1 Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: 8x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 02/01/2019 02:58:17 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 4.7 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,487 CDSs (total) :: 5,435 Genes (coding) :: 5,384 CDSs (with protein) :: 5,384 Genes (RNA) :: 52 tRNAs :: 46 ncRNAs :: 6 Pseudo Genes (total) :: 51 CDSs (without protein) :: 51 Pseudo Genes (ambiguous residues) :: 0 of 51 Pseudo Genes (frameshifted) :: 10 of 51 Pseudo Genes (incomplete) :: 38 of 51 Pseudo Genes (internal stop) :: 4 of 51 Pseudo Genes (multiple problems) :: 1 of 51 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..6357 /organism="Proteobacteria bacterium" /mol_type="genomic DNA" /isolate="PMG_207" /isolation_source="Deciduous woodland, University campus" /host="Populus alba" /db_xref="taxon:1977087" /environmental_sample /geo_loc_name="United Kingdom: Norwich" /lat_lon="52.6188 N 1.2451 E" /collection_date="24-Aug-2015" /note="metagenomic; derived from metagenome: phyllosphere metagenome" gene 1..264 /locus_tag="EOP11_06655" CDS 1..264 /locus_tag="EOP11_06655" /inference="COORDINATES: protein motif:HMM:PF04266.12" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ASCH domain-containing protein" /protein_id="RZA07801.1" /translation="MLLDAEAKPGCILRTERVLIHKFMDVPAEIAIAEGEGDLSLAYW RKVHGELWRPCLTAWGLAAMEEASVITEFFAIVYRGDQAPLPS" gene complement(259..810) /locus_tag="EOP11_06660" CDS complement(259..810) /locus_tag="EOP11_06660" /inference="COORDINATES: ab initio prediction:GeneMarkS-2+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS-2+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="RZA07802.1" /translation="MKKHARIDFEGTIKYVSLSPHGDPEGIVLDDGSFVKAPPHSLVK KELFKVGAKVSGTGEIISEEPHPVLHHAQIKAGSEILSDDSGDEDEREELKEAHKADL KSRPDAKEEKLTLKGRIAAIATKPKGEVDRVILEDGTSIHVSKDMKLTRDDCEIGSMI QVEGKTRLYGVARFMKAEIIKQL" gene complement(1022..1561) /locus_tag="EOP11_06665" CDS complement(1022..1561) /locus_tag="EOP11_06665" /inference="COORDINATES: protein motif:HMM:PF02660.13" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="RZA07803.1" /translation="MIFAVYLLMAYLVGSIPSGVLLARAMRAADPRHGDDKKIRAITF VLDFLKGFLPVIFAYLYWPDDSFPAAIGAFFAVLGHCYSIFLFFQGGKGLAAGAGALF PLAPFAMLVALAAWGFSYYVFRIKPQAALIAILSFFGALFIFGTAPHLLAVAGACALM MVRRHRADLDGLLAGVVSR" gene 1695..2420 /locus_tag="EOP11_06670" CDS 1695..2420 /locus_tag="EOP11_06670" /inference="COORDINATES: ab initio prediction:GeneMarkS-2+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS-2+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="RZA07804.1" /translation="MAFDPNNKGFLSLVPFTLPLALIAPYAFVASRFRGLVWVWESGY RPFWICLAALVSLAVISAVLLHRLQKMGSTLYRVGVVIATVQTAYLAMSERRHSLLVL VFALFAAQVFLAEKIKNVLKLPFFDSRRRWWESYPKAIPGLQVEVSAENGDTMEVRLS NFGLEGCFVFSENGTIPFHPQMVRIFTGTQTLLEADVEAVERTHDGFGWGLRFSSSAL EGDWSKDLQDYLGYLRRSGYEVA" gene 2407..2904 /locus_tag="EOP11_06675" CDS 2407..2904 /locus_tag="EOP11_06675" /inference="COORDINATES: ab initio prediction:GeneMarkS-2+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS-2+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="RZA07805.1" /translation="MKLRNGFVVAMAFLAVACSKTPKADTPEGALHRYVTLAFEARSA GAKKDLMELSTGEALAYLQRMDDADFKKQFLDSNLKFVSLKAKDKRVENSGDVSLVYE LAYKAGQPANATVLTNKKIAYLTKEGEGWKIKATKNMKTFIERKEDLLITPETTEQNQ APAAK" gene complement(2851..3621) /locus_tag="EOP11_06680" CDS complement(2851..3621) /locus_tag="EOP11_06680" /inference="COORDINATES: protein motif:HMM:PF01790.16" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="prolipoprotein diacylglyceryl transferase" /protein_id="RZA07806.1" /translation="MQPILFHLFGWPIQAYGFFIALGYILGLILVRRLAISRRRHPAP YTDLSFFALIVGLIGARALFVLTNLDYFSSHADEIFYFWGGGLVFYGGFLLAFPFVLW FVSFRGLPLKMSLDILAPGLALGHAVGRLGCFFAGCCHGRVCDLPWGVQMNSLQVEPA LRGLALHPTQLYEAAGLFLLSGILSFFVITKRLRDGLVAVAYAMGYAILRTGVEFYRG DSIRGTLAGTPFSTSQLIALALLGGGSLVLLRGFGRDQ" gene complement(3646..4329) /gene="lspA" /locus_tag="EOP11_06685" CDS complement(3646..4329) /gene="lspA" /locus_tag="EOP11_06685" /EC_number="3.4.23.36" /inference="COORDINATES: protein motif:HMM:TIGR00077.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="signal peptidase II" /protein_id="RZA07807.1" /translation="MLGAETAINPKIIIFIFIILCFARAEIGLGVKEELRRWEHKGKF IDLMPASTYPIEARYPVRSMMPRKYLILMATVGALVSFDQLTKLLMTETFALGMSRPV IKDFFHLSLVHNKGAAFGIMANLAPKMRDPILFLIPCLVLCLILFAFTRLKENQTLST YALSLVIGGAVGNLADRLRLGYVVDFLDFHWRNQYHFPAFNLADTAITCGVIMLFLSL FYEKETPGA" gene 4328..4654 /locus_tag="EOP11_06690" CDS 4328..4654 /locus_tag="EOP11_06690" /inference="COORDINATES: ab initio prediction:GeneMarkS-2+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS-2+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="RZA07808.1" /translation="MAFADFDCKAGQGDTALHVELDSATQIGYVTVYEGNASSLVKGS YSVESEVLSQKIVLALEGDGGKIDFVEKTQFCGRAGCWDSKPTAWKAKWSGKQGDAWF SCNETN" gene 4623..5363 /locus_tag="EOP11_06695" CDS 4623..5363 /locus_tag="EOP11_06695" /inference="COORDINATES: protein motif:HMM:PF13442.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="cytochrome c" /protein_id="RZA07809.1" /translation="MPGSPAMKLTNAGTYGGGLVICGAAALYFLATTAGAITFTSRES TTATGGPVTNSIALIEEKGQEIWMMNQSHHGAMASSEKWDRLAIVVKKENGVKRARFY QLEPGPLSWNPKAREVPRRAACYTCHANGPRGIRPQSALAWHEWPKLVAWNLKIKTYG KIALEDPPATPGQTPVKFSGPMANERLKVAACTKCHGGSGPFARNALLRQQETAIHFM LKEGIMPPMGFKISPQERQEIEEFLAGF" gene complement(5385..>6357) /locus_tag="EOP11_06700" CDS complement(5385..>6357) /locus_tag="EOP11_06700" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011163987.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=2 /transl_table=11 /product="sigma-54-dependent Fis family transcriptional regulator" /protein_id="RZA07810.1" /translation="INKLAQVDTSVLIRGESGTGKELVAKAIHMNGPRKDERFVAVNC SAIPEALIESEFFGHEKGAFTGADGRKIGKFQYADGGTLFLDEIGDISPALQVKLLRA LQEQRFTPVGANREVEVNVRIVAATNRNLEEMIKKGEFREDLFYRLNVLPIFLPPLRE RKDDVLALVEHFLQKFNGTYGAKIQGVADEARELLMKYNWPGNIRELENVMEHAFVIE SGARITPASLPDSIRGLSRSGKSAVASDNQGADGKENISYAINLNNLDFQNQKEEFEK HFIISALKVFNGRINQTALHANIPKKTLLRKLEKYGLTAKDYQDDGD" BASE COUNT 1509 a 1682 c 1837 g 1329 t ORIGIN 1 attctactcg atgccgaggc aaagcccggg tgcatcttgc gaaccgagcg ggtgctcatc 61 cataagttta tggatgtgcc cgcggagatt gcgatcgccg aaggcgaagg ggatctatcc 121 ctggcttact ggcgcaaagt gcacggagag ctttggcggc cgtgccttac ggcgtggggg 181 ttagccgcga tggaagaggc ttcggtgatc accgagttct tcgcgatcgt ttatcgcggt 241 gatcaagccc ccttgccatc atagctgctt aatgatctcg gccttcataa agcgggcgac 301 accgtagagg cgcgttttgc cttccacctg aatcatcgag ccgatctcgc agtcgtcgcg 361 ggttagtttc atatctttag aaacgtggat ggaggtgcca tcttctaaga tcacgcgatc 421 cacttctccc ttgggcttcg tggcgatcgc cgcgatcctg ccttttagcg ttagcttttc 481 ttccttggca tcggggcggg attttagatc ggccttatgg gcctctttta gctcctcgcg 541 ctcatcctca tccccggaat catcggaaag gatctctgac ccggccttaa tctgcgcgtg 601 gtgaagcacc gggtggggct cttcgctaat aatctcgccc gtgccgctta ccttggcacc 661 aaccttaaag agctctttct ttaccaagga atggggaggg gctttcacga agctgccatc 721 atcgagcacg atcccttcgg gatcgccgtg aggagaaagg cttacgtatt taatggttcc 781 ttcaaagtcg attcgggcgt gcttcttcat ggacgctgct ccttaagctt gattggtgtt 841 taaggatttc aaagcaaagg cagcgccatc ccccttggcc gctataaggc ccgctcgagc 901 aaaaaaaggg gatatgggtg tatggaaagt gggcgcggcg cggatgaaat cacgacaacc 961 acgaccggct ttataaataa aacttagttt ttcgcggccg cgccggtggc caaagcgaag 1021 tttagcgaga aactacaccg gccagaaggc catcaagatc tgcccggtgc cggcgcacca 1081 tcatcagggc gcaggcgccc gctaccgcta acaaatgcgg cgcggttccg aagataaaga 1141 gagcgccgaa gaaagagaga atggcgatta gcgcggcttg gggcttgatc cgaaaaacgt 1201 agtagctaaa accccacgcg gccagcgcca cgagcatcgc gaaaggggcg aggggaaaga 1261 gggcgcccgc cccggcggcg agccctttgc cgccctggaa gaataagaaa attgaatagc 1321 agtggccgag caccgcgaag aaagctccga tggcggccgg gaaagaatca tccggccagt 1381 agagataggc aaagattacg gggagaaagc ccttgagaaa atcgagcacg aaagtgatcg 1441 cgcgaatctt cttatcatcc ccgtgccgag gatccgcggc acgcatcgcc cgcgccaaaa 1501 gaaccccgct cgggatcgag ccaactagat aggccattaa caaatagacg gcaaaaatca 1561 cgataacctt cctggtagaa gtgattgcct ataaatcata ccaagcgaat cagtttaccg 1621 ctgcgattat tttcgcatcg agtcgtgtta agcgagccta cctgccaggc gggctcgcgt 1681 gctaaattac aaagatggca tttgatccca acaacaaagg cttcctgagc ctggttcctt 1741 tcacgctacc gttagccctc atcgcgccct acgccttcgt ggcctcccgc ttccgcggct 1801 tggtatgggt ttgggaatca ggctatcgcc ccttctggat ttgcttagcc gccctagtat 1861 ctctcgcggt gatcagcgcc gtgctcctcc accgtttgca gaaaatgggc tccaccctct 1921 accgcgtagg ggtggtgatc gctacggtgc aaacggccta cctcgcgatg tcggagcggc 1981 ggcactccct cctcgttttg gtttttgcgc tcttcgcggc gcaggttttt ctcgccgaaa 2041 aaatcaagaa cgtgctgaag cttcccttct ttgattcccg ccgccgttgg tgggaatctt 2101 atccgaaagc gattccgggg ctccaggtag aggtttccgc cgaaaacgga gatactatgg 2161 aagtacggtt atcgaacttc ggcttggaag gttgttttgt tttctccgaa aacggaacga 2221 tcccctttca tccgcagatg gtgcgcattt tcaccggcac gcaaacgctt ttagaagcgg 2281 atgtggaagc cgtggagcgc acccacgatg gctttggctg gggcttgcgt ttcagctcat 2341 ccgccttgga aggcgactgg agcaaagatc tccaggacta tttaggctat ttacgaagga 2401 gtgggtatga agttgcgtaa tgggttcgtg gtagcgatgg catttctagc ggtagcttgc 2461 agcaaaacgc ccaaggccga tacccccgag ggagcgctcc atcgctacgt aaccttagcg 2521 tttgaggcgc gcagcgcggg cgcgaaaaaa gatttgatgg agctaagcac cggcgaagcc 2581 ctggcctacc tgcagcggat ggatgatgcc gatttcaaga aacagtttct agattcgaac 2641 ctaaagttcg tgagccttaa agccaaagat aaacgcgtag aaaactccgg cgacgttagc 2701 ctagtttacg agctagccta caaggcgggc cagcccgcca acgccaccgt gctaaccaat 2761 aagaagatcg cttacctaac gaaggaaggc gaaggctgga aaatcaaagc tacgaagaac 2821 atgaaaactt ttatcgagcg taaagaagat ctactgatca cgcccgaaac cacggagcag 2881 aaccaggctc ccgccgccaa gtaaagccaa agcaatcagc tgcgaagtgg aaaaaggggt 2941 gcccgcgagg gtgcccctta tgctatctcc ccggtaaaat tccacgccgg tgcgcaagat 3001 cgcgtagccc atcgcatacg ccacggccac caaaccatcg cgcagccgct tcgtgatcac 3061 gaagaaactc aaaatcccgc tcaagagaaa taatccggcg gcttcgtaga gctgagtggg 3121 gtgcagggcc aagccacgca gcgcgggctc aacctgcaag ctattcatct gcactcccca 3181 gggaagatcg caaacgcgcc cgtggcagca accggcaaaa aaacagccca ggcggcccac 3241 cgcgtgcccc agcgctaagc ccggcgctag gatatcgagg ctcatcttta acggtaagcc 3301 gcggaagctc acgaaccaca gcacgaatgg aaaggctaag agaaagccgc cgtagaaaac 3361 gagcccgccg ccccaaaaat agaagatctc atcggcgtgc gaagaaaaat aatcaaggtt 3421 ggtgagcacg aagagcgccc gggctccgat gaggcctacg atcagcgcga aaaaagaaag 3481 atcggtatag ggcgcggggt ggcggcggcg agaaattgcg agccgccgca cgagaattaa 3541 gcccaggata tatcccagcg cgataaaaaa tccgtaggcc tggatcggcc agccaaaaag 3601 atgaaacagg atgggctgca tgaaaaaatc cctagggatg agggcttagg cgccgggggt 3661 ttctttttcg tagaagaggg agagaaaaag catgatcacg ccgcaggtga tggccgtatc 3721 ggcgaggtta aaggcgggaa aatggtattg gttgcgccag tgaaaatcga gaaaatccac 3781 cacgtagccc aggcgcaggc gatccgcgag gttgcccacg gctccgccga tcacgagcga 3841 gagcgcgtag gtgctgagcg tttgattttc ttttagccga gtaaaggcga agagaatcag 3901 gcagagcact aagcagggaa tgagaaagag aataggatct cgcatcttgg gcgcgaggtt 3961 ggccatgatc ccaaaggccg cgcccttgtt atgcacgagc gagagatgaa aaaaatcttt 4021 gatcacgggc cggctcatgc cgagcgcgaa agtttcggtc atcaaaagct tggtgagctg 4081 atcgaaagaa acgagcgcgc ccacggtggc catcaggatt aagtatttac ggggcatcat 4141 gctcctaaca gggtagcgcg cctcgatcgg ataagtcgat gccggcatca aatctatgaa 4201 ttttcctttg tgctcccacc ggcgtaattc ctctttcacg cccaagccaa tttcggcgcg 4261 ggcaaaacaa aggatgatga agatgaaaat gatgatttta ggattgatgg ccgtttcggc 4321 gccgagcatg gcttttgcgg atttcgactg caaggccggc caaggcgata cggccctgca 4381 cgtggaatta gatagcgcca cccagatcgg ctacgtaacg gtttacgaag gcaacgcaag 4441 ttcgctggta aaggggtctt attccgtgga atcggaagtt ctgagccaga aaatcgttct 4501 ggccctcgaa ggcgacggcg ggaagattga tttcgtggag aagacccaat tttgcggacg 4561 ggccggctgc tgggattcta agcccactgc ttggaaagca aagtggagcg ggaagcaggg 4621 tgatgcctgg ttctcctgca atgaaactaa ctaatgccgg cacgtatggc ggcggcctcg 4681 taatctgcgg ggccgccgct ctttattttt tggccaccac ggctggcgcg atcaccttca 4741 cctcgcgaga atcaaccacc gccaccggcg ggccggtaac gaattcgatc gcgctgatcg 4801 aggaaaaggg ccaagaaatt tggatgatga accagagcca ccacggagcc atggctagct 4861 ccgaaaagtg ggatcggttg gcgatcgtgg tgaagaaaga aaacggcgta aagcgagcgc 4921 gattttatca gttagagccc ggcccattga gctggaatcc gaaggcgcgg gaggtgccgc 4981 gaagagccgc ctgttatact tgccacgcca atggcccacg ggggataagg ccgcagagcg 5041 cgctagcttg gcacgaatgg cctaagctcg tggcctggaa tttaaagatc aaaacctacg 5101 gaaagattgc gctggaggat ccgcccgcca cgccggggca aacccccgtg aagttttcgg 5161 ggcccatggc caacgaacgt ttaaaggtag ccgcttgcac gaagtgccat ggcggctccg 5221 ggccttttgc ccggaatgca ctccttcggc agcaggaaac ggcgattcac ttcatgttaa 5281 aagaggggat catgccgccc atgggattta aaatttctcc gcaggaaagg caagaaattg 5341 aagagtttct cgcgggcttt taattagccc gcgaagcttt ctggttaatc gccatcatcc 5401 tgataatcct tcgcggttag cccgtatttt tcgagcttcc gcaggagggt tttcttagga 5461 atattcgcgt gcagggccgt ttggttgatg cggccgttga aaactttcag cgcgctaatg 5521 atgaagtgct tttcaaactc ttccttctgg ttttggaaat ccagattatt taggttgatg 5581 gcgtagctga tattttcttt gccatcggcg ccttggttat cggaggccac cgcggatttt 5641 ccgctgcggg agaggccacg gatagaatcg ggcaaggaag cgggcgtgat gcgggcgccg 5701 ctttcgatca cgaaagcatg ctccatcacg ttttcgagtt cgcggatgtt tccgggccag 5761 ttgtatttca tcagcaattc gcgggcttca tcggccacgc cctgaatctt ggcgccgtag 5821 gtgccattaa acttctgcaa aaaatgctct acgagggcga gcacgtcgtc tttgcgctcg 5881 cgcaaggggg gcaagaaaat aggaagcacg tttaggcggt agaagagatc ctcgcggaac 5941 tctccctttt taatcatctc ttctaagttt cggttcgtgg cggccacgat ccgcacgttc 6001 acttccactt cccggttagc gcccacgggg gtgaagcgct gctcctgaag ggcccgcaaa 6061 agctttacct gcagggcggg cgagatatcg ccgatctcat cgagaaagag cgtgccgcca 6121 tcggcgtatt gaaactttcc aattttgcgg ccatcggcgc cggtgaaggc tcctttttcg 6181 tggccgaaga attcgctttc gatcaaggct tcggggatcg cgctgcaatt taccgccacg 6241 aacctttcat ctttgcgggg gccgttcatg tggatggcct tggccacgag ctccttgccg 6301 gtgccgctct ctccgcgaat cagcacggag gtatccactt gggcgagttt gttgatg //