LOCUS JACFNH010000061 7677 bp DNA linear ENV 05-AUG-2021 DEFINITION MAG: Leptonema sp. (in: bacteria) isolate BROCD019 NODE_5128_length_7677_cov_3.881265, whole genome shotgun sequence. ACCESSION JACFNH010000061 JACFNH010000000 VERSION JACFNH010000061.1 DBLINK BioProject: PRJNA647942 BioSample: SAMN15615540 KEYWORDS WGS; ENV; Metagenome Assembled Genome; MAG. SOURCE Leptonema sp. (in: bacteria) (wastewater metagenome) ORGANISM Leptonema sp. (in: bacteria) Bacteria; Pseudomonadati; Spirochaetota; Spirochaetia; Leptospirales; Leptospiraceae; Leptonema. REFERENCE 1 (bases 1 to 7677) AUTHORS Pabst,M., Grouzdev,D.S., Lawson,C.E., Kleikamp,H.B.C., de Ram,C., Louwen,R., Lin,Y.M., Lucker,S., van Loosdrecht,M.C.M. and Laureni,M. TITLE A general approach to explore prokaryotic protein glycosylation reveals the unique surface layer modulation of an anammox bacterium JOURNAL ISME J (2021) In press PUBMED 34341504 REMARK Publication Status: Available-Online prior to print REFERENCE 2 (bases 1 to 7677) AUTHORS Pabst,M. TITLE Direct Submission JOURNAL Submitted (25-JUL-2020) R&D, SciBear LLC, Tallin 13417, Estonia COMMENT The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Date :: MAR-2020 Assembly Method :: SPAdes v. 3.13.0 Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: 7x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 07/28/2020 13:56:58 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 4.12 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 2,903 CDSs (total) :: 2,868 Genes (coding) :: 2,758 CDSs (with protein) :: 2,758 Genes (RNA) :: 35 rRNAs :: 1 (5S) complete rRNAs :: 1 (5S) tRNAs :: 32 ncRNAs :: 2 Pseudo Genes (total) :: 110 CDSs (without protein) :: 110 Pseudo Genes (ambiguous residues) :: 84 of 110 Pseudo Genes (frameshifted) :: 26 of 110 Pseudo Genes (incomplete) :: 8 of 110 Pseudo Genes (internal stop) :: 9 of 110 Pseudo Genes (multiple problems) :: 16 of 110 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..7677 /organism="Leptonema sp. (in: bacteria)" /mol_type="genomic DNA" /submitter_seqid="NODE_5128_length_7677_cov_3.881265" /isolate="BROCD019" /isolation_source="granules from the full-scale anammox reactor of Dokhaven-Sluisjesdijk wastewater treatment plant, further enriched at TU Delft in a chemostat over two months prior to sampling" /db_xref="taxon:2046886" /environmental_sample /geo_loc_name="Netherlands: Rotterdam" /lat_lon="51.5521 N 4.2845 E" /collection_date="May-2018" /metagenome_source="wastewater metagenome" /note="metagenomic" gene 134..1759 /locus_tag="H3C43_03615" CDS 134..1759 /locus_tag="H3C43_03615" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_002775249.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="aldehyde dehydrogenase family protein" /protein_id="MBW7857390.1" /translation="MSEDTSYSSLLDLFPESLPDNLSLPVLFPLEQKRYLIGGYIKEW QGETIDVFSPVCFNKNGNLQPIRLGSVPAMDEATALEALNSAVQAWNNGNGVWPRMRS EERIEAVLKFTRKMEELKDEIVSWIVFEIGKSKEGAEIEFDRTTEYIRDTLNAVKEGA RSTARFTLEQGILAQIRRVPLGVVLCMGPFNYPLNETFATFIPALLMGNTVLFKPPKN GVLLYEPLLNAMTECFPSGVVNTVYGDGPKVIPPIMKSGKVDVLAFIGSSRVADQLKK QHPHPHKLRSILGMGAKNAAILLPDVDIESVIDECVKGSLAFNGQRCTALKIFFVPEN KEESFVKLFSEKMTLLKAGMPYNDSVDITPLPIDKKSEYLQSLIDDAVSKGATVIGGQ HAGPLFLPTLIYGVKEGMQVYEVEQFGPIVPVVSYKDPKEALDWIVNSHYGQQVSLFG NQPDQVAMFVDELVHQVCRVNLNSQCQRGPDMFPFTGRKGSAEGTLSVSDALRSFSIR TLVAAKETDENLSLLRTITEERLSSFISTDFLF" gene 1803..2852 /locus_tag="H3C43_03620" CDS 1803..2852 /locus_tag="H3C43_03620" /inference="COORDINATES: protein motif:HMM:NF012318.1,HMM:NF024578.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="trypsin-like peptidase domain-containing protein" /protein_id="MBW7857391.1" /translation="MKNLTVALVLVSVLISLPVYSEPQSQARELQNTFHEIYENYQDS VVYIATERTVKVAEDPFMQFFGRQQGPMSQRQQSMGTGFILSVDGYVCTNHHVVAGAE KMMVRIDGKEYSAKLIGSDAVTDIALLKINNVSNLKPVKLGDSSGVQVGDWAIAIGNP FGLDRTFTVGVISAVARKGVDDLGMEHIQTDASINPGNSGGPLINLDSEVIGMNRMIF SQTGGSLGIGFAIPINRVKDIVEQLRTKGKVQRGFIGIQITPLTPDLAKEVNVPVKEG IWVASVFNQGPAGKAGIQPGDVIYTINGKTVTDPQEFIQIVTATSPGKTVKLGLYRGQ RQISMVVQVGQRPDK" gene 2944..3258 /locus_tag="H3C43_03625" CDS 2944..3258 /locus_tag="H3C43_03625" /inference="COORDINATES: protein motif:HMM:NF016060.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GtrA family protein" /protein_id="MBW7857392.1" /translation="MAIIYGLTAIGVSYIISNGIGYTAGFVNSFLMNRFWTFRSRGQI GRQLFWFVLIFGISYLFQLAALLIQTNLIGVPIWLAQLISMAVYTAVNFVLNKIVTFR TT" gene 3271..4272 /locus_tag="H3C43_03630" CDS 3271..4272 /locus_tag="H3C43_03630" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_002599083.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 2 protein" /protein_id="MBW7857393.1" /translation="MFTEKVSVIVPCFNEEEVIHETYNRLSSVMKTSRLTNHELIFVN DGSSDNTLSILRQISKTDKSVIVVSFSRNFGHQPAVTAGLHQCSGDVAVIIDADLQDP PELIPDMISKMEQEQASVVYAVRDERKGESFFKRFTAFIFYRLINGLSEVPLPMNTGD FRLIDRKVIDAFCKLDEKNKYIRGLISWIGFKQVPITYIREPRFAGETKYPFAKMLKF ATNALLYFTRKPLKIAMSLGFVSVVIGLLLTVYAVASKWLQPETTVSGWASTIIAIVF LGGVQLLTIGVIGEYIGSIFDEIKNRPQYIIDEVIQARPQSNRSVEKKAAPKRVGSV" gene 4269..6221 /locus_tag="H3C43_03635" CDS 4269..6221 /locus_tag="H3C43_03635" /inference="COORDINATES: ab initio prediction:GeneMarkS-2+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS-2+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="MBW7857394.1" /translation="MNQRIKSISFILFWSAVVVIGTWVVWWQYQTLYQVFLSACVNKD TLVWDANIRFIQAIDFVDDFKSGIYLPGISSLIASPTWPPLRTIFSILLIWWHGSPDP VIDILPSVVFLFGVVWLYFGFVVWYTKTRILDITIRAKAPNRVLLYSILTTLFFSLVF FTTLFLILGVFDLPAYIFSSMLEIQGMFFFTWNGWSIYSILKSKRTVSKFVIWSFFLS GVGLYLTKYPYGILTGLSLIAICMLKSPVDFWNASRKVLIERYKGWHLIPLVLLILAL VIVVLGPHLGTELVNTKAIKRFLYIALLLVFIDFNHYLYKRRPNYFSTELKIFYLYFI LPFAIVLLSHPDRFGSLIQAQTDTVPGGSRFYPVALFAEYFMSVGPLIVIVLGGILIL LVSIFLQKKSWHSLLTGEQSIEWLMTIFVWCNIFIMQFMTSNHQSRYLLQIFPLFLFF HSSMFLYLFRSKNKSNGLTRLNWPSFVLPVLIFVSLPFFAIKPLFTKDRLQPVNICFA ATDPTPVQEARELAASIPKSSRAIVVNDCHEVTAPLFARAQATEIDLFLRYRTWGSGI IRNDSKYRYKTWQDSKLNFNEVIHVSYECGDNESTSNQKLIDRANQVGANLKLEAVSM PKVNRQADQNQEKQNVNICLYRYRIE" gene complement(6245..7504) /gene="serS" /locus_tag="H3C43_03640" /pseudo CDS complement(6245..7504) /gene="serS" /locus_tag="H3C43_03640" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_002972706.1" /note="internal stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="serine--tRNA ligase" BASE COUNT 2135 a 1385 c 1684 g 2473 t ORIGIN 1 aaaaagtata atataagggt gtgattcttc tttaattaaa tttacttata cctatgagac 61 gaatgttgtt ttggttattg ttaccgtttg ttgctttact tgacactaca ttggtcgaac 121 aaaaatctta agtatgagcg aagacacttc ttattcaagt cttcttgatt tatttcctga 181 atcgctgcca gataatcttt cgttaccggt tctgtttcca ttagagcaga agcggtattt 241 aattggtggt tatattaaag aatggcaagg cgaaacgatc gatgtttttt cgccggtttg 301 ttttaataaa aatggcaatc tacagccgat tcggttaggt tctgtacctg caatggatga 361 ggcaactgcc ttagaagcat tgaatagcgc agtgcaagct tggaataacg gcaacggtgt 421 ttggcctcga atgcgttcag aagagcgaat tgaggcagtc ttaaagttta cccgtaaaat 481 ggaagagcta aaagacgaaa tcgtaagttg gattgtattt gaaattggta aatcgaaaga 541 aggtgctgaa attgaatttg atcgaactac cgaatatatt cgagatactt taaatgcggt 601 taaagaaggt gctcgctcaa ctgcacggtt tactttagag caaggtattt tagcacagat 661 tcgacgagtt ccgttaggtg tggtgctttg catggggccg tttaactacc cgttaaacga 721 aacttttgct acttttatac cagctttatt aatgggtaat accgttctat ttaaaccgcc 781 taaaaatgga gtgctgttat acgaaccctt gttaaatgca atgactgagt gctttccgtc 841 tggtgtggta aacactgtgt atggcgacgg gccaaaggtg ataccgccga ttatgaagtc 901 gggcaaggtt gatgtgcttg cctttatcgg ttcgagtcgg gtagccgatc agcttaaaaa 961 acaacatccg caccctcata agctacgatc aattttaggt atgggtgcaa aaaatgcagc 1021 gattctttta cccgacgttg acattgaatc ggtaatcgat gaatgcgtta aaggctcgct 1081 tgcttttaac ggtcagcgtt gcaccgcttt aaagatattt tttgtgcctg aaaacaaaga 1141 agaaagtttc gtaaaattat ttagtgaaaa aatgacactg cttaaagccg gtatgccgta 1201 taatgattct gtagatataa caccattgcc aattgataaa aaatctgaat acttacaaag 1261 cttaattgat gatgccgttt caaaaggcgc aaccgtaatt ggaggccaac atgccggccc 1321 tttatttttg ccaactttga tttacggtgt aaaagaagga atgcaagttt acgaagtaga 1381 acagttcggc ccgattgtgc cggtagtctc gtataaagac ccaaaagagg cattagactg 1441 gatcgtaaat tctcattatg gtcagcaggt ctcgcttttt ggaaaccaac ccgatcaggt 1501 ggcaatgttt gtcgatgagt tggtgcatca ggtttgccga gtgaacttaa acagtcaatg 1561 ccaacgaggc cccgatatgt ttccgtttac tggcagaaaa ggctcggctg agggtacctt 1621 atcggtttca gatgctttac ggtcgtttag tattcgaaca ttagttgcag caaaagagac 1681 cgatgaaaac ttatcattac ttcgcaccat taccgaagag agattatctt cttttatttc 1741 gactgatttt cttttttgat cggtgttgat ttttttaagc aatttccgaa tatgaacctg 1801 ttatgaaaaa tttaaccgtt gcgcttgttt tagtatcagt gctgattagt ttgccggttt 1861 atagtgaacc tcaaagtcaa gcacgtgaat tacaaaatac atttcatgaa atctatgaaa 1921 actatcaaga ttcagtggtg tatatcgcta ccgaacgaac cgtaaaagta gccgaagatc 1981 cgtttatgca gtttttcggg cgccaacaag ggccaatgtc gcaacgtcag caaagtatgg 2041 gcaccggctt tattttaagc gtcgatggtt atgtttgcac caatcatcat gttgtggccg 2101 gtgctgaaaa gatgatggta agaattgatg gcaaagagta ttcagcaaag ttaatcggct 2161 cagatgcagt gaccgatatt gctttactta aaattaacaa tgttagcaat ttgaagccgg 2221 ttaaattagg cgactcatct ggtgttcagg tgggtgattg ggcaatagca attggtaatc 2281 cgtttggatt agatcgaact tttacagtgg gtgtaattag tgcagtggct cgtaaaggtg 2341 tagatgattt aggaatggag catattcaaa ccgatgcatc aattaacccc ggcaattcgg 2401 gtgggccttt aatcaactta gatagtgaag ttataggcat gaatcgaatg attttttcgc 2461 aaactggcgg cagcttaggt attggttttg cgattccgat taaccgagta aaagatattg 2521 ttgaacagct tcgaaccaaa ggtaaggttc aacgaggctt tatcggaatt caaattacac 2581 cacttacccc agatttagct aaagaagtga atgtacctgt taaagaaggt atttgggttg 2641 cctctgtgtt taatcaaggg ccggcaggca aagcaggcat tcaaccgggc gatgtaattt 2701 acacaattaa cggaaaaacg gttaccgacc cccaagagtt tattcagatt gtaacagcga 2761 cttcaccggg taaaactgta aaattaggtt tgtatcgagg ccaacgccaa atttcaatgg 2821 ttgttcaagt tgggcaacga ccagataaat aaactaaatt cgttctttga tcggtcgctt 2881 ggcacaattc ttggccaggc gattcgattc ggcatagtcg gtctaattaa cacagctatt 2941 actttggcga ttatttatgg cttaacagcg attggtgttt cttatataat ttcaaatggt 3001 attggttata cggcgggctt tgttaatagt tttttgatga accgattttg gacatttcga 3061 agtcgtggtc aaattggccg gcaattgttt tggttcgttt tgattttcgg tattagctat 3121 ttatttcaat tggcggcact tttaattcaa acaaatctga ttggtgtacc gatatggctg 3181 gctcaactaa ttagtatggc tgtgtatact gcggtaaatt tcgttctgaa taagattgtt 3241 acttttagaa cgacttgaaa actggccccg atgtttactg aaaaagtatc tgttattgta 3301 ccttgcttta atgaagaaga agtgattcac gaaacctata atcggttgtc gtcggtaatg 3361 aaaacaagcc ggctaaccaa ccacgagctg attttcgtaa acgatggtag tagtgataat 3421 acactttcga ttttacgcca aatttctaaa accgacaagt cggtaattgt ggtttcgttt 3481 tcaagaaatt tcggccatca accggcggta actgccggcc ttcatcaatg cagtggcgac 3541 gtagctgtta ttattgatgc cgacttacaa gatccgccag agctgattcc agatatgatt 3601 tcgaaaatgg aacaagagca agctagcgta gtgtatgcgg tgcgtgatga acgaaaaggc 3661 gaaagttttt ttaagcgatt tactgctttt attttttacc ggttaattaa cggtctgtca 3721 gaggtgccgc taccaatgaa caccggtgac tttcgactta ttgatcgaaa agtaattgat 3781 gcgttttgca agcttgacga aaagaataaa tatattcgcg gtttaatcag ttggattggc 3841 tttaaacagg tgccaataac atatattcgt gagccgcgat ttgccggtga aactaagtac 3901 ccgtttgcaa agatgttgaa gtttgccacg aacgcattat tgtattttac tcgaaagcca 3961 ttaaagattg caatgagttt aggttttgta agtgttgtaa tcggtttgct gcttaccgtg 4021 tatgctgtgg catcaaaatg gctacagccc gaaactactg tctctggatg ggcttcgaca 4081 attattgcaa ttgtgttttt gggtggggtt cagttgttaa cgattggcgt tatcggtgaa 4141 tatattggct cgatttttga tgagatcaag aaccggccgc aatacattat tgatgaggta 4201 attcaagcaa ggccacagtc gaatcgatcg gtcgaaaaaa aagctgcacc taaaagagtc 4261 ggttcagtat gaatcaaaga ataaagtcaa tttcttttat tttgttttgg tcggccgttg 4321 ttgtaatcgg cacttgggta gtttggtggc aataccaaac gctgtatcag gtgtttttat 4381 cggcttgcgt taataaagac accttagttt gggatgccaa tattcggttt attcaggcaa 4441 ttgattttgt tgatgatttt aaaagtggta tttatttgcc gggtatatca agtttaattg 4501 cctcgcccac ttggccgcca cttcgcacga ttttcagtat tcttttaatt tggtggcatg 4561 gttcacccga cccggtgatt gatattttgc cgtcagtagt gtttttgttt ggtgtggttt 4621 ggctgtattt tggttttgtg gtgtggtata caaaaacacg aattcttgat attacaattc 4681 gagcgaaggc acccaatcgt gtacttttat attcgatact aacgacttta ttttttagct 4741 tagttttttt tactactttg tttttgattt taggcgtgtt cgatttgccg gcctatattt 4801 ttagttcgat gcttgaaatt caaggcatgt tcttttttac atggaatggc tggtcgattt 4861 attcgatttt aaaaagtaaa cgaaccgttt caaagttcgt aatttggtcg ttttttttgt 4921 cgggggttgg tctctattta accaagtatc cgtatggtat tttaactgga ctttcactaa 4981 ttgctatttg catgttaaag tcgccggttg atttttggaa tgccagtcga aaagttctaa 5041 tcgaacgata caaaggttgg catttaattc cgctagtgct tttgattttg gcactagtca 5101 ttgtggtttt aggcccgcat ttaggaactg agcttgttaa tacaaaagcg attaaacgat 5161 ttttgtatat tgctcttttg ttagtattta ttgattttaa tcattatttg tataagcgac 5221 gaccgaacta tttctcaacc gaattaaaga ttttttattt gtattttatt ttgccgtttg 5281 ctatcgttct tttatcgcac cctgatcgat tcggttcttt aattcaagct caaactgaca 5341 cagtacccgg tggcagccgt ttttaccctg tggcactttt tgccgaatac tttatgtcgg 5401 tgggcccttt gatcgttatc gttttgggcg gaattcttat tttacttgtt tcgatatttt 5461 tacaaaaaaa aagttggcat tcgttattga ccggtgaaca atcgattgaa tggctaatga 5521 cgatttttgt ttggtgcaat atctttatta tgcagtttat gacatcgaat catcaatcaa 5581 ggtacttact tcaaattttt ccattgtttt tatttttcca ttcgagtatg tttctatatc 5641 tatttcgaag taaaaacaaa tcaaatgggt taactcgatt aaactggccg agttttgtat 5701 tgccggtttt gatatttgta agtcttcctt tttttgcaat caaacctctt tttacgaaag 5761 atcgattaca accagtcaat atctgctttg cagctactga ccctacgccg gttcaagagg 5821 caagagagtt agcagcatcg attccaaaat caagtagagc aatagtcgtg aatgattgcc 5881 acgaggtaac tgcgccgtta ttcgcccgag ctcaggcgac tgagattgat ttgtttttac 5941 ggtatagaac ttggggcagt ggcataattc gaaacgattc gaagtatcgg tataaaactt 6001 ggcaagactc aaaattaaac tttaacgaag taattcatgt tagctatgag tgtggcgata 6061 acgaatcgac ttcaaatcaa aagctaattg atcgcgccaa tcaagttggt gcaaacctaa 6121 agcttgaggc agtatcgatg ccaaaggtta accgacaagc cgatcaaaac caagagaaac 6181 aaaatgtaaa tatttgtctt tatcggtatc gaattgaata gctgaaaaat tcgaattcga 6241 atcgttatct ttctaagtag ggttttaata cttcaggaat ttcaaagcta ccatctcgct 6301 tttgaaagtt ttcgataatg gcaatcatgg ttcggccggc agctaagcct gatccgttta 6361 aggtatgggc aaagatatta ccttgttggg ttcgaattct gattttacat cgacgtgcct 6421 gttaatcgcc gcagttcgat accgatgaaa tttcaagaca acgattgagg ccgggcatcc 6481 atacctctaa gtcgtaagtt ttgcgtgctg ttgcacccat atcgccagac gataaaagca 6541 ccactcgata cggtagttta agagcttgca atactgactc ggcatgagag agcattgatt 6601 ggtgttcgtc aatcgatgtt tcagggtgaa caattcgaac tagttcaact ttttgaaatt 6661 ggtgtactcg aactaaacct cttgtatctt taccggctgc accagcttca cgacgaaagc 6721 aagatgaggc cgcagtgatg gcaatcggta aatcggtatc gttaataatt tcatcgtaat 6781 acaagttaac caaaggcacc tcagcggttg gaatcaaaga caatccgtct cgttctaact 6841 gataataatc gcctttaaac ttcgggtatt gaccggtggt ggtcatacat tggtcgttaa 6901 ctaaaattgg tacccaagtt tcagtgtaac cgttttggct agtatgcaaa tcaagcataa 6961 agctaagaat agctcgctct aatcgagcgc ccaaacctcg ataagaataa aatcttgaac 7021 cggctaactt agtaccgcga tcaaagtcga agataccgat tcgttcacct aattcatagt 7081 gaggaaacgg tgtaaaatca aaagtaggaa tttcaccaac ttgacgaagc acgatattgg 7141 cagcttcatc tttgccaacc ggcacatcgg aatctaccca gtttggtagt gcaatagcga 7201 tttcgtcttg atcggctagg gtttgctcaa gcttcttttc gagttcgcta atttgattcg 7261 caatttgacg cacctgttct tttttctctt ttacatcggc gttctttttt tgagcaatta 7321 gttcgccgat ttgctgactt gattggttac gctcgtttcg aagcaggtcg atttgtgccc 7381 gtaattgacg cacctgttcg actaaatcaa ccaaccggtt aacatcgaca ccgtcaacat 7441 gacggacttt taacatttgt tctaactttt caggattttg ctgaactaag cgaagatcga 7501 gcatagctcg catagaatga ttgccgactt agctgtaaag taatttctgg aagtgtggac 7561 ttcattgctc aagattccgg aattcgttta gaccaatttt tgtatcaatc tttggtcgat 7621 cactttaacg aagaaaggta tagccggtct caaattcaaa aatggattaa aaccggt //