LOCUS AZHW01000285 6478 bp DNA linear BCT 22-JAN-2014 DEFINITION Candidatus Entotheonella sp. TSY1 TSY1_contig00239, whole genome shotgun sequence. ACCESSION AZHW01000285 AZHW01000000 VERSION AZHW01000285.1 DBLINK BioProject: PRJNA230050 BioSample: SAMN02420412 KEYWORDS WGS. SOURCE Candidatus Entotheonella sp. TSY1 (sponge metagenome) ORGANISM Candidatus Entotheonella sp. TSY1 Bacteria; Nitrospinae/Tectomicrobia group; Candidatus Tectomicrobia; Candidatus Entotheonella. REFERENCE 1 (bases 1 to 6478) AUTHORS Wilson,M.C., Mori,T., Ruckert,C., Uria,A.R., Helf,M.J., Takada,K., Gernert,C., Steffens,U.A., Heycke,N., Schmitt,S., Rinke,C., Helfrich,E.J., Brachmann,A.O., Gurgui,C., Wakimoto,T., Kracht,M., Crusemann,M., Hentschel,U., Abe,I., Matsunaga,S., Kalinowski,J., Takeyama,H. and Piel,J. TITLE An environmental bacterial taxon with a large and distinct metabolic repertoire JOURNAL Nature 506 (7486), 58-62 (2014) PUBMED 24476823 REFERENCE 2 (bases 1 to 6478) AUTHORS Wilson,M.C., Mori,T., Ruckert,C., Uria,A.R., Helf,M.J., Takada,K., Gernert,C., Steffens,U., Heycke,N., Schmitt,S., Rinke,C., Helfrich,E.J., Brachmann,A.O., Gurgui,C., Wakimoto,T., Kracht,M., Crusemann,M., Hentschel,U., Abe,I., Matsunaga,S., Kalinowski,J., Takeyama,H. and Piel,J. TITLE Direct Submission JOURNAL Submitted (03-DEC-2013) CeBiTec, Bielefeld University, Universitaetsstr. 27, Bielefeld, NRW 33615, Germany COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: http://www.ncbi.nlm.nih.gov/genome/annotation_prok/ Bacteria available from Piel Lab, ETH Zurich. Source DNA available from Piel Lab, ETH Zurich. ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 01/21/2014 18:50:29 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 2.3 (rev. 423251) Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes :: 8,506 CDS :: 8,438 Pseudo Genes :: 2 rRNAs :: 5 (5S, 16S, 23S) tRNAs :: 60 ncRNA :: 1 Frameshifted Genes :: 1 ##Genome-Annotation-Data-END## ##Genome-Assembly-Data-START## Assembly Method :: Newbler v. 2.6 Assembly Name :: v3 Genome Coverage :: 60.3x chromosome; 278.5x plasmid Sequencing Technology :: Sanger dideoxy sequencing; 454; Illumina; Pacific Biosciences ##Genome-Assembly-Data-END## FEATURES Location/Qualifiers source 1..6478 /organism="Candidatus Entotheonella sp. TSY1" /mol_type="genomic DNA" /isolation_source="disrupted sponge" /host="Theonella swinhoei" /db_xref="taxon:1429438" /environmental_sample /geo_loc_name="Japan" /note="metagenomic; derived from metagenome: sponge metagenome" gene 118..1089 /locus_tag="ETSY1_09335" CDS 118..1089 /locus_tag="ETSY1_09335" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="ETX00951.1" /db_xref="GI:575415803" /translation="MKSRIGKRFGLAVWAALFSILAGGSLFWLAIAQDTLTEDITAQI EAAAYDPDMAAGVVRDAVANTPHQAAAIVRSAVQAAPHLSTTLIAAAVVPARDQLAPM QQAATDAAPKQANVIAIAAAIASKVVAHPEQDTAIVSAAIRETPGQSKAVVTAAATVC RFVEGCSPISPMPLVQAAVIADLQATEGIVEATVSLFPASAGEIIAAATQSSLSARTA QQGRVTAVPQLAPVANPAPAVPGLQEPVIVADNSPEPGAADVVVPPAPVPTEFPDPTS SGTGDGTGTGDDEDDDDDDDEGTGTGTGTGTPVIPPLPPVFPASPFF" gene complement(1354..1959) /locus_tag="ETSY1_09340" CDS complement(1354..1959) /locus_tag="ETSY1_09340" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="ETX00952.1" /db_xref="GI:575415804" /translation="MDSSVHLKLAIVIPAYNASQTIERVFERIPSAVQPQIVNYIVVN DGSTDDTAAVLTRLQQSYPNVIVLHHDVNQGYGAAEKTLLHCAVETEADVVVLLHADG QYAPEKLPKLLEPFTAEAADIVQGSRMMEGRAALRGGMPRYKYVANRGLTAIENMAFG LRWRNITAATCFTLDAPCKRFHLRNLATPFALTKKCSSWPK" gene complement(1938..3026) /locus_tag="ETSY1_09345" CDS complement(1938..3026) /locus_tag="ETSY1_09345" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="ETX00953.1" /db_xref="GI:575415805" /translation="MKILLLGGGGFIGCHITQKLLQSTDHLVNCYDLFDARLQDSLGH SRFNYIHGDIRHDHTRVEKLIHDADVVVDLIAYANPSLYVSIPLDVFNLNFTENLKIT EYCVTHQKRLIQFSTCEVYGKTVASLLQNQLPDHDNPAHAVFQEDNTAFILGPVNKHR WIYSCAKQLLERIIHAYGLEDRLNYTIIRPFNFIGPRIDFLPSEQEGNPRVFSHFLDA LKTGEPMKLINGGHQRRTYTYIDDAVDCIVRIVENPNQVCDKEIFNIGSPDNEISIRD LAFKMRCIYKRRWWRGQTELPEPIEVSGETFYGEGYDDSDRRIPDITKAQQLLGWQPR YNLDQTLEYSMGYWFNGEEGTSWIPQFT" gene 3452..3874 /locus_tag="ETSY1_09350" CDS 3452..3874 /locus_tag="ETSY1_09350" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="ETX00954.1" /db_xref="GI:575415806" /translation="MGQNLYGGGIRDDYITCLECGKALQLLSNRHLALHGLTPNTYRQ KWGISPHILLTSHRLAQRRCQLANSLEKHELTLPGDDANMDDFFHELAEALTIIQGQV HLAQKSLSSYMVPQEYLEAIQHTVMRIGQRMQGGSLPS" gene complement(3971..4759) /locus_tag="ETSY1_09355" CDS complement(3971..4759) /locus_tag="ETSY1_09355" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="ETX00955.1" /db_xref="GI:575415807" /translation="MQLAGKVAIVGGASRGIGRDMAVALAAAGAKVAVAARSEVEPDP RLPGTIHQTVADIQAAGGDAIAVKADVSDEEQINAMVNTTLETYGRLDILVNNAAVLV PRGIMDLPTRHIDLHNKVNIKGPILCIRAALPTMLEQQPGWVINVSSRAGVFPGPGPY GDDVPKTRAFMYAATKAAVERLTQALAVEYQDQGISFNCLSPTGRIRTPGNVFGMTKP GETPEPFEEAITMGKSAVFICSQDPKTFTGNLLFDDATVEQYHL" gene 4754..5143 /locus_tag="ETSY1_09360" CDS 4754..5143 /locus_tag="ETSY1_09360" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="ETX00956.1" /db_xref="GI:575415808" /translation="MHVVFPVDWSQNLTSSIALTFELGFGANVERLQNTCIDGSDHIH STVQIGFVNTCFPCIRKAAFHSRLTVAHHGNRKAHEDLFTFTQIVNGVSIAVKLPEIS SLNHRLSPCLIWMMPHLLARRGLDVPL" gene complement(5234..5539) /locus_tag="ETSY1_09365" CDS complement(5234..5539) /locus_tag="ETSY1_09365" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="ETX00957.1" /db_xref="GI:575415809" /translation="MYLGLAAGCANKSALPPQPALAEPYAILKFSAAMQLIALDQQAI DTVVAIRTLRVRPGQHTLRFLHLNNGPEGSPQHAGQRTDPFVLEAFEGLIYEFEAKT" gene complement(5860..6171) /locus_tag="ETSY1_09370" CDS complement(5860..6171) /locus_tag="ETSY1_09370" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="ETX00958.1" /db_xref="GI:575415810" /translation="MTNQHKLLDAAIKKVGNRYLATMLVAKRIRQLHHGAPAYVKRSE GESHFTVAMREIAEGLVILDPPVVIDSFAAAAASLRPEDALSQEVAQAPESAETVVES S" BASE COUNT 1493 a 1653 c 1760 g 1572 t ORIGIN 1 cgatgtccag atttcggggt acattataga ccacacaacc gatttcacaa taattgcgtt 61 taatattttt cactatatca cgcactcgcg atttcgtgcc ggacaggagg tgcaagcatg 121 aaaagccgga ttgggaaacg ctttgggctc gctgtatggg ctgccctatt cagcatattg 181 gccggaggta gcttgttttg gttggccatt gcacaagata cgctgactga agacatcacc 241 gctcaaatcg aagcggcggc ctatgaccca gatatggcgg ctggtgtagt acgcgatgcg 301 gtggcgaaca cgccgcatca agccgcagcc attgtgaggt ctgcggtcca agcggcacct 361 catctgagca caacattgat cgcggctgcg gttgtgccgg ccagggatca acttgcgccg 421 atgcagcaag cggcaacaga tgctgcgcca aaacaggcga atgtgattgc cattgccgca 481 gcgattgcat caaaggttgt cgcacatccc gagcaggata cggccattgt ctcggcagcc 541 attcgtgaga cgcccggcca aagcaaagcg gtcgtgaccg ctgccgcgac tgtgtgccgg 601 tttgtagagg gttgttcgcc gattagtcca atgccactcg tgcaagccgc agtgatcgcc 661 gatcttcagg ctaccgaagg gattgtcgag gcgacggtct ccctattccc agcatcggct 721 ggggagatta tcgctgcggc gacgcaaagc tctctcagcg ccagaaccgc tcagcaaggg 781 cgagtgacgg ctgtgccgca attggctccc gttgcaaacc cggcaccagc cgttcctggg 841 cttcaggaac cggtgatcgt ggcggacaac tctcccgaac ctggcgctgc tgatgttgtg 901 gtgccgccag cgccggtgcc gactgagttc cccgacccga cgagcagtgg cacgggtgac 961 ggtactggca cgggcgatga cgaagatgat gacgatgacg atgacgaagg tacagggacg 1021 gggacgggta ctgggacgcc cgtcattccg cctctaccgc ctgtgtttcc tgcgagccca 1081 ttcttttaga ccattgaggc tgtttgcaca tctgaattaa cacgcaaaac ccctgcgacg 1141 ctgttcgtgt tgcaggggtt ttgttttgtg ggataacatt tttgctggca agccggctcg 1201 cgcgaagtat ttatatttag agcgcgtggt agtgcccttt tttataggcc cacactaaag 1261 agaggacatg aagaccatag cgaatgggtt tgagatggga aatttcatcc ccatagtgtg 1321 tcgggatagg ccgctgaaca atcttgagtc ccctcacttt ggccatgatg agcatttctt 1381 ggtcaaagca aaaggagtcg ctaagtttct caaatggaat ctcttgcaag gcgcgtctag 1441 cgtaaagcat gtagccgctg tgatattccg ccatcggagc ccaaacgcca tattttcaat 1501 cgcggtgaga ccccgattgg ccacatattt atagcgtggc atcccccctc gaagggcagc 1561 ccgaccctcc atcatgcgcg agccttgaac gatatccgca gcttcggcag tgaatggttc 1621 aagtagtttg ggaagttttt ccggtgcata ttgaccatcg gcgtgcaata acacgaccac 1681 gtctgcctcg gtctcaactg cgcagtgcag cagggttttt tcagccgcac catatccctg 1741 attgacatca tgatgtagca caatcacatt gggatagctt tgttgcaacc gggttaaaac 1801 cgctgcggtg tcgtccgtac tgccatcgtt aaccacgata tagttcacaa tctgtggttg 1861 tacggcagac ggtatacgtt cgaaaacgcg ctcgatggtt tggctcgcat tataggcggg 1921 aatcacaatg gcaagtttca ggtgaactga ggaatccatg atgtcccttc ttctccgtta 1981 aaccaatagc ccatgctgta ctccagggtt tgatccaggt tgtagcgcgg ttgccagccg 2041 agaagctgct gcgctttcgt gatgtccgga atgcggcggt cggaatcatc gtagccttcg 2101 ccgtaaaacg tctctcccga tacttcgatc ggttcgggta gctcggtttg accacgccac 2161 caacggcgtt tgtagatgca ccgcatttta aacgccaagt cccggatgga aatctcgttg 2221 tctggcgagc cgatattgaa gatttcttta tcgcaaacct gattcggatt ttccacaatc 2281 cgcacaatac agtcgacggc atcgtcaata taggtatagg tgcggcgctg atgtccacca 2341 ttgatcaatt tcatgggttc accggtcttg agcgcgtcga gaaagtgtga aaacacacgc 2401 ggatttccct cttgctcgct ggggagaaaa tcaatgcggg ggccaataaa attgaagggg 2461 cgaatgatcg tatagttgag ccgatcttcg agaccgtaag cgtggataat gcgctctaga 2521 agctgtttcg cacagctgta gatccaacga tgcttattca ccggaccaag gataaaagcc 2581 gtgttgtctt cttgaaaaac cgcatgagct gggttatcgt gatccggtag ctgattttgg 2641 agcagactgg caactgtttt gccatagact tcacaagttg aaaactgaat aagccgcttt 2701 tggtgggtga cacaatattc tgtaattttc agattttccg taaagttcag attgaataca 2761 tccagaggga tggaaacata gagtgaggga ttggcatagg caatgagatc gacgaccaca 2821 tcggcatcgt gaatcagttt ttcaacacgt gtatggtcgt gtcgaatatc cccatgaata 2881 taattaaatc gagaatgtcc caggctgtcc tgtaacctgg catcaaacaa atcatagcaa 2941 ttaactaggt gatccgtaga ttgaagcaat ttttgcgtta tatgacagcc aataaaccct 3001 cctcctccca gcaatagtat tttcatgagt gcctcctata tgaattttta tgccgaacac 3061 tgaaacatta aaatctaacc taatattgaa tagttacttt tattctaata tggaaatttt 3121 gtcaatgttt tgtattggct tgataggtca aaggggacag atagtgaggt gaaaatttcc 3181 tcacttgttc cactgcatcc gtcagattga atgcagggac tggaatcgat gtgctacgtt 3241 tgttaatttg caaacagtgt gtcacaaaat ggagagttag ggaacatatt tattcagacg 3301 aatattttta aatttcgcag cggctgtacg agagccatct caacccctgg caagcaaatt 3361 ttcagttgga ttctagccgt gaccccttta cactttgagg tcatggtcga tcgtaaaaat 3421 ctactgggca tgatatgaga aagggaagag tgtggggcaa aacttgtatg gaggagggat 3481 tcgggatgat tacatcacct gtctcgaatg tggcaaagcc ttacaacttt tgtcgaatcg 3541 ccaccttgca ttacacggtc tgacaccgaa cacctaccgg cagaaatggg ggatctcgcc 3601 gcatatcctt ttgacttcgc accgcttggc ccagcgccgt tgccagctcg ccaacagttt 3661 agagaaacat gagctgaccc tgccgggcga cgacgccaat atggatgact ttttccacga 3721 acttgccgaa gccttgacga ttattcaagg gcaggtccat ctcgcccaga aaagcctctc 3781 gtcatatatg gtgcctcagg aatatctgga ggcgattcaa cataccgtga tgcggattgg 3841 gcaacgcatg cagggcgggt cattgccatc atgacgtcgc cccgtcagag ccgtatggtg 3901 caggtatgtg gccgatgctt gtgatccacc gttaacgcgt ttggcactgc gttgacaggc 3961 cggcgccggg ctagaggtga tattgctcaa ccgtggcatc gtcgaagagc agattgccgg 4021 taaaggtctt cggatcttgc gagcagataa acacggcact tttacccatg gtaatcgctt 4081 cctcgaaggg ttctggtgtc tcgccaggtt tggtcatgcc aaacacatta cctggcgtgc 4141 gaatgcgacc ggtgggcgat aagcaattga agctgatacc ctggtcttgg tattccacgg 4201 caagggcttg ggtgagccgc tcgaccgctg ctttggttgc cgcatacatg aaggctcgcg 4261 tcttgggtac atcgtctcca tatggacctg gccctggaaa cacgccagct cgcgaggaga 4321 cattgatcac ccatcctggc tgttgttcca acatggtggg cagagcggcg cgaatgcata 4381 aaatcgggcc tttaatattg actttgttgt gcaagtcaat gtgccgggtc ggtaagtcca 4441 taatgccacg gggaacgaga acggctgcat tgttgacgag aatgtccagc cgcccgtaag 4501 tctccagggt cgtattgacc atagcgttaa tttgttcttc gtcactgacg tctgccttca 4561 ccgcgatcgc gtctccgccc gcggcttgaa tatccgcaac cgtttgatgg atggtcccag 4621 gcaggcgcgg gtccggttcg acttccgatc gggcggccac ggcgaccttg gcacctgctg 4681 cggccaacgc caccgccata tctctgccaa taccccgtga ggcgccgccg acaatggcga 4741 ctttccctgc taattgcatg ttgtgtttcc cgttgattgg agccaaaatc tgacgtcatc 4801 catcgccttg acgtttgagt tgggctttgg agcgaatgtt gagcgattgc aaaatacctg 4861 catcgacggc agcgatcaca tccacagcac agtccagatc ggtttcgtca atacctgctt 4921 tccttgcatc cgcaaggcgg cgttccactc aaggctgaca gttgcgcacc atggcaaccg 4981 caaggcccat gaagatcttt tcacgttcac tcagattgtc aacggcgtga gcatcgcggt 5041 aaaactgccg gaaatcagct cgctcaatca tcggctttcc ccctgtttga tatggatgat 5101 gccgcactta ctggcaaggc gcggtttgga tgtgccgctg taaaatgtgg gctgccacgt 5161 gggtgcaact atgggttgcg cagtaatccg gtatcggctt gctgaaagcg accgccggtc 5221 gccagatcac ccctcaagtt ttggcctcaa attcatagat cagtccttcg aacgcttcca 5281 gcacaaacgg atctgtccgc tgccccgcgt gttgtgggct gccttcgggg ccattgttga 5341 gatgtaaaaa acgcagcgtg tgttggcctg ggcgtacacg caaggttcgg atcgcgacca 5401 ccgtgtcaat agcctgctga tctagggcaa tcagttgcat ggcggcagag aattttaaga 5461 tggcataagg ctcggccagc gcgggctgtg ggggcagggc agacttgtta gcacagcctg 5521 ccgcgagacc caggtacaaa ccaagcagac cccaccgaaa ccagtgccaa caccttggat 5581 aacatacagc agccacacat cgtcctccat ggacaattga tcatagagtg acattaggca 5641 tagagtaaca tcagattgaa ggaggcatag gcctcttaaa gccgcaagca gtatagcttc 5701 tcctccgcga agaagccacg agaatcacac ccgatggccg tacgacctgt ggcatggaag 5761 gccaacgtga ggagaagaag agccgctgca gcgccgtgga tgggccgaac cggtgacagc 5821 ctgtagatgg cgtggcggtc tgccaccggc gtgtcgtcgc tacgatgatt ctacaacagt 5881 ttccgcggat tccggggctt gggcaacttc ttgggacagg gcgtcttccg ggcgcagaga 5941 ggctgccgct gcggcaaagc tgtcaatgac aactggcggg tcgagaatga cgagcccttc 6001 cgcaatttcg cgcatcgcta cggtaaaatg actctcgcct tcgctgcgtt tcacgtaggc 6061 cggtgcgccg tgatggagct ggcgaatccg tttggccacg agcatggttg ccaagtaacg 6121 attgcccact tttttgattg ccgcatcgag cagtttgtgc tgattggtca aggtttgtgc 6181 ctcctggcag attgcacgct ggcgggaaag ctgtggattg aagttaacga aggattatag 6241 cacaatattt gcaagatgtc tttatcctca gacgatttcg ttgtcgtgat agcgcttcgg 6301 gacgtccctg caaataacta tcccaactgg ctggtctttt tgcccagggg ctgcgttgtc 6361 gcgttagtta catagcgttg ctatgcgcct gccgctccgt cttacctctg tgcaaaaatc 6421 cggcgccatt gtgggacact ttatttttgg aacgcccctt agagcaagtc cagcaaat //