LOCUS JAGDOT010000144 4670 bp DNA linear ENV 18-MAR-2022 DEFINITION MAG: Chloroflexi bacterium isolate MT4_27 contig-80_8104, whole genome shotgun sequence. ACCESSION JAGDOT010000144 JAGDOT010000000 VERSION JAGDOT010000144.1 DBLINK BioProject: PRJNA692099 BioSample: SAMN17525762 KEYWORDS WGS; Metagenome Assembled Genome; MAG. SOURCE Chloroflexi bacterium (marine sediment metagenome) ORGANISM Chloroflexi bacterium Bacteria; Chloroflexi. REFERENCE 1 (bases 1 to 4670) AUTHORS Liu,R., Wei,X., Wang,L., Cao,J., Song,W., Wu,J., Thomas,T., Jin,T., Wang,Z., Wei,W., Wei,Y., Zhai,H., Yao,C., Shen,Z. and Fang,J. TITLE Novel Chloroflexi Genomes From The Deepest Ocean Reveal Metabolic Strategies For The Adaptation To Deep-Sea Habitats JOURNAL Res Sq (2021) In press REMARK DOI: 10.21203/rs.3.rs-254541/v2 REFERENCE 2 (bases 1 to 4670) AUTHORS Liu,R. and Wei,X. TITLE Direct Submission JOURNAL Submitted (25-JAN-2021) College of Marine Sciences, Shanghai Ocean University, No. 999, Huchenghuan Road, Shanghai 201306, China COMMENT The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: IDBA_UD v. 1.1.3 Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: 40.48x Sequencing Technology :: BGIseq 500 ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 03/16/2021 11:59:13 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 5.1 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 2,021 CDSs (total) :: 1,990 Genes (coding) :: 1,978 CDSs (with protein) :: 1,978 Genes (RNA) :: 31 rRNAs :: 1 (16S) partial rRNAs :: 1 (16S) tRNAs :: 27 ncRNAs :: 3 Pseudo Genes (total) :: 12 CDSs (without protein) :: 12 Pseudo Genes (ambiguous residues) :: 0 of 12 Pseudo Genes (frameshifted) :: 3 of 12 Pseudo Genes (incomplete) :: 9 of 12 Pseudo Genes (internal stop) :: 0 of 12 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..4670 /organism="Chloroflexi bacterium" /mol_type="genomic DNA" /submitter_seqid="contig-80_8104" /isolate="MT4_27" /isolation_source="marine sediment" /db_xref="taxon:2026724" /environmental_sample /geo_loc_name="Pacific Ocean: Mariana Trench" /lat_lon="11.4037 N 142.3630 E" /collection_date="2016-12" /metagenome_source="marine sediment metagenome" /note="metagenomic" gene complement(<1..1781) /locus_tag="J4N77_08745" CDS complement(<1..1781) /locus_tag="J4N77_08745" /inference="COORDINATES: protein motif:HMM:NF013259.1,HMM:NF015425.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nitrite/sulfite reductase" /protein_id="MCI0784399.1" /translation="MTVQEPDTFQAHPKVPKDLKAGHVFVTLEEEFDNFDTETKVYQK SPDLESDFVPYRLRMGTYGQRQENMQMMRIKLPYGGVNAEQMDALAEVAEKHSGMRRG HITTRENIQFHFVPLDQASTVMRILAEAGLSTREACGHTVRNVTGCPYAGVTDHQLFD VTPYLAAYARNMIRNPICQNMPRKWKTSFSCGPIDCAGSPFHDMGFVAALREENGEEV RGFKIVVGGGTSTMVRAAETLWEFARADDGQYIRVAEAALKVFDKEGGIPNFLRKNMN KARVKFIVKKLGIEEFRRQVDEELAQPWAWEPLDMPALKQLAPEGPTPGPAPNSVQPG PDFERWVKTNVVQQPQPGYVAVTLTIPLGNLSPEQFRAVGDIMRRFSGGNARTQQNQN LVLRWVHEAGLPALHSEIKKIGFGDPDAGLLSDVVGCPGTDSCKMGITSSTGVSEAIR EAALNDWGYQDDPLVQAINVKASGCPNGCSQHHLAAIGLQGSSFSANGATIPCFDIFL GGGNYIGGGKFATRVARVPSKRTPQAVKKMIDHYVANRNEGEEFVAFVDRLGAKTFDS LFDEFKEVGPVHEDIDVYMDWGKEELF" gene complement(1926..2165) /locus_tag="J4N77_08750" CDS complement(1926..2165) /locus_tag="J4N77_08750" /inference="COORDINATES: ab initio prediction:GeneMarkS-2+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS-2+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="MCI0784400.1" /translation="MDDPALSLVLEEALALLSAGEAPELIAERFPDFSDTLLPLLNVA VELREGAEDAIDDPIDFLHDLGEYLQDRISGSAPP" gene complement(2334..2612) /locus_tag="J4N77_08755" CDS complement(2334..2612) /locus_tag="J4N77_08755" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_014449903.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MoaD/ThiS family protein" /protein_id="MCI0784401.1" /translation="MTITVRIPMPLRKLTGEESVVAGEGSTLAECIDALEARYPGMKE RLCDESGELRRFVNVYINGEDVRFQAGLATPLTGGDEVSIVPAVAGGA" gene complement(2676..2918) /locus_tag="J4N77_08760" CDS complement(2676..2918) /locus_tag="J4N77_08760" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_014449902.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="NIL domain-containing protein" /protein_id="MCI0784402.1" /translation="MATVRTKFTFVEQLIKDPIIWKLAKDFKVITNIRRADVTDERGW VILELDGDQDEIERSLDWVREQGVRVDPVYEDVVEG" gene complement(2921..4198) /locus_tag="J4N77_08765" CDS complement(2921..4198) /locus_tag="J4N77_08765" /EC_number="4.2.3.1" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015083244.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="threonine synthase" /protein_id="MCI0784403.1" /translation="MSSRRNHVSVAKKLKCRECGREYQLEPSNVCEFCFGPLEVVYDY AALAGHVTRERIEAGPPTMWRYADLLPVDAEDAVDIGTGYTPLLRAPNLGRELGLSRL YLKNDCVNPTWSFKDRVVSVACSVARDFEFDTLACASTGNLANSVAAHAARAGMQARV FIPSDLERGKIIGSAVYAPTLVAVDGSYDDVNRLCSELGDKYPWAFVNINVRPYYAEG SKTLGHEVCEQLGWRAPDHCIVPMASGSLLTKIYKGIKELAWLGLTDWNPTRMSGAQA LGCSPIAEAYGRDSWNIRPQKPNTIAKSLAIGNPADGYYALKTMRSTNGSAVAVPDEE IVEGIQLLAETEGIFAETAGGVVVSGLRRLVREGRVEPDEVVVAFITGAGLKTQEAVA PALKPALTVEANIESFERALEEREGALPLTKAV" regulatory complement(4346..4447) /regulatory_class="riboswitch" /inference="COORDINATES: nucleotide motif:Rfam:12.0:RF00162" /inference="COORDINATES: profile:INFERNAL:1.1.1" /note="SAM riboswitch class I; Derived by automated computational analysis using gene prediction method: cmsearch." /bound_moiety="S-adenosylmethionine" /db_xref="RFAM:RF00162" BASE COUNT 757 a 1470 c 1541 g 902 t ORIGIN 1 tcgaacagct cttccttgcc ccagtccatg tagacgtcga tgtcctcgtg gaccgggccc 61 acttccttga actcgtcgaa gagcgagtca aacgtcttcg ccccgaggcg gtcgacgaag 121 gcgacgaact cctcgccctc gtttcggttg gcaacgtagt ggtcgatcat cttcttgacg 181 gcctgcggcg tgcgcttgga cggcactcgc gctacgcgtg tcgcgaactt gccgccgccg 241 atgtagttgc cgccgccgag gaagatgtcg aagcagggga tcgtagcacc gttggcgctg 301 aacgacgatc cctggaggcc gatcgccgcg aggtggtgct gggagcagcc gttggggcag 361 ccactggcct tgacgttgat cgcctgcacc agcgggtcgt cctggtagcc ccagtcattc 421 aacgccgcct cgcggatcgc ctcgctgaca ccggtcgagg aggtgatgcc catcttgcag 481 gagtcggttc cggggcagcc gacgacatcc gaaagcagtc cggcgtccgg atcgccgaag 541 ccgatcttct tgatctcgga gtgaagcgcc ggtagaccag cttcgtgaac ccatcgcagc 601 accaggttct ggttctgctg cgtgcgcgca ttgccaccgc taaagcgacg catgatgtcg 661 ccgacggcgc ggaactgctc cggcgagagg ttgccgagtg ggatcgtcag tgtcacagcg 721 acgtagccgg gctgtggctg ctgaacgacg ttcgtcttta cccagcgctc gaagtcgggg 781 ccgggttgga cgctgttggg cgccgggcct ggcgtcgggc cttcgggcgc gagctgcttc 841 agcgccggca tgtcgagcgg ctcccaggcc cacggctgtg ccagctcttc gtcgacctgg 901 cggcggaact cttcgatgcc cagcttcttg acgatgaact tgacgcgggc cttgttcatg 961 ttcttgcgca ggaagttggg tatgccgcct tccttgtcga agaccttgag cgcggcctcg 1021 gcgacgcgga tgtactggcc gtcgtcggcg cgggcgaact cccagagagt ctcagctgcg 1081 cgcaccatcg tcgaggtgcc gccgccgacg acgatcttga aaccgcgcac ttcctcgccg 1141 ttctcctccc gaagggcggc gacgaagccc atgtcatgga acggactgcc ggcgcagtcg 1201 atcgggccgc aggagaatga cgtcttccac ttgcggggca tgttctggca gatcggatta 1261 cggatcatgt ttctggcgta ggcagccagg tacggcgtca cgtcgaacag ctggtgatcg 1321 gtgacgccag catagggaca gcctgtcacg ttgcgcacgg tgtgcccgca ggcctcccgc 1381 gtcgagagtc cggcttcggc gaggatgcgc atgaccgtgc tggcctggtc cagcggcacg 1441 aagtggaact ggatgttctc acgagtggtg atgtggccgc ggcgcatgcc ggagtgcttc 1501 tccgccactt cggcgagggc gtccatctgc tccgcgttga cgccgccgta cggcagcttg 1561 atgcgcatca tctgcatgtt ttcctggcgc tggccgtagg tgcccatgcg caggcggtag 1621 gggacaaagt cggactcaag gtcggggctc ttctggtaga ccttggtttc ggtgtcgaag 1681 ttgtcgaact cttcttcgag cgtgacgaag acgtgcccgg ctttaaggtc tttgggcacc 1741 ttcggatgcg cctggaacgt gtctggctct tgaacagtca tggccgtctc cttccggtct 1801 gcggactcaa gcctttgcag ctaaagtcga gtattccagt ccggtaagtc aaaattgtat 1861 ctgtcgaagc gcctgcctgt caaacgccag agccgcttca gccgcggaaa acgcgcgttc 1921 ggcctctatg gcggggcgct gccgctgatc cggtcttgca gatactcgcc gaggtcgtgc 1981 aggaagtcga tcgggtcgtc gatggcgtct tctgcgcctt cgcgcagttc gacggcgacg 2041 ttgaggaggg gcaacagggt gtcagagaag tcggggaatc gctcggcgat gagttccgga 2101 gcttccccgg cgctgagtag cgccagggcc tcttccaaaa cgagcgaaag cgccggatcg 2161 tccaagtgtg agccctcccg tttgccgcaa tgacggggtt tctggccggc gccaccaata 2221 catcacattg gcattgagac gtcaatacct gacgataggc tgcgcattac gggagtgtta 2281 cgccttgaga cgatccgacg tctgtgcggc ccaggtaaac gccggcgctt ggtctaggca 2341 ccgccggcga cggcgggcac gatgctcacc tcgtcaccgc cggtgagagg cgtggcgagt 2401 ccggcctgaa agcggacgtc ctcaccgttg atgtagacgt tgacgaagcg tcgcagctcg 2461 ccgctctcgt cgcagaggcg ctccttcatg ccagggtagc gcgcctccag ggcgtcgatg 2521 cactcggcca gcgtcgatcc ctcgcccgcg acgacggact cttcgccggt cagcttgcgc 2581 agcggcatcg gtatgcggac ggtgatggtc agaccggcgc tccccgtccc gccgccagcc 2641 tcctgctcag gctctcgttg catcgccata tgtcactatc cctccacaac gtcttcgtag 2701 accgggtcga cgcgaacgcc ctgttccctg acccagtcga ggctgcgttc gatctcatcc 2761 tgatccccgt cgagctccag aatcacccac ccacgctcgt ccgtgacgtc cgcgcgccgg 2821 atgttggtga tgaccttgaa gtccttggcc agcttccaga tgatgggatc cttaatcaat 2881 tgctcaacga aggtgaattt ggtgcgtacc gttgccatct ctataccgct ttcgtcaagg 2941 gcagcgcgcc ctcgcgttct tcgagcgccc gctcgaacga ttcgatgttg gcctctacgg 3001 tcagcgccgg cttcagcgcc ggcgccacgg cttcctgcgt cttcagaccg gcgccggtga 3061 tgaaggcgac gacgacttcg tcgggctcca cacgaccttc ccggacgaga cggcggaggc 3121 cggagaccac gacaccgccg gcagtctcgg cgaagatgcc ttcggtctcg gccaggagct 3181 ggatgccctc gacgatctct tcgtcgggga ctgcgacggc cgacccattc gtgctcctca 3241 tcgtttttag agcgtagtac ccatcggccg gattgccaat ggccagcgac ttggcgatcg 3301 tgttcggctt ctgcggccgg atgttccacg agtcgcgacc gtacgcctcg gcgatcggcg 3361 agcagccgag ggcctgggcg ccgctcatgc gcgtcgggtt ccagtcggtc agaccgagcc 3421 aggccagctc cttgatgcct ttgtatatct tggtcaagag cgagccgctg gccatcggca 3481 cgatgcagtg atccggcgcg cgccagccca gctgctcgca gacctcgtgg cccagcgtct 3541 tgctgccctc ggcgtagtac ggccggacgt tgatgttgac gaacgcccag ggatacttgt 3601 cccccagttc gctgcacagg cggttgacat cgtcgtagct accatcgacg gcgaccagcg 3661 tcggcgcgta gaccgccgag ccgatgatct tgccccgctc caggtccgag gggatgaaga 3721 cgcgggcctg catcccggcg cgggcggcgt gggcggcgac gctgttcgcc aggttgcccg 3781 tcgaggcgca ggccagggta tcgaactcga aatcgcgggc cacgctgcag gcgacggaga 3841 cgacacgatc cttgaacgac cacgtggggt tcacgcagtc gttcttgagg tagagtcggc 3901 tcaggcccaa ttcgcggccc agattcgggg cgcgcagtag cggcgtatag ccggtgccga 3961 tgtcgacggc gtcctcggcg tcgacgggaa gcaggtcggc gtaacgccac atcgtcggcg 4021 ggccggcctc gatccgttca cgggtgacgt ggcccgcgag cgctgcgtag tcgtacacga 4081 cctccagcgg gccgaagcag aactcgcaga cgttcgaggg ttctagctga tattcacggc 4141 cgcattcgcg gcatttgagt ttcttggcga ctgacacgtg gttcctccgg gacgacaatc 4201 gatgtggacg ttagcgcagg cccgcgggcc tcccgagccg cctgaggcgg acggaggccc 4261 gggccatcgc catctgtgcc cgggtccgct gttgtgtcgt catacgaaaa aagcccccgc 4321 gcagagcgta aggggcctgg aaagctcctc atcgttctcc cgttagggag tcggcattgg 4381 caccgtgtct cgcgggaccg gttgccgggc ttcacagggc cgtgccctcc accgctctgg 4441 ataaggtcta ttgatttgta tccgaaatgc taacaccggc acggagcaag cgtcaacgag 4501 cggcggtagt cgctactgcg tcacgagcag ctaggagccg aggagcgggc agcgacgacc 4561 agggcgatgg ggcggccaag agtggcgcgg cagcgtagag tgaccggccg ctgtgctagg 4621 cggggctcgt ctgccgggtg cgccgctcgg gcgagcgcgc gaacttctgg //