LOCUS JACMPN010000251 2905 bp DNA linear ENV 10-FEB-2021 DEFINITION MAG: Candidatus Sericytochromatia bacterium isolate LF-bin-346 FL-bin-346-contig-k141_1000003, whole genome shotgun sequence. ACCESSION JACMPN010000251 JACMPN010000000 VERSION JACMPN010000251.1 DBLINK BioProject: PRJNA552582 BioSample: SAMN14819180 KEYWORDS WGS; ENV; Metagenome Assembled Genome; MAG. SOURCE Candidatus Sericytochromatia bacterium (glacier metagenome) ORGANISM Candidatus Sericytochromatia bacterium Bacteria; Candidatus Sericytochromatia. REFERENCE 1 (bases 1 to 2905) AUTHORS Zeng,Y. TITLE Metagenome-assembled genomes from the Lille Firn glacier at the Villum Research Station in northeast Greenland JOURNAL Unpublished REFERENCE 2 (bases 1 to 2905) AUTHORS Zeng,Y. TITLE Direct Submission JOURNAL Submitted (07-AUG-2020) Department of Environmental Science, Aarhus University, Frederiksborgvej 399, Roskilde 4000, Denmark COMMENT The isolate name was changed from FL-bin to LF-bin in Feb. 2021. #The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: MetaBAT v. 2 Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: 20x Sequencing Technology :: Illumina NovaSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 08/14/2020 21:32:16 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 4.12 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,112 CDSs (total) :: 5,062 Genes (coding) :: 5,023 CDSs (with protein) :: 5,023 Genes (RNA) :: 50 rRNAs :: 2, 2 (5S, 23S) complete rRNAs :: 2 (5S) partial rRNAs :: 2 (23S) tRNAs :: 43 ncRNAs :: 3 Pseudo Genes (total) :: 39 CDSs (without protein) :: 39 Pseudo Genes (ambiguous residues) :: 0 of 39 Pseudo Genes (frameshifted) :: 12 of 39 Pseudo Genes (incomplete) :: 17 of 39 Pseudo Genes (internal stop) :: 14 of 39 Pseudo Genes (multiple problems) :: 4 of 39 CRISPR Arrays :: 1 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..2905 /organism="Candidatus Sericytochromatia bacterium" /mol_type="genomic DNA" /submitter_seqid="FL-bin-346-contig-k141_1000003" /isolate="LF-bin-346" /isolation_source="glacial surface ice" /db_xref="taxon:2762020" /environmental_sample /geo_loc_name="Greenland: the Little Firn glacier in the Knuths Fjeld" /lat_lon="81.566 N 16.363 W" /collection_date="2018-07-02" /metagenome_source="glacier metagenome" /note="metagenomic" gene <1..626 /locus_tag="H7338_08950" CDS <1..626 /locus_tag="H7338_08950" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012240871.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=3 /transl_table=11 /product="AMP-binding protein" /protein_id="MBC7542845.1" /translation="TIIAAVATAMQPDDLVTIGHPIANVHLYILDPQGRQVPIGVAGE LHIGGVGLARGYLHREALTAARFIADPTGLEPGGRLYRTGDLARFRTDGRIGFLGRID DQVKIRGYRIELGEVETVLAAHPAVAAAVVTRHGPPESARLVAYVVLRPGHALDRTAL VAHLGGRLPAFMLPGAIVSLPALPYSPNDKIDRRALPSPTAADWLKH" gene complement(673..933) /locus_tag="H7338_08955" CDS complement(673..933) /locus_tag="H7338_08955" /inference="COORDINATES: protein motif:HMM:NF012759.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="acyl carrier protein" /protein_id="MBC7542846.1" /translation="MTVETTTMALMRRYLADVAPWLQAETVSDTVSMSAEIGLDSLTM TSFAVALEKGLGKAVGIDLWMVDTAPDEQDTLANLAAWLDGQ" gene complement(999..2207) /locus_tag="H7338_08960" CDS complement(999..2207) /locus_tag="H7338_08960" /inference="COORDINATES: protein motif:HMM:NF024920.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sterol carrier protein domain-containing protein" /protein_id="MBC7542847.1" /translation="MVLTTRTLTDADHWTVNDLLYFSRGRPALPADAWVRPRPLQGVL LESDRTPIAVQTLSFEPIRLNGRTAVLCQVLKEAMVPGVTGKQVVAALYAAAGQAVRD QNAALMTTLSEPPAPWVPAARASAEYGATVRRAMPPFQRGWVHLVTPGVGGGAPPAWG DVRPMRTADAGAVIALYERRLGGLNGSCVRSGQDWADLWRRPDDRSFVIGEGTGLSGY LRLEPMAFPAGGGWRVGDLLETGPAAGHALWQVALALGTQAQGLWVPPLPPDRPWRRW LDGLPVTTWPAVGLIRPGAMQPFLAALAFASPAGSLVLGIVDPYRLWPRRVRITWENH ALVECGETTAEPVAAMGVGALALLGPGLTPAPALWRQRLVDADLPTIARLAATWSAGP FFANWPNAGI" gene complement(2211..>2905) /locus_tag="H7338_08965" CDS complement(2211..>2905) /locus_tag="H7338_08965" /inference="COORDINATES: protein motif:HMM:NF037610.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=3 /transl_table=11 /product="hypothetical protein" /protein_id="MBC7542848.1" /translation="PPEDLPRLAALHDRRALELTGMVRRTPGAWAQLGGRLTPHGPLM AHAYEQDGTICGYLLFTSHELPGQVRRLRVLEWLETTPAALAGLVGFLHDQQLKVVEI ELPICPDRPLSAQVAARGRWAAVYDVGPSLQVLQAAPVLAALWPRFPAALAMVVIGGA AAGEVVHLGPAGETAGPPLPVPAEVLAGWAGGAASVAGAVAVGQLQASADQLALLSAH WQGRPAYRVRGL" BASE COUNT 454 a 1079 c 988 g 384 t ORIGIN 1 cgaccatcat cgccgccgtc gccacggcca tgcagccgga cgatctggtg acgatcggtc 61 atccgatcgc caacgtgcac ctctatatcc tcgacccgca gggtcggcag gtgccgatcg 121 gcgtggccgg ggagctgcat atcggcggcg tcgggctggc ccgggggtac ctgcaccgtg 181 aggcgctgac agccgcccgg ttcattgccg atccgaccgg tctggagccg ggcggacgcc 241 tgtaccggac gggcgatctg gcccggttcc gcaccgacgg ccgtatcggg tttctgggcc 301 gcatcgacga tcaggtcaag atccgcggct accgcatcga actgggcgag gtggagacgg 361 tgctggccgc ccatcctgcc gtggcggccg ccgtggtcac ccggcacggc ccgccggagt 421 cggcccgcct cgtggcgtac gtcgtcctcc ggccgggaca cgccctcgac cgcaccgccc 481 tggtcgccca tctgggcggg cgtctgccgg ccttcatgct cccgggggcg atcgtcagcc 541 tgccggccct gccgtacagc cccaacgaca aaatcgatcg ccgggccttg ccgtcaccga 601 ctgccgccga ctggctcaaa cactgagggt gccccctgct gccgaggatc cgatcctggc 661 agcctggcgg ggttattgtc cgtcgagcca tgccgccaga ttcgccagcg tatcctgttc 721 gtcgggggcc gtgtcgacca tccacaggtc gatcccgacg gccttgccga ggcctttttc 781 gagggccacg gcaaaactgg tcatcgtcag cgagtcgagg ccgatctccg ccgacatcga 841 gaccgtgtcc gagaccgtct cggcctgcag ccagggtgcc acgtcggcca aataccgccg 901 catcagcgcc atggtggtgg tttcgactgt catgaccgct tccttccgct cagcaatgct 961 gcctcagcgt aacatggcct cccgcacgcc aaagcccgtc aaatgccggc gttgggccag 1021 ttggcgaaaa acgggccggc cgaccacgtc gccgccaacc tggcgatggt gggcaggtcg 1081 gcatcgacca aacgctgccg ccagagcgcc ggcgccggcg tcagcccggg ccccagcagc 1141 gccagggccc cgacgcccat cgccgccacc ggctcggccg tcgtctcgcc gcactcgacc 1201 agggcatggt tttcccaggt gatccgcacc cggcgtggcc acaggcggta cggatcgacg 1261 atccccagta cgaggctgcc cgccggtgag gcgaaggcga gggcggccag gaacggctgc 1321 atggcacccg gccggatgag accgacggca ggccaggttg tgacgggcag gccatccagc 1381 cagcggcgcc acggccggtc cggcggtaac ggcggcaccc agaggccctg cgcctgggtc 1441 cccagcgcca gggccacctg ccacagggcg tgaccggccg ccggccccgt ttccagcaaa 1501 tcccccaccc gccagccgcc gccagccgga aaggccatcg gctccagacg cagatacccg 1561 gacaggcccg tcccctcgcc gatgacgaag gaccggtcgt ccgggcgtcg ccagaggtcc 1621 gcccagtcct ggcctgaacg cacacaggaa ccgttcaggc caccgagacg ccgctcgtaa 1681 agggcgatga ccgccccggc atccgccgtc cgcatcggac ggacgtcacc ccacgcggga 1741 ggcgcaccgc cgccgacacc cggcgtgacg agatggaccc acccccgctg aaagggcggc 1801 atcgcacgcc gcaccgtggc cccgtattcg gcagaggcgc gggctgcagg cacccagggg 1861 gccgggggtt cgctgagcgt cgtcatcagg gcggcgttct ggtcacgcac agcctgcccg 1921 gccgccgcat agagggctgc gacgacctgc ttgccggtga cgccggggac catggcctct 1981 ttcagcactt ggcagagcac ggccgtccga ccattcagcc ggatcggctc gaacgacaag 2041 gtctgcacgg cgatgggcgt tcggtcgctc tccagcagga cgccctgcag gggccggggc 2101 cggacccacg catcggcggg cagggcgggc ctcccccggg aaaaatagag gaggtcgttg 2161 acggtccaat gatcggcatc ggtcaacgtc cgggtcgtca gcaccatcgg ctacaggccc 2221 cgcacccggt acgccggtcg gccctgccag tgggccgaaa gcagcgccag ttgatccgcc 2281 gatgcctgca actgaccgac ggcgacagca ccggccactg aggcggcacc gccggcccag 2341 cccgccagga cctcggcagg caccggcagg ggcggccctg cggtctcgcc tgccggcccg 2401 agatgcacca cctcaccggc agcggcaccg ccaatgacga ccatggccag cgccgccggg 2461 aaccggggcc agagcgccgc cagcacgggg gccgcctgga gtacctgcag gctggggccg 2521 acgtcataca cggcggccca gcgcccccgg gcggcgacct gcgccgacag gggccggtcc 2581 gggcagatcg gcagctcgat ttccacgacc ttgagctgct ggtcgtgcag gaagccgacc 2641 aatccggcaa gagccgccgg tgtcgtttcc agccattcca ggacccgcag ccggcggacc 2701 tgccccggca gttcatggct ggtgaagagc agatacccgc agattgtccc gtcctgctcg 2761 taggcgtggg ccatcaatgg cccatggggc gtcagcctgc cgcccagttg ggcccaggcc 2821 ccgggcgtcc gccggaccat gccggtcagt tcgagggccc tccggtcatg cagggcggcc 2881 aatcggggca ggtcctcggg cggga //