LOCUS JACMJG010000437 1508 bp DNA linear ENV 10-FEB-2021 DEFINITION MAG: Pseudonocardia sp. isolate ES-bin-151 ES-bin-151-contig-k141_4776224, whole genome shotgun sequence. ACCESSION JACMJG010000437 JACMJG010000000 VERSION JACMJG010000437.1 DBLINK BioProject: PRJNA552582 BioSample: SAMN14819016 KEYWORDS WGS; ENV; Metagenome Assembled Genome; MAG. SOURCE Pseudonocardia sp. (glacier metagenome) ORGANISM Pseudonocardia sp. Bacteria; Actinobacteria; Pseudonocardiales; Pseudonocardiaceae; Pseudonocardia. REFERENCE 1 (bases 1 to 1508) AUTHORS Zeng,Y. TITLE Metagenome-assembled genomes from the Lille Firn glacier at the Villum Research Station in northeast Greenland JOURNAL Unpublished REFERENCE 2 (bases 1 to 1508) AUTHORS Zeng,Y. TITLE Direct Submission JOURNAL Submitted (07-AUG-2020) Department of Environmental Science, Aarhus University, Frederiksborgvej 399, Roskilde 4000, Denmark COMMENT The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: MetaBAT v. 2 Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: 20x Sequencing Technology :: Illumina NovaSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 08/14/2020 14:08:51 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 4.12 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 4,002 CDSs (total) :: 3,957 Genes (coding) :: 3,826 CDSs (with protein) :: 3,826 Genes (RNA) :: 45 rRNAs :: 1, 1 (5S, 23S) complete rRNAs :: 1 (5S) partial rRNAs :: 1 (23S) tRNAs :: 41 ncRNAs :: 2 Pseudo Genes (total) :: 131 CDSs (without protein) :: 131 Pseudo Genes (ambiguous residues) :: 0 of 131 Pseudo Genes (frameshifted) :: 47 of 131 Pseudo Genes (incomplete) :: 88 of 131 Pseudo Genes (internal stop) :: 13 of 131 Pseudo Genes (multiple problems) :: 15 of 131 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..1508 /organism="Pseudonocardia sp." /mol_type="genomic DNA" /submitter_seqid="ES-bin-151-contig-k141_4776224" /isolate="ES-bin-151" /isolation_source="glacier surface soil" /db_xref="taxon:60912" /environmental_sample /geo_loc_name="Greenland: the Little Firn glacier in the Knuths Fjeld" /lat_lon="81.567 N 16.358 W" /collection_date="2018-07-02" /metagenome_source="glacier metagenome" /note="metagenomic" gene complement(<1..258) /locus_tag="H7Y15_07970" CDS complement(<1..258) /locus_tag="H7Y15_07970" /inference="COORDINATES: protein motif:HMM:NF024949.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="helix-turn-helix transcriptional regulator" /protein_id="MBC8091859.1" /translation="MAEEAMQRPHRISRPVGTRLMGEADGGARVARADLQMLRLRLLL SGEELAQKAGIALSTYQRIERGEQNPHPKTMRGLALALGVDE" gene complement(251..466) /locus_tag="H7Y15_07975" CDS complement(251..466) /locus_tag="H7Y15_07975" /inference="COORDINATES: ab initio prediction:GeneMarkS-2+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS-2+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="MBC8091860.1" /translation="MVDVDAVLAVQRRAATALLLLTGTYLGSVGPVAVPDEALVGVDR ALVDLGDYFDACRRRPGIGVRWWPDHG" gene complement(484..705) /locus_tag="H7Y15_07980" CDS complement(484..705) /locus_tag="H7Y15_07980" /inference="COORDINATES: protein motif:HMM:NF024949.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="helix-turn-helix transcriptional regulator" /protein_id="MBC8091861.1" /translation="MIAARWRAGLSRSDAAGRLGVSVETYIGWESGTGVPPRRDRPAV ASVLGVRLVDVDGWLESGLDDPYRIGRPR" gene complement(834..>1508) /locus_tag="H7Y15_07985" CDS complement(834..>1508) /locus_tag="H7Y15_07985" /inference="COORDINATES: ab initio prediction:GeneMarkS-2+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS-2+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="MBC8091862.1" /translation="GLARRLDRSAAVGLAAYSRAHAAAIGAGYEVMMLVAERAADEMR PSLDTPEHDPDSHRVYGSLLLTASLGAATALRRERVDPYAAEAVEVARRIGDAPRAGD PWQTYFGPSNAAIWQMTIAADIGNAGDVLAHAATVDLDVLDSRPRRAAYWTELGRGLA LMPGREDDAIEALLQAHSTSPARVATSRYVLGALDELLDRPLRRPSTRAKLALLARRL GHLTAA" BASE COUNT 225 a 590 c 505 g 188 t ORIGIN 1 ctcgtcgacc cccagcgcca gcgccaaccc ccgcatcgtc ttcgggtggg ggttctgctc 61 gccgcgttcg atccgctggt aggtcgacag cgcgatcccc gccttctgcg ccagctcttc 121 ccccgacagc aggagacgca gccgcagcat ctgcaggtcc gcccgcgcga cccgcgcgcc 181 accgtcggcc tcacccatca gccgcgtccc gaccggccgg ctgatccgat gcgggcgctg 241 catcgcctcc tcagccatgg tcaggccacc accggacgcc gatccccggg cggcgacggc 301 acgcatcgaa gtaatccccc aggtcaacca gcgcccggtc gaccccgacc agcgcctcat 361 cgggcaccgc caccggcccg acgctgccca gatacgtgcc cgtgagcagc agcagagccg 421 tcgccgcacg acgctgcacc gccagcaccg catcgacgtc gaccacctcc gccgggttcc 481 cgatcaccgc ggccgcccga tccggtacgg gtcgtcaagg cccgactcca gccacccgtc 541 gacgtcgacc agacggacac cgagcaccga cgcgaccgcc ggccggtcac gccgcggcgg 601 aacccccgtc ccggactccc aaccgatgta ggtctccacc gacaccccca gccgaccggc 661 ggcgtccgaa cggctcaggc cggccctcca acgggcggca atcatcggcc gaacattcac 721 cgacggtaga tcaggcggat gatgtgcatt gatgtgtcgc gtcatgggca cggaccgtag 781 ggccgtcgtc ccccgcgctg gtttccgctc ggaaaccctg ccgcagggtg gggtcaggcc 841 gcggtgaggt gccccagtct gcgggcgagc agcgccagct tggcccgtgt cgacggccga 901 cggagaggcc gatcaagtag ttcgtcgagc gccccgagca cgtaccgaga ggtggccacc 961 cgggccggag aggtggagtg cgcctgcagc agggcctcga tcgcgtcgtc ctcccggcca 1021 ggcatcagcg ccaacccccg cccgagctcg gtccagtacg ccgcacgtcg gggccgggag 1081 tcgaggacgt cgaggtcgac cgtggcggcg tgggccagga cgtcgccggc gttgccgatg 1141 tcggccgcga tcgtcatctg ccagatcgca gcgttgctag gcccgaagta ggtctgccag 1201 gggtcgccgg cgcggggcgc gtcgccgatc cgccgggcga cctcgacggc ctccgcggcg 1261 taggggtcaa cccgctcccg ccgaagggcg gtcgccgcgc cgagcgacgc ggtcagcagc 1321 agcgacccgt agacccggtg agagtccggg tcatgctcag gggtgtccaa gcttggccgc 1381 atctcgtcgg cggcccgctc ggctacgagc atcatcacct cgtagccggc accgatcgcc 1441 gcggcgtgtg cccgggagta cgccgccagc ccgaccgcgg ccgaccggtc gaggcgtcgc 1501 gcaaggcc //