LOCUS JACDGJ010000212 4675 bp DNA linear ENV 28-JUL-2020 DEFINITION Nostocaceae cyanobacterium isolate MGR_bin409 SD2908_NODE_9235_length_4675_cov_0.530123, whole genome shotgun sequence. ACCESSION JACDGJ010000212 JACDGJ010000000 VERSION JACDGJ010000212.1 DBLINK BioProject: PRJNA630822 BioSample: SAMN15052300 KEYWORDS WGS. SOURCE Nostocaceae cyanobacterium (soil metagenome) ORGANISM Nostocaceae cyanobacterium Bacteria; Cyanobacteria; Nostocales; Nostocaceae. REFERENCE 1 (bases 1 to 4675) AUTHORS Leung,P.M., Ortiz,M., Shelley,G., Cowan,D.A. and Greening,C. TITLE Metagenome from Mackay Glaciers Regions JOURNAL Unpublished REFERENCE 2 (bases 1 to 4675) AUTHORS Leung,P.M., Ortiz,M., Shelley,G., Cowan,D.A. and Greening,C. TITLE Direct Submission JOURNAL Submitted (29-MAY-2020) Department of Microbiology, Biomedicine Discovery Institute, Monash University, Clayton, Melbourne 3168, Australia COMMENT The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Date :: 2020 Assembly Method :: SPAdes v. 3.14.0 Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: 6.596709688504491x Sequencing Technology :: Illumina NextSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 07/16/2020 05:43:41 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 4.12 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 4,916 CDSs (total) :: 4,885 Genes (coding) :: 4,553 CDSs (with protein) :: 4,553 Genes (RNA) :: 30 rRNAs :: 2, 1, 1 (5S, 16S, 23S) complete rRNAs :: 2, 1, 1 (5S, 16S, 23S) tRNAs :: 24 ncRNAs :: 2 Pseudo Genes (total) :: 332 CDSs (without protein) :: 332 Pseudo Genes (ambiguous residues) :: 69 of 332 Pseudo Genes (frameshifted) :: 156 of 332 Pseudo Genes (incomplete) :: 91 of 332 Pseudo Genes (internal stop) :: 90 of 332 Pseudo Genes (multiple problems) :: 66 of 332 CRISPR Arrays :: 4 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..4675 /organism="Nostocaceae cyanobacterium" /mol_type="genomic DNA" /submitter_seqid="SD2908_NODE_9235_length_4675_cov_0.53012 3" /isolate="MGR_bin409" /isolation_source="mineral soil" /db_xref="taxon:2723028" /environmental_sample /geo_loc_name="Antarctica: Mackay Glacier regions, Towle Glacier" /lat_lon="76.4373 S 161.007 E" /collection_date="2015-01" /metagenome_source="soil metagenome" /note="metagenomic" gene complement(<1..757) /locus_tag="H0X31_07715" CDS complement(<1..757) /locus_tag="H0X31_07715" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_016877228.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 2 protein" /protein_id="MBA3921580.1" /translation="MSKLPVTVLIPAKDEEANLPACLQSVQRAAEIFVVDSQSSDRTA EIAKNYGATVVQFNFNGHWPKKKNWSLDNLPFHNEWVLIVDCDERIPDELWTEIAEVI QNPDYDGYYLNRRVFFLGQWIRHGGKYPDWNLRLFKHKKGRYENLCTEDIPNTGDNEV HEHVILPGKVGYLKNDMLHEDFRDLYHWLARHNRYSNWEARVYLNLLTGKDDSGTIGA SLFGDAVQRKRFLKKVWVRLPFKPLLRFVLFYII" gene complement(762..1688) /locus_tag="H0X31_07720" CDS complement(762..1688) /locus_tag="H0X31_07720" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015198711.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="glycosyltransferase family 2 protein" /protein_id="MBA3921581.1" /translation="MPDPQISAIICTHNRDMYLGAAIDSLLAQEETPAFEVVVVDNGS SDRTREVVEQRSHPRLKYVFEPVLGLSVARNTGAKASCGEIIAYLDDDAVASDRWLQV LSTAYANNEKLAIAGGKVTLLWPEGIQAPSWLSPGLAGNLGAYDLGDQMLLIDRPGLT PRGLNYSLRRSFLEEIGGFDPHLGRVGKNLLSNEELQMTELALQAGWQVAYLPSALVA HNVSPERINRSWFLSRGWWQGISECHREQIAGRAGFAQLGRGGERLIRGLYKSLQYFR DPAERFDKFVYAYGQIGYLNAVIQGLLSPTKN" gene complement(1823..2569) /locus_tag="H0X31_07725" CDS complement(1823..2569) /locus_tag="H0X31_07725" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015207204.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="MBA3921582.1" /translation="MKPALKRIEATLHTLDQETPTYTYASSCKEPAHSYKLNTNTPLS EPEDQNAANSRSFFPHQGSVQTFPIEDQGQKTPALPKFKTPNFSSHRHGANPALALNL LQEIQANITSWQIELHKLVREIQDIYLEGPIVDGWLESHKTEFPTGGTATLRHADIER LMDYVEEICHPTGNNTQSNRAGYRLCSLDETGKVWSRPCPPEQVPALSIAIARNQKLR QMLGRKQYLETRLSQLAETLVVLHSHIQSA" gene 2827..3372 /gene="cobU" /locus_tag="H0X31_07730" CDS 2827..3372 /gene="cobU" /locus_tag="H0X31_07730" /EC_number="2.7.1.156" /EC_number="2.7.7.62" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015137406.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="bifunctional adenosylcobinamide kinase/adenosylcobinamide-phosphate guanylyltransferase" /protein_id="MBA3921583.1" /translation="MSKIILVTGPARSGKSEWAETLALQSGKTVVYVATAYENIADIE WCDRLTQHKLRRPQNWLTLEIPVALGATVGEATAQNCLLVDSLGTWVANLLEQDEEIW LQVVQDFLVTARHCEGEIIFVAEETGWGVVPAYPIGRTFRDRLGSLVRRLSTISDAVY LVTGGHVLNLSALGSPLPELK" gene 3390..3809 /locus_tag="H0X31_07735" CDS 3390..3809 /locus_tag="H0X31_07735" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015137405.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="MBA3921584.1" /translation="MASQEEVKQYLAYWFQLGKKVVLGNGVERLLPQPVIRGASYSPE FEQCWQRIISPTSGDCYLEGTQETIAELLTPTWDLTSCGRCEMPVPMRSVGMPTLLCP CNDLPNWPNTELPYPRQPVDTQTQLKEIRSRLLAKNE" gene 3913..>4675 /locus_tag="H0X31_07740" CDS 3913..>4675 /locus_tag="H0X31_07740" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015128190.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="substrate-binding domain-containing protein" /protein_id="MBA3921585.1" /translation="MGQTTPHKHKIPVLTLAILAIAGLASSSFWLFNKHTQPHVTEVN HKESKTAVQPEEFTNVQNVPHGVFNYGGSTSWAPIRQVADSGIQAAHPEFKLNYVLPT ANGTSGIDGGIKMLIDGKISFAQSSRPITESEYKQAQQQGFKLEQIGVAIDMVAVAVN PNLNISGLTIDQLRSIYTGKITNWKNLGGPNIAIKPFSRPIHNNGTVELFQKEILENQ PFGSNVQIVTTTTEGVQKLSAPGAIYFASAAEIVPQ" BASE COUNT 1269 a 993 c 1108 g 1305 t ORIGIN 1 gaatgatata aaacaataca aatcgtaaca atggtttaaa tggcagccgc acccaaactt 61 ttttgagaaa gcgcttacgt tgtacagcat cgccaaacaa acttgcgcca atcgtcccac 121 tgtcatcttt acctgtgagc aggttgagat aaacacgagc ttcccagtta gaatagcggt 181 tgtgtcttgc taaccagtgg tagaggtcgc ggaagtcttc gtgcagcata tcgtttttga 241 gatagccgac tttccctggc agaattacat gttcgtgaac ttcgttgtcg ccagtgttgg 301 gaatatcttc ggtgcagagg ttttcgtagc gtcctttttt gtgcttgaat aagcggagat 361 tccagtcagg gtatttaccg ccgtggcgaa tccattgacc taagaagaag acgcgacgat 421 tgaggtaata gccatcgtag tcgggatttt ggatgacttc cgcgatttct gtccaaagtt 481 cgtcgggaat gcgctcatca caatcgacaa ttagcaccca ctcgttgtgg aatgggaggt 541 tgtctaatga ccaatttttc tttttcggcc agtgaccgtt gaaattgaac tggacgacag 601 tcgcaccata atttttggca atttcggcag ttcgatcgct actttgagaa tcgacaacaa 661 aaatttctgc ggctcgttgt acactttgca agcaagctgg gagatttgct tcttcgtctt 721 tagcgggaat taagactgta actggtaatt tagacatata gttaattttt tgtgggtgag 781 agcaaacctt ggatgaccgc atttaaatag cctatttgac cataggcgta gacaaattta 841 tcaaaacgtt ctgccggatc gcgaaagtat tgcagtgatt tatacaaacc tcgtattaac 901 cgttcgccac cgcgtcctaa ctgagcaaat ccggctctcc cggcgatttg ttcccggtga 961 cattcgctaa ttccttgcca ccagcctcgg ctgagaaacc agctgcggtt gatacgttct 1021 ggagaaacgt tgtgagcaac tagcgccgat ggaagataag cgacttgcca tcctgcttgt 1081 agtgctagtt cggtcatttg cagttcttcg tttgataata agtttttgcc aacacgaccg 1141 agatgtgggt caaaaccgcc aatctcttcg aggaagctgc ggcgcaggga ataatttaaa 1201 cctctggggg ttaaaccggg gcgatcaatc aaaagcatct gatcgcctaa atcgtaggct 1261 cctaagtttc ctgcgagtcc aggagataac caagacggtg cttgaattcc ctccggccaa 1321 agcaaggtga ctttaccacc agcaattgcg agtttttcat tattggcgta agcagttgac 1381 aaaacttgca gccagcgatc gctagctaca gcatcgtcat caagataagc aataatttct 1441 ccacaagaag ctttcgcgcc agtgttgcgg gcgactgaga gaccgagcac tggctcaaaa 1501 acatacttga ggcgcggatg ggatctttgt tcgacaacct cccgcgtgcg atcgcttgag 1561 ccattgtcca caactactac ttcaaaggca ggagtttcct cttgcgccaa aagactgtca 1621 atggctgcgc ctaaatacat gtctcgattg tgtgtacaga tgatggcgga aatttgcgga 1681 tctggcatag ggtcgttttg agtttttggc tttgagattt tggcactaaa tcacaactgc 1741 caacattcgc ctgctgcccg ctaatttaca ctaaaattca tctacctacc aaccgagttt 1801 ttgcggaaaa ctacttgaca ttttaggcag attgaatatg gctgtgtaaa actacgagag 1861 tttctgccaa ttgacttagg cgagtttcca aatattgctt ccgtcccaac atctgccgta 1921 atttttggtt gcgggcgatg gcgatactca aagccggtac ctgttcaggt ggacaaggac 1981 gcgaccacac tttaccagtc tcatctaaac tacaaaggcg atagccagca cggttcgatt 2041 gcgtgttatt accagtggga tgacaaatct cttctacata gtccatcagg cgttcaatat 2101 ccgcatgacg gagtgtagca gttccgcctg tcgggaattc ggttttgtgg gactctaacc 2161 agccatcaac tattggtcct tctaagtaaa tatcctggat ttcgcgcaca agtttgtgca 2221 gttcgatttg ccaactagta atatttgctt ggatttcttg taagagattt aaagccaaag 2281 ctggattagc gccgtggcga tggctactga agttcggagt tttgaatttg ggtagggcgg 2341 gtgttttttg accttggtct tcgatgggaa aggtttgaac tgaaccttga tgcggaaaaa 2401 atgagcggga attagcggcg ttctgatctt ctggttcact taagggagta ttcgtgttta 2461 gcttataaga gtgggcaggt tctttacaac tactcgcata agtgtacgta ggagtttctt 2521 gatctagagt gtggagggtt gcctcaattc gctttaaagc tggtttcata aaacctcatc 2581 ctgaatggga taaatgggat gaacggaaat cacgtaatac tgatgtcagt aattctatct 2641 atttaaaatg gttacaactt aaaaaaccat gtgtaaaccc acagtttatc taagaaaatt 2701 gtcactcagt atacatatat ccgtgttttt acgcgccgcc acagtaatgt tactagttga 2761 cagaagctac atcagcttaa ataaattgtc acaggcggca atgagtgccc aggagaatta 2821 cctactttga gtaaaatcat cttagttaca ggaccagcac gttctggtaa aagtgaatgg 2881 gcagaaactc tggcgttgca atctggcaaa acagttgttt acgtggcaac ggcgtatgaa 2941 aatattgctg atattgaatg gtgcgatcgc ttaacacaac acaaattacg tcgtccgcaa 3001 aattggctaa cattagagat accagttgcc ctaggagcga ctgtaggcga ggctacggct 3061 caaaattgtc tcttggtgga ttctttaggt acttgggttg ccaatcttct ggaacaggat 3121 gaagagatct ggctacaagt ggtgcaggat tttctggtga ctgcgcggca ttgtgaggga 3181 gagataattt ttgttgccga ggaaacaggt tggggtgtag taccagctta cccaatcggc 3241 agaactttcc gcgatcgctt gggttctttg gtgcggcgtc taagtacaat ttctgatgct 3301 gtttaccttg tcactggcgg acacgttctt aacctcagcg cccttggttc accgttacct 3361 gaacttaaat aaaatagtag agtggaatta tggcatctca agaagaagtt aaacaatacc 3421 ttgcttactg gtttcaattg ggtaagaaag ttgttctcgg taatggtgta gaaagattgc 3481 taccacaacc agtaatccga ggcgcaagtt acagcccaga atttgaacag tgctggcagc 3541 gaattatttc accgacatca ggcgactgct acttggaggg tacacaagaa actattgccg 3601 aactgctgac ccctacttgg gatctgacat cttgcgggcg ttgcgagatg ccagtaccca 3661 tgcgtagtgt tggtatgccg actttattgt gtccgtgcaa tgacttacct aattggccga 3721 atacggaact tccttacccg cgtcagcctg ttgacactca aacacaactc aaggaaatac 3781 gctctaggct tttggctaag aatgaataat tactgatttc tatctgcata agttcaatat 3841 taaaacccct atgtaattta taggggtctt tttttgaagt tttattaaaa aagttaaatc 3901 aggggagcga aaatgggaca gactacgcct cacaaacaca aaatacctgt tttgactttg 3961 gcaatcctgg cgatagctgg gttagcaagc agtagctttt ggttgttcaa taaacacact 4021 caacctcacg tcacagaggt gaatcacaaa gagagtaaaa cagcagtcca gccagaagaa 4081 tttaccaacg tgcagaatgt tccacatggc gtgtttaact atggtggtag cacttcttgg 4141 gcaccgattc gacaggtcgc tgattcagga attcaagcgg cgcatcctga gtttaagctg 4201 aattatgtct tgccgaccgc taatggaaca tctggcattg atggtgggat taagatgtta 4261 atcgacggta aaatcagctt tgctcaatcc tcaaggccga ttacggaaag cgagtataag 4321 caggcgcagc agcaggggtt taaactagag caaattggcg ttgccattga tatggtggcg 4381 gtcgcagtta accctaactt aaatatttcc ggacttacca tagaccaact gaggtctatt 4441 tatactggca agataactaa ctggaaaaat ttaggaggtc ctaatatagc gattaagcca 4501 ttttcacgcc caatacacaa taacggtact gttgaacttt ttcagaaaga aattttagaa 4561 aatcaacctt ttggctcgaa tgtgcagatt gttactacca ccactgaagg tgtgcaaaaa 4621 ttgagcgctc caggagcaat ttactttgcc tcggcggcag aaattgtccc gcagt //