LOCUS JAAUUY010000299 4341 bp DNA linear ENV 05-APR-2020 DEFINITION Leptolyngbyaceae cyanobacterium SU_3_3 NODE_3969_length_4341_cov_1.213336, whole genome shotgun sequence. ACCESSION JAAUUY010000299 JAAUUY010000000 VERSION JAAUUY010000299.1 DBLINK BioProject: PRJNA612530 BioSample: SAMN14376599 KEYWORDS WGS. SOURCE Leptolyngbyaceae cyanobacterium SU_3_3 (stromatolite metagenome) ORGANISM Leptolyngbyaceae cyanobacterium SU_3_3 Bacteria; Cyanobacteria. REFERENCE 1 (bases 1 to 4341) AUTHORS Waterworth,S.C., Isemonger,E.W., Rees,E.R., Dorrington,R.A. and Kwan,J.C. TITLE Conserved bacterial genomes from two geographically distinct peritidal stromatolite formations shed light on potential functional guilds JOURNAL Unpublished REFERENCE 2 (bases 1 to 4341) AUTHORS Waterworth,S.C. TITLE Direct Submission JOURNAL Submitted (13-MAR-2020) Pharmaceutical Sciences, University of Wisconsin, Madison, 777 Highland Avenue, Madison, WI 53705, USA COMMENT The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: SPAdes v. 3.12.0 Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: 3.29x Sequencing Technology :: IonTorrent ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 03/31/2020 08:46:48 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 4.11 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 3,941 CDSs (total) :: 3,930 Genes (coding) :: 2,918 CDSs (with protein) :: 2,918 Genes (RNA) :: 11 tRNAs :: 10 ncRNAs :: 1 Pseudo Genes (total) :: 1,012 CDSs (without protein) :: 1,012 Pseudo Genes (ambiguous residues) :: 0 of 1,012 Pseudo Genes (frameshifted) :: 868 of 1,012 Pseudo Genes (incomplete) :: 155 of 1,012 Pseudo Genes (internal stop) :: 78 of 1,012 Pseudo Genes (multiple problems) :: 87 of 1,012 CRISPR Arrays :: 1 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..4341 /organism="Leptolyngbyaceae cyanobacterium SU_3_3" /mol_type="genomic DNA" /submitter_seqid="NODE_3969_length_4341_cov_1.213336" /isolate="SU_3_3" /isolation_source="Stromatolite" /db_xref="taxon:2720479" /environmental_sample /geo_loc_name="South Africa: Schoenmakerskop" /lat_lon="34.041167 S 25.5385 E" /collection_date="Apr-2018" /metagenome_source="stromatolite metagenome" /note="metagenomic" gene complement(129..695) /locus_tag="HC936_11995" CDS complement(129..695) /locus_tag="HC936_11995" /inference="COORDINATES: ab initio prediction:GeneMarkS-2+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS-2+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NJK53344.1" /translation="MGSLLLYLLVPELRARFNPILPPISLPSISLPSASPVAPANSPA PNATASISPSPSATTSLAPSPLLEPFLEVGSFLQVAPRSTPNSTPIDLLSLPGSPIDS TPDPGVGQIPNGSILQVVGKQTMTDRRSWLRLKVCSVANNSGSASGSKFVRAGDVGWL EAKIMTGAIAQNFSIKPTQLGSCANSTP" gene complement(643..1071) /locus_tag="HC936_12000" CDS complement(643..1071) /locus_tag="HC936_12000" /inference="COORDINATES: ab initio prediction:GeneMarkS-2+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS-2+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NJK53345.1" /translation="MKLGNRNRPPLDGTTDVATVSRTLVEIANTRNGHDNVTIGLVYC QVDRQANAERSDRNLRVANLPKIPQTLKPRRVSAHPQRRIDRRASHPYSVVRFSGQDA TNCSSAFTVGLSAFPADSIGFIGLGQLIALLAGSRTAGAV" gene complement(1090..1527) /locus_tag="HC936_12005" CDS complement(1090..1527) /locus_tag="HC936_12005" /inference="COORDINATES: protein motif:HMM:NF012693.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NJK53346.1" /translation="MSNRNLLAGGQLDSATLIHGLERATLAANDAISERNDQEQRHER QRMGTTLVMGLAHAHEFYLTHVGDSRAYRITRTGCDQVTLDDDVASREVRLGYALYRD ALQQPASGALVQALGMGNSSFLHPTVQRFVVDEDCVFCSALMG" gene complement(1637..2548) /locus_tag="HC936_12010" CDS complement(1637..2548) /locus_tag="HC936_12010" /inference="COORDINATES: ab initio prediction:GeneMarkS-2+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS-2+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NJK53347.1" /translation="MDNDRPILQCPNYLCQALNPEGHKFCHKCRTPLPKLFLWAVGLE GYRLGEVLGDRYLVKADQILLDTKPGLPLEMPGEPPRHWESYLRLFPYRLHVPQIHGW VCEKGRSNSPILLLEGAPIFPIGERQADARLASVELAGHLMPELTSLWGQSFPLRQLN WLWQMAALWQPLSSEKVISTLLDPTLLRVDHSCLRLLELKFDPPTAPSLAELGQLWRS WNTQLERAAPGGVAQEIVPFLGQLCDQMIHGQIKTPDHLLDLLDQALSHEGQFQIRQV QIATCTDQGPSAREMRMLAIRRVGRVL" gene 2772..3047 /locus_tag="HC936_12015" CDS 2772..3047 /locus_tag="HC936_12015" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_017286628.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="helix-turn-helix transcriptional regulator" /protein_id="NJK53348.1" /translation="MAEGNSQVQAQAPLSERELQVIELVAAGLTNQDIAIQLEISKRT VDNHISNILTKTETENRVALVRWALQWGKVCIDEVNCCVLPPYKGER" gene 3094..3359 /locus_tag="HC936_12020" /pseudo CDS 3094..3359 /locus_tag="HC936_12020" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007353879.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene 3376..>4341 /locus_tag="HC936_12025" CDS 3376..>4341 /locus_tag="HC936_12025" /EC_number="6.1.1.21" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007353880.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="histidine--tRNA ligase" /protein_id="NJK53349.1" /translation="MSSIQAIRGTRDILPDEVGYWQQIEAIARKIFATAVYQEIRTPI FEQTALFERGMGEATDVVSKEMYTFTDRGDRAITLRPEGTAGVVRSFVEHGMQAQGGV QRLWYLGSMFRYERPQAGRQRQFHQLGVEVLGSADFRADAEAIALATTILQALGLKNL RIELNSIGNGDDRQKYRAALVNYLTPHQADLDSDSQDRLTRNPLRILDSKNSGTQKIL KDAPSILDYLSPDSQQRFDRLQQLLGDLGIVCNINDRLVRGLDYYTHTVFEIQSDDLG SQATVCAGGRYDGLVAELGGPIPLVLVGAMGLERLVILLQQIQPAH" BASE COUNT 1036 a 1063 c 1251 g 991 t ORIGIN 1 tcacgaggtt cgatctaggc aagtggcgtt gatccaggaa ggctcaaccc ttgaggttgg 61 gaatccccat ccgcctatgc ggtagggagg atgtcaattg aatctacgtg acctgagggt 121 tgcccaagtt agggagtgga attggcacag gagccgagtt gggttggctt gatgctaaag 181 ttttgagcga tcgctcctgt cattatcttt gcttctagcc aacccacatc acccgcccgg 241 acgaattttg agccactggc actcccagaa ttattcgcaa ccgaacaaac cttgagccgc 301 agccacgatc ggcgatcggt catcgtttgc ttgcccacca cttgcagaat gctgccatta 361 ggaatctgcc ccaccccagg atcgggggtg gagtcgatcg gcgatccggg cagactgagc 421 aagtcgatgg gcgtagaatt tggagtcgat cgcggcgcaa cctgtagaaa tgagccaact 481 tctaggaaag gttctaacaa agggctgggc gcaaggctgg tcgttgcaga cgggctagga 541 gaaatcgacg cagttgcatt aggagcggga ctattggcag gagcaacggg gctggcagac 601 ggcaacgaga tcgacggcaa cgagatcggc ggcaaaatcg gattaaaccg cgcccgcagt 661 tctggaacca gcaagtaaag caataagctg cccaagccca ataaagccaa tagagtcagc 721 agggaaggca gacagcccaa cggtgaacgc cgaggagcaa ttagttgcgt cttgaccgga 781 gaatctgacg actgagtagg gatgggaggc tcggcgatcg attctacgtt gagggtgggc 841 agagacacgt cggggcttaa gcgtttgcgg aattttaggt aggttggcta cgcgcaggtt 901 gcgatcgctc ctttcagcat tggcttgacg atcgacctgg cagtagacta acccgatcgt 961 cacattgtca tgaccattgc gcgtgttagc aatttcgacc agagtgcggc tgacggttgc 1021 tacatccgtc gtcccatcca aagggggacg atttctgttt cccaatttca tccacacgat 1081 cgttatcgct caacccatca gagcagagca aaatacgcag tcttcatcca ccacaaatcg 1141 ctgcacagtc ggatgcaaaa agctagaatt tcccatgccc agcgcctgaa ctagcgcccc 1201 cgacgcgggt tgttgcaggg catctcgata taaggcgtag cccagccgca cttctctaga 1261 agccacatcg tcgtctaacg tcacctgatc acagcctgtg cgcgtgatgc ggtaggcacg 1321 gctatcgcca acgtgggtga gataaaactc gtgggcgtgg gccagcccca tcaccagtgt 1381 tgtgcccatg cgctggcgtt cgtggcgttg ttcttggtcg ttgcgttcgc tgatggcatc 1441 gttagccgcc agggtggcac gttctagccc atgaattagg gttgccgaat cgagttgacc 1501 gcctgctagc aggtttcggt ttgagattgt agagtggcga tcgccagccc cgaagccact 1561 tctccgcctt catgcccacc aatgccatcg cagacaatca ccaacggccg cgtcggctct 1621 cctgaacgaa tcggaactac aagactcgtc ccacccggcg gatagcaagc atcctcattt 1681 ctctggcgct ggggccctgg tcagtgcaag ttgcaatctg cacctgtcga atttgaaact 1741 gaccctcatg ggacaaggcc tgatccagca gatccaataa atgatcagga gttttgatct 1801 gcccgtggat catttggtcg cacagttgac cgagaaaggg cacaatttct tgggcaaccc 1861 cacccggtgc agccctttct aattgggtgt tccacgatcg ccacagttga ccgagttctg 1921 caaggctagg ggcagtcggg ggatcgaatt tcaattccag taaccgcagg caggaatgat 1981 cgacccgcaa caacgtggga tcaagcaaag tagaaatcac cttttcagaa ctgaggggct 2041 gccaaagggc ggccatttgc cagagccaat ttagctggcg cagtggaaag ctctgccccc 2101 agaggcttgt gagttctggc atcaaatgcc ctgcgagttc aacacttgcc aagcgcgcat 2161 cggcttgtct ctcgcctatt ggaaaaatcg gagcaccttc taacaaaaga atgggtgaat 2221 tcgatcgtcc cttctcgcag acccagccgt gaatctgtgg tacatgtagc cgatagggaa 2281 acaaccgcaa ataactttcc cagtggcggg gcggttcgcc tggcatttct aggggcaaac 2341 ctggcttggt gtcgagcaaa atttgatcag ctttgaccag gtagcgatcg cccaaaacct 2401 ctcccaggcg gtatccctct aaacccactg cccacaaaaa cagctttggc aacggggtgc 2461 gacacttgtg gcaaaatttg tggccttcag ggtttagagc ctgacaaaga tagttagggc 2521 attgaagaat aggacgatcg ttatccattg aggttagggg ggtgtgcggt cagattaggt 2581 ctcagactac ctgtttgaag tcttcagtct agaggttaaa gcccatagtc taccgtccga 2641 tctgcgaaca tttcatcaaa aaaggacgcg ctcctttttc aatattagtt ctgcggcttt 2701 tcgggttgac cagaatcctc ctagaattga gagacaatga gtagatatac acagcatagg 2761 aaaaaatcga catggctgag ggcaactctc aagtgcaagc ccaggctcct ctttctgaga 2821 gagagttgca ggtgattgag ctagtagctg ctggcttgac aaaccaagac attgccatac 2881 agctagaaat tagtaagcgg acggttgata accacatcag caacattttg acaaaaactg 2941 agaccgagaa tcgcgttgcc ctagtgcgat gggcgttgca gtggggtaag gtctgtattg 3001 atgaggtgaa ttgctgtgta ttgccgcctt ataaggggga gagatgagtc agaagtgctg 3061 tagagaaatg ttgtagagaa atgctgtaga gggctggagg ataaaggttt gaggtattgg 3121 gtgcgctgta gggggctgcc attagcggtc tatcgggagg tcgcggcaca cttgtcgcag 3181 gtggatggtg tgcagacagg gctggtggct cagtcgtcgc ggcagtttga gtatgggcaa 3241 agtcaagtag aggggctgtt gggttcaggt gtcggtaggc aggtcgatcg caggtggagc 3301 agattttggc ctactatggc gatcgctatt ggcgcttggg aaacagaagt tcaggctgag 3361 taagttctaa atttaatgag ttcaattcaa gcaattcgag gaacgcggga tattttgccc 3421 gatgaagttg gctactggca gcagattgag gcgatcgcgc gcaaaatttt cgccacggcg 3481 gtctatcaag aaattcggac tcccattttt gagcagacgg ctttgtttga gcgaggcatg 3541 ggcgaagcga cggatgtcgt cagcaaagaa atgtatacct ttactgatcg cggagatcgg 3601 gcgattactc tgcgaccgga aggaacggct ggcgtggtgc gatcgttcgt agagcacggg 3661 atgcaggctc aaggaggcgt acagcggctc tggtatctcg gctctatgtt tcgctatgag 3721 cgccctcagg cgggtcggca gcgccagttt catcagttgg gtgtagaagt gttgggcagt 3781 gctgactttc gggctgatgc cgaagcgatc gctctggcta ccaccatttt gcaagcgctg 3841 gggctaaaga atctgcgaat cgagttgaat tctattggca atggggacga tcgccaaaaa 3901 tatcgggcag cgctggtcaa ttatttgact cctcaccaag cagatcttga ttccgactcc 3961 caagatcgtc tgactcgcaa cccgttgaga attctcgaca gcaagaactc tggaactcag 4021 aaaattctca aggatgcgcc cagcatttta gattacctta gccccgattc tcagcaacgg 4081 ttcgatcgcc tacaacagtt gctcggcgat ttgggcattg tctgcaatat caacgatcgc 4141 ctagtccgtg gtttagacta ctacacccat actgtgtttg agattcaatc tgacgattta 4201 ggatcgcagg cgacggtctg cgcgggcggg cgctacgacg gacttgtcgc cgaactgggg 4261 ggccctatac ccctggtatt ggttggggca atgggcttag agcgcttggt cattcttttg 4321 cagcagatac aaccagcaca c //