LOCUS VGCQ01000271 6760 bp DNA linear ENV 11-FEB-2021 DEFINITION Alphaproteobacteria bacterium isolate K_Offshore_120m_m2_043 K_Offshore_120m_m2_043_Ga0194131_10019445, whole genome shotgun sequence. ACCESSION VGCQ01000271 VGCQ01000000 VERSION VGCQ01000271.1 DBLINK BioProject: PRJNA523022 BioSample: SAMN10966325 KEYWORDS WGS. SOURCE Alphaproteobacteria bacterium (freshwater metagenome) ORGANISM Alphaproteobacteria bacterium Bacteria; Proteobacteria; Alphaproteobacteria. REFERENCE 1 (bases 1 to 6760) AUTHORS Tran,P. TITLE Lake Tanganyika Metagenome-Assembled Genomes (MAGs) JOURNAL Unpublished REFERENCE 2 (bases 1 to 6760) AUTHORS Tran,P. TITLE Direct Submission JOURNAL Submitted (14-MAR-2019) Bacteriology, University of Wisconsin - Madison, 1550 Linden Drive, Madison, WI 53706, USA COMMENT The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Date :: 21-AUG-2017 Assembly Method :: SPAdes v. 3.10.1 Assembly Name :: JGI 2017-08-21 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 5019.780936x Sequencing Technology :: Illumina HiSeq 2500-1TB ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 06/20/2019 06:26:04 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 4.8 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 5,230 CDSs (total) :: 5,191 Genes (coding) :: 5,119 CDSs (with protein) :: 5,119 Genes (RNA) :: 39 tRNAs :: 35 ncRNAs :: 4 Pseudo Genes (total) :: 72 CDSs (without protein) :: 72 Pseudo Genes (ambiguous residues) :: 0 of 72 Pseudo Genes (frameshifted) :: 14 of 72 Pseudo Genes (incomplete) :: 44 of 72 Pseudo Genes (internal stop) :: 18 of 72 Pseudo Genes (multiple problems) :: 4 of 72 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..6760 /organism="Alphaproteobacteria bacterium" /mol_type="genomic DNA" /submitter_seqid="K_Offshore_120m_m2_043_Ga0194131_1001944 5" /isolate="K_Offshore_120m_m2_043" /isolation_source="freshwater lake" /db_xref="taxon:1913988" /environmental_sample /geo_loc_name="Tanzania: Lake Tanganyika, Kigoma" /lat_lon="6.2556 S 29.5108 E" /collection_date="2015" /metagenome_source="freshwater metagenome" /note="metagenomic" gene complement(<1..385) /locus_tag="FJX11_18400" CDS complement(<1..385) /locus_tag="FJX11_18400" /inference="COORDINATES: protein motif:HMM:PF00440.21" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TetR/AcrR family transcriptional regulator" /protein_id="MBM3649756.1" /translation="MTRRKPTAAAPPPPVERDPRPPVEAIRAAAFAQFAERGYPVVTV RDIMKACGLTQGALYNHFRSKDELLHDIIASTQGELERICQQAVTEAGDDPRAKLAAF VRVYVVRHCRLRVEALVANREMSWLD" gene complement(382..1524) /locus_tag="FJX11_18405" CDS complement(382..1524) /locus_tag="FJX11_18405" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012045071.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="phosphotransferase family protein" /protein_id="MBM3649757.1" /translation="MPADGRANSGPESEQLFRDWGVARRGAEYTGTMAVQEKHRFDER SLERFMATEIEGFVPPVTVEQFKGGQSNPTYRLTDGAGRRYVLRRKPPGKLLPSAHAV DREFRVISALNKTDVPTPRAYALCEDEAVVGTAFYVMEYCDGRVLWDPLLPELPREAR AAIHYAKVETLARLHAVDYVALGLADFGRPGSYVARQISRWGKQYKASETERIEAMDR LLDWLPAHLPANDETVLVHGDYRLDNMVFHPTEPRVLGIIDWEISTLGDPLAELSYLC MLWRTPKDWGGLLGHDLSALGLPAEHEMVERYCALAERGLPEPALWEFYMAYNLFRVS CIRQGVYARALDGTASNVRAAESGKLVRPAADLAWSIVTSMDGATP" gene 1731..2153 /locus_tag="FJX11_18410" CDS 1731..2153 /locus_tag="FJX11_18410" /inference="COORDINATES: protein motif:HMM:PF07331.9" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="tripartite tricarboxylate transporter TctB family protein" /protein_id="MBM3649758.1" /translation="MSGLMFIGFGLLALYFGQKLALGTPVRMGPGYVPRMLAFILMGL GGLICVVTLFAGGEPVERPKWRPITLVTIGIVCFALLFERAGLLPALIVLVFITSLAG EEFKLTEVIGNMVVLAILCTVVFKVGLGMNISIIRGVW" gene 2155..3657 /locus_tag="FJX11_18415" CDS 2155..3657 /locus_tag="FJX11_18415" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_018008350.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="tripartite tricarboxylate transporter permease" /protein_id="MBM3649759.1" /translation="MDLLNNLLLGLSVAVSFQNLLYALIGCMVGTLIGVLPGIGPVAT IAMLLPITFHLPPTASLIMLAGIYYGAQYGGSTTSILVNLPGEAASVVTCLDGYQMAR RGRAGAALSISAVGSFFAGTVGTIIIVIFAEPLTRMAQKFGPADYCSLMALGLVAAVV LASGSVLKAIAMVFLGLLFGLVGTDVNTGAQRFTFDIPELSDGIDFAPIAMGLFGIAE IVVNLERRLERTGAIKVGSLWPSAEEIRRAIPAVLRGTTLGSLLGVLPGGGPTLGAFS SYTLEKKISRNPSEFGKGAVEGVAAPEAANNAAAQTSFIPMLTLGIPSNAVMALMVGA MIIQGIQPGPEVMTKKPDLFWGMIVSMWIGNLMLVVINLPMIGMWVKLLTVPYRFLAP AILLFCCIGAYSLQNSTFHVLQVAGFGVLGYIFVRLGCEGAPFLLGLVLGPQMEEYFR RAMLLSRGDAMVFLQRPISLGLLITTSLLLVLMALPSIKKARQQAFQEEG" gene 3676..4632 /locus_tag="FJX11_18420" CDS 3676..4632 /locus_tag="FJX11_18420" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011810957.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hydroxymethylglutaryl-CoA lyase" /protein_id="MBM3649760.1" /translation="MSDLPKRVLIGEEGPREGFQSEKKTIPVADKVRLIEALADTGLM HIACVSYVNPKRVPTMADAEDVAAAIRRKPGIQYSALWLNQQGLERALRGPLHVDGGV RITASDTFSRKNIGKSVPDSLVEQRLSLKTLKDRGIAVEWGIVLGAFGCNYEGELSTE LILQRVRSVLDEADLAGFKLKGIKLTDAMGWATPRSVEQLIGAIRSKWPELEIALHLH DTRGTGLATAYAGLRLGVSKFDASIAGLGGCPFAATDGAAGNICTEDFAFMCEEMGVE TGLDIDHLIEVAKLAEDVVGRSLPGHVMRGGTLKAAKQRAAA" gene 4629..4976 /locus_tag="FJX11_18425" CDS 4629..4976 /locus_tag="FJX11_18425" /inference="COORDINATES: similar to AA sequence:RefSeq:YP_006895265.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="MBM3649761.1" /translation="MNVETLIPDSTYVDVAAIPWKPTRFPGVEAKTLMENPATGMSTV LMRWAPGARLPRHEHVEIEQTFVLEGSFADHAGVCRAGNYVWRRAGSRHDAWTDEGCL MLAFFLKPNTFFD" gene 5053..6198 /locus_tag="FJX11_18430" CDS 5053..6198 /locus_tag="FJX11_18430" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007609061.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TAXI family TRAP transporter solute-binding subunit" /protein_id="MBM3649762.1" /translation="MIRRAVLTGAVVAWFAYGTAFAQALPKEMSWTAYDTGSSGFNIA VAIGQQFKQGYQSDVRILPSGNDTGRLAPIKANRAVVSQMGIGTYFAQEGVFEFGTRS WGPQASRLIMAATARNGLGLAVAKDTGVKEVKDLKGKRLGVVVGSPPLTQGALALIAF GGLGEKNMIVVQFSSNNAMWKGVINSEADAVLTSTISGQSKEADSSPRGIVFPPMPAF DKEGWARVHKKAPYFVPVKATCGAGGLSPENPVEMAGYAYPIFMTFADRPADTIYAIT NVMIDAYDKYKDGAPGAEGMALKLQNLTWVVPYHDSTIRAFKEKGVWTDAAQKHNDAL LKRQQILADAWKAYGINAPADDKAFADGWRKARADALAKAGLDVIFD" gene 6288..>6760 /locus_tag="FJX11_18435" CDS 6288..>6760 /locus_tag="FJX11_18435" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009735799.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="TRAP transporter permease" /protein_id="MBM3649763.1" /translation="MRVRALSAFWRWALVAATAVTIFLCVNQQFALRFFVGFTQLNTE YFYLLILCMLPFTFLIFPGSPRASLTRMPWYDVVLFVATAAASLHLMLHIREAAELGW EFGDPPKSIIWAGYVMWLVLLEALRRTGGWSLMLCVLPFTVYPMFAGASWLGPLKG" BASE COUNT 1160 a 2193 c 2142 g 1265 t ORIGIN 1 catcgagcca gctcatttcg cgattggcga ccaacgcctc cacccggagc cgacagtggc 61 gcacgacgta gacgcgcacg aaagccgcga gcttggctcg cggatcgtcg cccgcctccg 121 tcaccgcctg ctggcaaatc cgctcgagtt cgccctgggt cgaagcgatg atgtcatgca 181 gcaactcgtc cttggacctg aaatgattgt agagcgcgcc ctgagtcagg ccacaggctt 241 tcatgatgtc gcgcaccgtc acgaccgggt agccgcgctc ggcgaactgc gcgaaagctg 301 ctgcccggat cgcctcgaca ggcggccgcg gatcgcgctc gaccggcggc ggcggagcag 361 ccgctgtggg cttgcgccgg gtcatggcgt cgctccgtcc atgctcgtga cgatcgacca 421 ggcgaggtcg gcggccggcc gcacgagctt gcccgactcc gcggcccgga cgttcgaggc 481 ggtgccgtcg agtgcgcggg cgtagacgcc ctggcgaatg cacgacacgc ggaaaagatt 541 gtacgccata taaaactccc agagcgccgg ctctggcaag ccgcgctccg cgagagcgca 601 atagcgctcg accatctcgt gctcagccgg caggcccagg gcagacaggt catgaccgag 661 caggccgccc cagtccttcg gtgtgcgcca cagcatgcag aggtaggaaa gctcggccag 721 cgggtcgccg agcgtggaaa tctcccagtc gatgatgccc agcacgcgcg gttcggtcgg 781 atggaaaacc atgttgtcca agcgatagtc gccgtgcacc aggacggtct cgtcgttggc 841 cggcaagtgc gccggcagcc aatcgagcag ccggtccatg gcctcgatgc gttcggtctc 901 cgacgccttg tactgcttgc cccatcgcga gatctgacgc gcgacgtagc tcccgggtcg 961 accgaaatcg gccaagccca gagcgacata gtccacggcg tgaagacgcg ccaaggtctc 1021 gaccttggca tagtggatgg ccgctctcgc ctccctcggc agttctggca gcaacggatc 1081 ccacaggacc cggccgtcgc agtactccat gacatagaag gcggtcccca cgacggcctc 1141 gtcctcacac agggcatagg ctctgggagt cgggacgtcg gtcttgttga gggccgagat 1201 cacccggaac tcgcgatcga ccgcatgggc cgacggcagg agcttgccgg gcggcttgcg 1261 gcgcagcaca tagcgccgac cggcgccgtc ggtcaggcgg taggtcgggt tcgattgccc 1321 tcccttgaac tgctcgaccg ttacaggcgg tacaaagcct tcaatctcag tcgccatgaa 1381 tctctcaagc gaccgctcgt cgaaacggtg cttttcctgc accgccatcg tgccagtgta 1441 ttccgcccct cgtcgcgcca cgccccagtc cctgaacagc tgttcacttt ccggcccgga 1501 gtttgctcgc ccgtcagcgg gcatgcaaca tcgaccaaag gcgatgcgcc gggacaccgt 1561 cgaccgccct cgccggtcca ccctcagcgg cggatttgcc gcttgggcaa gacgaagaaa 1621 gtgtgtttat gtagcatcaa caaggccgcg ggtttgggcc caataacggt cgcagagaga 1681 gccaaaggac tggggaggcg tcgttgaatc gattcctgag tagggacttc ttgtccgggc 1741 tgatgttcat cggcttcggg ctgttggccc tgtatttcgg ccaaaagttg gcgctcggca 1801 cgccggtccg catgggaccg ggttatgtgc cgcgcatgct ggcctttatt ttgatgggtc 1861 tcggcgggct gatctgcgtg gtcacgctgt tcgccggtgg cgagcctgtc gaacgcccga 1921 agtggcggcc gatcaccctg gtgacgatcg gcatcgtctg ttttgccttg ttgttcgagc 1981 gcgccggcct cctgccggcc ctgatcgtcc tcgtcttcat cacctccctg gccggcgagg 2041 aattcaagct aaccgaagtg atcggcaaca tggtcgtgct cgcgatcttg tgcaccgtcg 2101 tcttcaaggt tgggctgggc atgaacatct cgatcatccg gggagtgtgg tgagatggac 2161 ctgctgaaca acctgctgct cggcctcagc gtcgcggtct ccttccagaa cctgctctac 2221 gcactcatcg gctgcatggt cggcacgctg atcggcgtgc tgccgggcat cggcccggtc 2281 gccaccatcg ccatgctgct gccgatcacc ttccacctgc cgccaaccgc ctcgctgatc 2341 atgctggccg gcatctacta cggcgcgcag tacggcggct cgacgacctc gatcctggtc 2401 aacctgccgg gcgaagcggc gtcggtcgtc acatgcctcg acggttatca gatggcgcgg 2461 cgtgggcgtg ccggcgccgc tttgtcgatc tcagccgtcg gctcgttctt cgccggcacc 2521 gtcggcacca tcatcatcgt gatctttgcc gagccgctga cgcgcatggc gcagaagttc 2581 gggcctgcgg attactgttc gttgatggct ctcggcctgg tcgccgcggt cgtgctggcc 2641 agtggttcgg tgctcaaggc gatcgccatg gtgttcctcg ggctgctgtt cggcctagtc 2701 ggcaccgacg tcaataccgg cgcgcagcgc ttcaccttcg acattcccga gctcagcgac 2761 ggcatcgact tcgcccccat cgccatgggc ctgttcggca tcgccgagat cgtcgtcaac 2821 ctcgaacgac gattggaacg gaccggcgcc atcaaggtcg gctcgctttg gccgagcgcc 2881 gaggagatac ggcgagcgat cccagccgtt ctgcgcggca cgacgctcgg ttctctgctc 2941 ggcgtgttgc cgggcggcgg cccgacgctg ggcgccttct cgtcctacac cctcgagaag 3001 aagatttcgc gcaacccgag cgagttcggc aagggcgcag tcgaaggtgt ggcagcgccc 3061 gaggcggcca acaatgccgc ggcgcagacc tcattcattc cgatgctgac gctcggcatc 3121 ccgtccaacg cggtgatggc gctgatggtc ggcgccatga tcattcaggg tattcagccc 3181 ggtcccgaag tcatgaccaa gaagccggat ttgttctggg gcatgattgt ctcgatgtgg 3241 atcggcaacc tgatgctggt cgtcatcaac ctgccgatga tcggcatgtg ggtgaagcta 3301 ctcaccgtac cgtaccgatt cctcgcccca gccatcctgc tgttctgctg catcggcgcc 3361 tacagcctgc aaaacagcac cttccatgtg ctgcaggtcg ccgggttcgg cgtgttgggc 3421 tacatcttcg tgcggctggg ctgcgagggc gcgcccttcc tgctcggcct ggtactgggg 3481 ccgcagatgg aagagtactt ccgtcgcgcc atgctgctgt cgcgtggcga tgctatggtc 3541 ttcctgcagc ggccgatcag cctcggcctc ttgatcacga cctcgctact cctggtgctg 3601 atggccttgc ccagcatcaa gaaggcgcgt cagcaggcct tccaagaaga gggttgacgg 3661 aagaggagct gtcgcatgtc cgacctgccc aagcgtgtcc tgatcggcga ggaaggacct 3721 cgcgagggct tccagtcgga aaagaagacg atcccggtag ccgacaaggt ccgcctcatc 3781 gaagccttgg cggacaccgg gctgatgcac atcgcatgcg tgtcctacgt caatcccaag 3841 cgcgtgccga ccatggccga tgccgaggac gtggcggcgg ccatccggcg caagcccggc 3901 atccaataca gtgccctatg gctcaaccag cagggtctcg agcgcgcctt gcgtggaccg 3961 ctgcatgtcg acggcggcgt gcgtatcacg gcatcggaca ccttctcgcg caagaacatc 4021 ggcaagtcgg tgcccgattc gttggtcgag cagcgcctgt cgctcaagac gttgaaggat 4081 cgcggcatcg ccgtcgaatg gggcatcgtg ctcggcgcgt tcggctgcaa ctatgaaggc 4141 gagctatcga ccgaactgat cctgcaacgg gttcggtcgg tgctcgatga ggccgacctg 4201 gcgggcttca agctcaaagg catcaagctc accgatgcga tgggttgggc gacgccgcgc 4261 tcggtcgagc agctgatcgg cgccatccgc tcgaaatggc cggagctcga gatcgcgctt 4321 cacctccacg atacgcgcgg caccgggctg gccacggcct atgccggact gcggctcggc 4381 gtcagcaagt tcgatgcctc gatcgccggt ctgggcgggt gcccattcgc cgccaccgac 4441 ggcgccgcgg gcaacatctg caccgaggac ttcgccttca tgtgcgagga gatgggcgtg 4501 gagaccgggc tcgatatcga ccacttgatc gaggtcgcca agcttgccga agacgtggtc 4561 ggccgctcgc tgcccggcca cgtgatgcgt ggcggcacgc tcaaagccgc caaacagagg 4621 gccgccgcat gaacgtcgaa accctgatcc ccgattcgac ctatgtcgat gtcgccgcga 4681 ttccctggaa gccgacgcgc tttccaggcg tcgaagccaa gacattgatg gagaatccgg 4741 ccaccggcat gtcgaccgtg ttgatgcgct gggcgccggg cgcgcgtctg ccgcggcacg 4801 agcatgtcga gatcgagcag actttcgtcc tcgaaggttc gttcgccgac catgccggcg 4861 tttgccgagc cggcaactat gtgtggcgcc gcgccggcag tcggcacgac gcctggaccg 4921 atgaaggctg cctgatgctg gctttcttcc tgaagcccaa tacgttcttc gattgaacgt 4981 cgccaccagg cttggcaaca tcgccacgcg ttcagcgtcg cgaacaaggc gctgcgcgac 5041 gggaggattc atatgatcag acgcgccgtg ttgacgggcg ccgttgtcgc gtggttcgcg 5101 tacggtactg ctttcgcgca agcgttgccc aaggagatgt cgtggacggc ctacgacacc 5161 ggttcctcgg gcttcaatat cgccgtcgcg atcggtcagc aattcaagca gggttaccag 5221 tccgacgttc gcatcctgcc gtcgggcaac gacacgggtc ggctggcgcc aatcaaggcc 5281 aaccgtgctg tcgtgtcgca gatgggcatt ggcacctact tcgcccaaga aggggtcttc 5341 gagttcggca cgcgcagttg gggaccacag gccagccggc tgatcatggc tgccaccgcg 5401 cgcaatgggt tgggacttgc cgtcgccaag gacaccggcg tgaaggaggt caaggacctc 5461 aagggcaagc ggctcggtgt cgtggtcggc tcgccgcccc tgactcaggg agctctggcg 5521 ctcatcgcct tcggcggcct cggcgagaag aacatgatcg tcgtgcagtt ctcatccaac 5581 aacgccatgt ggaaaggcgt catcaacagc gaagccgatg ccgtcctgac atcgacgatc 5641 tccggccagt ccaaggaggc ggactcctcg ccgcgcggca tcgtcttccc gccgatgccg 5701 gcattcgaca aggaaggctg ggcacgtgtc cacaagaagg cgccctactt cgtgccggtc 5761 aaagcgacct gtggcgcggg cggtctgtcg cccgagaatc cggtcgagat ggcaggttac 5821 gcctatccga tcttcatgac tttcgcggac cgtccggcgg acacgatcta cgccatcacc 5881 aatgtcatga ttgacgccta cgacaagtac aaggacggcg cacccggcgc cgaaggcatg 5941 gcgctcaagc tgcaaaatct cacctgggtc gtgccctatc atgacagcac catccgcgcc 6001 ttcaaagaga agggcgtatg gaccgacgcg gctcaaaagc acaacgacgc gctactgaaa 6061 cggcagcaga tcttggcgga cgcctggaag gcctacggca tcaacgcgcc ggctgacgac 6121 aaggccttcg ccgacggctg gcggaaagcc cgagccgacg ctctggcgaa agcaggattg 6181 gacgtgatct tcgactagtc actaccgggc gacacagcat gacagatcaa gcgacgcggg 6241 tcgtgttcga cgatccgcac caggcggccc cggccgaagg cgagacgatg cgtgtacgcg 6301 ccctgtccgc gttctggcgc tgggcgctcg tcgcggcgac agccgtcacg atcttccttt 6361 gcgtcaatca gcagtttgcg ctgcgcttct tcgtcggttt cacgcaactc aacaccgagt 6421 acttctatct cctgatcctg tgcatgctgc cctttacctt cctgatcttc ccgggaagcc 6481 cgagggcgtc gttgacccgc atgccgtggt acgatgtcgt gcttttcgtc gccactgctg 6541 cggcatccct gcatttgatg ctgcacattc gcgaggccgc cgagctcggc tgggagttcg 6601 gcgacccgcc caagtcgatc atttgggcgg gctacgtgat gtggctcgtc ctgctggagg 6661 cgctgcgccg aaccggcggc tggagcctga tgctctgcgt cctgcccttc accgtctacc 6721 cgatgttcgc aggcgcatca tggttgggcc cgctcaaggg //