LOCUS VBNE01000150 604 bp DNA linear ENV 28-MAY-2019 DEFINITION Gammaproteobacteria bacterium isolate GP_13 14_1009_12_20cm_scaffold_7722_e:6670, whole genome shotgun sequence. ACCESSION VBNE01000150 VBNE01000000 VERSION VBNE01000150.1 DBLINK BioProject: PRJNA449266 BioSample: SAMN11380622 KEYWORDS WGS. SOURCE Gammaproteobacteria bacterium (soil metagenome) ORGANISM Gammaproteobacteria bacterium Bacteria; Proteobacteria; Gammaproteobacteria. REFERENCE 1 (bases 1 to 604) AUTHORS Diamond,S., Andeer,P.F., Li,Z., Crits-Christoph,A., Burstein,D., Anantharaman,K., Lane,K.R., Thomas,B.C., Pan,C., Northen,T.R. and Banfield,J.F. TITLE Mediterranean grassland soil C-N compound turnover is dependent on rainfall and depth, and is mediated by genomically divergent microorganisms JOURNAL Nat Microbiol (2019) In press PUBMED 31110364 REMARK Publication Status: Available-Online prior to print REFERENCE 2 (bases 1 to 604) AUTHORS Diamond,S. and Banfield,J.F. TITLE Direct Submission JOURNAL Submitted (01-MAY-2019) Earth and Planetary Science, Jill Banfield's Lab at Berkeley, University of California, Berkeley, CA 94720, USA COMMENT The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: IDBA_UD v. 1.1.1 Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: 17x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 05/16/2019 13:12:02 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 4.8 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 4,034 CDSs (total) :: 3,993 Genes (coding) :: 3,887 CDSs (with protein) :: 3,887 Genes (RNA) :: 41 rRNAs :: 1 (16S) partial rRNAs :: 1 (16S) tRNAs :: 36 ncRNAs :: 4 Pseudo Genes (total) :: 106 CDSs (without protein) :: 106 Pseudo Genes (ambiguous residues) :: 21 of 106 Pseudo Genes (frameshifted) :: 43 of 106 Pseudo Genes (incomplete) :: 51 of 106 Pseudo Genes (internal stop) :: 3 of 106 Pseudo Genes (multiple problems) :: 12 of 106 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..604 /organism="Gammaproteobacteria bacterium" /mol_type="genomic DNA" /isolate="GP_13" /isolation_source="temperate grassland biome" /db_xref="taxon:1913989" /environmental_sample /geo_loc_name="USA: Angelo Coast Range Reserve, CA" /lat_lon="39.74 N 123.63 W" /collection_date="2014-09-03" /metagenome_source="soil metagenome" /note="metagenomic" gene 76..>604 /locus_tag="E6K32_06155" CDS 76..>604 /locus_tag="E6K32_06155" /inference="COORDINATES: protein motif:HMM:PF07969.9" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="amidohydrolase" /protein_id="TLZ44028.1" /translation="MMNRRPAPSEPCPWGLLALMFLSWAARVCSGAESIVLVSGNVYT ADDRNSRAQAVVAAHGRILYVGANADALRRAPPGARRMDMHGLTILPGLTDSHAHLAG IGFRELSFNLEGTASVADLKDRLRERAKQGTPGEWLTGRGWIESRWTPPTFPGRTDLD EIASDRPVFLERADGH" BASE COUNT 100 a 195 c 198 g 111 t ORIGIN 1 accgctcgag ctacacttca ggctcgatca tttccggccc gtgcttaccg gagacgccgt 61 cgccatggcg ttaagatgat gaatagacgg cctgcaccat ctgagccgtg cccatggggc 121 ttactggccc tgatgtttct gtcctgggcg gctcgcgtct gcagcggtgc cgagtccatc 181 gtgcttgtca gtggcaacgt gtatacggcc gatgaccgta actcacgcgc tcaggccgtc 241 gtcgcagcgc acggtcgcat tctgtacgtc ggcgccaacg ccgatgcgtt gcgacgagcg 301 ccaccaggcg cgcgccgtat ggatatgcac ggcctcacca tcctacccgg gcttacggac 361 tcacacgccc acttggccgg tatcgggttc cgggagttga gcttcaatct ggaaggcacc 421 gcgagcgttg cagacctgaa ggaccggctt cgcgagcgcg caaagcaggg gacgcccggc 481 gagtggctca cggggcgagg atggatcgag tcgcgctgga cgcctccgac ctttccgggc 541 cgcaccgacc tggatgagat cgccagcgat cggccggtgt ttctcgagcg tgcggacggg 601 cacg //