LOCUS UJP40881.1 1308 aa PRT BCT 25-JAN-2022 DEFINITION Cellulomonas palmilytica S8 family serine peptidase protein. ACCESSION CP062221-1031 PROTEIN_ID UJP40881.1 SOURCE Cellulomonas palmilytica ORGANISM Cellulomonas palmilytica Bacteria; Actinobacteria; Micrococcales; Cellulomonadaceae; Cellulomonas. REFERENCE 1 (bases 1 to 3834012) AUTHORS Siriatcharanon,A.-k., Sutheeworapong,S., Pason,P., Waeonukul,R., Kosugi,A., Ratanakhanokchai,K. and Tachaapaikoon,C. TITLE Cellulomonas coenopalmateriei, sp. nov., a novel species of genus Cellulomonas for degradation of raw lignocellulose biomass; oil palm empty fruit bunch JOURNAL Unpublished REFERENCE 2 (bases 1 to 3834012) AUTHORS Siriatcharanon,A.-k., Sutheeworapong,S., Pason,P., Waeonukul,R., Kosugi,A., Ratanakhanokchai,K. and Tachaapaikoon,C. TITLE Direct Submission JOURNAL Submitted (16-SEP-2020) Pilot Plant Development and Training Institute, King Mongkut's University of Technology Thonburi, Bangkuntien-Chaitalay, Bangkok 10150, Thailand COMMENT Bacteria and source DNA available from Enzyme Technology Laboratory, Pilot Plant Development and Training Institute, King Mongkut's University of Technology Thonburi. The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Date :: 19-MAY-2020 Assembly Method :: unicycler v. v0.4.8 Assembly Name :: EW123_hybrid Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: 10.0x Sequencing Technology :: Oxford Nanopore MinION; Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 09/30/2020 19:03:08 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 4.13 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 3,454 CDSs (total) :: 3,400 Genes (coding) :: 3,351 CDSs (with protein) :: 3,351 Genes (RNA) :: 54 rRNAs :: 2, 2, 2 (5S, 16S, 23S) complete rRNAs :: 2, 2, 2 (5S, 16S, 23S) tRNAs :: 45 ncRNAs :: 3 Pseudo Genes (total) :: 49 CDSs (without protein) :: 49 Pseudo Genes (ambiguous residues) :: 0 of 49 Pseudo Genes (frameshifted) :: 8 of 49 Pseudo Genes (incomplete) :: 41 of 49 Pseudo Genes (internal stop) :: 2 of 49 Pseudo Genes (multiple problems) :: 2 of 49 ##Genome-Annotation-Data-END## FEATURES Qualifiers source /organism="Cellulomonas palmilytica" /mol_type="genomic DNA" /strain="EW123" /isolation_source="Earthworm bio-fertilizer soil" /type_material="type strain of Cellulomonas palmilytica" /db_xref="taxon:2608402" /country="Thailand: Bangkok" /lat_lon="13.6771 N 100.4591 E" /collection_date="Nov-2015" /collected_by="Enzyme Technology Laboratory, KMUTT" protein /locus_tag="F1D97_05235" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013769708.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /transl_table=11 BEGIN 1 MSRPPARRST LAAATALSVA LTTGVLATSA TAAPPGPPSP ADHRIDPSRV DRTLAEARKA 61 VGITGATGQI TALVTLDTPA GVDVAAQGKD AVQDAAAQTE AVAEAVVPTE LTTKNARSAT 121 PKRVGTLTNL VAGTLVSGDA AKIRALASKD EVTAIYRVME KRPTNSSTVA FTKALQAWQD 181 TGQTGAGISL AVIDTGLDYT HASFGGAGTV EAYEEAYGED GTQPVDPAWF DAAKFAGGYD 241 FAGPLYDASL DEPGSTDVPT PDENPIDALS TSSNSGHGTH VAGTAAGYGV DATGATFRGD 301 YTSLTDVSDW EVGPGSAPGA LLWSFKVFGD IGGSTDLTSL ALDRAADPNG DGDLSDRVDV 361 VNMSLGGDGA PADDPDSLLV TELTKLGVLV TNSAGNAGDI TDIGGAPGNS PSALTVANSV 421 ATPALDAVKV TAASDDALEG DLLPAQNSVA YGGPDVEAPV AYVGATFDGC TAFTAAQAAA 481 VAGKIAYLWW DDDDTSRRCG SAARFNNATA AGAVGVVLPT ELTVFSAGIA GNATIPGAQL 541 TKQSTDTLLP EIQAGTLTLE IGPGLALSSR LDGAQDLLND GSSRGVHGSL GVVKPDVAAP 601 GTGILSAASG GGTAGHVLSG TSMAAPHVAG IAALVRAAHP RWSPTEVKAA IVNTATHDVT 661 TEPDGGGLAY GPERVGAGRV DALAAVETDV VAYNAKNPAL TSVTFGVVDV GPTKVTRTAT 721 VTVRNFGRTT QSYDASFAAA STTGGATVSV SPKRVTVPRG GKATVTLTLS VDPATLERDI 781 DPTSSLEQGG LPREYVAALT GRLVLDSRSG DDELRVPVQA APRIVSDLKA KDVTFTGSAD 841 TAGLALRGRG VASGGWQSLT TPLILGATSP RLPNEATLGT SRSAVRSGDL RYVGWSSTAP 901 YVEALGADPQ DDGLLNVGIA TEGNWASLGL AVYPVIDIDV DGDGTFDLES IVWKLDEAID 961 LTVVTTYDLN APASAPAVDI EPINYEFGDV DTGVFDSNVL VAPISLGATG IAPGDTPTIS 1021 VWTYSPYGGD DSVVDEADVF TTDPYDPPFW FENDRASLVS TDGADGVSIP VHRSADATSG 1081 DLLVFQHHNR DGKRVQVVDV TVPTSTTTTL AVSGGTSYGS KARLTATVAP ATARGSVAFR 1141 DGTKLLAVQK VHRGKATASV ALGIGEHSLT ASFVPDRGSS YLASTSAPVS LTVGKSATTT 1201 SVKVDGFGGA SRSATLPSGG KGGPSGPGGP LTAVVTVVGA TAAPSGTVTL SEGGTKLGTA 1261 KLSARGLTGT AKVTVRDLAP GKHTLTATYP GSATTEASTG AVTVTVRR //