LOCUS UJP39274.1 1988 aa PRT BCT 25-JAN-2022 DEFINITION Cellulomonas palmilytica DUF4157 domain-containing protein protein. ACCESSION CP062221-2581 PROTEIN_ID UJP39274.1 SOURCE Cellulomonas palmilytica ORGANISM Cellulomonas palmilytica Bacteria; Actinobacteria; Micrococcales; Cellulomonadaceae; Cellulomonas. REFERENCE 1 (bases 1 to 3834012) AUTHORS Siriatcharanon,A.-k., Sutheeworapong,S., Pason,P., Waeonukul,R., Kosugi,A., Ratanakhanokchai,K. and Tachaapaikoon,C. TITLE Cellulomonas coenopalmateriei, sp. nov., a novel species of genus Cellulomonas for degradation of raw lignocellulose biomass; oil palm empty fruit bunch JOURNAL Unpublished REFERENCE 2 (bases 1 to 3834012) AUTHORS Siriatcharanon,A.-k., Sutheeworapong,S., Pason,P., Waeonukul,R., Kosugi,A., Ratanakhanokchai,K. and Tachaapaikoon,C. TITLE Direct Submission JOURNAL Submitted (16-SEP-2020) Pilot Plant Development and Training Institute, King Mongkut's University of Technology Thonburi, Bangkuntien-Chaitalay, Bangkok 10150, Thailand COMMENT Bacteria and source DNA available from Enzyme Technology Laboratory, Pilot Plant Development and Training Institute, King Mongkut's University of Technology Thonburi. The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Date :: 19-MAY-2020 Assembly Method :: unicycler v. v0.4.8 Assembly Name :: EW123_hybrid Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: 10.0x Sequencing Technology :: Oxford Nanopore MinION; Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 09/30/2020 19:03:08 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 4.13 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 3,454 CDSs (total) :: 3,400 Genes (coding) :: 3,351 CDSs (with protein) :: 3,351 Genes (RNA) :: 54 rRNAs :: 2, 2, 2 (5S, 16S, 23S) complete rRNAs :: 2, 2, 2 (5S, 16S, 23S) tRNAs :: 45 ncRNAs :: 3 Pseudo Genes (total) :: 49 CDSs (without protein) :: 49 Pseudo Genes (ambiguous residues) :: 0 of 49 Pseudo Genes (frameshifted) :: 8 of 49 Pseudo Genes (incomplete) :: 41 of 49 Pseudo Genes (internal stop) :: 2 of 49 Pseudo Genes (multiple problems) :: 2 of 49 ##Genome-Annotation-Data-END## FEATURES Qualifiers source /organism="Cellulomonas palmilytica" /mol_type="genomic DNA" /strain="EW123" /isolation_source="Earthworm bio-fertilizer soil" /type_material="type strain of Cellulomonas palmilytica" /db_xref="taxon:2608402" /country="Thailand: Bangkok" /lat_lon="13.6771 N 100.4591 E" /collection_date="Nov-2015" /collected_by="Enzyme Technology Laboratory, KMUTT" protein /locus_tag="F1D97_13115" /inference="COORDINATES: protein motif:HMM:NF025078.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /transl_table=11 BEGIN 1 MTTTDQRLDG RAATDRVTAV DLGDAGVLEV ERDAESGVLV ALRGGGRDYL TVGHDGTRAW 61 HGVCTTSERL GEGTLERRVR PTTGEAWTER YRWSGDDLVH VDGVEVRRDA SGRVVACLPG 121 GPDPAPAEHR WFYTHDERGL VEVRGPGRER RVVLGPDGRA REVHDDSGAV VLPHDADGRR 181 LTAVRTAGDT IDDEGRTWVT RTPDGRVSRV FLWDGMRCLA RVDGPLGSPL AAVLSLDPSG 241 TPVRVVTPGG VRRVPRDAYG EGLLDVPDVP GLFGGRVHAG VVHLPLRRLD PRTGTFCEPD 301 PWHGGDDDPR RPAGYTGPVP VEKEPRSAYE VCRGDPVGRS DPTGGVSAGL VLSTLTWSFQ 361 NNVMTFFGID WWFNLFLSLL AAPFAGDKYD FFSSTGLSST DRLGSFGVRR DGFMNVITGG 421 RAFTTQHIVW SPDGEFADLQ RGEVVDPRGA YEPTHYGTVL SLAPTGAAVS FLACGPTTRM 481 PGLPGNLTTW SRHGGLGVPA APGTLTPWFP SGGLHLDTSR EDTRHDVEAT LTELQPGPVG 541 VGDFEQRSVL TSTAATGLAA GARVLVDDGT ALAIATVVGV VAVPGGERVQ LAEELTVTGT 601 TLRVTPLEDA PASSETRPAG APAASLDARG TTTTYAPADL VRVTATSGQV LVARVAHLEA 661 RLPLERPLPA SLAGPIAVAT GTAGPTVPVT ATGTTLDFGT ATPPGQGTTG LLVGGTTTGV 721 RVETPPTGST TTVDLAAPAG TTGFQTVTAS TVLGSRTDAA EADPALTYTP LTAGSAPDGS 781 AGLVLVRVES AGTAHARVVP GAPAHDVVVL DRPLVGTGPF TVERWRTRGA ALTSLTLAQV 841 LSVVVPSPER FEGMPLLLTR VAGEPPTVTA ALTGVDVTDG TTTLATPTAA AGLRAGFPVR 901 VGTEHTAVRD LRCDVTFAPP VDLGGGDLRL VRLEPTGFAY DAVVAAADAI DVRPTVTVGG 961 TPVAAPFLRV RPGDLVEVTS GGTTTWHRVT AASAGRLTVT AAATALTPGA TATVRQAAVD 1021 DPDTGCPFLG IRGARTGTGP TTTATFSLWR SDDLPAGTAA LGIVDGDVTH PVQQSAAAVV 1081 RSVTFSTSFA ASAVDVSVFT HVTTAVLASV TRDGGVLLAE TPPGAGTITT APGQSIVAVA 1141 LEAVGDARTV TLGPGTLLVP DEETTEIDRG QSLTNHELTH TVQYARWGPL WFCAFPMIAL 1201 ELPAILTSDT ELPEFSAFLD ATVAAGADAL WDVTIPQRAG VSIAKDDTLQ VVQGSRIVEV 1261 EVRSVAGDVA RVRVASGTLP TGRVAVRKKQ RSAGWDVSIA ILDLMTHGGL VNLLAGSTWG 1321 GIFWLVGKAF YGLGRAIGGT GDLYAGTVTV GGTTITLAQA ADAEKIPVTG RVTIRRGEDT 1381 VIRSATRAGQ TLSLGEAITF TGDVRVAAYD SHEPGSAFDW YDYRAGTVDA ANHFAVDLGA 1441 DHGLDPEDRV VVRYRSGRPF RTDVLAVAGA RVELSERVPV TGGELSVRIA RVGASDPLGN 1501 ADSAAMVEMG MGWMKWLFDP FGQIEPAVAP GDWTRWLLRV VRWLLGTQNF SLLPFGYVWW 1561 GRLFGIQKEH EAPIEQEASS ESGDVYSPLG RLTGEVVHSD GAIADARATV GDVMRYRYTP 1621 GSRFRSFVVA GRLDGPGVHL SRTLRVMPTR SSTGAAADPN GTVRSDAGTA DPGRFVDPTL 1681 TDHDGDTDPR MLPRNGGGAG DALGFRASAL GSVPVSARVQ RNESIYAAFT RPGDHRVTTL 1741 NGIAGATEAV EAHAKGLQTL WFDLTVADVT VTAAGRTLDA SSPGTSDRLV LVPSQSVDVT 1801 TTPATPRVYR VTALDPTSSV TVSDARLTAV AATTAPVPVE VSRYYAATDG RYTGGLAYAG 1861 MHLSRDLDVP VRTFTVEVVT TLPLRAAARA DATEVTTLAR GTEAFLLVPA NVTVPPAVTS 1921 IGGALPSAGT PDPATRVDAP DAAAFLGATG SAWRVLFPAS ATPGDYELTV TVGDGATPHP 1981 LTCRVTVT //