LOCUS QJW38766.1 1129 aa PRT BCT 29-DEC-2022 DEFINITION Cellulosimicrobium protaetiae LysM peptidoglycan-binding domain-containing protein protein. ACCESSION CP052758-111 PROTEIN_ID QJW38766.1 SOURCE Cellulosimicrobium protaetiae ORGANISM Cellulosimicrobium protaetiae Bacteria; Actinobacteria; Micrococcales; Promicromonosporaceae; Cellulosimicrobium. REFERENCE 1 (bases 1 to 152586) AUTHORS Le Han,H., Nguyen,T.T.H., Li,Z., Shin,N.R. and Kim,S.G. TITLE Cellulosimicrobium protaetiae sp. nov., isolated from the gut of the larva of Protaetia brevitarsis seulensis JOURNAL Int J Syst Evol Microbiol 72 (3) (2022) PUBMED 35348452 REFERENCE 2 (bases 1 to 152586) AUTHORS Le Ho,H. and Kim,S.-G. TITLE Direct Submission JOURNAL Submitted (06-NOV-2019) Korean Collection for Type Cultures (KCTC), Korea Research Institute of Bioscience & Biotechnology (KRIBB), 181 Ipsin-gil, Jeongeup-si, Jeollabuk-do 56212, Republic of Korea REFERENCE 3 (bases 1 to 152586) AUTHORS Ho,H. and Kim,S.-G. TITLE Direct Submission JOURNAL Submitted (17-APR-2020) Korean Collection for Type Cultures (KCTC), Korea Research Institute of Bioscience & Biotechnology (KRIBB), 181 Ipsin-gil, Jeongeup-si, Jeollabuk-do 56212, Korea, Republic of COMMENT The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Date :: JUL-2019 Assembly Method :: HGAP v. 3.0 Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: 178.0x Sequencing Technology :: PacBio RSII; Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 04/28/2020 06:45:21 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 4.11 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 4,140 CDSs (total) :: 4,077 Genes (coding) :: 4,001 CDSs (with protein) :: 4,001 Genes (RNA) :: 63 rRNAs :: 3, 3, 3 (5S, 16S, 23S) complete rRNAs :: 3, 3, 3 (5S, 16S, 23S) tRNAs :: 51 ncRNAs :: 3 Pseudo Genes (total) :: 76 CDSs (without protein) :: 76 Pseudo Genes (ambiguous residues) :: 0 of 76 Pseudo Genes (frameshifted) :: 11 of 76 Pseudo Genes (incomplete) :: 66 of 76 Pseudo Genes (internal stop) :: 2 of 76 Pseudo Genes (multiple problems) :: 3 of 76 CRISPR Arrays :: 1 ##Genome-Annotation-Data-END## FEATURES Qualifiers source /organism="Cellulosimicrobium protaetiae" /mol_type="genomic DNA" /strain="BI34" /isolation_source="intestine from larvae" /host="wax moth" /type_material="type strain of Cellulosimicrobium protaetiae" /db_xref="taxon:2587808" /plasmid="pCPRO01" /country="South Korea: Jeongeup" /collection_date="2019-04" protein /locus_tag="FIC82_020495" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012880165.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /transl_table=11 BEGIN 1 MSTLRRRLAG LVALLGILAI VVGLPVVLLA VGANPFAGLD PSVDGVIDAL TSPDDGTLFL 61 ALIKLVGWVS WAVLTVSVLL ELGSQLRGVR APRLPGLRLP QNAARSLVGT AMLLFVTVPG 121 LATTATAATA AEPAPPIEVV AVVSAPAEAT PAPAAVDVAP PAATPAPAPA LPAPATTVVA 181 HTVQPGESLW SIAADLLGDG HRFNEIVEQN EAVLGGQASL IRPGWVLQVT VPAAAPDTAQ 241 DTTASESVVV ERGDTLSGIA QAELGDAQRY PEIYEASKDI EQPGGVHLTD PNVIDVGWTL 301 QLPTETPAAT PTTAPVQTPA PDPVEEAAPA QPPAPETPAV DAAPAPDVAP APHEAPAAES 361 APAEEAAPAP APSPPAPDTA STDAGDHLEE DSAWTTRTTF GVGAVLAAGV LALIEARRRT 421 QQRRRRPGQA LPMATGDAAA TEQELRATAD ALSVEHVDVA LRTLAATCAR TGQPLPVTRA 481 ARLTATQFDL YLSEPATLPA PWVGTSDELV WTLTVEDCEN LAVDDAARAT PAPYPALVTL 541 GHDEENGHVL LNLEHLGSLA ITGDDPTTRE ILGALSVELA TSIWADDLQV TLVGAFPDLE 601 DALQTGRIRY LPSVSRVLDD LLTRAEQDRK VLADSGVADL YSARVTGDAS DAWAPEIVVI 661 ASDLTDRQRH QLTELVDQMP HVALAAITNG HGAGEWSLDL VPGATDGESS LAPIGLRVWA 721 QQLPTAQYGH LLEVVAMADV DELDDTQIAT FVPTVTEVES IAPDDRPSDP VSAIPEAILE 781 LLETPAADAP TPADEEEDED AEREGQDDDE HSEHDVDERV DERVDERVDE FEPATSAGAE 841 PDAAAPVVAV AASADAAPQP SGAAPEPEIS EPEVAGRPQV LVLGPVDITG STGRVEPSKR 901 ARLLEYATYL ALNAGVSHTA IDDAIWPDRK TEDNLNTRNT ATTKLRRWVG RTPDGEDYLP 961 RHQAGGGYGF LPEVTTDVDA WDELLNHAPA SASTDDLEAA LKLVRGIPFE GTHRKRYAWA 1021 EPLKQRLISE IVDASSELAR RRLLEGRWRA AEQAVVVGLR IEPAQENLWR LRILAAHESR 1081 NQAAEAEAID RLLTITEHLE CDLEPETEHL LAALKNPGAD FDRLMADAL //