LOCUS QJW38727.1 1075 aa PRT BCT 29-DEC-2022 DEFINITION Cellulosimicrobium protaetiae family 43 glycosylhydrolase protein. ACCESSION CP052758-70 PROTEIN_ID QJW38727.1 SOURCE Cellulosimicrobium protaetiae ORGANISM Cellulosimicrobium protaetiae Bacteria; Actinobacteria; Micrococcales; Promicromonosporaceae; Cellulosimicrobium. REFERENCE 1 (bases 1 to 152586) AUTHORS Le Han,H., Nguyen,T.T.H., Li,Z., Shin,N.R. and Kim,S.G. TITLE Cellulosimicrobium protaetiae sp. nov., isolated from the gut of the larva of Protaetia brevitarsis seulensis JOURNAL Int J Syst Evol Microbiol 72 (3) (2022) PUBMED 35348452 REFERENCE 2 (bases 1 to 152586) AUTHORS Le Ho,H. and Kim,S.-G. TITLE Direct Submission JOURNAL Submitted (06-NOV-2019) Korean Collection for Type Cultures (KCTC), Korea Research Institute of Bioscience & Biotechnology (KRIBB), 181 Ipsin-gil, Jeongeup-si, Jeollabuk-do 56212, Republic of Korea REFERENCE 3 (bases 1 to 152586) AUTHORS Ho,H. and Kim,S.-G. TITLE Direct Submission JOURNAL Submitted (17-APR-2020) Korean Collection for Type Cultures (KCTC), Korea Research Institute of Bioscience & Biotechnology (KRIBB), 181 Ipsin-gil, Jeongeup-si, Jeollabuk-do 56212, Korea, Republic of COMMENT The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Date :: JUL-2019 Assembly Method :: HGAP v. 3.0 Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: 178.0x Sequencing Technology :: PacBio RSII; Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 04/28/2020 06:45:21 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 4.11 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 4,140 CDSs (total) :: 4,077 Genes (coding) :: 4,001 CDSs (with protein) :: 4,001 Genes (RNA) :: 63 rRNAs :: 3, 3, 3 (5S, 16S, 23S) complete rRNAs :: 3, 3, 3 (5S, 16S, 23S) tRNAs :: 51 ncRNAs :: 3 Pseudo Genes (total) :: 76 CDSs (without protein) :: 76 Pseudo Genes (ambiguous residues) :: 0 of 76 Pseudo Genes (frameshifted) :: 11 of 76 Pseudo Genes (incomplete) :: 66 of 76 Pseudo Genes (internal stop) :: 2 of 76 Pseudo Genes (multiple problems) :: 3 of 76 CRISPR Arrays :: 1 ##Genome-Annotation-Data-END## FEATURES Qualifiers source /organism="Cellulosimicrobium protaetiae" /mol_type="genomic DNA" /strain="BI34" /isolation_source="intestine from larvae" /host="wax moth" /type_material="type strain of Cellulosimicrobium protaetiae" /db_xref="taxon:2587808" /plasmid="pCPRO01" /country="South Korea: Jeongeup" /collection_date="2019-04" protein /locus_tag="FIC82_020290" /inference="COORDINATES: protein motif:HMM:NF016497.1,HMM:NF019170.1,HMM:NF024777.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /transl_table=11 BEGIN 1 MKLLRTAVAG LAGAALLGAG LAGPSAADET SDEGLLANFT FEEATGTVDG GAATAAVHGT 61 VNLVDGPDGK AARLGRDFWL DVTNRDGSPV LAGHDAVTIS YDSKPDSAGN TGWTVFAASG 121 PGTQTYGSEH YLGYLDRTTG LTVERYANTG GRDSTGNLTS TRQNDGWKHV DLVLDGTEAR 181 LYVDQRLVTM NRAGKPLTEI LGDAGGILQV GRANWGAGEY YSGLLDNLRV YDRALTSAEL 241 GVPPAPVDLA AAIAIPSFVT DDLPAEVLGR EVTWSATGEW ADLVAPDGTI THPETGSAHA 301 TVTAHVDGLD DPITGEIEIL AVGGAVASYV KTVTTTSGVK DDPLAYNDDR RADAWYVAAR 361 ARGAEAWEPL NRSQAILYAA WAGDQGAQPN AQLGSPTPLR FADGSLGAVA AQNNATDSIY 421 VWDSPDGATF TNQRTVRVSA DGSPVSDPRI VHDAAGGVYK VFWNDPLDGG GRVATLADLD 481 ADAVPSQASQ ADPRQMGVTG PGLPAFAAQD EASTFSLTPQ EYDTFVTNYV DLRNTGVVEL 541 DDVAVDSGQD VTAADLPATA RMTYTDGSTK DLPVRWDQEQ LEALDSSTSG EYVIEGAVQQ 601 TTEAMVNDAR ADPHLFFNDD DGYWYLTGSH YSIPSDAPDS QIHDRNAYRK IGLKRARTIE 661 GLSDATEQIV IDPDDGTVGH EDQYPNTFYG WGGYIWAQEF HKINGTWWIV AGMNRGYAPT 721 GGWCDNTVMI PYTGDEASFA NGGFLKEENW GQPVVLEGAA FDVSYLEREE DGVTQGYWVM 781 PRNAELWVAK ATMGETGTVP LVDGAMKRIY AIHQPWEYGK SAPTPSDTNE GRDQGIVEAP 841 FMIEHGDHVY LTYSGGTVDK YYDLGLLRAP KDADLQDPNS WTAVDFPVLD ANDTADGQIG 901 GEAHAGTGHN SFAIDDTGNL VLAYHARPYP EQHGGNGAGG LFDPDRNSWF KAVNVRANGM 961 LDLSLSSEQE VAPENRTVRV TVVVRAAAPV VTVTATTRCV AGKNVLAVTT SNDGDGPVGL 1021 TVRTPFGFRE VTVPGGRSTT TSFTTRQVSV DAGEVLVSAD GGDSTSTTYP AASCG //