LOCUS QJW38600.1 1197 aa PRT BCT 29-DEC-2022 DEFINITION Cellulosimicrobium protaetiae hypothetical protein protein. ACCESSION CP052757-3491 PROTEIN_ID QJW38600.1 SOURCE Cellulosimicrobium protaetiae ORGANISM Cellulosimicrobium protaetiae Bacteria; Actinobacteria; Micrococcales; Promicromonosporaceae; Cellulosimicrobium. REFERENCE 1 (bases 1 to 4631595) AUTHORS Le Han,H., Nguyen,T.T.H., Li,Z., Shin,N.R. and Kim,S.G. TITLE Cellulosimicrobium protaetiae sp. nov., isolated from the gut of the larva of Protaetia brevitarsis seulensis JOURNAL Int J Syst Evol Microbiol 72 (3) (2022) PUBMED 35348452 REFERENCE 2 (bases 1 to 4631595) AUTHORS Le Ho,H. and Kim,S.-G. TITLE Direct Submission JOURNAL Submitted (06-NOV-2019) Korean Collection for Type Cultures (KCTC), Korea Research Institute of Bioscience & Biotechnology (KRIBB), 181 Ipsin-gil, Jeongeup-si, Jeollabuk-do 56212, Republic of Korea REFERENCE 3 (bases 1 to 4631595) AUTHORS Ho,H. and Kim,S.-G. TITLE Direct Submission JOURNAL Submitted (17-APR-2020) Korean Collection for Type Cultures (KCTC), Korea Research Institute of Bioscience & Biotechnology (KRIBB), 181 Ipsin-gil, Jeongeup-si, Jeollabuk-do 56212, Korea, Republic of COMMENT The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Date :: JUL-2019 Assembly Method :: HGAP v. 3.0 Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: 178.0x Sequencing Technology :: PacBio RSII; Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 04/28/2020 06:45:21 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 4.11 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 4,140 CDSs (total) :: 4,077 Genes (coding) :: 4,001 CDSs (with protein) :: 4,001 Genes (RNA) :: 63 rRNAs :: 3, 3, 3 (5S, 16S, 23S) complete rRNAs :: 3, 3, 3 (5S, 16S, 23S) tRNAs :: 51 ncRNAs :: 3 Pseudo Genes (total) :: 76 CDSs (without protein) :: 76 Pseudo Genes (ambiguous residues) :: 0 of 76 Pseudo Genes (frameshifted) :: 11 of 76 Pseudo Genes (incomplete) :: 66 of 76 Pseudo Genes (internal stop) :: 2 of 76 Pseudo Genes (multiple problems) :: 3 of 76 CRISPR Arrays :: 1 ##Genome-Annotation-Data-END## FEATURES Qualifiers source /organism="Cellulosimicrobium protaetiae" /mol_type="genomic DNA" /strain="BI34" /isolation_source="intestine from larvae" /host="wax moth" /type_material="type strain of Cellulosimicrobium protaetiae" /db_xref="taxon:2587808" /country="South Korea: Jeongeup" /collection_date="2019-04" protein /locus_tag="FIC82_017725" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007071535.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /transl_table=11 BEGIN 1 MRGGPPGPHR TSSPGGRAAH GTGARLVAGL GAGALVAGCA LAALPAHASP LGATAASTGS 61 TASSATAAAA DVAGVTVTPD PSYAGAPFEG WGTSLVWFAN ATGGYPDEIR DRLADMVFGD 121 EGLNLNIARY NVGGGNAPDV PDYLRAGGAV DGWWKAPEGT TRTDVDWWDP ENPDHWDADA 181 DATQRWWVDR IKDDVTHWET FSNSPPWFMT VSGYVSGGFD ANADQLKTES IDDFAAYMVG 241 VTERLEDAHG IEVDTIDPFN EPNTNYWGTQ LGGDGNPTGG RQEGAHMGPE LQAKVVPALA 301 AALDGSSTDA VISAMDETNP GRFATNWYAY PDAVRDQVSQ LNVHTYGTGQ RTTVRDIAKG 361 EDKPLWMSEV GGNWSSTGQD FETMESGLGS AQHMVDDLRE LEPSAWVFWQ PVEDYANMAP 421 GGESANGMNW GEIQIPFDCT AEDTLETCPI YTNTKYWATQ NFTHYIEPGD SLIRSDDASS 481 TAAVSADGTS ATVVHVNATK AERAVTLDLS KFGAVGSDAT VTPVVTSTAG YLVEGEPVAV 541 TTDENGPSVT LVVPAESVTT FVVDGVSGVA DDAALVQDGH AYRLDGVQAD RSLAPSASGT 601 GVVIRTDAPV AEQAWELTAL GASEGSGTHR TRYAVTNAAT GRQLAVAADT SAVLQDAPAD 661 VADTPLAAQW ILSTTGDGTF TLVNASSKTL LEVGGQATAD GSPVGTYLAN SGVNQRWRIV 721 DETVLGTEPV EAFTTPGTAP ELPATVTPVY RDGARGSLPV TWDVPGDDAW AEPGTVEVAG 781 TVVAPTGGTV AATATVVVDE LTSTLPARAK AYAGGTPSLP ATVTAVAAGG AQVQRPVVWD 841 AAPAGAYDAV GVVALTGTAD AGAGATLPAT VRVQVTEAAS ANGALAGGTT ASATFTEPGY 901 GVGGVVNGAL TDKAWSNWVS GTKRASDTLT VTLPADRDVT GVVTRFWKDG SSASWAQSVR 961 LQALVDGTWT DVGTPVTVDA SPDGPAPAVE VPADVRTSSV RVVLTARANT HMVVSEIEVL 1021 AKVPGTGTDT TASGITVDGE PLAGFDPAVT SYDVAADGEV PEVAAAATDP YASVQVEPAD 1081 AVPGTTTVRV TAEDGTEQAY ALRWSADAGA APVTAVAETR CLAGKVYVAV RATNDGGAPL 1141 DVTLTTPYGT RTVAAVAPGA SAYQSFASRS ASVPAGTAVV SVDGYGPLEV AFDARTC //