LOCUS QJW37227.1 1265 aa PRT BCT 29-DEC-2022 DEFINITION Cellulosimicrobium protaetiae hypothetical protein protein. ACCESSION CP052757-2866 PROTEIN_ID QJW37227.1 SOURCE Cellulosimicrobium protaetiae ORGANISM Cellulosimicrobium protaetiae Bacteria; Actinobacteria; Micrococcales; Promicromonosporaceae; Cellulosimicrobium. REFERENCE 1 (bases 1 to 4631595) AUTHORS Le Han,H., Nguyen,T.T.H., Li,Z., Shin,N.R. and Kim,S.G. TITLE Cellulosimicrobium protaetiae sp. nov., isolated from the gut of the larva of Protaetia brevitarsis seulensis JOURNAL Int J Syst Evol Microbiol 72 (3) (2022) PUBMED 35348452 REFERENCE 2 (bases 1 to 4631595) AUTHORS Le Ho,H. and Kim,S.-G. TITLE Direct Submission JOURNAL Submitted (06-NOV-2019) Korean Collection for Type Cultures (KCTC), Korea Research Institute of Bioscience & Biotechnology (KRIBB), 181 Ipsin-gil, Jeongeup-si, Jeollabuk-do 56212, Republic of Korea REFERENCE 3 (bases 1 to 4631595) AUTHORS Ho,H. and Kim,S.-G. TITLE Direct Submission JOURNAL Submitted (17-APR-2020) Korean Collection for Type Cultures (KCTC), Korea Research Institute of Bioscience & Biotechnology (KRIBB), 181 Ipsin-gil, Jeongeup-si, Jeollabuk-do 56212, Korea, Republic of COMMENT The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Date :: JUL-2019 Assembly Method :: HGAP v. 3.0 Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: 178.0x Sequencing Technology :: PacBio RSII; Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 04/28/2020 06:45:21 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 4.11 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 4,140 CDSs (total) :: 4,077 Genes (coding) :: 4,001 CDSs (with protein) :: 4,001 Genes (RNA) :: 63 rRNAs :: 3, 3, 3 (5S, 16S, 23S) complete rRNAs :: 3, 3, 3 (5S, 16S, 23S) tRNAs :: 51 ncRNAs :: 3 Pseudo Genes (total) :: 76 CDSs (without protein) :: 76 Pseudo Genes (ambiguous residues) :: 0 of 76 Pseudo Genes (frameshifted) :: 11 of 76 Pseudo Genes (incomplete) :: 66 of 76 Pseudo Genes (internal stop) :: 2 of 76 Pseudo Genes (multiple problems) :: 3 of 76 CRISPR Arrays :: 1 ##Genome-Annotation-Data-END## FEATURES Qualifiers source /organism="Cellulosimicrobium protaetiae" /mol_type="genomic DNA" /strain="BI34" /isolation_source="intestine from larvae" /host="wax moth" /type_material="type strain of Cellulosimicrobium protaetiae" /db_xref="taxon:2587808" /country="South Korea: Jeongeup" /collection_date="2019-04" protein /locus_tag="FIC82_014555" /inference="COORDINATES: protein motif:HMM:NF012376.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /transl_table=11 BEGIN 1 MDTPVPTDHT TLGRRRTLAG VLGATLVVGL LAPAATAATE GAAAPQERAE RVAAASQPEL 61 RPTSGTYLDG TVPVTADPVA AGDPVTALAV DGVPLDAAAT PATARLLFDV GSNSIERRYG 121 SHVLVNGERL ELDRDMVSER VALDVPAQWL VQGENQVRFV VGAVTTSCGS NYDDFDLTDL 181 SLELLGEAAD GSDNRYSYAF GDGSCGSNTA RVLTADLTFD LDQDPAATTG LAADLDTTTL 241 ENGAHTLTAT TAAGETASSA VTVNNGAPGA PALTPADGAL LVGTQTVLAT PTAGGDAPAA 301 VELDGTPLDT VTTHGAGTSQ FVFTVGSNSI EARYTNHLLV NGQRIDLVDR DYVSETVRVD 361 VPNAYLLPGR NTVRFVTGTY PTSCGDNRDD FAVSGLALDV TDATATGVGI APSYSLGDGD 421 CGTSATKPRE VDLVFDVVRD ADAPATGLRA DVDTATLADG EHTITSATAT GAVARRTVTT 481 DNTGPAIASS TPAAGADLAA AASLAVELVD ASGVLSGPDV TLDGEPVEIG APVGPGLPAG 541 EHALVVTATD VLGNASSHEI TFTSLGVPDV PTDLAPAHGA RDVGESTELS ARVTAPGEGD 601 VTATFSRARV ATPVLVAQGE SAGVPTTLPV EGEQPAAVDA LVPGDDATLD SPQSRDVTYQ 661 RLEIPADGDT AGQVVRWEGV VDPQRLATLH VWDGERWTPV ASGRGVAEGP TVLSAVLGSG 721 ADHDGTVHVL VTGTDPFADD IAAGGSRDET DFAPREDYDF SLVHFTDTQY LAEGAVEQET 781 AEERAVWAKA YTDLTRWVVD NAEARNIAYA AHTGDVNENY TRLPADDAMA AQIRGEYEFS 841 SSAQKILDDA NIPNGVLAGN HDNLTGQDNG PGALFNEFYG PDRYEALSAG WEDASYGGPW 901 REGDNQNHYD LFSAGGLDFV AVYLSYGVTD EEAAWADGVL KQFPDRNAIV LTHDYLVPSA 961 NPDGRDSEIS TPDGRLVHAK VVEPNPNVFL VMAGHRHGVG INVRQDVGTE GSGVVELLAD 1021 YQFYEVTAEE AGLTEIGGYT PDQGLRLGAS FLRLLQFDVD RSEMIVDTYS PWLANFGATE 1081 YDDEQRYDGG EDDFTVPVDL TSRVTSLETD AVSLYAPADE IGSVTVPSGE VATVAWDGLA 1141 PGTLHAWLVT ATSAHGGTAV SPVSTFTTAP ASGGLVVETT ARSQCMAGKA FVAVRALNAD 1201 TVPVDVTLST PYGERTVTGV QPGASAYQAF PVRSAQVAAG TAHVTATGDD RTFAGDVAFD 1261 AFACG //