LOCUS       QJW35996.1              1847 aa    PRT              BCT 29-DEC-2022
DEFINITION  Cellulosimicrobium protaetiae hypothetical protein protein.
ACCESSION   CP052757-1401
PROTEIN_ID  QJW35996.1
SOURCE      Cellulosimicrobium protaetiae
  ORGANISM  Cellulosimicrobium protaetiae
            Bacteria; Actinobacteria; Micrococcales; Promicromonosporaceae;
            Cellulosimicrobium.
REFERENCE   1  (bases 1 to 4631595)
  AUTHORS   Le Han,H., Nguyen,T.T.H., Li,Z., Shin,N.R. and Kim,S.G.
  TITLE     Cellulosimicrobium protaetiae sp. nov., isolated from the gut of
            the larva of Protaetia brevitarsis seulensis
  JOURNAL   Int J Syst Evol Microbiol 72 (3) (2022)
   PUBMED   35348452
REFERENCE   2  (bases 1 to 4631595)
  AUTHORS   Le Ho,H. and Kim,S.-G.
  TITLE     Direct Submission
  JOURNAL   Submitted (06-NOV-2019) Korean Collection for Type Cultures (KCTC),
            Korea Research Institute of Bioscience & Biotechnology (KRIBB), 181
            Ipsin-gil, Jeongeup-si, Jeollabuk-do 56212, Republic of Korea
REFERENCE   3  (bases 1 to 4631595)
  AUTHORS   Ho,H. and Kim,S.-G.
  TITLE     Direct Submission
  JOURNAL   Submitted (17-APR-2020) Korean Collection for Type Cultures (KCTC),
            Korea Research Institute of Bioscience & Biotechnology (KRIBB), 181
            Ipsin-gil, Jeongeup-si, Jeollabuk-do 56212, Korea, Republic of
COMMENT     The annotation was added by the NCBI Prokaryotic Genome Annotation
            Pipeline (PGAP). Information about PGAP can be found here:
            https://www.ncbi.nlm.nih.gov/genome/annotation_prok/
            
            ##Genome-Assembly-Data-START##
            Assembly Date          :: JUL-2019
            Assembly Method        :: HGAP v. 3.0
            Genome Representation  :: Full
            Expected Final Version :: Yes
            Genome Coverage        :: 178.0x
            Sequencing Technology  :: PacBio RSII; Illumina HiSeq
            ##Genome-Assembly-Data-END##
            
            ##Genome-Annotation-Data-START##
            Annotation Provider               :: NCBI
            Annotation Date                   :: 04/28/2020 06:45:21
            Annotation Pipeline               :: NCBI Prokaryotic Genome
                                                 Annotation Pipeline (PGAP)
            Annotation Method                 :: Best-placed reference protein
                                                 set; GeneMarkS-2+
            Annotation Software revision      :: 4.11
            Features Annotated                :: Gene; CDS; rRNA; tRNA; ncRNA;
                                                 repeat_region
            Genes (total)                     :: 4,140
            CDSs (total)                      :: 4,077
            Genes (coding)                    :: 4,001
            CDSs (with protein)               :: 4,001
            Genes (RNA)                       :: 63
            rRNAs                             :: 3, 3, 3 (5S, 16S, 23S)
            complete rRNAs                    :: 3, 3, 3 (5S, 16S, 23S)
            tRNAs                             :: 51
            ncRNAs                            :: 3
            Pseudo Genes (total)              :: 76
            CDSs (without protein)            :: 76
            Pseudo Genes (ambiguous residues) :: 0 of 76
            Pseudo Genes (frameshifted)       :: 11 of 76
            Pseudo Genes (incomplete)         :: 66 of 76
            Pseudo Genes (internal stop)      :: 2 of 76
            Pseudo Genes (multiple problems)  :: 3 of 76
            CRISPR Arrays                     :: 1
            ##Genome-Annotation-Data-END##
FEATURES             Qualifiers
     source          /organism="Cellulosimicrobium protaetiae"
                     /mol_type="genomic DNA"
                     /strain="BI34"
                     /isolation_source="intestine from larvae"
                     /host="wax moth"
                     /type_material="type strain of Cellulosimicrobium
                     protaetiae"
                     /db_xref="taxon:2587808"
                     /country="South Korea: Jeongeup"
                     /collection_date="2019-04"
     protein         /locus_tag="FIC82_007075"
                     /inference="COORDINATES: protein
                     motif:HMM:NF012573.1,HMM:NF015083.1,HMM:NF016047.1"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /transl_table=11
BEGIN
        1 MSTPAAAVTD PDPSFRLTAH DLEFILKQIH VSEAHSAGGT LLCDEPTDRS GKCVPSPALP
       61 FGLRTVDGSF NNLLAGQEEY GSADRPFPRH LGTTWREADP GPPFSPPNPP GATDVCAAGT
      121 TCYSMFAPGS FVYDADPRVI SNLIVDQSAD NPAAVNAAES TEGGEVRPDG SVYIPNTAPD
      181 EGLSAPFNAW FTFFGQFFDH GLDLVNKGGN GTLVVPLQPD DPLYDPASRT NFLVLTRATR
      241 LPGPDGVVGT SDDEHNNQTT PWVDQNQTYT SHPAHQVFLR EYELRDGVPH DTGRLLDGTL
      301 PGGGTGGLTT WADVKRQANE VLGIALTDAD VLNVPQVAVD LYGNFVPGPG GFPLLVTDED
      361 GDPATEDPLN VEGDLADPVA TTNALGTAHA FLDDIAHGST PLFDADGNLV PQQFDDEGSP
      421 IPPGVPLFNA DGSPVTDQSL AGYDNVSLDE HFVTGDGRGN ENIGLTAVHH VFHAEHNRLV
      481 GYIDGVLAEN PELQKAYQGL EHQWPTKRSG DELPGPEDDD WSYEQRLFQA ARFATEMQYQ
      541 HLVFEEFARK IQPNVDPIVF NENSYDATVD PAIVAEYAHV VYRFGHSMLT EEIDRTGFGT
      601 QSVSLLDGFL NPRTYDDDGT LTPEQAAGAV VNGTTNQVAG QIDEHVIGTL RNNLLGLPLD
      661 LATINMLRGR DTGTPGLQEA RRTFFATTGS PTLEPYESWY DFGVGLKNGN NFGRGGSNAS
      721 LINFVAAYGT HPTILSANTV EAKRDAASLL VNGTLQGQEV TFRVGGGDRF ATAAQISAGY
      781 FGTGVPVAYV TSGMNFPDAL AGGPAAAAGG GPVLLVQPGS IPDATSTELA RLQPQRIVVL
      841 GGEGAVSQTV LNGLGTYTSG AVTRVFGADR YETAANISAT VFAPGVDVVY VASGENFPDA
      901 LAGGAVAARD GAPILLVTQG SIPAAVQAEL DRLDPGRIVV LGGAGVVSPA VQTQLGTYTD
      961 GGVTRLGGAD RYQTALLISQ YAYPSGAPAA FVATGSNFPD ALAAAPVAGL SGAPLLLVPP
     1021 TAAPPGVLAE LARLGADRVT LLGGTGVISP AIQQQLTPPP PVPADLPADR LDFVHSTGAW
     1081 ANQPDGLTTT GLENVDFWTG GLAEALDPFG GMLGSTFNFV FEKQLENLQF GDRFYYLFRN
     1141 QGNQLFAALE ANSFAKLIQR NTDASLLPAD IFSVPDPSLD LENLPDPWPS QLTQMGDGTY
     1201 RWDGDEHVEI HGNRTEADRI RGGQGDDALW GYGGNDRIEG GSGNDEIIGG PGDDILTDSF
     1261 GDDNIKGGLG NDAINGGPGV NLLLASHGND FVVGGNDTPN DIFAGTGNDV VLGGAGRTNV
     1321 FGGEGDDWIQ GGSHADLAQG DNANQFQNDT IGGNDVVVGG PGNDDIEGEG GDDVLVGRAF
     1381 GTDRHEGAIG YDWVTYYGEN GGVDADLRFS TLQRPDVQAV RDRYDLVEAL SGGGGNDVLR
     1441 GQGFGVDDIP DNEVPLHKMT QETLDLFPGL EAILRPGGGH EDYALRFMDD PLAADQDGQS
     1501 NLLIGGPGSD LVEGRGGNDF IDGDAYLRVQ LAVGDERFDN AAQLQTRVFS GEINPGDISI
     1561 VREIVEDDEA DGVIDTAVYQ FDREAYEVTD LGDGYTAVEH TGAAELEESD GRDVLRNVEM
     1621 LQFGDGCLVL ATMEACPSFG SVTFAGQIDP PTEDLPLAAT VVFDDSLVQN PTNIRFAWQL
     1681 GEGGEEWEPS ATGDTLPDAP NGRVDTFTPG DGDGGMLLRV VVTFRDDQGQ LRSIVSDALP
     1741 AVVNVNDVPT QPVLSPEVVT VGSGLNLGGF TDGDGLEESS EAGITYTWQA STDGFATVRV
     1801 LGNVVPTAFQ VTAAEAGHQI RVVVAYTDDQ GTVETVVSTV STVTGGP
//