LOCUS QJW34937.1 1438 aa PRT BCT 29-DEC-2022 DEFINITION Cellulosimicrobium protaetiae hypothetical protein protein. ACCESSION CP052757-141 PROTEIN_ID QJW34937.1 SOURCE Cellulosimicrobium protaetiae ORGANISM Cellulosimicrobium protaetiae Bacteria; Actinobacteria; Micrococcales; Promicromonosporaceae; Cellulosimicrobium. REFERENCE 1 (bases 1 to 4631595) AUTHORS Le Han,H., Nguyen,T.T.H., Li,Z., Shin,N.R. and Kim,S.G. TITLE Cellulosimicrobium protaetiae sp. nov., isolated from the gut of the larva of Protaetia brevitarsis seulensis JOURNAL Int J Syst Evol Microbiol 72 (3) (2022) PUBMED 35348452 REFERENCE 2 (bases 1 to 4631595) AUTHORS Le Ho,H. and Kim,S.-G. TITLE Direct Submission JOURNAL Submitted (06-NOV-2019) Korean Collection for Type Cultures (KCTC), Korea Research Institute of Bioscience & Biotechnology (KRIBB), 181 Ipsin-gil, Jeongeup-si, Jeollabuk-do 56212, Republic of Korea REFERENCE 3 (bases 1 to 4631595) AUTHORS Ho,H. and Kim,S.-G. TITLE Direct Submission JOURNAL Submitted (17-APR-2020) Korean Collection for Type Cultures (KCTC), Korea Research Institute of Bioscience & Biotechnology (KRIBB), 181 Ipsin-gil, Jeongeup-si, Jeollabuk-do 56212, Korea, Republic of COMMENT The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Date :: JUL-2019 Assembly Method :: HGAP v. 3.0 Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: 178.0x Sequencing Technology :: PacBio RSII; Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 04/28/2020 06:45:21 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 4.11 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 4,140 CDSs (total) :: 4,077 Genes (coding) :: 4,001 CDSs (with protein) :: 4,001 Genes (RNA) :: 63 rRNAs :: 3, 3, 3 (5S, 16S, 23S) complete rRNAs :: 3, 3, 3 (5S, 16S, 23S) tRNAs :: 51 ncRNAs :: 3 Pseudo Genes (total) :: 76 CDSs (without protein) :: 76 Pseudo Genes (ambiguous residues) :: 0 of 76 Pseudo Genes (frameshifted) :: 11 of 76 Pseudo Genes (incomplete) :: 66 of 76 Pseudo Genes (internal stop) :: 2 of 76 Pseudo Genes (multiple problems) :: 3 of 76 CRISPR Arrays :: 1 ##Genome-Annotation-Data-END## FEATURES Qualifiers source /organism="Cellulosimicrobium protaetiae" /mol_type="genomic DNA" /strain="BI34" /isolation_source="intestine from larvae" /host="wax moth" /type_material="type strain of Cellulosimicrobium protaetiae" /db_xref="taxon:2587808" /country="South Korea: Jeongeup" /collection_date="2019-04" protein /locus_tag="FIC82_000705" /inference="COORDINATES: ab initio prediction:GeneMarkS-2+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS-2+." /transl_table=11 BEGIN 1 MREPWVDVAL ARLPETRSCP ACAATLRSSR CDRCLLDLTG PLAFEVAAAS TEAADALSRR 61 QVALDALRAS QPEAAAWAAR AVVATPPGDA PPGVRAVPSG PPRAMRPLDA TGRGQAGPRT 121 SPVAQPPGPV RGPVRGPVPV RPATVGPVPP GSAGSPAGAV VPGAAPARSV GLQPVLAGAG 181 AGLLAVATVV FVFFTFADDL ALRALVTGVV TVLTVGAAVL LRSRGLRSSA EAVAALAVVL 241 AVVDVELALS AWARAGALDP VGTAVARASL LAAVVVGLGL VGDRARVRAW VTSAVVLGPL 301 VPLVAAPAAG APWGWAVALL ATACLTALAA PVAARSGARV GSALRGEQAV LGVVRTVAVP 361 LAVVVGLTVT APPGLPTGSG SAAVALGAAL AAALLRVSTA ERRWYAIGGA SAVLAGALLG 421 TGGDVVWVGL APAFAALAWF VVLALTTPRV VQAVRAAPTP AVGRSDTLLG GAVVAIVAAV 481 PAVAIAGLRA AEVLMTATTG TDAELGTAPL GVGVLTAASR DDGLGVGSDG AEILGGTLLG 541 LTAVLAVVSV AGRLALRTPQ PVPAPLPPGV RPVPLPTGAR PVPLPTGARS APAPGGWAPA 601 VSRLGAAPVV RTGRAFGPWL LLALVLTLAL DPRLAGVTSL ALLAVLAAAL VVATARPVAV 661 PAVPAVSPAP ASAPVPSGTA SDVAAGSAPG AVVRAVRRVL VHVLRPAARL LPDPRAAIRH 721 GVLPVGRAER RVWRAAAVAG TIAALLLLVA GSWVARPTAT VGAVVVGVLL LAARACAPRG 781 LHAVLVGTAY GYALLVLGVT LAWGGIGTVA VLCVVSGVAS LVALTVTLVP RVDRDSWWAV 841 LGVTAVPFGL GVLTVVDERS AWSVAACVAM LALEVVLVGS RRPGTVTGLR VLAAALVLPT 901 AAVAVVGAGA LVLPGSGSPV VLPVVAVLVA AALAGARPVT ERIGLGAGRV AVEAAAALTG 961 AIAVGLAFAR PAAGPDVAVA VLLLLAAGAG LAARDRDRRA EWWLAAALTT AALWTALAAA 1021 EVGLVEAYTA PPALAAVVTG ALLARRSRRW WELASAGLVL LVVPSVLALG ASPGAGDMRA 1081 LLLVAAGAGC VVVAATLRRA DAAGGAWRRA AALRLAGAGV LAATAGTVES VHVAHAAGGG 1141 AVFLVGFAWA LAAGAVALGG GLVAARGASG RATAVVRRWA VAPAAVLVVV GAVANVRPVW 1201 GVIATVWAVE VLLLVLLVLG VRRAVRGRLD LPPAWFTWLL ALAAAIGAWS PRELRVEVFS 1261 LPLGAGLLVA GYLALAAGTT SARGTGAGNP DAEGAAAGTA GGVTAPVRTT STLAGWPVGL 1321 AGSWRTLAPG ILALLGPSVL ATYTDARTWR AVLVIALALA AVLVGTRTHL AAPFLLGVAV 1381 LPVEILVVFV SQLGTRISAG PWMLTLAAAG GLLLIIATYY ERRIAAYDGA AAYVRDLR //