LOCUS QJW35263.1 1130 aa PRT BCT 29-DEC-2022 DEFINITION Cellulosimicrobium protaetiae APHP domain-containing protein protein. ACCESSION CP052757-524 PROTEIN_ID QJW35263.1 SOURCE Cellulosimicrobium protaetiae ORGANISM Cellulosimicrobium protaetiae Bacteria; Actinobacteria; Micrococcales; Promicromonosporaceae; Cellulosimicrobium. REFERENCE 1 (bases 1 to 4631595) AUTHORS Le Han,H., Nguyen,T.T.H., Li,Z., Shin,N.R. and Kim,S.G. TITLE Cellulosimicrobium protaetiae sp. nov., isolated from the gut of the larva of Protaetia brevitarsis seulensis JOURNAL Int J Syst Evol Microbiol 72 (3) (2022) PUBMED 35348452 REFERENCE 2 (bases 1 to 4631595) AUTHORS Le Ho,H. and Kim,S.-G. TITLE Direct Submission JOURNAL Submitted (06-NOV-2019) Korean Collection for Type Cultures (KCTC), Korea Research Institute of Bioscience & Biotechnology (KRIBB), 181 Ipsin-gil, Jeongeup-si, Jeollabuk-do 56212, Republic of Korea REFERENCE 3 (bases 1 to 4631595) AUTHORS Ho,H. and Kim,S.-G. TITLE Direct Submission JOURNAL Submitted (17-APR-2020) Korean Collection for Type Cultures (KCTC), Korea Research Institute of Bioscience & Biotechnology (KRIBB), 181 Ipsin-gil, Jeongeup-si, Jeollabuk-do 56212, Korea, Republic of COMMENT The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Date :: JUL-2019 Assembly Method :: HGAP v. 3.0 Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: 178.0x Sequencing Technology :: PacBio RSII; Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 04/28/2020 06:45:21 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 4.11 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 4,140 CDSs (total) :: 4,077 Genes (coding) :: 4,001 CDSs (with protein) :: 4,001 Genes (RNA) :: 63 rRNAs :: 3, 3, 3 (5S, 16S, 23S) complete rRNAs :: 3, 3, 3 (5S, 16S, 23S) tRNAs :: 51 ncRNAs :: 3 Pseudo Genes (total) :: 76 CDSs (without protein) :: 76 Pseudo Genes (ambiguous residues) :: 0 of 76 Pseudo Genes (frameshifted) :: 11 of 76 Pseudo Genes (incomplete) :: 66 of 76 Pseudo Genes (internal stop) :: 2 of 76 Pseudo Genes (multiple problems) :: 3 of 76 CRISPR Arrays :: 1 ##Genome-Annotation-Data-END## FEATURES Qualifiers source /organism="Cellulosimicrobium protaetiae" /mol_type="genomic DNA" /strain="BI34" /isolation_source="intestine from larvae" /host="wax moth" /type_material="type strain of Cellulosimicrobium protaetiae" /db_xref="taxon:2587808" /country="South Korea: Jeongeup" /collection_date="2019-04" protein /locus_tag="FIC82_002635" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_020014419.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /transl_table=11 BEGIN 1 MRTPWKHTVV AGAALALVAP LGVVAVAHAA DATNLALGKP IAASSVTQTY VAGNANDGNA 61 GSYWEGAGGQ YPSHLTVDLG AEADVDRVVV TLPPPSVWSS RTQTFSVLGR SDGETAFRTL 121 KASAAYGFDP ASGNKVEIPL DAEVDEVRLA FTANTGAGNG QVSELQVWGT PTGGTDPTDP 181 TDPTDPTGTN YAKNRPATAS SAEWQFVAGN AVDGSATTYW EGAGGQYPST LDVALAAPTQ 241 LSSVRVRLNP DAAWGPRTQT FSVWGRTGTG AWQELKASAG YAFAPGSGNL VDVPVTGTAT 301 DVRLRFTGNT GAGNGQVAEL EVYGAPAPNP NLTVTAVTAS PASPTATTPV TLTATVKNTG 361 DRASSATTLD GKLGGSTAGS AAVAALQPGA SAQVQVAAGT RPAGEYTMGA VVDPANTVAE 421 QNETDNAFTA TAKLVVGEAP GPDLEVVSVS SNPANPAVGS AVTFSVQVRN RGNQPVAAGS 481 VTRVVAGSTT LNGTTPAVAA GATVTVTPSG TWTATNGGAT VTATADATGV VAETNEGNNT 541 GTLAVTVGRG AAVPYTTYEA EDGQYTGTLL QTDAVRTFGH TNFATESSGR ESVRLTSAGQ 601 YVQFTSTNAT NSIVVRNSIP DAPGGGGQEK TISLYADGQF VQKLTLSSKH AWLYGTTDQP 661 EGLVNTPGGD ARRLFDESHA LLGRSFPAGT VFKLQRDAGD DAAFYVIDLV ELEQVAPPLA 721 KPAGCTSITE YGAVPNDGLE DTAAIQAAVT ANQNGDIDCV WIPAGQWRQE KKILTDDPLN 781 RGMHNQVGIR DVTIRGAGMW HSQLYSLIPP HLAPGVINHP HEGNFGFDID DNTQISDLAI 841 FGSGTIRGNN AQEEGGVGLN GRFGKNTKIT NVWIEHANVG VWVGRDYSNI PELWNPGDGL 901 VFSGMRIRNT YADGINFSNG TRNSTVVNST FRNTGDDALA VWANPYVKDR AVDIGHSNTF 961 RNNTVQLPWR ANGIAIYGGY DNSIENNLVY DTMNYPGIML ATDHDPLPFS GTTLIANNGL 1021 YRTGGAFWNE DQEFGAITIF PQTHDIVGVT IRDTDIVDST YDGIQFKNGG GNMPDVKITN 1081 VRIDQSNNGS GILAMGGARG NAILSNVTVT NSRDGDVAKE PGSQFTFTGQ //