LOCUS QJW35487.1 1648 aa PRT BCT 29-DEC-2022 DEFINITION Cellulosimicrobium protaetiae class I SAM-dependent DNA methyltransferase protein. ACCESSION CP052757-796 PROTEIN_ID QJW35487.1 SOURCE Cellulosimicrobium protaetiae ORGANISM Cellulosimicrobium protaetiae Bacteria; Actinobacteria; Micrococcales; Promicromonosporaceae; Cellulosimicrobium. REFERENCE 1 (bases 1 to 4631595) AUTHORS Le Han,H., Nguyen,T.T.H., Li,Z., Shin,N.R. and Kim,S.G. TITLE Cellulosimicrobium protaetiae sp. nov., isolated from the gut of the larva of Protaetia brevitarsis seulensis JOURNAL Int J Syst Evol Microbiol 72 (3) (2022) PUBMED 35348452 REFERENCE 2 (bases 1 to 4631595) AUTHORS Le Ho,H. and Kim,S.-G. TITLE Direct Submission JOURNAL Submitted (06-NOV-2019) Korean Collection for Type Cultures (KCTC), Korea Research Institute of Bioscience & Biotechnology (KRIBB), 181 Ipsin-gil, Jeongeup-si, Jeollabuk-do 56212, Republic of Korea REFERENCE 3 (bases 1 to 4631595) AUTHORS Ho,H. and Kim,S.-G. TITLE Direct Submission JOURNAL Submitted (17-APR-2020) Korean Collection for Type Cultures (KCTC), Korea Research Institute of Bioscience & Biotechnology (KRIBB), 181 Ipsin-gil, Jeongeup-si, Jeollabuk-do 56212, Korea, Republic of COMMENT The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Date :: JUL-2019 Assembly Method :: HGAP v. 3.0 Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: 178.0x Sequencing Technology :: PacBio RSII; Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 04/28/2020 06:45:21 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 4.11 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 4,140 CDSs (total) :: 4,077 Genes (coding) :: 4,001 CDSs (with protein) :: 4,001 Genes (RNA) :: 63 rRNAs :: 3, 3, 3 (5S, 16S, 23S) complete rRNAs :: 3, 3, 3 (5S, 16S, 23S) tRNAs :: 51 ncRNAs :: 3 Pseudo Genes (total) :: 76 CDSs (without protein) :: 76 Pseudo Genes (ambiguous residues) :: 0 of 76 Pseudo Genes (frameshifted) :: 11 of 76 Pseudo Genes (incomplete) :: 66 of 76 Pseudo Genes (internal stop) :: 2 of 76 Pseudo Genes (multiple problems) :: 3 of 76 CRISPR Arrays :: 1 ##Genome-Annotation-Data-END## FEATURES Qualifiers source /organism="Cellulosimicrobium protaetiae" /mol_type="genomic DNA" /strain="BI34" /isolation_source="intestine from larvae" /host="wax moth" /type_material="type strain of Cellulosimicrobium protaetiae" /db_xref="taxon:2587808" /country="South Korea: Jeongeup" /collection_date="2019-04" protein /locus_tag="FIC82_003995" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013838302.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /transl_table=11 BEGIN 1 MAASDAIVVG EDWISEHYVG SDGKQSFRTR VLERRKAWDD AEKEGEQTTR ARFVGARASL 61 LSTLAGLGEE GGRFAVLPEL YAELRRVLGY TSVALQSKQD GPVERVHATG LEAAAPLVIV 121 EAVAVKDVEA LLAKGDRDKP SPRDRTLLEP YAVDDTTQIH SVARLLSYLF VQDDAPEFAL 181 VLAGGWMLVA EKARWAEGRY LAVDLQLVAE RADDKRGGET DTALTCVDAA SLAPDPEGNL 241 WWPGVLDESV KHTVGVSKDL REGVRLSIEI IANEVVRRRA AQGLDPLPQA EAQELAGQSL 301 RFLYRILFLL FAEASPELGV LPVGDPTYEQ GYSLDRLREL TLVNIADHRS QDGTHLYQSL 361 GTLFRLVDAG HDGAGGAARD ERVDGEVHRV DDGADGLTFQ PLKADLFLPS KTAHIDAVGL 421 GNAALQQVLR HLLLSKESKG KDRGFISYAE LGINQLGAVY EGLMSYTGFF ATEDLHEVAK 481 DGNAEKGSWV VPVVRSQSIA PKDFVTAPDP VTGEQKPVVH EQGTFVFRLA GRERQQSASY 541 YTPEVLTRFT VSQALVELIG PDEVKEGSVE WEGREVPRKM SAREILDLTV CEPALGSGAF 601 AIEAVRQLAA AYLRRKQDET GERIDPDRYA AELQKVKAHI ALHNVYGVDL NGTAVELAEI 661 SLWLDTMGEG LQAPWFGLHL KRGNSLIGAR RAVYRRDQLA KRAWLTAVPT DVPLSPSDAD 721 RAAGRSSSLG DVGGRIHHFL LPAAGWGSAV EAKEAKELAP EALARLKAWR KTVLVTPSKK 781 QADELVNLAH RVEALWDLAH PRLRIAEDQI RRSIDVWGAD DLPVGGAVTR KQIEEALADA 841 KGAYQRLRLV MDAWSALWFW PLTDGSTRVK SDDGSEESIE PPTLDEWIGG LRAVLGVHAE 901 SGASGRGRKW TGGDQTLAST ADWDELNEAE EFELSFAGVA SPERVLHEHP WLVVCQRVAA 961 QQGFFHWELD FASVFATRGG FDLQVGNPPW VRPDFDEAAA LGEYEVAFAL EGKLATGRAS 1021 DLRSATLELP AARDFYLDSL TATVATREAV SSPTDYPYLV GLRPDLYRCF MEQTWRHIAT 1081 SGSIGLIHPE THFTDEKAGP LRAVTYRRLR RHWQFINELV LFEIHHLVSY GVHVYGTSRA 1141 PHFLQAASLY HPDTVERSFD HQGLGEEPGL KDPDGRWDVR PHAARVIAVD EAILRTWHAT 1201 LEDADVPTSR SRVVYAVNKA TASALDKISA AERIKSLDLT FSQGWNETTD FKRGLFEKSW 1261 DVADCWEDAI IQGPHLHVAN PAYKTPNETM ANNLDWSAVD LEALGARAIP ATSYKPRGDR 1321 KTYDAAYTHW TRDVVVGPDS KPIDGSPAVD PKYVRHVETA SRADGTAVRT ETVSARAFYR 1381 IAWRTMAAPT GERTLIPALL PPGVTHVDGG FSAGLPAGSY RTLVDVAGFA SSLVLDATTR 1441 VVPKKHIRAA QLERLPFVSS HFDREIRLRA LRLNCVTGAY ADLWAECYDT AFRDDSWTGL 1501 PERTGWVDLG DVGPEWTPET PLRRAEDRRQ ALLEIDALVA LSLGLTADEL CTIYRTQFPV 1561 LYGYDRNRDH YDDNGRLVPN TVLTTWRKKG GNDGRFSEDD LTAVHPGSGV AYTYDLPFQT 1621 LDREAHMRQA YAEFERRLAA RAEPATPE //