LOCUS QJW36577.1 1197 aa PRT BCT 29-DEC-2022 DEFINITION Cellulosimicrobium protaetiae methionine synthase protein. ACCESSION CP052757-2095 PROTEIN_ID QJW36577.1 SOURCE Cellulosimicrobium protaetiae ORGANISM Cellulosimicrobium protaetiae Bacteria; Actinobacteria; Micrococcales; Promicromonosporaceae; Cellulosimicrobium. REFERENCE 1 (bases 1 to 4631595) AUTHORS Le Han,H., Nguyen,T.T.H., Li,Z., Shin,N.R. and Kim,S.G. TITLE Cellulosimicrobium protaetiae sp. nov., isolated from the gut of the larva of Protaetia brevitarsis seulensis JOURNAL Int J Syst Evol Microbiol 72 (3) (2022) PUBMED 35348452 REFERENCE 2 (bases 1 to 4631595) AUTHORS Le Ho,H. and Kim,S.-G. TITLE Direct Submission JOURNAL Submitted (06-NOV-2019) Korean Collection for Type Cultures (KCTC), Korea Research Institute of Bioscience & Biotechnology (KRIBB), 181 Ipsin-gil, Jeongeup-si, Jeollabuk-do 56212, Republic of Korea REFERENCE 3 (bases 1 to 4631595) AUTHORS Ho,H. and Kim,S.-G. TITLE Direct Submission JOURNAL Submitted (17-APR-2020) Korean Collection for Type Cultures (KCTC), Korea Research Institute of Bioscience & Biotechnology (KRIBB), 181 Ipsin-gil, Jeongeup-si, Jeollabuk-do 56212, Korea, Republic of COMMENT The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Date :: JUL-2019 Assembly Method :: HGAP v. 3.0 Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: 178.0x Sequencing Technology :: PacBio RSII; Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 04/28/2020 06:45:21 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 4.11 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 4,140 CDSs (total) :: 4,077 Genes (coding) :: 4,001 CDSs (with protein) :: 4,001 Genes (RNA) :: 63 rRNAs :: 3, 3, 3 (5S, 16S, 23S) complete rRNAs :: 3, 3, 3 (5S, 16S, 23S) tRNAs :: 51 ncRNAs :: 3 Pseudo Genes (total) :: 76 CDSs (without protein) :: 76 Pseudo Genes (ambiguous residues) :: 0 of 76 Pseudo Genes (frameshifted) :: 11 of 76 Pseudo Genes (incomplete) :: 66 of 76 Pseudo Genes (internal stop) :: 2 of 76 Pseudo Genes (multiple problems) :: 3 of 76 CRISPR Arrays :: 1 ##Genome-Annotation-Data-END## FEATURES Qualifiers source /organism="Cellulosimicrobium protaetiae" /mol_type="genomic DNA" /strain="BI34" /isolation_source="intestine from larvae" /host="wax moth" /type_material="type strain of Cellulosimicrobium protaetiae" /db_xref="taxon:2587808" /country="South Korea: Jeongeup" /collection_date="2019-04" protein /gene="metH" /locus_tag="FIC82_010605" /EC_number="2.1.1.13" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012866605.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /transl_table=11 BEGIN 1 MPIVVPQDLA DARSRALADA LRSRVVVADG AMGTMIQEQD PSLEDYQGLE GCNEILNVSR 61 PDIIAAVHDA YLEVGVDAIE TNTFGANWSN LSDYDIDDRI RELAAAGARI ARERADAFTT 121 DEQPRWVLGS MGPGTKLPSL GHTTYAHLRD TFTEQAAGLL EGGVDAILVE TSQDLLQAKA 181 AVTGSRNAMR DVGREVPVIV QVTVETTGTM LMGSEIGAAL TTLQALDVDA IGLNCATGPA 241 EMSEHLRHLS RHAEIPVTCM PNAGLPVLGP NGATYPLTPD ELAAAHTQFV TEFGLGLVGG 301 CCGTTPEHLR RVVEAVRARP VVERHPEREN GVASLYSHTD LHQDASFLAI GERTNANGSK 361 AFREAMLAEN WDECVEIARA QTRDGAHLLD VCIDYVGRDG VADVKQVVSR LASASTLPLV 421 IDSTEPEVIA AGLELVGGRA VVNSVNFEDG DGPTSRFARI MPHVVEHGAA VVALTIDEEG 481 QARTASGKVE IATRLVETLT RDWGMRVDDI IVDALTFPIA TGQEETRRDA IETIEAIREI 541 TRRYPGIHTT LGVSNVSFGL NPAARTVLNS VFLHEAVEAG LDSAIVHAAK ILPLAQIPDE 601 QREAALDLVW DRRRYDDEGN LVHDPLARLL ELFDGVDSAA LKDQRAAELA ALPVGERLER 661 RIVDGARKGL EEDLDAAMAS GIKALEIVND HLLEGMKVVG DLFGRGEMQL PFVLQSAEVM 721 KTAVAYLEPH MEKIEGSTGS KGTIVLATVR GDVHDIGKNL VDIILTNNGY TVVNIGIKQP 781 ISAMIEAADE HDADVIGMSG LLVKSTVVMK ENLAELASRG LAGRWPVLLG GAALTRTYVE 841 DDLAGQFPGV VRYARDAFEG LRLMEPLVRV ARGESPDAVG LPALKKRRHA VVTVTETPVE 901 DLPARSDVAA DNPVPAPPFW GTRLVKGVQL AEYAAFLDER ATFMGQWGLK PGRGDDGASY 961 EELVETEGRP RLNAWYERIR TEGVVDPSVV YGYFPVWSEG DDVVVAHHGA GLAGIGAPDG 1021 GSDGEPGTER LRFTFPRQRR DRHLCLADFV KPRSWVEETG RYDVLPVQLV TMGASVDVHT 1081 AKLFAGNHYR EYMELHGLSV QLTEALAEMW HSRVRAELGF ADEDPTEVEG MFKLEYRGAR 1141 FSLGYPACPD MEDRTKVVAL LRPERVGVEL SAELQLHPEQ STDAFIMHHP EAKYFSV //