LOCUS UQS86900.1 1037 aa PRT BCT 19-SEP-2022 DEFINITION Nicoliella spurrieriana HsdR family type I site-specific deoxyribonuclease protein. ACCESSION CP093361-1294 PROTEIN_ID UQS86900.1 SOURCE Nicoliella spurrieriana ORGANISM Nicoliella spurrieriana Bacteria; Bacillota; Bacilli; Lactobacillales; Lactobacillaceae; Nicoliella. REFERENCE 1 (bases 1 to 1709727) AUTHORS Oliphant,S.A., Watson-Haigh,N.S., Sumby,K.M., Gardner,J., Groom,S. and Jiranek,V. TITLE Apilactobacillus apisilvae sp. nov., Nicolia spurrieriana gen. nov. sp. nov., Bombilactobacillus folatiphilus sp. nov. and Bombilactobacillus thymidiniphilus sp. nov., four new lactic acid bacterial isolates from stingless bees Tetragonula carbonaria and Austroplebeia australis JOURNAL Int J Syst Evol Microbiol 72 (9) (2022) PUBMED 36094463 REFERENCE 2 (bases 1 to 1709727) AUTHORS Oliphant,S.A., Sumby,K.M., Gardner,J.M., Watson-Haigh,N.S. and Jiranek,V. TITLE Direct Submission JOURNAL Submitted (11-MAR-2022) Wine Science, The University of Adelaide, PMB 1, Glen Osmond, South Australia 5064, Australia COMMENT The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Date :: NOV-2020 Assembly Method :: Smrtlink v. 9.0 Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: 6375x Sequencing Technology :: PacBio Sequel II ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 03/14/2022 11:59:34 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 6.0 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 1,728 CDSs (total) :: 1,646 Genes (coding) :: 1,628 CDSs (with protein) :: 1,628 Genes (RNA) :: 82 rRNAs :: 5, 5, 5 (5S, 16S, 23S) complete rRNAs :: 5, 5, 5 (5S, 16S, 23S) tRNAs :: 64 ncRNAs :: 3 Pseudo Genes (total) :: 18 CDSs (without protein) :: 18 Pseudo Genes (ambiguous residues) :: 0 of 18 Pseudo Genes (frameshifted) :: 8 of 18 Pseudo Genes (incomplete) :: 5 of 18 Pseudo Genes (internal stop) :: 8 of 18 Pseudo Genes (multiple problems) :: 3 of 18 CRISPR Arrays :: 1 ##Genome-Annotation-Data-END## FEATURES Qualifiers source /organism="Nicoliella spurrieriana" /mol_type="genomic DNA" /strain="SGEP1_A5" /isolation_source="Honey" /host="Tetragonula carbonaria" /type_material="type strain of Nicolia spurrieriana" /db_xref="taxon:2925830" /country="Australia: Brisbane" /lat_lon="27.4810 S 153.0121 E" /collection_date="2020-03-26" protein /locus_tag="MOO44_08560" /EC_number="3.1.21.3" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_014571485.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology. GO_component: GO:0019812 - type I site-specific deoxyribonuclease complex [Evidence IEA]; GO_function: GO:0009035 - type I site-specific deoxyribonuclease activity [Evidence IEA]; GO_process: GO:0009307 - DNA restriction-modification system [Evidence IEA]" /transl_table=11 BEGIN 1 MTVENDELKF EQKVYDYLAN LGGSKQWQQM DAIKTTADLW DNFRKIVYQL NQDKLTKPLS 61 DAEFNQVKAA ILNQTKTPYY AGVFLYGVGG KSQVEIDRDD NKHVYLTIFD QDEIGAGNTV 121 YQIVRQIERK NVVKGKHNRR FDVTLLINGL PIIQIEEKSD KINAKKALEQ MRQYADENQY 181 SDIFSTVQIL IAMTPHEVRY MANTTSEDFN TDFAFEWKQN GGRKLPIQDW KEFADQFLSI 241 PMAHRMATNY MILDNTPNHQ SIKVMRYYQV YATQEIIRKL SKHDFDDLTD GNKKVGYIWH 301 TTGSGKTISS FKAAFLASRL PNVDKVVFLV DRIALTNQTA REYKAYDPNS DADNKGGIVS 361 DTANVGDLRR KLKSKNKTDI VVTSIQKLHM LVSNDKFSMD TKRTVFIVDE AHRSTSGDML 421 QKIKKSFRKS AWIGYTGTPN FDQKNGPTTK QIFGLPLHRY VISDAIADKN VLGFKVDFQT 481 TIPYSELKSK YLPQYFHEQY PNWTDTDIQH RINSLTNEDM DDAVKSSVYD MQPNHVEQVV 541 ADILKYWDNR SVNGLYNAML TTHVSGKNAS TPMAMMYYDE FQKQMTAKNK PLKIAITFSQ 601 NTSNDDNQLK NNQDLFRVIS DYNQQFNTNF DDTTVNEYFD DVCSRLNRTV DDKNYLDLVI 661 VVNQLLTGFD APNLNTLYVD RTLKGANLIQ AYSRTNRIQD MEHKPYGHIV NYRWPMHNEK 721 LMRQALAIYS NPDSADSGID LIPDDDKILA KNYEEVKSDF SKVVDSLSYL TGSFTAAPNS 781 EKEQRETYNQ LNRYNRILTK LKQYSEFVDE GGDKVLQAAG LPPDDEARLT GAIAYDVKKS 841 IANVEKVDPI ELDMRMEHVK EVTVNYDYLH ELIAEYANHV HDNDVDQSEL DKIKNEILRM 901 LDSVDETDQK YARQVRNLID SDFAKQVKYP LKPDEIEDLI KNSGRNWSQG EISAFIQQWG 961 LENANVQQAI EELIDQHQLE KDDLQHSDKL EKVVNLGKSA YKQDATDQSV KKLSKIKYYN 1021 QMKEAFKSLA DKIKKQF //