LOCUS UQS82890.1 1387 aa PRT BCT 19-SEP-2022 DEFINITION Bombilactobacillus folatiphilus type II CRISPR RNA-guided endonuclease Cas9 protein. ACCESSION CP093366-751 PROTEIN_ID UQS82890.1 SOURCE Bombilactobacillus folatiphilus ORGANISM Bombilactobacillus folatiphilus Bacteria; Firmicutes; Bacilli; Lactobacillales; Lactobacillaceae; Bombilactobacillus. REFERENCE 1 (bases 1 to 1622785) AUTHORS Oliphant,S.A., Watson-Haigh,N.S., Sumby,K.M., Gardner,J., Groom,S. and Jiranek,V. TITLE Apilactobacillus apisilvae sp. nov., Nicolia spurrieriana gen. nov. sp. nov., Bombilactobacillus folatiphilus sp. nov. and Bombilactobacillus thymidiniphilus sp. nov., four new lactic acid bacterial isolates from stingless bees Tetragonula carbonaria and Austroplebeia australis JOURNAL Int J Syst Evol Microbiol 72 (9) (2022) PUBMED 36094463 REFERENCE 2 (bases 1 to 1622785) AUTHORS Oliphant,S.A., Sumby,K.M., Gardner,J.M., Watson-Haigh,N.S. and Jiranek,V. TITLE Direct Submission JOURNAL Submitted (11-MAR-2022) Wine Science, The University of Adelaide, PMB 1, Glen Osmond, South Australia 5064, Australia COMMENT The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Date :: NOV-2020 Assembly Method :: Smrtlink v. 9.0 Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: 7404x Sequencing Technology :: PacBio Sequel II ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 03/14/2022 11:45:56 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 6.0 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 1,623 CDSs (total) :: 1,550 Genes (coding) :: 1,527 CDSs (with protein) :: 1,527 Genes (RNA) :: 73 rRNAs :: 4, 4, 4 (5S, 16S, 23S) complete rRNAs :: 4, 4, 4 (5S, 16S, 23S) tRNAs :: 58 ncRNAs :: 3 Pseudo Genes (total) :: 23 CDSs (without protein) :: 23 Pseudo Genes (ambiguous residues) :: 0 of 23 Pseudo Genes (frameshifted) :: 12 of 23 Pseudo Genes (incomplete) :: 9 of 23 Pseudo Genes (internal stop) :: 10 of 23 Pseudo Genes (multiple problems) :: 7 of 23 CRISPR Arrays :: 3 ##Genome-Annotation-Data-END## FEATURES Qualifiers source /organism="Bombilactobacillus folatiphilus" /mol_type="genomic DNA" /strain="SG4_D2" /isolation_source="Bee" /host="Tetragonula carbonaria" /type_material="type strain of Bombilactobacillus folatiphilus" /db_xref="taxon:2923362" /country="Australia: Brisbane" /lat_lon="27.4810 S 153.0121 E" /collection_date="2020-03-20" protein /gene="cas9" /locus_tag="MOO45_03980" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009557865.1" /note="Cas9, originally named Csn1, is the large, multifunctional signature protein of type II CRISPR/Cas systems. It is well known even to general audiences because its RNA-guided endonuclease activity has made it a popular tool for custom editing of eukaryotic genomes; Derived by automated computational analysis using gene prediction method: Protein Homology. GO_component: GO:0005575 - cellular_component [Evidence IEA]; GO_function: GO:0004520 - endodeoxyribonuclease activity [Evidence IEA]; GO_process: GO:0043571 - maintenance of CRISPR repeat elements [Evidence IEA]" /transl_table=11 BEGIN 1 MVCLDIGTNS CGFAAMDMKN QLLHLQGKTA IGARLFEEGK SAAERRGFRT TRRRLKRRKW 61 RLRLLEEFFD DEMSQVDPYF FARMRESGLS PLDHQKTAQA IVFPTPNEDH AFYCDYPTIY 121 HLRKALMTQD KKFDLRLVYL AIHHIVKYRG NFLQKDGVDN FNASKIEVGK VLKKLNYFFA 181 EINPDHPIQL AIQNSAEIEA VLRDVKKSKT DKVKSIGELL VSDSNHDKNT KAIASQIAKA 241 IMGYKTQFET ILSQEIDSDS KSEWQFKLSD SDADDKLAAI TDQVDETGQE IIEVIQSLFG 301 AITLSGIVDE GKSLSESMVR KYDDHKKDLK LLKQVIKQHP DRDKAQNLQL AYDLYVNNRH 361 GQLLKAKNKF SAKKVMSKEE FYKTIEKNLD DSSEVNAILE KIALDTFMPK QRTSANGVIP 421 FQLHQIELDQ IIKNQSKYYP FLAQRNPIIE HQKQAAYKLD ELIRFRVPYY VGPMITKENQ 481 IKTSGTEFAW MIRNQNDPKP NEAITPWNFD EKVDRMATAN QFIKRMTTKD TYLLGEDVLP 541 ANSLLYQKFT VLNELNNLRV NGQHLKAATK QDVYENLFKQ NKTVSKKCLN AYLCQSYQMA 601 SVKIEGLADE HKFNSSLKTY NQFKKFIPLS ILDNADYQAD LEKIIEWSTI FEDRHIYQAK 661 LEQAQTEQIS WLTGKQIGCL LKLRHQGWGS LSYKLLINLH DDNGQNIIER LWDSQLNFMQ 721 IVKEPAFKSV IDQANSSLVK DNQENAVEDV LADAYTSPAN KKAIRQVVKV VADIVKAAGG 781 KIPAKFAIEF TREPQKNPQL SKQRGKQLKE AYKEIANQLV EQGVKDELDS AIQSKQLVRD 841 KYYLYFMQGG RDAYTGQTIN IDDITTKYQI DHILPQSFIK DDSLNNRVLT ASALNNAKSD 901 DVPFKHFANK LVPDLKISVS EMWKQWQKAG MISKFKLNNL QLDPDNLDKY KRAGFVNRQL 961 VETSQVIKLV TIILQTKYPE AEIITVKASY NHALRKRLDL YKSREVNDYH HAIDAYLSAI 1021 CGNYLYQMYP NLRQFFVYGK FKKMNADSDR NHAAIKELNN FNFIGLLLQK DRPGHSTVEK 1081 IYRPHTDELL FEKHPDIFDP LRHAYSFKHM LISRETYTQD QEMFGMTLYP RLERDTKKTR 1141 TLVPKSKNLD PNIYGGYSSN TNAYLAIIKI NKASESTYKV VSVPMRILGK LNQTQNVIEH 1201 DNLLKEYLAP TILDKRGVRD FSIVKGKVHY KQVVWDGNRK YMLGSATYLY NAKQLTLSTE 1261 AMRVVTGDFK TNDDESLLLD QVFDEILTKV DQYLPLFDVG KAREKLHHGR TKFYDLSVID 1321 KKYVVHQLLI GLHDNPAQGD TLKIGFSNGM KLGLMKLGSG ITLSPNTKLI YRSPTGLFEK 1381 RIKITDL //