LOCUS UQS87452.1 6361 aa PRT BCT 19-SEP-2022 DEFINITION Nicoliella spurrieriana hypothetical protein protein. ACCESSION CP093361-570 PROTEIN_ID UQS87452.1 SOURCE Nicoliella spurrieriana ORGANISM Nicoliella spurrieriana Bacteria; Bacillota; Bacilli; Lactobacillales; Lactobacillaceae; Nicoliella. REFERENCE 1 (bases 1 to 1709727) AUTHORS Oliphant,S.A., Watson-Haigh,N.S., Sumby,K.M., Gardner,J., Groom,S. and Jiranek,V. TITLE Apilactobacillus apisilvae sp. nov., Nicolia spurrieriana gen. nov. sp. nov., Bombilactobacillus folatiphilus sp. nov. and Bombilactobacillus thymidiniphilus sp. nov., four new lactic acid bacterial isolates from stingless bees Tetragonula carbonaria and Austroplebeia australis JOURNAL Int J Syst Evol Microbiol 72 (9) (2022) PUBMED 36094463 REFERENCE 2 (bases 1 to 1709727) AUTHORS Oliphant,S.A., Sumby,K.M., Gardner,J.M., Watson-Haigh,N.S. and Jiranek,V. TITLE Direct Submission JOURNAL Submitted (11-MAR-2022) Wine Science, The University of Adelaide, PMB 1, Glen Osmond, South Australia 5064, Australia COMMENT The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Date :: NOV-2020 Assembly Method :: Smrtlink v. 9.0 Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: 6375x Sequencing Technology :: PacBio Sequel II ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 03/14/2022 11:59:34 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 6.0 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 1,728 CDSs (total) :: 1,646 Genes (coding) :: 1,628 CDSs (with protein) :: 1,628 Genes (RNA) :: 82 rRNAs :: 5, 5, 5 (5S, 16S, 23S) complete rRNAs :: 5, 5, 5 (5S, 16S, 23S) tRNAs :: 64 ncRNAs :: 3 Pseudo Genes (total) :: 18 CDSs (without protein) :: 18 Pseudo Genes (ambiguous residues) :: 0 of 18 Pseudo Genes (frameshifted) :: 8 of 18 Pseudo Genes (incomplete) :: 5 of 18 Pseudo Genes (internal stop) :: 8 of 18 Pseudo Genes (multiple problems) :: 3 of 18 CRISPR Arrays :: 1 ##Genome-Annotation-Data-END## FEATURES Qualifiers source /organism="Nicoliella spurrieriana" /mol_type="genomic DNA" /strain="SGEP1_A5" /isolation_source="Honey" /host="Tetragonula carbonaria" /type_material="type strain of Nicolia spurrieriana" /db_xref="taxon:2925830" /country="Australia: Brisbane" /lat_lon="27.4810 S 153.0121 E" /collection_date="2020-03-26" protein /locus_tag="MOO44_04670" /inference="COORDINATES: ab initio prediction:GeneMarkS-2+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS-2+." /transl_table=11 BEGIN 1 MMGSESNKID AKAQSMQSDG PAAAHQADAV SSLTAKSELL VNSVAKKVVL EYARSAIAAA 61 VDSASQRADQ LAAPARSSAS VALSQFADSA YTNAVNYANP SEAVAGVEAF TANVDHAIDT 121 ANFSSANDVV MHAANSLVNK TTVLGASAAD DTKLTINSAF DQISTELSQQ ADNIAGADSL 181 AQSGVDHLES LTSAAVLSFA KHEMEATYLQ FKDELFQLSD AVAQDAIAEI HSVTDPTYAW 241 VQSNAAAPAS ASAQVSTAIT SMNSIFASYG NVLVSAGVAE YASNAYSAVA YLSGDQLLNA 301 NRMIDSAARV LLNNSIATSN IDAQSINQIY GLIPVASNAI KSITDEVNHS FADRQLSAAT 361 VSAAATTSSL TPAGKVHVDF SINSYVQQAQ SEVAAVVNDP ATLNATLSSA TSTIASMATA 421 AMVADPYASA AHQLQTTVAS ADEQLSRLSG VAQLNATSAV HVATTSASAA LDANHFDAAQ 481 LTETVQTIGE QVDGIIDHYA NSAAVDQLNG AYSAADSMVV QLGTKAERSA ARTQIEAIRD 541 SAAQVIAAQS DVDSVVASAT DTLHSTAVQL IQSDPNRSAH YAVQMVLRSA TDKGAYLDAI 601 SRDNAVSEVA MVATRASSAM VKNAGRASAL ANDVEQASLA ADRIIDHYAV EDARSKTHSM 661 ASSAHQQMQF VSDVAFADQA ASQVAAVQNS MNMTLFHNSD QLAVVSQAVV DAQANLTSIA 721 DYYVQTDAVA AAHAKLASLA STAAVTTADL DEANRGNANR EVAHTLQSAQ AALSATSGHH 781 QVAEQAAASV TATLNHYTLA YTSVAVSSAA SSANQHLAQV GIGDDQTARE IASFVQATDA 841 NIATDRDNFA NVLLDVRNGI AQVNSYVDAA VKKNPHASAV NTVQSTAARL TAERSAANYD 901 FANFSSAVAK TTSVNCKTIS QINDTATIDQ LMSVADTENM HHYQSNVAAS ATAALSAVAK 961 SVRQKLIAVG VTIDDPLDEL MAEYAEIINR DASDARFAAI DQQLGTLELQ SLFDNTIKDH 1021 PQAAKRYELE SMASAASSVA AEYLAESVEL DRDLYRVVSS ATQQVNASAD EAGMASAVAT 1081 ASAAMDAVVN ENVAASALQL IDSAADSANR RVANIKDTGK SIDAQSEIRF NAETGIKLVK 1141 ADAQTPKLVN IDVRNTLMKL TSMTSAAVGD DTVAATLDQI NSMVASARAS VSFADSTADL 1201 RFASQVDQYV DQARSAVAGV EDRNNVAPHL AGLEVQLTTL TDQVKRDSVN AQMIDAKADA 1261 KRSLATLRGQ DQLALNGELD AITDRVADAI DHDLNDPENL KTDIQNGLTA IAQLVADAIQ 1321 KDPYATAHQL VQSAVATTTS TADALSGSVA NNLRSEIASA NSHYADQLPA ADDASVSAVA 1381 SGAVQALTMT TNNYVAAYAA NVINVARERI LNASRAYSGV STNTLERRFS AILGMAMGVI 1441 QSDFNDLNLI HLDIQNIIAQ LEAAFEEAVA ADPNGQYQLA VTKLKQSATA AANVSDQLTA 1501 ANLSSALNSV ASAADGNSLA ASAQMQSVVN AYHADRAVAA VSSAATAAQL RVLRLNDVSK 1561 ISTSAAIDDL VSSANGEISA DADYPRYTSL DAAEGIQAIN ARLASAVAND ATAVTSSVAA 1621 SATSVAFERM GSMTAAASLV ATNQIKRVVD RAVTNQQPTE TLVHQIDQVA DRYLIDSAQA 1681 RLNQTADAVN DQLDTNPDQE PIIKRIQTIT KETMAKVRAD VKMPAYLALD VTDGVAALNE 1741 LVSEYQTDPL VSIKAEINSA AQDLTTMVPA TGSVNSNVSS AAASVARTAI QNIANVADDS 1801 NAVAAFTQSA ADQFDLLRQS YVAESATAVV QSAISSAQVS AAMIADESIA AQYHDQFSGI 1861 ADSFNRQLQA DARNASLTAL DIENAVLEIN QITDQAVASD SQASAATTYS TAVSGALSAA 1921 FRNTALMTTS QVDNYHSAMA KVLPDSEEVL NASQSAVADQ VSAVQTAIAS VANRYYGLWA 1981 VSAVDQAASD LLSGAANYNG SVASWAASQA TNYQSSFNAN LAKDSAATDL IILDAQNAVS 2041 AISATASKAA EQDSTAFANI EINAQISSAN SVLSELTGHT DSTADMQMNS VAHQANASLN 2101 SVSNNLNTVM KIVSSTAVAI DTVANMTIAS SAHSLVASAA DSLMQRAGHL SRGTTQAELI 2161 DLVKQTALTI DDAIDSNASY LVSVSMAAEA GVESMGSAML ADLKQDRSAS AQIAFESLAS 2221 SAADQVGALD SFASANFSSA VGSISNRTTA YLADQLDKPK RVNSEIRYAG KTIQKLMRRA 2281 IAQSATDWVT SQAAGIREQL STIADQPLRD RVDSQIAAQL AKAEVMITSD AFDYQLASLD 2341 ADNHLSAINS LAASAFASDT AAVQSTAHQQ ALDSAYAKVG SLSGTRSVAF DSQIAALDAS 2401 AADYAAQLDA VGNHFIMTSA NAVLDATVAS ANHRLAVIND VNAQAEVLKD IEHRQSQASV 2461 AISADAHHSK FASLDANNAA SAIDLMVNKA VANDSIASAT VMIDQAASSV TAEMQAASID 2521 QLLRFEKQVQ MLTNNAEQAL LGPFTDNDSL VSQAESQLVR LANNYMVSSA YSAVGSLASA 2581 ASDWTSDLAV ADSLNVKAYI DSALNSADSQ ISVHSAAPSL VADAVNNVAN TVEHLASDAV 2641 NNSPMAAAKH SLAHLKSSAA TIDQSLSGLD AQNFSSAINS VAQSAVERFT ENGATINQQA 2701 LQKAAHLMTN LSNLHAAQGA MDQLLRAAEN AEHKIDYLAD NEKRGLAFTD VANTIEKYRD 2761 LIEFDIDHLA LINLDVINGK AKLAAIASAA VKDDVVARAI DRMDRHASSV TSQASGLDSL 2821 ASANLSSAIN DLTARAMTMI QSTDDHPDTL NSLSQSATAK MDSLLGSYVT AAAHEQLANV 2881 RARVDEQAAL IVDVDRASSL STELDALQSS AAAQVSHDAN HAAFIDLDIA NADSAMQRVA 2941 RNAVNGDVSA SAAAQVQGLL ATATSRAQPL VGSAAANFSS AMKSMASRAN RETSGVYDYQ 3001 DLDSILAATS KEVSSTVITY LVDSAHERLS ATVNVASTAV NVIDGQLRAD ANRLIASAAN 3061 SATQMVTTDS HDDQLVSLDA KEASQTILDL ADSAVASQPF ASASSVIAST ATVAMQRVGS 3121 LTSFAAANLS STMNRLNTQA RASLIANGAD SKSAQRVQSS AIMAINEVAN SYVASSASQL 3181 IASATDSASA IIARFDNQDA ASTANDAIDS ALATVANDFA KDRNQPDLMA LDATNAINAI 3241 QSAALFITTT DEVASGNISI SNAAESAYAV ADEWGRTTQF DKQVRDIIAD STGRFASGED 3301 VDIVVSEASD AIANVVSYLI LAVANARLAA LIANVGNNLK RLLNDKQYQA AQLRLLEFNE 3361 VAKHTIQKDV ASSQLIQLDL TNIERDLTRL QAELVAVDGN AVALSAVADH ASAATTNPYV 3421 HGSSDDYLNF VSAVNTVVRD FGAELDQATS AAAINQIDSL TQSYVTSQAT NAIDATVASL 3481 DDQVRDLNNF VIQSSAYRVV HSIASQASVN VQSDYADFSA VSLDADNAIS AATSVMDNAV 3541 TNDSVASGKR AARSIYSSAA SFAVGLSWSD SVEFSMALAS QSAATSYGPN ANNSYLANST 3601 ARFVDQMTAS YVASYANTAL HNAASQALAN LGPVNAQTKT NFTTVLHNVV GDLIDNVASD 3661 NADQELVDLD VQNGITMIDS LADSVVRADP GAAGQYIIAS AANSATSGMA LGAEMDEANF 3721 TSAINSVANQ ASISVNLVKK DLRLVNKAAN SAARKMNHLA TEYRNSLAQS TLYSTFDSAA 3781 LVAESFANPA VQSQAANDVQ SATEHFSSLL DADAANASYV DLDLANGTGE IDRLVASAAS 3841 QDPAVRTHLY LDSIASDAAQ RLVTFADDVQ LNFTSAASRI TNQTVGLVSY ADDPTIVSAA 3901 MSSTATTFYR MANSAVVDSA SATVSTIASD ASRALQLFTD DNVRQQLRAD ILRVESAGLA 3961 NINNDVADDD LLALDIRNVQ RDASAAVSSA IDGDRHAAAI HHVQSVAASL SAANSIHDQV 4021 ERTRMGLQLD SAAQSARADI VNAGSNTDLI NVTVEATAAS MGSIAQTFMV NDAQSQVAAL 4081 TASANSALGW FSGDRRASLA ANVDQLATVM NSQIDRYALD QNARQLMVQR GQASLSAYVD 4141 NVITNDHQAA TSAAIGDLAA STVANVHHLS GTHQANFASE VAMTTSSANA LNRQMVSDAK 4201 TATELQASTA DAFDQVARKY VAESGYAALD AVASSANARF TQYGIDLDQV AEPVQSLLRD 4261 YNSMVDSDAG VEATVALDCH NASLEMNQML AEFTAEDVTT NAKWSVQQGA ASFATSLGVP 4321 SDASVAANYS SAVASVVHSA NVALDEAAVD PDEVNQRVVS AYQALGSLTD QVVADSANAQ 4381 LNSAYSSAAQ LVSAVAADDT INAVFNSYAN LITTDIANRR FASLDADNGA AALHQVVVNT 4441 INQDVQAAFR YRVASDAATI SAASHLNRDD QVRFSSAIAS VASAATVSSV ANISQMNQVA 4501 VVTSQQMQSV ADYYAGQSAV NAVSAAVSSA TVRIAEFSDA AVNAVLQNVT KSVVSAISVD 4561 PQNQLATSLY AHDGVAAAES ITASLIADHP LVSATERVNS MVDLATHDVL MSDSLADMNF 4621 LSAVAKQSLQ ASDAFVTAES DAQALDELTS QTSAAITSIA NSYALASATS ALHRANNSAM 4681 MIANQISDEQ ARSRADQGIV EQVSLASSVI ANDVADKSRL NLDIANATAN LDSVVTEAAA 4741 ADPQTAFNNE VVSAFNAATS KAQSLTGSLA ANYDSAINSA SSAVRFENAS VSSVDPSAVS 4801 SAAKRFQSIT RDYITTSAEQ VVSAHVAEVK ARGHFKDDFM NELNFAFTKA QQTIALDAHD 4861 VELTTLDVHN AIDAIDRLYD SKMAKEPQTA FDHHFASQAA SASEKIGKLD SHSAANFSSA 4921 ANWIANENRK QIATFDNLAP AVKNAAKEMN RLANEYVRGS AANALRGMLD SANAQIELVH 4981 NYVQVERSFD QVASATNSVV ATTSDDPAIV SLHVASANRR IASMATAAVA ADSRASAMTQ 5041 IHQFADSLQS AADFHGAPQQ SFAAVVASVA SQANDQLLEH RSAGATIADV EQRLQAAAGD 5101 LVVQIADQMV TEALDVANTQ AIEIEDIHSR NRTISEIGRL VNSARELIKQ HAHRPFFAYL 5161 DAQNAVEAVA KVTGHAIDAD SIANAILSLN RIADQVVAKL QSLLKATALT HLDNEVHSVA 5221 SAARQAIKAS TADLVAVNVK FASAESQVAS IANSYVAASA NSIVASADQS AATQIEQLSN 5281 ADARNTASRD ILMLVGSAGA AIESDAAHFS YASLDADNAS AAISSVTDKA IANDPRASAN 5341 VMIATQIAST ADMLNGMGLD DQAAFSAAVM PIKASATDRF ERASGDAITI AQAADQSNLE 5401 LTSVAQTFIM TSAKAVVSSA ADSAIALLGE MFADSVAVVY GSVASAVDEA LTRLDGDQSD 5461 FATVSLDAYN AKDDLARIVR ENQNSYAHSA IDSQASHAAS MAGILNARLR SVAINGITRH 5521 ANAAHSTVDL HFGQTAATLS DVAATSATMD SLINEYVIKD PVTSANSVVA SAASSANAKA 5581 ANLNAEYVTE IDTSVASAVS QANSLIAMNP ANSLLDSQLA SDTASSMNSF VNMYQAMAAN 5641 KITTSYAASA YAEIDVMDTK FKLLANSAVA TEVDNVKHVL LDHVDEPDVI FDNVSSAATN 5701 MDSLVAEYGG QNSSAVVDAK QSFATRMSDM ADDVYSQAQV LSINNRGQIN RLNSSVANSA 5761 SVSVSGATTI SGVGSVFTSA SSLVGSVVAI AVGFARETAR AELANAVHDA DKSLGTLEPN 5821 DKRGANSAID TIMAETYGQL TIASDTPTVV DKTKQSIEKI QAAVNAALSN NLMNIKDSYK 5881 AQINSAAQAA YQEETGFNAK ANSFTNETIN QAASAANANI DGTNNLAEIQ RLTNAGINSI 5941 DEANSVAVTN FKTSGSSMLA SAANSATDRT NTLDSDERAA VNSMIGSLVN GANFGLDSAS 6001 NVAEASAAVA SGTAAIGSAV ESTIASVISS AMAMASDLKA ETKRDSIRMQ ISAAAEKART 6061 FATSFESGAA SAVDNYIDQT ASSAAQLIID ADNDDDAEQI ASSGAVNLQM AATYSVADSY 6121 AASMQRSANS VADSLADSAA SFSQATDHAD VTSAITAATN ETKDKINGSR DTKRINSLVN 6181 LLVKNLNKLI GNDNHDQFDI DRIDAKAKID QTAHALLAKL GPMTHGVKQK ITASIQSETE 6241 QLNARIDQST SAEKINRVVE GAKSRFEAQM VTINNAEVKN AKAEAKQVIS SNSQNALVKL 6301 SGIGEVFLET ATETISEVIE RAYRDIDAAE TFDRVTELTN QTSGEINHIV SEAISMNQKM 6361 H //