LOCUS UQS84930.1 3145 aa PRT BCT 19-SEP-2022 DEFINITION Apilactobacillus apisilvae DUF5776 domain-containing protein protein. ACCESSION CP093362-1277 PROTEIN_ID UQS84930.1 SOURCE Apilactobacillus apisilvae ORGANISM Apilactobacillus apisilvae Bacteria; Firmicutes; Bacilli; Lactobacillales; Lactobacillaceae; Apilactobacillus. REFERENCE 1 (bases 1 to 1469670) AUTHORS Oliphant,S.A., Watson-Haigh,N.S., Sumby,K.M., Gardner,J., Groom,S. and Jiranek,V. TITLE Apilactobacillus apisilvae sp. nov., Nicolia spurrieriana gen. nov. sp. nov., Bombilactobacillus folatiphilus sp. nov. and Bombilactobacillus thymidiniphilus sp. nov., four new lactic acid bacterial isolates from stingless bees Tetragonula carbonaria and Austroplebeia australis JOURNAL Int J Syst Evol Microbiol 72 (9) (2022) PUBMED 36094463 REFERENCE 2 (bases 1 to 1469670) AUTHORS Oliphant,S.A., Sumby,K.M., Gardner,J.M., Watson-Haigh,N.S. and Jiranek,V. TITLE Direct Submission JOURNAL Submitted (11-MAR-2022) Wine Science, The University of Adelaide, PMB 1, Glen Osmond, South Australia 5064, Australia COMMENT The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Date :: NOV-2020 Assembly Method :: Smrtlink v. 9.0 Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: 11482x Sequencing Technology :: PacBio Sequel II ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 03/14/2022 11:52:50 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 6.0 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 1,575 CDSs (total) :: 1,494 Genes (coding) :: 1,480 CDSs (with protein) :: 1,480 Genes (RNA) :: 81 rRNAs :: 5, 5, 5 (5S, 16S, 23S) complete rRNAs :: 5, 5, 5 (5S, 16S, 23S) tRNAs :: 63 ncRNAs :: 3 Pseudo Genes (total) :: 14 CDSs (without protein) :: 14 Pseudo Genes (ambiguous residues) :: 0 of 14 Pseudo Genes (frameshifted) :: 5 of 14 Pseudo Genes (incomplete) :: 11 of 14 Pseudo Genes (internal stop) :: 1 of 14 Pseudo Genes (multiple problems) :: 3 of 14 CRISPR Arrays :: 1 ##Genome-Annotation-Data-END## FEATURES Qualifiers source /organism="Apilactobacillus apisilvae" /mol_type="genomic DNA" /strain="SG5_A10" /isolation_source="Bee" /host="Austroplebeia australis" /type_material="type strain of Apilactobacillus apisilvae" /db_xref="taxon:2923364" /country="Australia: Brisbane" /lat_lon="27.4810 S 153.0121 E" /collection_date="2020-04-30" protein /locus_tag="MOO46_06715" /inference="COORDINATES: protein motif:HMM:NF039855.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /transl_table=11 BEGIN 1 MQYNKKQFNK VNDKKIMKKV KKQWVVVSMA SLAVLGGFAV SGTLSMTQPS SVVAHADDGT 61 SDSTVQSKQQ EEPQGETDQQ KASDATKKDA SSANNKASTN VDNKKGDAPY TPDHENVAVN 121 SSGKSSTLPN GSNAGSGSTT QPQTSNNTQG PNQGAKNYEG ASSNSTDNDG NQKDYDKTKD 181 GFNDKLNNQN QDKSNSDKQD NTGKKDSYDY GAQLANDNKT IIDQGAQDAI NGKVKDDQYK 241 DNNYYNSSYD GSNQAKADYN NSINNQGTSD KDYTSYDNSV SDSQNNSKKD GGANTNSSSN 301 ISNDDHSGGA DNPSDASNSA KNYDSDLKKQ FNGTDANGKS TNSSVQNNDV RVPSDSNVSA 361 TTPQNTKLEG INADTYAYGY NYFLANQAAI DYESGKWKGT KASDNNGSNE SNSQNDPTHD 421 YYMDSQPDTK SAYYKGYQGA KTAAESQWTG NQASTKLSNT NGNISDVSLS ENQFYKIGYN 481 NVKNEINDNS TAFVSNGYQL DNLLSNNQVK SNVKIINDID YSNVNGIASN NANYTTVDKQ 541 ITPSGVINVN IDGQNHIADF KNMQYRFNPS DTNSQSSLSM NNFKTIYGNG YYGPVSFQTT 601 NSTINYKNIN YVGVQLISAR TTKVNFAGKN NVMLVDSYHS PFTTNVVVNT PQQNIEASNM 661 TLLPGASYYG DTNFQNLGGS DLVQVFNGGT LTLGRNSNMT LLTGGKSINN SSSGYNSGVY 721 IDNSDSTMNL NKESNLNVIY NKDYNFGGNY GSGIYNNGNI NSNGGNINIE FSGDSYGNNP 781 LIINKGKINV TNNGLMQIKL SNANGSYSSG LLNASTGNQF NIASGNLIID GSQNNQNPIT 841 LLNGSISVND PGNKGILLKT NENGQLVSSG TMNTSTVAVD DNRNNKQLYY QYNVNANGTI 901 TGISGNGKTY NGSTKGKNEV SIYRAPSVSV SGPVNAKYND DGTVTITGNL VVDRAEELDK 961 NDNGIYLRSK ADSNETNNTI QQRDFDDNTK YNNVIPNISN GTLVPFTQTV NGLSGKPDNV 1021 STTIKYGIQG ITINLHDKDD NYSKETIDAQ SYSNNPGENG GVSPDGSRPI QILNSDNSYA 1081 RRGIKDAIND KDFNSSNYKN NDLYNDNDLY ANSYNSAIAG YEAYVKKPDN NISDNYNTTD 1141 DNSYKIAKDK QATEAFDQGY QAAQSNAGTN DYIAGQSKDK SNYNGNSSSY KSGYDEASNG 1201 YKDGAEAKNA VADASNGYTK GYVAGKGTSD YLSGKDKNSS SVDQTQNNNV NQDIYNNAYD 1261 STKNGYNNTK DNQNTAYNYG KNMLNGMKAA NGNKGFNEPS DKDSSDYQAY NDGKNAYNGM 1321 NDALTGQNKG KNEDYKSNND EYNSAYEAAQ KGLTDSSNNP EKYTNPSPQY SAYEAGQAAR 1381 KGSTDAQADS YSNRPSEDSF AQKAYDNAKG NFNAGAGRSN GASTPDTNSQ AYKDGLAAQA 1441 GQQAATKGDK NGSYPNNSDG STYNDTQRTA YDNAKKAYNK GLTGDNTTGD AKADQAANNA 1501 GMSAKDGIDD AFSNGNPSND DKQKGPDTNS YDAAKRAAED GMNGQAKSNL DKQQQNAYDN 1561 GKAMYDGMQA AKQDNTNGLN HGELKQDKSS AYSNSYTEGS SQHSNYDKAY TAYKDGLAKI 1621 VDGKNENNDR PGQNPISDGN QNPNESTVPY QIAKDSETTD NAIHDAIYHT NTYRPSNNDK 1681 NQLTVYNQAQ DAYNAGLTQQ SNQSSVDHNQ GYAYNVGKDA LKGIQDAENG QQNEPGNPND 1741 QKDSQDKNTT YNKEAYDQAQ NDFKDGNKER NKLSSSNDHN TVDTSASLAY QTGQKYEATK 1801 SGIDKYVNNE DGPTLKNNDY SKTLNNGYNA YKAGYEGSNN GQPTETIKSD PSQMDAYNQG 1861 RAAQAAVKDA SSGKFGNDGS NRPLDSTQPN SSKNSSADSM YNSDDYKPSG ADWTTNQQTA 1921 YDNAYEAYLD GKKSPNSGSA PNGQKVAYNK GQGDGQLPEA INDAINGKTG NGDNDYSNNI 1981 KNFNYSLENS NKDSNSDDIH KAVDIVNKAI SDAKTTGNMS DGNLSPDQKN YYDNAFNAYQ 2041 YGFGKNSSDS SITTDNEKNN SIPYQIGLAA RKGIQDTKDV NGKTSYDKNG ANQVEKDAYT 2101 YAQSAYKSGL TGNPQPNNST QAFNEGQSDK SGIDAAIRGD SEPKDDSTEV SKVAYDATKA 2161 GLNNQNNAQN NYANNAGKAY QSGLSDAQST DNPDNSGKDL NKDHQSEYNQ AKKDYQDGLS 2221 AQSGNVNSNG YGEGLNDRAV KQGVKDASNG VAKEDGSSYT PVKQVDGKDI NNVDNQINAY 2281 KNAVNGFDYG YGNKSIDKDN SAYQSGEKAG KDAKQAVKDQ QSETKSTDAK SDSYNQAQQN 2341 YQDGFNNPKA NGDDLKKFNS QSDAFKQGQN DKNAIDSALN GKEDHSDSPA YKAAKAGLAG 2401 NESYDNNYQA AHDIGQAANQ GINDSLNGTP GKTQYVNEKD KTDAYNNAEH NFDMGLADKT 2461 DDKTSESSDA YKAGQAAKDG YKDAEANKGD FNKTSNVDNY NGNPFAKDSY DNAQAAYKAG 2521 LAGNDSNADA KKNSVANKAG LDAKQGIADA IKGQTDPTAV QSEDYNNAFN AAKAGNNVDN 2581 ANKSAADAKK DDQSTAYQAG QAAQKGIADV QTGTGSADQY KDNSAAQSAY SDAKQAYNDG 2641 LNGGDSKQAS KDNPTANAAG QAARQGMIDA QSGNGKNNPY TDNPQKDAYN KAKQAFDKGL 2701 NGNDDADNPV ANEAGKAARQ AAADAQVGKN DASPYNNNSN AHDAYQRAMN AYNDGFNSDK 2761 DNASATKAND IGKAARQGMN DAENGKDNSD SYNSNPDAKA AYDNAKKAYQ AGINGDTTSA 2821 DAKGNPAVNK AGYANFVPFS PKTENDNNNS PKPNAYQEAQ NAAKQKNSNE KAAAKDAVNA 2881 LINGKSKFTN SINLANKSPE YRKAFKKAYD QSKAGFDAGK KGHYSGSYDS SYQEGYKAGK 2941 KQFVKNTNDG KQAGKQAASK LTKLPSFKNK SQAYADAYKE AFKKEVKYNT PHYVYNLKKV 3001 YSHNNPSMTR KTRENKYAKT AMYKRETFKI TGYKINDKGQ LVYKTSKGWI SADKKSFNDV 3061 YYRHDNQNTN SARKSTQKIR VIKPQGTYIY NSKNFNKKTA VKNMRKGTTL EVKSVEQLGH 3121 ITRFYLGNGQ YISSNKTIVE KINKK //