LOCUS BDN79879.1 2231 aa PRT BCT 09-FEB-2023 DEFINITION Mycobacterium pseudoshottsii polyketide synthase protein. ACCESSION AP026367-94 PROTEIN_ID BDN79879.1 SOURCE Mycobacterium pseudoshottsii ORGANISM Mycobacterium pseudoshottsii Bacteria; Bacillati; Actinomycetota; Actinomycetes; Mycobacteriales; Mycobacteriaceae; Mycobacterium; Mycobacterium ulcerans group. REFERENCE 1 (bases 1 to 6051062) AUTHORS Komine,T., Fukano,H. and Wada,S. TITLE Direct Submission JOURNAL Submitted (27-JUN-2022) Contact:Takeshi Komine Nippon Veterinary and Life Science University, School of Veterinary Medicine; Kyohnan-cho 1-7-1, Musashino, Tokyo 180-8602, Japan REFERENCE 2 AUTHORS Komine,T., Fukano,H., Yoshida,M., Inohana,M., Hoshino,Y., Kurata,O. and Wada,S. TITLE Complete Genome and Partial Megaplasmid Sequences of Mycobacterium pseudoshottsii Strain NJB1907-Z4, Isolated from an Aquarium-Reared Japanese Sardine (Sardinops melanostictus) in Japan JOURNAL Microbiol Resour Announc 11 (12), e00785-22 (2022) REMARK Publication Status: Online-Only DOI:10.1128/mra.00785-22 COMMENT Annotated by DFAST https://dfast.ddbj.nig.ac.jp/ ##Genome-Assembly-Data-START## Assembly Method :: Flye v. 2.9; galaxy v. 0 Genome Coverage :: 189x Sequencing Technology :: Pacbio sequel; Illumina HiSeqX ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="2019-07-10" /db_xref="taxon:265949" /geo_loc_name="Japan:Tokyo" /host="Sardinops melanostictus" /isolation_source="liver of Japanese sardine" /mol_type="genomic DNA" /organism="Mycobacterium pseudoshottsii" /strain="NJB1907-Z4" protein /gene="pks7_1" /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:RefSeq:WP_003900387.1" /locus_tag="NJB1907Z4_C00940" /transl_table=11 BEGIN 1 MAQILGDPAA ERIDRDVAFA ELGFDSRMTV ELRNRLAAVT GLRLPETVGW GYGSISQLAA 61 HLETELAGSG GRGKPGPSVV GGAPVAIVGV GCRYPGGVES AGGLWDVVVG GRDVISGFPV 121 DRGWDVEGVF DPDPDALGKT YCRLGGFLDG ADRFDAGFFG IGPSEALALD PQQRLLLEFS 181 WEALEDAGID PVSLRGSVTG VFTGLMSSDY GAGRVSGDLE GYGLTSAAAS VASGRVAYLL 241 GLEGPAVSVD TACSSSLVAL HLAAASLRSG ECDVALAGGV TVMATPATFV GFSRQRGLAA 301 DGRCKAFAGA ADGTGFSEGA GVVVLTRLSE ARRRGLAVLG VIAGSAVNQD GASNGLTAPN 361 GPAQQRVIEA ALANAGLTAA DVDVVEGHGT GTTLGDPIEA QALLATYGQA RPADRPLWLG 421 SIKSNRGHTQ AAAGIAGVIK MVQAMRHELM PATLHMDVPS PHVDWSSGAV SLLTQPRPWP 481 AVDGRPRRAG VSSFGISGTN AHVIVEQVCP AVVAEAVDVS PDSLPWVVSG KSEAAVAAQA 541 KRLLAAVQAD EGLDRLDVGF SLARRTAFEY RAVVLGEDRQ QLISGLTELA AGQPGPTVLN 601 GRAATVGKTV MVFPGQGSQW PGMGRELLAA SPVFAEHMRL CAEALGEFVD WSLLDGVNGV 661 AGAPTLDRVD VVQPVLWAMM VSLAQLWRSM GMVPDAVIGH SQGEIAAACV AGALSLRDGA 721 AVVALRSRAL VDLAGTGGMV AIACGVERVR ELLADYGDRL SLVAVNGVAA VVVSGEAEAL 781 HGLTGRCEAE GMRARRVEVD YASHSAQVES IGSSLVEALA GVRPRSSDIE FVSTVTGASV 841 DGASLDADYW YRNIRQTVRF DRAVRYCHEQ GCRTFVEASP HPVLLGGIEE SLAEGIGRPD 901 SAGVIVIPTL GRNEGGVERF WMSLSQAWVA GVGVDWSAVF AGSGGRQVGL PTYAFARRRF 961 WLDGSSSAAD VGEAGLVAAG HALLGAVVEQ PDTGAVVLTG RLSLARQPWL ADHLVGGAVL 1021 FPGTGFVELA IRAGDEVGCG VVEELTLATP LVLNAGTAVQ VQVVVGSAGQ SGQRLVSMYS 1081 RADQPDQHWV LHAQGSVAPA AVQPAPAASA ELSTRPPAGA EAVDIGGLYE RLARRGYGYG 1141 PAFQGLRAVW RRGRDVFAEV GLPNDEGLDL TDVGIHPALL DAALHAWLCV GGFGGDGEAT 1201 VLPFSWQHLS LHGSGASRLR VRIAPAGPSA VSVELADGAG LPVLSVGSLT TRPVGAAQLR 1261 AAMSAGGDGT GRELLDVVWT PITFERDSVQ REGERRVVSW DDFLAGRCAA ARDLDRDVAV 1321 VWQWESGGAE SVVNTVYAAT HRVLEVLQRW LADDLPAVLV VCTRGAMGLA GEANTDLAGA 1381 AVWGLVRSAQ TEHPGRIVLI DTDTSMALPT VIGVGEPQLV VRAGDVYATR LARGRAALRT 1441 PDAGQGWQLA ATGGGTFDDV VLEPCPRSDE PLAVGQVRVA LAALGVNFRD VLVVLGLYPG 1501 NKPTLGGEGA GVVVEVGPGV SGLEPGDRVL GLMSGRSECV VDQRLLVPIP AGWSFAEAAS 1561 VPIVFLTAFY GLSDLAGLRR GESVLVHAAT GGVGMAAIQL ARLWGARVFV TASRGKWDTL 1621 RAMGFDDDQI ADSRTLEFEE KFSAATGGRG IDVVLNALAG EFTDASLRLL ADGGRFIEMG 1681 KTDIRDGQEV AQEHPGVSYR AFDLANEVAP QRLGQMLAEL MTLFAAGKLH RLPVKSWDVR 1741 CAPQAYQFVS QARHIGKVVL STPTELRDAL AAGTVLITGG TGLVGSVLAR HLVSAYGVRN 1801 LVLVSRMGEQ GAGVAELVDE LSEAGARVLV AACDVADQSA VEKLIVGWGR EYPALTGVIH 1861 AAGVLDDAVI TSMTPDQVDS VLRAKVDGAW NLHHATRGLG LSMFVLCSSI AGVVGAPGQG 1921 NYAAANAFLD ALVTDRRAHG LAGVSLGWGL WEQASGMTKH LRGSDVSRLS RGGFAAISAQ 1981 QALDLFDAAL IVDQPTVLAA RLDRRALENP ALNADLPTLF SDLITRPMRR NVDNDCTPTE 2041 LALVNRLNMM AQDEQHDLLT EVVCAQAVMV LGRLNAADID PNATFSDLGF DSLTAIELRN 2101 RLKTVTGLTL PPTLIFDHRA PSALAQHLGQ QLCATHQHES HNAMAPADSE DEKLRSVLNT 2161 ISVADLRDAG LLDKLLRLRK DPDPNLRVDR VDNVGTKQQE LEGMINSLSP EELVAMALAE 2221 PSGGNRMDSG G //