LOCUS BDN79879.1 2231 aa PRT BCT 09-FEB-2023
DEFINITION Mycobacterium pseudoshottsii polyketide synthase protein.
ACCESSION AP026367-94
PROTEIN_ID BDN79879.1
SOURCE Mycobacterium pseudoshottsii
ORGANISM Mycobacterium pseudoshottsii
Bacteria; Bacillati; Actinomycetota; Actinomycetes;
Mycobacteriales; Mycobacteriaceae; Mycobacterium; Mycobacterium
ulcerans group.
REFERENCE 1 (bases 1 to 6051062)
AUTHORS Komine,T., Fukano,H. and Wada,S.
TITLE Direct Submission
JOURNAL Submitted (27-JUN-2022)
Contact:Takeshi Komine
Nippon Veterinary and Life Science University, School of
Veterinary Medicine; Kyohnan-cho 1-7-1, Musashino, Tokyo 180-8602,
Japan
REFERENCE 2
AUTHORS Komine,T., Fukano,H., Yoshida,M., Inohana,M., Hoshino,Y.,
Kurata,O. and Wada,S.
TITLE Complete Genome and Partial Megaplasmid Sequences of Mycobacterium
pseudoshottsii Strain NJB1907-Z4, Isolated from an Aquarium-Reared
Japanese Sardine (Sardinops melanostictus) in Japan
JOURNAL Microbiol Resour Announc 11 (12), e00785-22 (2022)
REMARK Publication Status: Online-Only
DOI:10.1128/mra.00785-22
COMMENT Annotated by DFAST https://dfast.ddbj.nig.ac.jp/
##Genome-Assembly-Data-START##
Assembly Method :: Flye v. 2.9; galaxy v. 0
Genome Coverage :: 189x
Sequencing Technology :: Pacbio sequel; Illumina HiSeqX
##Genome-Assembly-Data-END##
FEATURES Qualifiers
source /collection_date="2019-07-10"
/db_xref="taxon:265949"
/geo_loc_name="Japan:Tokyo"
/host="Sardinops melanostictus"
/isolation_source="liver of Japanese sardine"
/mol_type="genomic DNA"
/organism="Mycobacterium pseudoshottsii"
/strain="NJB1907-Z4"
protein /gene="pks7_1"
/inference="COORDINATES:ab initio
prediction:MetaGeneAnnotator"
/inference="similar to AA sequence:RefSeq:WP_003900387.1"
/locus_tag="NJB1907Z4_C00940"
/transl_table=11
BEGIN
1 MAQILGDPAA ERIDRDVAFA ELGFDSRMTV ELRNRLAAVT GLRLPETVGW GYGSISQLAA
61 HLETELAGSG GRGKPGPSVV GGAPVAIVGV GCRYPGGVES AGGLWDVVVG GRDVISGFPV
121 DRGWDVEGVF DPDPDALGKT YCRLGGFLDG ADRFDAGFFG IGPSEALALD PQQRLLLEFS
181 WEALEDAGID PVSLRGSVTG VFTGLMSSDY GAGRVSGDLE GYGLTSAAAS VASGRVAYLL
241 GLEGPAVSVD TACSSSLVAL HLAAASLRSG ECDVALAGGV TVMATPATFV GFSRQRGLAA
301 DGRCKAFAGA ADGTGFSEGA GVVVLTRLSE ARRRGLAVLG VIAGSAVNQD GASNGLTAPN
361 GPAQQRVIEA ALANAGLTAA DVDVVEGHGT GTTLGDPIEA QALLATYGQA RPADRPLWLG
421 SIKSNRGHTQ AAAGIAGVIK MVQAMRHELM PATLHMDVPS PHVDWSSGAV SLLTQPRPWP
481 AVDGRPRRAG VSSFGISGTN AHVIVEQVCP AVVAEAVDVS PDSLPWVVSG KSEAAVAAQA
541 KRLLAAVQAD EGLDRLDVGF SLARRTAFEY RAVVLGEDRQ QLISGLTELA AGQPGPTVLN
601 GRAATVGKTV MVFPGQGSQW PGMGRELLAA SPVFAEHMRL CAEALGEFVD WSLLDGVNGV
661 AGAPTLDRVD VVQPVLWAMM VSLAQLWRSM GMVPDAVIGH SQGEIAAACV AGALSLRDGA
721 AVVALRSRAL VDLAGTGGMV AIACGVERVR ELLADYGDRL SLVAVNGVAA VVVSGEAEAL
781 HGLTGRCEAE GMRARRVEVD YASHSAQVES IGSSLVEALA GVRPRSSDIE FVSTVTGASV
841 DGASLDADYW YRNIRQTVRF DRAVRYCHEQ GCRTFVEASP HPVLLGGIEE SLAEGIGRPD
901 SAGVIVIPTL GRNEGGVERF WMSLSQAWVA GVGVDWSAVF AGSGGRQVGL PTYAFARRRF
961 WLDGSSSAAD VGEAGLVAAG HALLGAVVEQ PDTGAVVLTG RLSLARQPWL ADHLVGGAVL
1021 FPGTGFVELA IRAGDEVGCG VVEELTLATP LVLNAGTAVQ VQVVVGSAGQ SGQRLVSMYS
1081 RADQPDQHWV LHAQGSVAPA AVQPAPAASA ELSTRPPAGA EAVDIGGLYE RLARRGYGYG
1141 PAFQGLRAVW RRGRDVFAEV GLPNDEGLDL TDVGIHPALL DAALHAWLCV GGFGGDGEAT
1201 VLPFSWQHLS LHGSGASRLR VRIAPAGPSA VSVELADGAG LPVLSVGSLT TRPVGAAQLR
1261 AAMSAGGDGT GRELLDVVWT PITFERDSVQ REGERRVVSW DDFLAGRCAA ARDLDRDVAV
1321 VWQWESGGAE SVVNTVYAAT HRVLEVLQRW LADDLPAVLV VCTRGAMGLA GEANTDLAGA
1381 AVWGLVRSAQ TEHPGRIVLI DTDTSMALPT VIGVGEPQLV VRAGDVYATR LARGRAALRT
1441 PDAGQGWQLA ATGGGTFDDV VLEPCPRSDE PLAVGQVRVA LAALGVNFRD VLVVLGLYPG
1501 NKPTLGGEGA GVVVEVGPGV SGLEPGDRVL GLMSGRSECV VDQRLLVPIP AGWSFAEAAS
1561 VPIVFLTAFY GLSDLAGLRR GESVLVHAAT GGVGMAAIQL ARLWGARVFV TASRGKWDTL
1621 RAMGFDDDQI ADSRTLEFEE KFSAATGGRG IDVVLNALAG EFTDASLRLL ADGGRFIEMG
1681 KTDIRDGQEV AQEHPGVSYR AFDLANEVAP QRLGQMLAEL MTLFAAGKLH RLPVKSWDVR
1741 CAPQAYQFVS QARHIGKVVL STPTELRDAL AAGTVLITGG TGLVGSVLAR HLVSAYGVRN
1801 LVLVSRMGEQ GAGVAELVDE LSEAGARVLV AACDVADQSA VEKLIVGWGR EYPALTGVIH
1861 AAGVLDDAVI TSMTPDQVDS VLRAKVDGAW NLHHATRGLG LSMFVLCSSI AGVVGAPGQG
1921 NYAAANAFLD ALVTDRRAHG LAGVSLGWGL WEQASGMTKH LRGSDVSRLS RGGFAAISAQ
1981 QALDLFDAAL IVDQPTVLAA RLDRRALENP ALNADLPTLF SDLITRPMRR NVDNDCTPTE
2041 LALVNRLNMM AQDEQHDLLT EVVCAQAVMV LGRLNAADID PNATFSDLGF DSLTAIELRN
2101 RLKTVTGLTL PPTLIFDHRA PSALAQHLGQ QLCATHQHES HNAMAPADSE DEKLRSVLNT
2161 ISVADLRDAG LLDKLLRLRK DPDPNLRVDR VDNVGTKQQE LEGMINSLSP EELVAMALAE
2221 PSGGNRMDSG G
//