LOCUS       BDN83809.1              1602 aa    PRT              BCT 09-FEB-2023
DEFINITION  Mycobacterium pseudoshottsii polyketide synthase protein.
ACCESSION   AP026367-4024
PROTEIN_ID  BDN83809.1
SOURCE      Mycobacterium pseudoshottsii
  ORGANISM  Mycobacterium pseudoshottsii
            Bacteria; Bacillati; Actinomycetota; Actinomycetes;
            Mycobacteriales; Mycobacteriaceae; Mycobacterium; Mycobacterium
            ulcerans group.
REFERENCE   1  (bases 1 to 6051062)
  AUTHORS   Komine,T., Fukano,H. and Wada,S.
  TITLE     Direct Submission
  JOURNAL   Submitted (27-JUN-2022)
            Contact:Takeshi Komine
            Nippon Veterinary and Life Science University, School of
            Veterinary Medicine; Kyohnan-cho 1-7-1, Musashino, Tokyo 180-8602,
            Japan
REFERENCE   2
  AUTHORS   Komine,T., Fukano,H., Yoshida,M., Inohana,M., Hoshino,Y.,
            Kurata,O. and Wada,S.
  TITLE     Complete Genome and Partial Megaplasmid Sequences of Mycobacterium
            pseudoshottsii Strain NJB1907-Z4, Isolated from an Aquarium-Reared
            Japanese Sardine (Sardinops melanostictus) in Japan
  JOURNAL   Microbiol Resour Announc 11 (12), e00785-22 (2022)
  REMARK    Publication Status: Online-Only
            DOI:10.1128/mra.00785-22
COMMENT     Annotated by DFAST https://dfast.ddbj.nig.ac.jp/
            
            ##Genome-Assembly-Data-START##
            Assembly Method       :: Flye v. 2.9; galaxy v. 0
            Genome Coverage       :: 189x
            Sequencing Technology :: Pacbio sequel; Illumina HiSeqX
            ##Genome-Assembly-Data-END##
FEATURES             Qualifiers
     source          /collection_date="2019-07-10"
                     /db_xref="taxon:265949"
                     /geo_loc_name="Japan:Tokyo"
                     /host="Sardinops melanostictus"
                     /isolation_source="liver of Japanese sardine"
                     /mol_type="genomic DNA"
                     /organism="Mycobacterium pseudoshottsii"
                     /strain="NJB1907-Z4"
     protein         /inference="COORDINATES:ab initio
                     prediction:MetaGeneAnnotator"
                     /inference="similar to AA sequence:RefSeq:WP_010908855.1"
                     /locus_tag="NJB1907Z4_C40240"
                     /transl_table=11
BEGIN
        1 MTPSIGGEAD LRHWLVDYLV TNIGCPPDEV DPNLSLADLG VSSRDAVVLS GELTELLGRT
       61 VSPIDFWEHP TINDLAAYLT APEPSTGAEA AVSRTVRGSL EEPIAVVGMG CRFPGGISGP
      121 EALWQFLCDR KSSIGRVPDE RWAQFDDGSP AVKALLARTT RWGSYLTDID AFDADFFEIS
      181 ASEADKMDPQ QRLLLEVAWE ALEHAGIPPS SLRRSQTGVF AGSCLSEYGA IASTDLTQVD
      241 GWSNTGGAMS IIANRLSYFL DLRGPSVAVD TACSSSLVAI HLACQSLRMQ DSNLAIAAGV
      301 NLLLSPAVFR GFDQVGALSP TGNCRAFDAA ADGFVRGEGA GVVVLKRLTD AQQDGDRVLA
      361 VICGSAINQD GRSNGLMAPN PAAQQAVLRA AYTNAGMQPS EVDYVEAHGT GTLLGDPIEA
      421 RALGSVLGRG RPEESPLLIG AVKTNLGHTE AAAGIAGFIK AVLAVQHGRI PPNQRFESPN
      481 PHIAFADLRM KVVDELTDWP DTGHPRRAGV SSFGFGGTNA HVVIEQGQEA ASSPEAGLTP
      541 ALSTLVVAGK TPARVAATAG MLADWMEGPG AEVALADVAH TLNHHRSRQA RFGTVVARER
      601 AQAVAGLRAL AANQHAPGVV NPADAPPEPG TVFVYSGRGS QWAGMGRQLL ADEPAFAAAV
      661 AELEPVFLAE AGFSLHDVLA NGTELVGIEQ IQLGLIGMQL TLTELWRSYG IQPDLVIGHS
      721 MGEVAAAVVA GALTPAEGLR VTAVRSRLMA PLSGQGGMAL LELDASQTEA LIADYPQVTL
      781 GIYNSPRQTV ISGPTDQIDE LITVVRARDR FATRVNIEVA PHNPAMDALQ PQMRSELADL
      841 APRTPTIPII STTYADLGAA RESGPTFDAE HWAINMRNPV HFQQAITAAA TDKHNFIEIS
      901 AHPLLTQAIL ETLHTVQPGS KYTSLGTLQR DSDDTIVFRT NLNTVRTAPP QTPHPPEPHP
      961 QIPTTPWHHT HHWIDTPAVA SRSASTPDKD AAGSSEPSVS GDSDDAVDSC HYRVGWPTKP
     1021 LADAKASTET ASGTRWLVFA DAELGAELGL AAGAQTRVDV IDPSALTEES ELLAALAGVE
     1081 HVVYAPPAGK SLDVNAAYQL FHQVRRLVTV MTKASLTAKL LLVTRNAQPI AEGDRANPAH
     1141 GVLWGLGRTI ALEHPEIWRG IIDLDESMPA ELAAPKILGE VTGTDGEDQV VYRCGGRHVP
     1201 RLQRRTAPAV APVTLDPNSS QLVIGATGNI GPYLIRQLAQ MGAKTVVAVS RNPGQRLQEL
     1261 AESLAAEGTN LVIEAADATD EAAMTALFDR FGADLPPLEG IYLAAFAGGP VLLNEMTDAD
     1321 VRAMFAPKLD AAALLHRLSL KVPARHFVLF SSISGLIGSR WLAHYTATSG YLDALAYARH
     1381 ALGLPATTVN WGLWKSLADA EHDASQVSVG SGLLPMQDEV AIGTLPLLMN PAAGVHSVVV
     1441 EADWPLLAAA YRTRGSLHIV DDLLRDFAEA STIPARDWSH LSAQEVRTEF EAGLRRIVAR
     1501 ELRVSESDLE TDRPLAELGL NSLMAMAIRR EAEMFVGIEL SATMLFNHPT VASLASYLAN
     1561 RVAPQDNSSN DQMAELSASA GSTLDSLFDR IESSSLLPEG PG
//