LOCUS       BDN80525.1              3116 aa    PRT              BCT 09-FEB-2023
DEFINITION  Mycobacterium pseudoshottsii PPE family protein protein.
ACCESSION   AP026367-740
PROTEIN_ID  BDN80525.1
SOURCE      Mycobacterium pseudoshottsii
  ORGANISM  Mycobacterium pseudoshottsii
            Bacteria; Bacillati; Actinomycetota; Actinomycetes;
            Mycobacteriales; Mycobacteriaceae; Mycobacterium; Mycobacterium
            ulcerans group.
REFERENCE   1  (bases 1 to 6051062)
  AUTHORS   Komine,T., Fukano,H. and Wada,S.
  TITLE     Direct Submission
  JOURNAL   Submitted (27-JUN-2022)
            Contact:Takeshi Komine
            Nippon Veterinary and Life Science University, School of
            Veterinary Medicine; Kyohnan-cho 1-7-1, Musashino, Tokyo 180-8602,
            Japan
REFERENCE   2
  AUTHORS   Komine,T., Fukano,H., Yoshida,M., Inohana,M., Hoshino,Y.,
            Kurata,O. and Wada,S.
  TITLE     Complete Genome and Partial Megaplasmid Sequences of Mycobacterium
            pseudoshottsii Strain NJB1907-Z4, Isolated from an Aquarium-Reared
            Japanese Sardine (Sardinops melanostictus) in Japan
  JOURNAL   Microbiol Resour Announc 11 (12), e00785-22 (2022)
  REMARK    Publication Status: Online-Only
            DOI:10.1128/mra.00785-22
COMMENT     Annotated by DFAST https://dfast.ddbj.nig.ac.jp/
            
            ##Genome-Assembly-Data-START##
            Assembly Method       :: Flye v. 2.9; galaxy v. 0
            Genome Coverage       :: 189x
            Sequencing Technology :: Pacbio sequel; Illumina HiSeqX
            ##Genome-Assembly-Data-END##
FEATURES             Qualifiers
     source          /collection_date="2019-07-10"
                     /db_xref="taxon:265949"
                     /geo_loc_name="Japan:Tokyo"
                     /host="Sardinops melanostictus"
                     /isolation_source="liver of Japanese sardine"
                     /mol_type="genomic DNA"
                     /organism="Mycobacterium pseudoshottsii"
                     /strain="NJB1907-Z4"
     protein         /gene="PPE8"
                     /inference="COORDINATES:ab initio
                     prediction:MetaGeneAnnotator"
                     /inference="similar to AA sequence:RefSeq:WP_010950386.1"
                     /locus_tag="NJB1907Z4_C07400"
                     /transl_table=11
BEGIN
        1 MNFAVLPPEI NSARLTIGAG LGPMLEAANA WQGLAGELGS AASAFSSVTT DLVSGGWQGA
       61 ASAAMASAAA PYLKWLTTAA AQAGQAATQV RLAAAAFEAA LAATVHPAAI SANRSQFVSL
      121 VVSNLLGQNA PAIAAAEAAY EQMWAQDVAA MFGYRSGAES IAAALTPFPL QAAGSVVTAN
      181 LGFANVGFRN FGNGNVGDYN LGSGNLGSEN VGSSNIGSGK IGFGNSGPAL TAALNNIGFG
      241 NTGDNNRGIG LTGTGRFGFG GLNSGSGNIG LFNSGTGNFG IGNSGTGNWG IGNSGNSYNT
      301 GIGNSGDANT GFFNAGVANT GIGNVGSYNT GGFNPGDANT GGFNTGSYNT GYLNRGDYNT
      361 GVANFGNVNT GALNTGDYNN GFLWRGDNQG LIFGEPGFGN STTVPSSGFF NSGEGSASGF
      421 FNSGANNSGF FNDSSGPVGN SGFANTGVLQ SGLVNSGNTI SGLLNSSLVP ITTPAYISGV
      481 LNTGSSLAGF LLGPTLFNIG FANDGVGNIL AHANVGNYNI GGNGNVGDYN IFGSGNAGSY
      541 NIFGSGNIGS LNLGSANIGS YNIGSASLGS YNVGFGNLGD YNTGFGNAGS YNQGLANTGS
      601 NNTGFANTGN GNIGIGLTGD GQIGIGALNS GIGNAGLFNS GTDNVGLFNS GSGNIGIGNS
      661 GSGNTGLLNA GVANWGLENP GTANTGIGNA GQYNTGLHNV GAANTGFYNN GSYNTGGFNV
      721 GSTNTGSFDA GSANTGNFNP GDTNTGSYNP GNVNTGFFDT GNYNNGFFVT SDNQGQIAIN
      781 LSVTTPHIPI DVQSIVPVNQ VYTLGGNTIQ VSEAGSVFPR TYYLGGLFFL GPINLGASTL
      841 TIPTVTIALG GPSTNIPISI VGAVESRAIT FLDIPAAPGF GNTTAGPSSG FFNAGTGGDS
      901 GFANFGAASS GLWNSGVAAA GNSGVQNFGS LASGWANLGN TLSGFYNTST ANLAAPANLS
      961 GLFNVGTDLA GVLRDANGTI LNLGLADLGH LNVGSGNVGD FNVGSANIGS ANFGFGNVGS
     1021 DNLGSGNIGS GNIGFGNAGA RLTDAFNNIG FGNTSDNNIG FGNTGNNNIG IGLTGAGQRG
     1081 VNFAGGLNSG SGNIGLFNSG TNNVGIGNSG TGNWGIGNSG SLNTGFGNAG STNTGLFNTG
     1141 MVNTGIADVG DYNTGWLNTG DVNTGAANRG DYNTGWYNTG NYSTGFANAS NIDTGLLISG
     1201 DMGNGFAWRG DQQGQVSASY VIHVPEIPAH VNASAPINIP LTANFTNTLY SGITMEDINF
     1261 GFTVAIGPLP VLTGTIDSVT LAPITATGPA ISFNIGDPKG TTVVSIPATA SLGPVDWTIF
     1321 DIPAVTGLFN STSSPSSGFF NGGTGTVSGI ANFGGDISGF QNFGTSAVSG FKNVGSLQSG
     1381 LLNIGDTVSG LFNTGIGTPA NVSGVSNIGT TLTGILVDQA SGMSTFSLGL ADIGQANVGF
     1441 GNTGDFNLGF ANLGDENLGL GNSGFNNLGL ANLGNFNVGS ANLGGYNIGS SNLGDYNVGF
     1501 GNAGGFNQGL ANSGNNNIGI GLTGDNQQGF NFGGGLNSGT GNTGLFNSGN ANWGIGNTGV
     1561 TNTGVFNSGQ INTGLWNAGN VNTGIGNPGN YNTGSFNTGS FNTGSFNTAG LNPGDYNTGY
     1621 LNSGNYNTGF ANSGDVNTGA FISGNYSNGL LWRGDYQGLW GIDYAITIPD TAIPLNLDVP
     1681 VYLDIPITGT LGVFTVDGFT IPASTVELDI VGIGFKAVHV SAIKFPNLEL GLPAIDINIG
     1741 TGPEPLIHIV GTGGLLPIKI PIIDIAAAPG FGNSTTTASS GFFNTGSGTA SGFGNTGGNN
     1801 SGFFNLASGS SGISGYQNLG DLVSGMSNLG NSVSGFFNTS SLGVSIPADI SGAANIGTNI
     1861 AGFFNDNPIG NLGIANLGNG NIGDGNIGSN NAGFGNVGAN NFGSGNIGSF NFLSGNHGTY
     1921 NFGLGNLGDY NVGLANLGDY NVGFGNAGSL NQGFANTGDH NIGIGNTGNN NIGIGLTGDN
     1981 QVGINFAGGL NSGTGNAGLF NSGTNNTGIF NSGTGNTGIF NSGTANWGIG NSGTANSGIA
     2041 NSGSTNWGLW NPGTGNTGIA NAGDYNTGLY NADHTNTGLA NPTDYNTGFF NTGDYNTGLA
     2101 NAGNYNTGFF NTGDYNTGLA NTGDVNTGAF ISGNYSNGMF WRGDYQGLVG AHYEIFIPEF
     2161 PVLNFDLNIP VDIPIYLDLG SLALNGFTIP TITIDALSIT DFKIGPITIP TIKGILPVID
     2221 INIGNPDGSS SIPIAIRSGL GPISIVLLDI PAAPGFGNST SAPSSGFFNS GTGTASGFGN
     2281 VGANNSGFWN TGFGDIGSSG LQNYGQQLSG WANLGNTVSG WYNSSTADLP TAANLSGLFN
     2341 IGTELSGVLR DSAGTIFNAG LGDLGQLNLG SGNVGDFNVG SANIGSFNVG FGNVGADNLG
     2401 SGNIGSGNIG FGNAGAGLTE AFNNIGFGNV GDTNIGFGNT GNGNLGFGNT GNGNIGIGLS
     2461 GDGQIGFGPL NDGVGNAGLF NLGDFNTGLA NAGNDNQGWS NTGSNNIGLF NTGDNNVGIG
     2521 LTGDGRVGFG SLSSGAGNTG FFNSGISNAG MFNSGTGNVG FFNSGTGNVG IGNMGTGGIG
     2581 IGLQGDNLVG IGGLNSGSFN VGLFNSGTGN VGIGNSGTSN WGIGNSGSHN SGIGNTGSYN
     2641 TGLFNSGSFN TGIANPGNFN TGLFNIGVFN TGIANPGDYN TGFYNTGDYN TGLANIGNFD
     2701 TGAFITGNMA NGVFWRADSM GQLSAHYAIT VNRIPAFMTI DAPVNIPITG TITDISIPAV
     2761 TFPKVPATGK VDLAIISGTV IAPIGPITIH GGDASAPLNT PIVIDFGTQP ALQLNLGNPD
     2821 GSTVVHVGGY FNFCPGQIPL IDIKPAAGFF NATTGPSSGF FNLGDGSASD FANVGANNSG
     2881 IWNVATAALG NSGFQNIGSQ QSGLANLGNA ISGFLNTGGA TPANLSGFQN IGTDLAGWYR
     2941 NGPDATTFSF GIANPGFWNV GSANIGSYNL GNANIGDSNF GFGNLGSANL GSANLGGFNL
     3001 GSANIGSYNF GFANTGPALS DAIGNIGFGN TGSYNIGFGN PGNGNLGFAN TGDGNIGIGL
     3061 AGDTMTGFGG WNSGSGNIGL FNSGTNNIGF GNSGTGNWGH RQLRQLQHRH RQHRHN
//