LOCUS       SIU02107.1               455 aa    PRT              BCT 25-MAY-2020
DEFINITION  Mycobacterium tuberculosis variant bovis AF2122/97 probable
            membrane-anchored mycosin mycp4 (serine protease) (subtilisin-
            like protease) (subtilase-like) (mycosin-4) protein.
ACCESSION   LT708304-3521
PROTEIN_ID  SIU02107.1
SOURCE      Mycobacterium tuberculosis variant bovis AF2122/97
  ORGANISM  Mycobacterium tuberculosis variant bovis AF2122/97
            Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae;
            Mycobacterium; Mycobacterium tuberculosis complex.
REFERENCE   1
  AUTHORS   Malone K.M.
  JOURNAL   Submitted (06-DEC-2016) to the INSDC. School of Veterinary
            Medicine, Tuberculosis Molecular Microbiology Research Group,
            University College Dublin, Tuberculosis Molecular Microbiology
            Research Group, School of Veterinary Medicine, University College
            Dublin, D4, Ireland
REFERENCE   2
  AUTHORS   Malone M K., Farrell D., Malone K.
  JOURNAL   Submitted (15-APR-2020) to the INSDC. School of Veterinary
            Medicine, Tuberculosis Molecular Microbiology Research Group,
            University College Dublin, Tuberculosis Molecular Microbiology
            Research Group, School of Veterinary Medicine,, University College
            Dublin, D4, Ireland
FEATURES             Qualifiers
     source          /organism="Mycobacterium tuberculosis variant bovis
                     AF2122/97"
                     /chromosome="Mycobacterium_bovis_AF212297"
                     /isolate="AF2122/97"
                     /mol_type="genomic DNA"
                     /isolation_source="Mycobacterium bovis subsp. bovis strain
                     AF2122/97. This strain is a fully virulent strain that was
                     isolated in 1997 in the UK from a cow suffering necrotic
                     lesions in lung and bronchomediastinal lymph nodes. The
                     strain was also reported to infect and persist in badgers
                     that are considered to be a significant source of bovine
                     infection."
                     /db_xref="taxon:233413"
     protein         /transl_table=11
                     /gene="mycp4"
                     /locus_tag="BQ2027_MB3479"
                     /note="Mb3479, -, len: 455 aa. Equivalent to Rv3449, len:
                     455 aa, from Mycobacterium tuberculosis strain H37Rv,
                     (100.0% identity in 455 aa overlap). Probable secreted
                     serine protease (EC 3.4.21.-). Similar to hypothetical
                     unknowns or proteases from Mycobacterium tuberculosis
                     strains H37Rv and CDC1551 e.g. AAK48366|MT3998 SUBTILASE
                     FAMILY PROTEIN from Mycobacterium tuberculosis strain
                     CDC1551 (411 aa), FASTA scores: opt: 747, E(): 3.5e-33,
                     (45.65% identity in 416 aa overlap);
                     O05461|Rv3883c|MTCY15F10.29 HYPOTHETICAL PROTEIN (446 aa),
                     FASTA scores: opt: 747, E(): 3.8e-33, (45.45% identity in
                     451 aa overlap); O53695|Rv0291|MTV035.19 HYPOTHETICAL
                     PROTEIN (461 aa), FASTA scores: opt: 660, E(): 1.9e-28,
                     (44.0% identity in 457 aa overlap); etc. And similar to
                     hypothetical proteases from Mycobacterium leprae e.g.
                     O33076|MLCB628.04|ML0041 HYPOTHETICAL 45.7 KDA PROTEIN
                     (PROBABLE SECRETED PROTEASE) (446 aa), FASTA scores: opt:
                     683, E(): 1.1e-29, (43.8% identity in 450 aa overlap);
                     Q9CD36|ML2528 PUTATIVE PROTEASE (475 aa), FASTA scores:
                     opt: 608, E(): 1.3e-25, (43.0% identity in 451 aa
                     overlap); Q9CBV3|ML1538 POSSIBLE PROTEASE (567 aa), FASTA
                     scores: opt: 389, E(): 9.7e-14, (33.8% identity in 562 aa
                     overlap); etc. Also some similarity to other proteases
                     from several organisms e.g. O31788|APRX ALKALINE SERINE
                     PROTEASE from Bacillus subtilis (442 aa), FASTA scores:
                     opt: 296, E(): 8.3e-09, (29.4% identity in 313 aa
                     overlap); O86650|SC3C3.17c PUTATIVE SECRETED SERINE
                     PROTEASE from Streptomyces coelicolor (450 aa), FASTA
                     scores: opt: 279, E(): 7e-08, (33.55% identity in 343 aa
                     overlap); Q9KBJ7|APRX|BH193 INTRACELLULAR ALKALINE SERINE
                     PROTEASE from Bacillus halodurans (444 aa), FASTA scores:
                     opt: 257, E(): 1.1e-06, (28.65% identity in 335 aa
                     overlap); O86642|SC3C3.08 SERINE PROTEASE from
                     Streptomyces coelicolor (413 aa), FASTA scores: opt: 243,
                     E(): 5.7e-06, (38.25% identity in 387 aa overlap); etc.
                     Has putative signal peptide at N-terminus and hydrophobic
                     stretch at C-terminus. Contains three signatures typical
                     of subtilase family: aspartic acid active site (PS00136),
                     histidine active site (PS00137), serine active site
                     (PS00138)."
                     /db_xref="GOA:A0A1R3Y468"
                     /db_xref="InterPro:IPR000209"
                     /db_xref="InterPro:IPR015500"
                     /db_xref="InterPro:IPR022398"
                     /db_xref="InterPro:IPR023827"
                     /db_xref="InterPro:IPR023828"
                     /db_xref="InterPro:IPR023834"
                     /db_xref="InterPro:IPR036852"
                     /db_xref="UniProtKB/TrEMBL:A0A1R3Y468"
BEGIN
        1 MTTSRTLRLL VVSALATLSG LGTPVAHAVS PPPIDERWLP ESALPAPPRP TVQREVCTEV
       61 TAESGRAFGR AERSAQLADL DQVWRLTRGA GQRVAVIDTG VARHRRLPKV VAGGDYVFTG
      121 DGTADCDAHG TLVAGIIAAA PDAQSDNFSG VAPDVTLISI RQSSSKFAPV GDPSSTGVGD
      181 VDTMAKAVRT AADLGASVIN ISSIACVPAA AAPDDRALGA ALAYAVDVKN AVIVAAAGNT
      241 GGAAQCPPQA PGVTRDSVTV AVSPAWYDDY VLTVGSVNAQ GEPSAFTLAG PWVDVAATGE
      301 AVTSLSPFGD GTVNRLGGQH GSIPISGTSY AAPVVSGLAA LIRARFPTLT ARQVMQRIES
      361 TAHHPPAGWD PLVGNGTVDA LAAVSSDSIP QAGTATSDPA PVAVPVPRRS TPGPSDRRAL
      421 HTAFAGAAIC LLALMATLAT ASRRLRPGRN GIAGD
//