LOCUS       SIU01133.1              1150 aa    PRT              BCT 25-MAY-2020
DEFINITION  Mycobacterium tuberculosis variant bovis AF2122/97 PE-
            PGRS FAMILY PROTEIN [SECOND PART] protein.
ACCESSION   LT708304-2547
PROTEIN_ID  SIU01133.1
SOURCE      Mycobacterium tuberculosis variant bovis AF2122/97
  ORGANISM  Mycobacterium tuberculosis variant bovis AF2122/97
            Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae;
            Mycobacterium; Mycobacterium tuberculosis complex.
REFERENCE   1
  AUTHORS   Malone K.M.
  JOURNAL   Submitted (06-DEC-2016) to the INSDC. School of Veterinary
            Medicine, Tuberculosis Molecular Microbiology Research Group,
            University College Dublin, Tuberculosis Molecular Microbiology
            Research Group, School of Veterinary Medicine, University College
            Dublin, D4, Ireland
REFERENCE   2
  AUTHORS   Malone M K., Farrell D., Malone K.
  JOURNAL   Submitted (15-APR-2020) to the INSDC. School of Veterinary
            Medicine, Tuberculosis Molecular Microbiology Research Group,
            University College Dublin, Tuberculosis Molecular Microbiology
            Research Group, School of Veterinary Medicine,, University College
            Dublin, D4, Ireland
FEATURES             Qualifiers
     source          /organism="Mycobacterium tuberculosis variant bovis
                     AF2122/97"
                     /chromosome="Mycobacterium_bovis_AF212297"
                     /isolate="AF2122/97"
                     /mol_type="genomic DNA"
                     /isolation_source="Mycobacterium bovis subsp. bovis strain
                     AF2122/97. This strain is a fully virulent strain that was
                     isolated in 1997 in the UK from a cow suffering necrotic
                     lesions in lung and bronchomediastinal lymph nodes. The
                     strain was also reported to infect and persist in badgers
                     that are considered to be a significant source of bovine
                     infection."
                     /db_xref="taxon:233413"
     protein         /transl_table=11
                     /gene="PE_PGRS43b"
                     /locus_tag="BQ2027_MB2517C"
                     /note="Mb2517c, PE_PGRS43b, len: 1150 aa. Similar to 3'
                     end of Rv2490c, len: 1660 aa, from Mycobacterium
                     tuberculosis strain H37Rv, (98.0% identity in 1150 aa
                     overlap). Member of the Mycobacterium tuberculosis PE
                     family, PGRS-subfamily of Gly-rich proteins, similar to
                     many e.g. AAK47971|MT3612.1 PE_PGRS family protein from
                     Mycobacterium tuberculosis strain CDC1551 (1715 aa), FASTA
                     scores: opt: 5161, E(): 1.5e-187, (51.7% identity in 1752
                     aa overlap); etc. REMARK-M.bovis-M.tuberculosis: In
                     Mycobacterium tuberculosis strain H37Rv, PE_PGRS43 exists
                     as a single gene. In Mycobacterium bovis, a frameshift due
                     to a 8 bp insertion (*-gggggggg) splits PE_PGRS43 into 2
                     parts, PE_PGRS43a and PE_PGRS43b."
                     /db_xref="UniProtKB/TrEMBL:A0A1R3Y1D6"
BEGIN
        1 MTAIFLGSSG TPGEDGGNGG AGGAGGAGGA HAGDGGAGGA GGNGGAGGAG GNGAHGFNAV
       61 LVSDGGNGGD GGAGGRGGDG GAGGAGGDAP AGRAGSQGVG GDGGAGGAGG APGNGGSGGR
      121 GDMAFKDGDG GAGGDGGDPG AGGKGGAGGA GATEGVTGAT GATVHSGGNG GKGGNGADAT
      181 VAGANGGKGG AGGNGGLVGD GGAGGDGGSG AAGANGANVG EDGADGTLSG QPGEGSEANG
      241 GQGGVGGGGA GGAGGDGGAG SSALGSGGNG GRGDAGQAGG AGGAGGAGGA GGSVSGDGGP
      301 GGKGGAGGAG GAGASGGGGG KGASGADSAE AVGGAGGKGG DGGVGGVGGD GGPGGDGGAG
      361 GAAPAGQVGS HGVGGVGGDG GLGGAGGNGG DGGHGSDGGD GGDGGDPGAG GLGGLGGDSG
      421 NGTRAASGVD ASDHGPGSGG NGGNGGNGAQ ASVAGGAGGN GGDGGNAGRV GDGGAGGNGG
      481 DGAAGANGAN SGAPGSDALA LGQPGGNGGQ GDAGQAGGAG GAGGAGGSVS GDGGAGGNGG
      541 AGGNGGVGAS GGAGARGANG IDSIGGTGGA GGGGGDGGAG GVGGHGGDGG VGGAAPSGTV
      601 GSHGTGGVGG DGGLGGAGGV GGAGGNGGIG ITVGGAGGAG GNGGDPGAGG RGGLGGDSGN
      661 GTSAANGVDA SKHGPLTGGD GGVGGNGAKA AAAGGDGGQG GDGGNAGLFG DGGAGGDGAD
      721 GTAAEALGGD GGAGGAGGKG GDAGDIGDGG DGGKGGDGAH GALGGLTVAG GNGGAGGAGG
      781 AGGAGGAFLG DGGNGGAGGQ GGAGRGGSPG GGGGVGGHGG AGGDAGMNGG GGTGGQGGNG
      841 AAGGAGWSPD SDLKGFDGFD GGSGGAGGDG GAGGAGGTQT GDGGDGGAGG LGGAGGVGGN
      901 GVDGFDINET TGRDGDGGDG GYGGWGGAGG NGGAGGSAPA GEVGNRGVGG DGGDGGSGGD
      961 AGNGGLGGDG FTYLADFDGE PGGDGGDGGD GGWGRPGGQG GFGSTSGAHG KAGFGAPGGD
     1021 GGDGGNGGHG GDGNGSFADA GDGGPGGNGG NGGLGGAGRD GGAPGGDGGD GGTGGSGGFG
     1081 APPPRSIGGG DGGDGGRGGD GGRGAGGLTS GGVGSSGESG GSGNGRGDPG SGGSGGEGGE
     1141 GGPSISVNVT
//