LOCUS       SIU01624.1               227 aa    PRT              BCT 25-MAY-2020
DEFINITION  Mycobacterium tuberculosis variant bovis AF2122/97 PROBABLE
            URACIL-DNA GLYCOSYLASE UNG (UDG) protein.
ACCESSION   LT708304-3038
PROTEIN_ID  SIU01624.1
SOURCE      Mycobacterium tuberculosis variant bovis AF2122/97
  ORGANISM  Mycobacterium tuberculosis variant bovis AF2122/97
            Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae;
            Mycobacterium; Mycobacterium tuberculosis complex.
REFERENCE   1
  AUTHORS   Malone K.M.
  JOURNAL   Submitted (06-DEC-2016) to the INSDC. School of Veterinary
            Medicine, Tuberculosis Molecular Microbiology Research Group,
            University College Dublin, Tuberculosis Molecular Microbiology
            Research Group, School of Veterinary Medicine, University College
            Dublin, D4, Ireland
REFERENCE   2
  AUTHORS   Malone M K., Farrell D., Malone K.
  JOURNAL   Submitted (15-APR-2020) to the INSDC. School of Veterinary
            Medicine, Tuberculosis Molecular Microbiology Research Group,
            University College Dublin, Tuberculosis Molecular Microbiology
            Research Group, School of Veterinary Medicine,, University College
            Dublin, D4, Ireland
FEATURES             Qualifiers
     source          /organism="Mycobacterium tuberculosis variant bovis
                     AF2122/97"
                     /chromosome="Mycobacterium_bovis_AF212297"
                     /isolate="AF2122/97"
                     /mol_type="genomic DNA"
                     /isolation_source="Mycobacterium bovis subsp. bovis strain
                     AF2122/97. This strain is a fully virulent strain that was
                     isolated in 1997 in the UK from a cow suffering necrotic
                     lesions in lung and bronchomediastinal lymph nodes. The
                     strain was also reported to infect and persist in badgers
                     that are considered to be a significant source of bovine
                     infection."
                     /db_xref="taxon:233413"
     protein         /transl_table=11
                     /gene="ung"
                     /locus_tag="BQ2027_MB3000C"
                     /note="Mb3000c, ung, len: 227 aa. Equivalent to Rv2976c,
                     len: 227 aa, from Mycobacterium tuberculosis strain H37Rv,
                     (100.0% identity in 227 aa overlap). Probable ung,
                     uracil-DNA glycosylase (EC 3.2.2.-), equivalent to Q9CBS3
                     URACIL-DNA GLYCOSYLASE from Mycobacterium leprae (227 aa),
                     FASTA scores: opt: 1394, E(): 8.8e-85, (88.1% identity in
                     227 aa overlap). Also highly similar to others e.g. Q9EX12
                     from Streptomyces coelicolor (225 aa), FASTA scores: opt:
                     1134, E(): 1.3e-67, (72.75% identity in 224 aa overlap);
                     Q9K682|UNG_BACHD from Bacillus halodurans (224 aa), FASTA
                     scores: opt: 652, E(): 8.9e-36, (45.5% identity in 222 aa
                     overlap); P39615|UNG_BACSU from Bacillus subtilis (225
                     aa), FASTA scores: opt: 625, E(): 5.4e-34, (45.5% identity
                     in 222 aa overlap); etc. BELONGS TO THE URACIL-DNA
                     GLYCOSYLASE FAMILY. Protein product from Mb3000c detected
                     using shotgun mass spectrometry. Mb3000c found to be
                     expressed during exponential growth in Sauton's minimal
                     media by RNA-sequencing."
                     /db_xref="GOA:P67072"
                     /db_xref="InterPro:IPR002043"
                     /db_xref="InterPro:IPR005122"
                     /db_xref="InterPro:IPR018085"
                     /db_xref="InterPro:IPR036895"
                     /db_xref="UniProtKB/Swiss-Prot:P67072"
BEGIN
        1 MTARPLSELV ERGWAAALEP VADQVAHMGQ FLRAEIAAGR RYLPAGSNVL RAFTFPFDNV
       61 RVLIVGQDPY PTPGHAVGLS FSVAPDVRPW PRSLANIFDE YTADLGYPLP SNGDLTPWAQ
      121 RGVLLLNRVL TVRPSNPASH RGKGWEAVTE CAIRALAARA APLVAILWGR DASTLKPMLA
      181 AGNCVAIESP HPSPLSASRG FFGSRPFSRA NELLVGMGAE PIDWRLP
//