LOCUS SIU02326.1 245 aa PRT BCT 25-MAY-2020 DEFINITION Mycobacterium tuberculosis variant bovis AF2122/97 PROBABLE ENDONUCLEASE III NTH (DNA-(APURINIC OR APYRIMIDINIC SITE) LYASE) (AP LYASE) (AP ENDONUCLEASE CLASS I) (ENDODEOXYRIBONUCLEASE (APURINIC OR APYRIMIDINIC)) (DEOXYRIBONUCLEASE (APURINIC OR APYRIMIDINIC)) protein. ACCESSION LT708304-3740 PROTEIN_ID SIU02326.1 SOURCE Mycobacterium tuberculosis variant bovis AF2122/97 ORGANISM Mycobacterium tuberculosis variant bovis AF2122/97 Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae; Mycobacterium; Mycobacterium tuberculosis complex. REFERENCE 1 AUTHORS Malone K.M. JOURNAL Submitted (06-DEC-2016) to the INSDC. School of Veterinary Medicine, Tuberculosis Molecular Microbiology Research Group, University College Dublin, Tuberculosis Molecular Microbiology Research Group, School of Veterinary Medicine, University College Dublin, D4, Ireland REFERENCE 2 AUTHORS Malone M K., Farrell D., Malone K. JOURNAL Submitted (15-APR-2020) to the INSDC. School of Veterinary Medicine, Tuberculosis Molecular Microbiology Research Group, University College Dublin, Tuberculosis Molecular Microbiology Research Group, School of Veterinary Medicine,, University College Dublin, D4, Ireland FEATURES Qualifiers source /organism="Mycobacterium tuberculosis variant bovis AF2122/97" /chromosome="Mycobacterium_bovis_AF212297" /isolate="AF2122/97" /mol_type="genomic DNA" /isolation_source="Mycobacterium bovis subsp. bovis strain AF2122/97. This strain is a fully virulent strain that was isolated in 1997 in the UK from a cow suffering necrotic lesions in lung and bronchomediastinal lymph nodes. The strain was also reported to infect and persist in badgers that are considered to be a significant source of bovine infection." /db_xref="taxon:233413" protein /transl_table=11 /gene="nth" /locus_tag="BQ2027_MB3698C" /note="Mb3698c, nth, len: 245 aa. Equivalent to Rv3674c, len: 245 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 245 aa overlap). Probable nth, endonuclease III (EC 4.2.99.18), equivalent to Q9CB92|NTH|ML2301 PUTATIVE ENDONUCLEASE III from Mycobacterium leprae (272 aa), FASTA scores: opt: 1363, E(): 3.6e-81, (89.4% identity in 226 aa overlap). Also similar to many e.g. Q9XA44|SCH17.03c from Streptomyces coelicolor (250 aa), FASTA scores: opt: 937, E(): 2.2e-55, (61.65% identity in 219 aa overlap); P46303|UVEN_MICLU from Micrococcus luteus (Micrococcus lysodeikticus) (279 aa), FASTA scores: opt: 899, E(): 8.1e-53, (58.45% identity in 248 aa overlap); P73715|END3_SYNY3|NTH|SLR1822 from Synechocystis sp. strain PCC 6803 (219 aa), FASTA scores: opt: 684, E(): 1.7e-38, (52.2% identity in 203 aa overlap); P39788|END3_BACSU|NTH|JOOB from Bacillus subtilis (219 aa), FASTA scores: opt: 552, E(): 1.2e-29, (43.3% identity in 194 aa overlap); etc. Equivalent to AAK48142 from Mycobacterium tuberculosis strain CDC1551 (262 aa) but shorter 17 aa. Contains PS00764 Endonuclease III iron-sulfur binding region signature, and PS01155 Endonuclease III family signature. BELONGS TO THE NTH/MUTY FAMILY. COFACTOR: BINDS A 4FE-4S CLUSTER WHICH IS NOT IMPORTANT FOR THE CATALYTIC ACTIVITY, BUT WHICH IS PROBABLY INVOLVED IN THE PROPER POSITIONING OF THE ENZYME ALONG THE DNA STRAND (BY SIMILARITY). N-terminus extended since first submission (previously 226 aa). Protein product from Mb3698c detected using shotgun mass spectrometry and SWATH mass spectrometry. Mb3698c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P63541" /db_xref="InterPro:IPR000445" /db_xref="InterPro:IPR003265" /db_xref="InterPro:IPR003651" /db_xref="InterPro:IPR004035" /db_xref="InterPro:IPR004036" /db_xref="InterPro:IPR005759" /db_xref="InterPro:IPR011257" /db_xref="InterPro:IPR023170" /db_xref="UniProtKB/Swiss-Prot:P63541" BEGIN 1 MPGRWSAETR LALVRRARRM NRALAQAFPH VYCELDFTTP LELAVATILS AQSTDKRVNL 61 TTPALFARYR TARDYAQADR TELESLIRPT GFYRNKAASL IGLGQALVER FGGEVPATMD 121 KLVTLPGVGR KTANVILGNA FGIPGITVDT HFGRLVRRWR WTTAEDPVKV EQAVGELIER 181 KEWTLLSHRV IFHGRRVCHA RRPACGVCVL AKDCPSFGLG PTEPLLAAPL VQGPETDHLL 241 ALAGL //