LOCUS SIU01855.1 1055 aa PRT BCT 25-MAY-2020 DEFINITION Mycobacterium tuberculosis variant bovis AF2122/97 POSSIBLE ATP-DEPENDENT DNA HELICASE protein. ACCESSION LT708304-3269 PROTEIN_ID SIU01855.1 SOURCE Mycobacterium tuberculosis variant bovis AF2122/97 ORGANISM Mycobacterium tuberculosis variant bovis AF2122/97 Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae; Mycobacterium; Mycobacterium tuberculosis complex. REFERENCE 1 AUTHORS Malone K.M. JOURNAL Submitted (06-DEC-2016) to the INSDC. School of Veterinary Medicine, Tuberculosis Molecular Microbiology Research Group, University College Dublin, Tuberculosis Molecular Microbiology Research Group, School of Veterinary Medicine, University College Dublin, D4, Ireland REFERENCE 2 AUTHORS Malone M K., Farrell D., Malone K. JOURNAL Submitted (15-APR-2020) to the INSDC. School of Veterinary Medicine, Tuberculosis Molecular Microbiology Research Group, University College Dublin, Tuberculosis Molecular Microbiology Research Group, School of Veterinary Medicine,, University College Dublin, D4, Ireland FEATURES Qualifiers source /organism="Mycobacterium tuberculosis variant bovis AF2122/97" /chromosome="Mycobacterium_bovis_AF212297" /isolate="AF2122/97" /mol_type="genomic DNA" /isolation_source="Mycobacterium bovis subsp. bovis strain AF2122/97. This strain is a fully virulent strain that was isolated in 1997 in the UK from a cow suffering necrotic lesions in lung and bronchomediastinal lymph nodes. The strain was also reported to infect and persist in badgers that are considered to be a significant source of bovine infection." /db_xref="taxon:233413" protein /transl_table=11 /locus_tag="BQ2027_MB3227C" /note="Mb3227c, -, len: 1055 aa. Equivalent to Rv3202c, len: 1055 aa, from Mycobacterium tuberculosis strain H37Rv, (99.9% identity in 1055 aa overlap). Possible ATP-dependent DNA helicase (EC 3.6.1.-), showing some similarity to UvrD proteins e.g. Q9FCK5|2SC3B6.07 PUTATIVE ATP-DEPENDENT DNA HELICASE from Streptomyces coelicolor (1159 aa), FASTA scores: opt: 666, E(): 1e-29, (34.5% identity in 1154 aa overlap); Q9L7T3|UVRD|PA5443 MISMATCH REPAIR PROTEIN MUTU (DNA HELICASE II) from Pseudomonas aeruginosa (728 aa), FASTA scores: opt: 239, E(): 7.3e-06, (23.8% identity in 677 aa overlap) (no similarity in C-terminal part for this one); etc. C-terminal region similar to Q9FDU2|ORF3 ORF3 PROTEIN (FRAGMENT) from Streptomyces griseus (551 aa), FASTA scores: opt: 800, E(): 1.7e-37, (36.2% identity in 525 aa overlap); and Q9ZG15 HYPOTHETICAL 35.5 KDA PROTEIN from Rhodococcus erythropolis (323 aa), FASTA scores: opt: 232, E(): 9.7e-06, (28.55% identity in 266 aa overlap). Mb3227c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:A0A1R3Y579" /db_xref="InterPro:IPR000212" /db_xref="InterPro:IPR013986" /db_xref="InterPro:IPR014016" /db_xref="InterPro:IPR014017" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR034739" /db_xref="InterPro:IPR038726" /db_xref="UniProtKB/TrEMBL:A0A1R3Y579" BEGIN 1 MSHIWGVEAG AALAPGLRGP VLVLGGPGTG KSTLLVEAAV AHIGAGTDPE SVLLLTGSGR 61 MGMRARSALT TALLRSRTNG PCRAAIREPV VRTVHSYAYA VLRKAAQRAG DALPRLLTSA 121 EQDAIIRELL AGDAEDGPAA TTTWPAHLRP ALTTAGFATE LRNLLARCAE RGLDPLELQQ 181 LGRRRGRPEW IAAGQFAQRY EQVMLLRGAV GLAAPQATAP ALSAAELVGA ALEAFAVDPE 241 LLAAERARVR TLLVDDAQQL DPQAARLVRM LAAGTELALI AGDPNQAVFG FRGGEPTGLL 301 ADDPPPAGGA PIPSVTLTVS HRCAPAVARA VTGIARRLPG RSVGRRIEGT GTEVGSVTVR 361 LAGSAHAEAA MIADALRRAH LIDGVPWSQM AVIVRSVPRA VRLPRALAAA GVPVAPPAVG 421 GPLSAEPAVR ALLTVLEATA DGLDGDQALL LLTGPIGGVD PVSLRQLRRT LQRARPGQTS 481 RKFGDLLVEV LGGDAPPSGP GSRALRRVRA VLTAAARCHR SGSLGGQDPR HTLWAAWQRS 541 GLQRRWLAAS EHGGAAAVQA TRDLETVTAL FDITDHYVSR TSGASLRGLV EHVTALQLPV 601 VRPEPAAPTE QVMVLSAHAA LGHEWDLVVI AGLQDGLWPN TVPRGGVLGT QRLLDELDGV 661 TKDASMRAPL LAEERRLLVT AMGRARRRLL VTAVDSDAGG GGHEAVLPSA FFFEIAQWAD 721 GDGEPVAMQP VSAPRVLSAA AVVGRLRAVV CAPACAVDDA DRDCAATQLA RLAKAGVPGA 781 DPSEWHGLAP VSTSDPLCDS DDLVTLTPST LQALNDCPLR WLAERHGGTN TRELPSAVGS 841 VLHALFAEPG RSESQLLAEL DRVWGHLPFG AQWYSANELA RHRAMIQAFV QWRAQSRSEL 901 TEVGVEVDID GALEDGSGQA RKIRLRGRAD RLERDPAGRL VIVDIKTGKT PVSKDDAQQH 961 AQLAMYQLAV AEGLVRAGDE PGGARLVYVG KSGAAGVAER KQDPLTPAAR DEWRNLVRQL 1021 AAATAGPQFI ARRNDGCTHC PLRPGCPAHV RGSAP //