LOCUS SIU02417.1 626 aa PRT BCT 25-MAY-2020 DEFINITION Mycobacterium tuberculosis variant bovis AF2122/97 possible hydrolase protein. ACCESSION LT708304-3831 PROTEIN_ID SIU02417.1 SOURCE Mycobacterium tuberculosis variant bovis AF2122/97 ORGANISM Mycobacterium tuberculosis variant bovis AF2122/97 Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae; Mycobacterium; Mycobacterium tuberculosis complex. REFERENCE 1 AUTHORS Malone K.M. JOURNAL Submitted (06-DEC-2016) to the INSDC. School of Veterinary Medicine, Tuberculosis Molecular Microbiology Research Group, University College Dublin, Tuberculosis Molecular Microbiology Research Group, School of Veterinary Medicine, University College Dublin, D4, Ireland REFERENCE 2 AUTHORS Malone M K., Farrell D., Malone K. JOURNAL Submitted (15-APR-2020) to the INSDC. School of Veterinary Medicine, Tuberculosis Molecular Microbiology Research Group, University College Dublin, Tuberculosis Molecular Microbiology Research Group, School of Veterinary Medicine,, University College Dublin, D4, Ireland FEATURES Qualifiers source /organism="Mycobacterium tuberculosis variant bovis AF2122/97" /chromosome="Mycobacterium_bovis_AF212297" /isolate="AF2122/97" /mol_type="genomic DNA" /isolation_source="Mycobacterium bovis subsp. bovis strain AF2122/97. This strain is a fully virulent strain that was isolated in 1997 in the UK from a cow suffering necrotic lesions in lung and bronchomediastinal lymph nodes. The strain was also reported to infect and persist in badgers that are considered to be a significant source of bovine infection." /db_xref="taxon:233413" protein /transl_table=11 /locus_tag="BQ2027_MB3788C" /note="Mb3788c, -, len: 626 aa. Equivalent to Rv3762c, len: 626 aa, from Mycobacterium tuberculosis strain H37Rv, (99.7% identity in 626 aa overlap). Possible hydrolase (EC 3.-.-.-), highly similar to hypothetical proteins and beta-lactamases (EC 3.5.2.6) e.g. Q9RL04|SC5G9.23 HYPOTHETICAL 70.3 KDA PROTEIN from Streptomyces coelicolor (648 aa), FASTA scores: opt: 2088, E(): 3.7e-124, (52.9% identity in 624 aa overlap); P32717|YJCS_ECOLI|B4083 HYPOTHETICAL 73.2 KDA PROTEIN from Escherichia coli strain K12 (661 aa), FASTA scores: opt: 1911, E(): 5.7e-113, (46.9% identity in 631 aa overlap); Q9A824|CC1540 METALLO-BETA-LACTAMASE FAMILY PROTEIN from Caulobacter crescentus (647 aa), FASTA scores: opt: 1891, E(): 1e-111, (48.55% identity in 628 aa overlap); Q08347|YOL164W CHROMOSOME XV READING FRAME ORF from Saccharomyces cerevisiae (Baker's yeast) (646 aa) FASTA scores: opt: 1829, E(): 8.4e-108, (45.7% identity in 615 aa overlap); Q9I5I9|PA0740 PROBABLE BETA-LACTAMASE from Pseudomonas aeruginosa (658 aa), FASTA scores: opt: 1699, E(): 1.4e-99, (43.15% identity in 630 aa overlap); Q52556|SDSA ALKYL SULFATASE (protein involved in the degradation of sulfate esters of long-chain primaryal cohols e.g. SDS sodium dodecyl sulfate) from Pseudomonas sp (528 aa), FASTA scores: opt: 841, E(): 1.7e-45, (33.7% identity in 534 aa overlap); etc. N-terminual end also highly similar to Q48790|SEPA SEPA PROTEIN (protein implicated in cell separation) from Listeria monocytogenes (391 aa), FASTA scores: opt: 1256, E(): 8.3e-72, (49.6% identity in 363 aa overlap). Also slight similarity to P96253|Rv0407|MTCY22G10.03 HYPOTHETICAL 37.0 KDA PROTEIN from Mycobacterium tuberculosis (336 aa). Protein product from Mb3788c detected using SWATH mass spectrometry. Mb3788c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="GOA:A0A1R3Y554" /db_xref="InterPro:IPR001279" /db_xref="InterPro:IPR029228" /db_xref="InterPro:IPR029229" /db_xref="InterPro:IPR036527" /db_xref="InterPro:IPR036866" /db_xref="InterPro:IPR038536" /db_xref="UniProtKB/TrEMBL:A0A1R3Y554" BEGIN 1 MPMEHKPPTA VIQAAHGEHS LPLHDTTDFD DADRGFIAAL SPCVIKAADG RVVWDNDAYS 61 FLDGAAPTSV HPSLWRQSQL TAKQGLYQVV PGIYQVRGFD ISNISFVEGD TGLIVIDPLV 121 STEVAAAALD LYRAHRGADR PVVAVIYTHS HVDHFGGVLG GTTQADVDAG KVAVLAPEGF 181 TAHAVQENIY AGSAMMRRAG YMYGTVLARG LRGHVGCGLG QTLSTGEVSL VVPTVDITET 241 GETHTIDGVE IEFQMAPGTE APAEMHFYFP RFRALCMAEN ATHNLHNLLT LRGALVRDPR 301 AWSGYLTEAI DTFADRTDVV FASHHWPTWG REKIVEFLSQ QRDMHSYLHD QTLRLLNQGY 361 TGVEIAEMFQ LPPALQRAWH THGYYGSVSH NVKAIYQRYM GWFDGNPGWL WPHPPEALAP 421 RYVDALGGID RVLELAREAF DAGDFRWAAT LLDHAVFADS EHAAARGLYA DTLEQLAYGA 481 ECATWRNFFL TGAAELRDGN PGSSGQVPAP TFFAQLTPDQ IFDVLAISIN GPRAWDLDLA 541 IDFTFTEPDV NYRLTLRNGV LIHRKLPADP ATANATVTVG DKVRLVAAAL GDISSPGFEV 601 FGDRTVLQTF LSVLDRPDSA FNIVTP //