LOCUS SIU01525.1 364 aa PRT BCT 25-MAY-2020 DEFINITION Mycobacterium tuberculosis variant bovis AF2122/97 CONSERVED HYPOTHETICAL PROTEIN protein. ACCESSION LT708304-2939 PROTEIN_ID SIU01525.1 SOURCE Mycobacterium tuberculosis variant bovis AF2122/97 ORGANISM Mycobacterium tuberculosis variant bovis AF2122/97 Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae; Mycobacterium; Mycobacterium tuberculosis complex. REFERENCE 1 AUTHORS Malone K.M. JOURNAL Submitted (06-DEC-2016) to the INSDC. School of Veterinary Medicine, Tuberculosis Molecular Microbiology Research Group, University College Dublin, Tuberculosis Molecular Microbiology Research Group, School of Veterinary Medicine, University College Dublin, D4, Ireland REFERENCE 2 AUTHORS Malone M K., Farrell D., Malone K. JOURNAL Submitted (15-APR-2020) to the INSDC. School of Veterinary Medicine, Tuberculosis Molecular Microbiology Research Group, University College Dublin, Tuberculosis Molecular Microbiology Research Group, School of Veterinary Medicine,, University College Dublin, D4, Ireland FEATURES Qualifiers source /organism="Mycobacterium tuberculosis variant bovis AF2122/97" /chromosome="Mycobacterium_bovis_AF212297" /isolate="AF2122/97" /mol_type="genomic DNA" /isolation_source="Mycobacterium bovis subsp. bovis strain AF2122/97. This strain is a fully virulent strain that was isolated in 1997 in the UK from a cow suffering necrotic lesions in lung and bronchomediastinal lymph nodes. The strain was also reported to infect and persist in badgers that are considered to be a significant source of bovine infection." /db_xref="taxon:233413" protein /transl_table=11 /locus_tag="BQ2027_MB2904C" /note="Mb2904c, -, len: 364 aa. Equivalent to 5' end of Rv2880c and 3' end of Rv2879c, len: 275 aa and 189 aa, from Mycobacterium tuberculosis strain H37Rv, (100.0% identity in 177 aa overlap and 100.0% identity in 188 aa overlap). Rv2880c: Conserved hypothetical protein, highly similar in N-terminus to others e.g. O86754|SC6A9.22c HYPOTHETICAL 40.4 KDA PROTEIN from Streptomyces coelicolor (368 aa), FASTA scores: opt: 663, E(): 2.6e-33, (52.6% identity in 213 aa overlap); Q55880|Y098_SYNY3|SLL0098 HYPOTHETICAL 38.9 KDA PROTEIN from Synechocystis sp. strain PCC 6803 (350 aa), FASTA scores: opt: 362, E(): 7.3e-15, (38.9% identity in 162 aa overlap); O66732|AQ_416 HYPOTHETICAL 40.2 KDA PROTEIN from Aquifex aeolicus (348 aa), FASTA scores: opt: 321, E(): 2.4e-12, (39.75% identity in 146 aa overlap); etc. Appears to be a frame shift with respect to preceding ORF but we can detect no error in the cosmid sequence to account for this. Rv2879c: Conserved hypothetical protein, similar to others e.g. C-terminus of Q9RVT6|DR0936 CONSERVED HYPOTHETICAL PROTEIN from Deinococcus radiodurans (346 aa), FASTA scores: opt: 505, E(): 1e-26, (46.5% identity in 185 aa overlap); O34617|YLON_BACSU HYPOTHETICAL 41.6 KDA PROTEIN from Bacillus subtilis (363 aa), FASTA scores: opt: 459, E(): 1.2e-24, (40.5% identity in 185 aa overlap); YFGB_ECOLI|P36979 hypothetical 43.1 kd protein from Escherichia coli (384 aa), FASTA scores, opt: 410, E(): 2.8e-21, (41.7% identity in 187 aa overlap); etc. Appears to be a frame shift with respect to following ORF but we can detect no error in the cosmid sequence to account for this. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, Rv2880c and Rv2879c exist as 2 genes. In Mycobacterium bovis, a single base deletion (g-*) leads to a single product. Protein product from Mb2904c detected using SWATH mass spectrometry and 0. Mb2904c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing (TPM > 20)." /db_xref="GOA:P0A645" /db_xref="InterPro:IPR004383" /db_xref="InterPro:IPR007197" /db_xref="InterPro:IPR013785" /db_xref="InterPro:IPR027492" /db_xref="InterPro:IPR040072" /db_xref="UniProtKB/Swiss-Prot:P0A645" BEGIN 1 MVPELMFDEP RPGRPPRHLA DLDAAGRASA VAELGLPAFR AKQLAHQYYG RLIADPRQMT 61 DLPAAVRDRI AGAMFPNLLT ASADITCDAG QTRKTLWRAV DGTMFESVLM RYPRRNTVCI 121 SSQAGCGMAC PFCATGQGGL TRNLSTAEIL EQVRAGAAAL RDDFGDRLSN VVFMGMGEPL 181 ANYARVLAAV QRITARPPSG FGISARAVTV STVGLAPAIR NLADARLGVT LALSLHAPDD 241 GLRDTLVPVN NRWRISEALD AARYYANVTG RRVSIEYALI RDVNDQPWRA DLLGKRLHRV 301 LGPLAHVNLI PLNPTPGSDW DASPKPVERE FVKRVRAKGV SCTVRDTRGR EISAACGQLA 361 AVGG //