LOCUS SIU02165.1 1460 aa PRT BCT 25-MAY-2020 DEFINITION Mycobacterium tuberculosis variant bovis AF2122/97 pe- pgrs family protein pe_pgrs54 protein. ACCESSION LT708304-3579 PROTEIN_ID SIU02165.1 SOURCE Mycobacterium tuberculosis variant bovis AF2122/97 ORGANISM Mycobacterium tuberculosis variant bovis AF2122/97 Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae; Mycobacterium; Mycobacterium tuberculosis complex. REFERENCE 1 AUTHORS Malone K.M. JOURNAL Submitted (06-DEC-2016) to the INSDC. School of Veterinary Medicine, Tuberculosis Molecular Microbiology Research Group, University College Dublin, Tuberculosis Molecular Microbiology Research Group, School of Veterinary Medicine, University College Dublin, D4, Ireland REFERENCE 2 AUTHORS Malone M K., Farrell D., Malone K. JOURNAL Submitted (15-APR-2020) to the INSDC. School of Veterinary Medicine, Tuberculosis Molecular Microbiology Research Group, University College Dublin, Tuberculosis Molecular Microbiology Research Group, School of Veterinary Medicine,, University College Dublin, D4, Ireland FEATURES Qualifiers source /organism="Mycobacterium tuberculosis variant bovis AF2122/97" /chromosome="Mycobacterium_bovis_AF212297" /isolate="AF2122/97" /mol_type="genomic DNA" /isolation_source="Mycobacterium bovis subsp. bovis strain AF2122/97. This strain is a fully virulent strain that was isolated in 1997 in the UK from a cow suffering necrotic lesions in lung and bronchomediastinal lymph nodes. The strain was also reported to infect and persist in badgers that are considered to be a significant source of bovine infection." /db_xref="taxon:233413" protein /transl_table=11 /gene="PE_PGRS54" /locus_tag="BQ2027_MB3538" /note="Mb3538, PE_PGRS54, len: 1460 aa. Similar to Rv3508, len: 1901 aa, from Mycobacterium tuberculosis strain H37Rv, (71.06% identity in 1901 aa overlap). Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins, similar to others from Mycobacterium tuberculosis strains H37Rv and CDC1551 e.g. downstream O53559|Rv3514|MTV023.21 (1489 aa), FASTA scores: opt: 6598, E(): 0, (71.05% identity in 1533 aa overlap). Equivalent to AAK47971 from Mycobacterium tuberculosis strain CDC1551 (1384 aa) but shorter 13 aa and with some minor differences between the proteins. Contains five PS00583 pfkB family of carbohydrate kinases signatures 1. REMARK-M.bovis-M.tuberculosis: In Mycobacterium bovis, insertions of 9 bp (*-cggcgcggg), 27 bp and 18 bp, deletions of 168 bp, 603 bp and 48 bp, and several substitutions lead to a shorter product compared to its homolog in Mycobacterium tuberculosis strain H37Rv (1460 aa versus 1901 aa). Mb3538 found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:A0A1R3Y615" BEGIN 1 MSFVLIAPEF VTAAAGDLTN LGSSISAANA SAASATTQVL AAGADEVSAR IAALFGGFGL 61 EYQAISAQVA AYHQRFVQAL STGAGAYASA EAAAAEQIVL GVINAPTQAL LGRPLIGDGA 121 NATTPGGAGG AGGLLFGNGG AGAAGAPGQA GGPGGPAGLW GNGGPGGAGG SGGGTGGAGG 181 AGGWLFGVGG AGGVGGAGGG TGGAGGPGGL IWGGGGAGGV GGAGGGTGGA GGRAELLFGA 241 GGAGGAGGAG TDGGPGATGG TGGHGGVGGD GGWLAPGGAG GAGGQGGAGG AGSDGGALGG 301 TGGTGGTGGA GGAGGRGALL LGAGGQGGLG GAGGQGGTGG AGGDGVLGGV GGTGGKGGVG 361 GVAGLGGAGG AAGQLFSAGG AAGAVGVGGT GGQGGAGGMG GSGADNASGI GADGGAGGTG 421 GNAGAGGAGG AAGTGGTGGV VGAAGKAGIG GTGGQGGAGG AGSAGTDATA TGATGGTGFS 481 GGAGGAGGAG GNTGVGGTNG SGGQGGTGGA GGAGGAGGVG ADNPTGIGGA GGTGGAGGTG 541 GTGGAAGAGG AGGAVGTGGT GGVVGDVGNA GIGGTGGKGG AGGTGFAGGA GGAGGQGGSS 601 GAGGTNGSGG AGGTGGQGGA GGAGGAGADN PTGIGGAGGT GGTGGAAGAG GAGGAIGTGG 661 TGGAVGSVGN AGIGGTGGTG GVGGAGGAGA AAAAGSSATG GAGFAGGAGG EGGAGGNSGV 721 GGTNGSGGAG GAGGKGGTGG AGGSGADNPT GAGFAGGAGG TGGAAGAGGA GGATGTGGTG 781 GVVGATGSAG IGGAGGRGGD GGDGASGLGL GLSGFDGGQG GQGGDGGSAG AGGINGAGGA 841 GGDGGDGGDG ATGAAGLGDN GGVGGDGGAG GAAGNGGNAG VGLTAKAGDG GAAGNGGNGG 901 AGGAGGAGDN NFNGGQGGAG GQGGQGGLGG ASTTSINANG GAGGNGGTGG KGGAGGAGTL 961 GVGGSGGTGG DGGDAGAGGG GGFGGAAGKA GGGGNGGVGG DGGEGASGLG LDLSGFDGGQ 1021 GGQGGAGGNA GAGGINGAGG TGGTGGAGGD GAPATLIGGP DGGDGGQGGI GGDGGNAGFG 1081 AGVPGDGGIG GTGGAGGAGG AGGAGDAGAD GDPSIDGGQG GAGGHGGQGG KGGLNSTGLA 1141 SAASGDGGNG GAGGAGGNGG DGDGFIGGSG GTGGTGGDAG AGGLANTGGT AGNAGIGGAG 1201 GRGGDGGAGD SGALSQDGNG FAGGQGGQGG AGGNAGAGGI NGAGGTGGTG GAGGDGAPAT 1261 LIGGPDGGDG GQGGGAGFGS GVAGAAGAGG NGGKGGDGGT GGTGGTNFAG GQGGAGGRGG 1321 AGGNGANGVG DNAAGGDGGN GGAGGLGGGG GTGGTNGNGG LGGGGGNGGA GGAGGTPTGS 1381 GTEGTGGDGG DAGAGGNGGS ATGVGNGGNG GDGGNGGDGG NGAPGGFGGG AGAGGLGGSG 1441 AGGGTDGDDG NGGSPGTDGS //