LOCUS CAB5248067.1 3507 aa PRT BCT 25-MAY-2020 DEFINITION Mycobacterium tuberculosis variant bovis AF2122/97 ppe family protein ppe7 protein. ACCESSION LT708304-366 PROTEIN_ID CAB5248067.1 SOURCE Mycobacterium tuberculosis variant bovis AF2122/97 ORGANISM Mycobacterium tuberculosis variant bovis AF2122/97 Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae; Mycobacterium; Mycobacterium tuberculosis complex. REFERENCE 1 AUTHORS Malone K.M. JOURNAL Submitted (06-DEC-2016) to the INSDC. School of Veterinary Medicine, Tuberculosis Molecular Microbiology Research Group, University College Dublin, Tuberculosis Molecular Microbiology Research Group, School of Veterinary Medicine, University College Dublin, D4, Ireland REFERENCE 2 AUTHORS Malone M K., Farrell D., Malone K. JOURNAL Submitted (15-APR-2020) to the INSDC. School of Veterinary Medicine, Tuberculosis Molecular Microbiology Research Group, University College Dublin, Tuberculosis Molecular Microbiology Research Group, School of Veterinary Medicine,, University College Dublin, D4, Ireland FEATURES Qualifiers source /organism="Mycobacterium tuberculosis variant bovis AF2122/97" /chromosome="Mycobacterium_bovis_AF212297" /isolate="AF2122/97" /mol_type="genomic DNA" /isolation_source="Mycobacterium bovis subsp. bovis strain AF2122/97. This strain is a fully virulent strain that was isolated in 1997 in the UK from a cow suffering necrotic lesions in lung and bronchomediastinal lymph nodes. The strain was also reported to infect and persist in badgers that are considered to be a significant source of bovine infection." /db_xref="taxon:233413" protein /transl_table=11 /gene="PPE8" /locus_tag="BQ2027_MB0362C" /note="Mb0362c, PPE8, len: 3507 aa. Equivalent to Rv0355c and Rv0354c, len: 3300 aa and 141 aa, from Mycobacterium tuberculosis strain H37Rv, (99.8% identity in 3296 aa overlap and 100.0% identity in 125 aa overlap). PPE8, member of the Mycobacterium tuberculosis PPE family, similar to others e.g. AL009198|MTV004_5 from M. tuberculosis (3716 aa), FASTA scores: opt: 2906, E(): 0, (40.9% identity in 3833 aa overlap); MTV004_3 FASTA scores: (39.0% identity in 3531 aa overlap); etc. Gene contains large number of clustered Major Polymorphic Tandem Repeats (MPTR). Related to MTCY13E10.16c, E(): 0; MTCY13E10.17c, E(): 0; MTCY48.17, E(): 0; MTCY98.0034c, E(): 0; MTCY03C7.23 E(): 0; MTCY98.0031c, E(): 0; MTCY31.06c, E(): 5.6e-17; MTCY359.33, E(): 2.3e-16. PPE7, member of the Mycobacterium tuberculosis PPE family, similar to others e.g. MTCY63_9 from Mycobacterium tuberculosis (2411 aa), FASTA scores: E(): 3.6e-11, (47.6% identity in 103 aa overlap). Possible continuation of ORF upstream, but no sequence error apparent. REMARK-M.bovis-M.tuberculosis: In Mycobacterium tuberculosis strain H37Rv, PPE7 and PPE8 exist as 2 genes. In Mycobacterium bovis, a 2 bp insertion (*-ta) resulting in the absence of a stop codon between the 2 genes, leads to a single product. Mb0362c found to be expressed during exponential growth in Sauton's minimal media by RNA-sequencing." BEGIN 1 MSFAVLPPEI NSARLYVGAG LAPMLDAAAA WDGLADELGS AAASFSAVTA GLAGSSWLGA 61 ASTAMTGAAA PYLGWLSAAA AQAQQAATQT RLAAAAFEAA LAATVHPAII SANRALFVSL 121 VVSNLLGQNA PAIAATEAAY EQMWAQDVAA MFGYHAGASA AVSALTPFGQ ALPTVAGGGA 181 LVSAAAAQVT TRVFRNLGLA NVGEGNVGNG NVGNFNLGSA NIGNGNIGSG NIGSSNIGFG 241 NVGPGLTAAL NNIGFGNTGS NNIGFGNTGS NNIGFGNTGD GNRGIGLTGS GLLGFGGLNS 301 GTGNIGLFNS GTGNVGIGNS GTGNWGIGNS GNSYNTGFGN SGDANTGFFN SGIANTGVGN 361 AGNYNTGSYN PGNSNTGGFN MGQYNTGYLN SGNYNTGLAN SGNVNTGAFI TGNFNNGFLW 421 RGDHQGLIFG SPGFFNSTSA PSSGFFNSGA GSASGFLNSG ANNSGFFNSS SGAIGNSGLA 481 NAGVLVSGVI NSGNTVSGLF NMSLVAITTP ALISGFFNTG SNMSGFFGGP PVFNLGLANR 541 GVVNILGNAN IGNYNILGSG NVGDFNILGS GNLGSQNILG SGNVGSFNIG SGNIGVFNVG 601 SGSLGNYNIG SGNLGIYNIG FGNVGDYNVG FGNAGDFNQG FANTGNNNIG FANTGNNNIG 661 IGLSGDNQQG FNIASGWNSG TGNSGLFNSG TNNVGIFNAG TGNVGIANSG TGNWGIGNPG 721 TDNTGILNAG SYNTGILNAG DFNTGFYNTG SYNTGGFNVG NTNTGNFNVG DTNTGSYNPG 781 DTNTGFFNPG NVNTGAFDTG DFNNGFLVAG DNQGQIAIDL SVTTPFIPIN EQMVIDVHNV 841 MTFGGNMITV TEASTVFPQT FYLSGLFFFG PVNLSASTLT VPTITLTIGG PTVTVPISIV 901 GALESRTITF LKIDPAPGIG NSTTNPSSGF FNSGTGGTSG FQNVGGGSSG VWNSGLSSAI 961 GNSGFQNLGS LQSGWANLGN SVSGFFNTST VNLSTPANVS GLNNIGTNLS GVFRGPTGTI 1021 FNAGLANLGQ LNIGSANLGD FNLGSGNVGS FNVFSGNQGS YNIGPANLGN YNIGFANLGN 1081 YNIGFGNAGD FNQGFANTGN NNIGFANTGN NNIGIGLSGD NQQGFNFAGG WNSGTANIGL 1141 FNSGTNNVGI GNSGTGNWGI GNSGSGNTGI GNTGSTNTGF FNTGIVNTGV ANAGSYNTGW 1201 YNTGDTNTGI ANLGDFNTGF YNTGNFSTGF ANQGDIATGA FITGDMGNGA FWRGDQQGLF 1261 SAGYRVHVPE IPAHVTVEVP VNIPITASFT NTVYSGITLE QINFGFTIDI AGIPLLAGAI 1321 SKAVLPPITG TGPAITVNIG DPGGSTAIRI PATASVGPFD VTFVNIAATT GFFNATTDPS 1381 SGFFNGGPGT VSGIANIGAN ISGFQNVANS ATSGFNNYGS LQSGLANLGD TVSGVFNTGI 1441 GAPANVSGMF NIGSNLAGFF HDQATGMSMF NLGLGNIGQF NVGFSNVGDS NAGLANIGSF 1501 NLGSGNLGSF NVFGGNQGSY NIGPANLGNY NIGLGNLGSY NFGFGNAGDF NLGFANTGNN 1561 NIGFANTGNN NIGIGLSGDN QQGFNFAGGW NSGSGNSGLF NSGTNNIGLF NSGTGNIGIG 1621 NSGTGNWGIA NTGDTNTGIF NTGDVNTGLL NAGNVNTGIF NTGHYNTGSF NAGSFNTAGF 1681 NPGSYNTGYL NTGSYNTGLA NSGDVNTGGF ITGNYSNGFW WRGDYQGLAG ISQTITVPDT 1741 AVPVKLHVPI FLDIPVTGTL GTFTVHGFRF PEITGDIFLI GIPFNAATLD AFSFPNISIV 1801 LPNIGINLGS GPDPLIDIAG TGGLLPIKIP LIDIPAAPGF GNSTTTPSSG FFNAGTGTVS 1861 GVGNVGSNSS GFFDLTSGSS GISGVQNFGE LISGGFNFGN TVSGLVNAST LGLSMPANLS 1921 GGGNVGATVA GFVNNTQILN LGFGNVGSGN VGHGNIGDSN VGLGNLGNAN VGHGNIGSFN 1981 VFSGNRGSYN IGPANLGNYN IGLGNLGSYN FGFGNAGDFN LGFANSGSNN IGFANTGNNN 2041 IGIGLSGHNQ QGFGSWNSGT ANTGLFNSGT NNIGLFNSGT GNIGIGNSGI GNTGIGNPGV 2101 GNTGLGNSGT GNWGLWNPGT GNMGVANVGT YNTGGYNVGS TNTGIANVGI ANTGSYNTGS 2161 TNTGSFNDGD FNTGFYNTGD YNTGFYNTGD VNTGAFIGGN FSNGAFWQSD HQGQWGAHYA 2221 ITVPQIPLLN FSLNIPVNIP IHLDFGTLAV NGFQIPAITL RALGVTHFSV GPIIVPRIAG 2281 TLPVIDINIG DPGGSSSIPI TITSGAGPVV IPLLDIPPAP GFGNSTTGPS SGFFNSGTGS 2341 SSGFGNVGAN NSGFWNTAFA GIGNSGLQNF GSLQSGWANL GNTVSGFYNT SAADFATPAN 2401 LSGLSNVGAD LTGVLRGPNG STFNAGLANL GQFNVGSANL GSANLGSANL GNSNVGFGNI 2461 GNANIGGANI GDFNVGIANT GPGLTAAVNN IGIGNTGNYN IGVGNTGNYN IGFGNTGNNN 2521 IGIGLSGDNQ IGFGPLNAGI ANMGLFNLGD NNFGMANAGN FNQGIANTGN NNIGLFNTGN 2581 NNVGIGLTGD GLSGFSSLNS GAGNTGFFNS GTANTGLFNS GTGNTGLFNS GTGNVGIGNM 2641 GTGGFGVGLS GDSQVGIGGT NSGSFNIGLF NSGTGNVGIG NSGTGNVGIG NTGTGNTGIG 2701 NSGNYNTGLL NAGLVNTGIA NPGNHNTGLF NIGTFNTGIA NPGHYNTGSY NTGSYNTGMA 2761 NAGDYGTGAF ITGSMNNGLL WRADRQGLLA ANYTITIERP AAFLNVDIPV NIPITGDITN 2821 VSIPAITFPR IDASGSVDIG ILSGTVLAPV GPITLHGGDA SAPLDTPIEI DFGPSPAINL 2881 NIGKPDGSTV INIVGGAGAG PISIPIIDLR PAPGFFNATT GPSSGFLNWG AGSASGLLNF 2941 GNNSGLYNFA TSSMGNSGFQ NYGSLQSGWA NLGNSISGIY NTGLGAPANV SGLLNIGTNL 3001 AGWLQNGPTE TTFSVGLANL GFWNLGSANI GNYNLGSANI GVYNLGSANI GDFNLGSANI 3061 GDFNLGSANI GSSNIGFGNV GPGLTAAIGN IGFGNTGNGN IGIGNTGTGN IGFGNTGNGN 3121 IGIGLTGDTM TGFGGWNSGT GNIGLFNSGT GNIGFGNSGT GNWGIGNSGD YNTGIGNTGS 3181 TNSGFFNTGL VNTGIGNSGD YNTGLFNAGN TNTGSFNPGD YNTGGFNPGN YNTGYFNPGN 3241 SNTGFANSGD VNTGAFNSGN YSNGFFWRGD YQGLGGFAYQ SAVSEIPWSY DIGSNIEIPI 3301 EGDINAITQD AFTIDEFEIP IKLRVSVCVI YIPFKGCVKH VSVTIPITTE HLGPYEIDAS 3361 TINPDQPIDT AFTQTLDFAG SGTVGAFPFG FGWQQSPGFF NSTTTPSSGF FNSGAGGASG 3421 FLNDAAAAVS GLGNVFTETS GFFNAGGVGN SGFQNFGNLL SGWANLGNTV SGFYNTSMLD 3481 LATQALISGF GNHGARLSGI LNNGSGP //