LOCUS       CAB5248067.1            3507 aa    PRT              BCT 25-MAY-2020
DEFINITION  Mycobacterium tuberculosis variant bovis AF2122/97 ppe
            family protein ppe7 protein.
ACCESSION   LT708304-366
PROTEIN_ID  CAB5248067.1
SOURCE      Mycobacterium tuberculosis variant bovis AF2122/97
  ORGANISM  Mycobacterium tuberculosis variant bovis AF2122/97
            Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae;
            Mycobacterium; Mycobacterium tuberculosis complex.
REFERENCE   1
  AUTHORS   Malone K.M.
  JOURNAL   Submitted (06-DEC-2016) to the INSDC. School of Veterinary
            Medicine, Tuberculosis Molecular Microbiology Research Group,
            University College Dublin, Tuberculosis Molecular Microbiology
            Research Group, School of Veterinary Medicine, University College
            Dublin, D4, Ireland
REFERENCE   2
  AUTHORS   Malone M K., Farrell D., Malone K.
  JOURNAL   Submitted (15-APR-2020) to the INSDC. School of Veterinary
            Medicine, Tuberculosis Molecular Microbiology Research Group,
            University College Dublin, Tuberculosis Molecular Microbiology
            Research Group, School of Veterinary Medicine,, University College
            Dublin, D4, Ireland
FEATURES             Qualifiers
     source          /organism="Mycobacterium tuberculosis variant bovis
                     AF2122/97"
                     /chromosome="Mycobacterium_bovis_AF212297"
                     /isolate="AF2122/97"
                     /mol_type="genomic DNA"
                     /isolation_source="Mycobacterium bovis subsp. bovis strain
                     AF2122/97. This strain is a fully virulent strain that was
                     isolated in 1997 in the UK from a cow suffering necrotic
                     lesions in lung and bronchomediastinal lymph nodes. The
                     strain was also reported to infect and persist in badgers
                     that are considered to be a significant source of bovine
                     infection."
                     /db_xref="taxon:233413"
     protein         /transl_table=11
                     /gene="PPE8"
                     /locus_tag="BQ2027_MB0362C"
                     /note="Mb0362c, PPE8, len: 3507 aa. Equivalent to Rv0355c
                     and Rv0354c, len: 3300 aa and 141 aa, from Mycobacterium
                     tuberculosis strain H37Rv, (99.8% identity in 3296 aa
                     overlap and 100.0% identity in 125 aa overlap). PPE8,
                     member of the Mycobacterium tuberculosis PPE family,
                     similar to others e.g. AL009198|MTV004_5 from M.
                     tuberculosis (3716 aa), FASTA scores: opt: 2906, E(): 0,
                     (40.9% identity in 3833 aa overlap); MTV004_3 FASTA
                     scores: (39.0% identity in 3531 aa overlap); etc. Gene
                     contains large number of clustered Major Polymorphic
                     Tandem Repeats (MPTR). Related to MTCY13E10.16c, E(): 0;
                     MTCY13E10.17c, E(): 0; MTCY48.17, E(): 0; MTCY98.0034c,
                     E(): 0; MTCY03C7.23 E(): 0; MTCY98.0031c, E(): 0;
                     MTCY31.06c, E(): 5.6e-17; MTCY359.33, E(): 2.3e-16. PPE7,
                     member of the Mycobacterium tuberculosis PPE family,
                     similar to others e.g. MTCY63_9 from Mycobacterium
                     tuberculosis (2411 aa), FASTA scores: E(): 3.6e-11, (47.6%
                     identity in 103 aa overlap). Possible continuation of ORF
                     upstream, but no sequence error apparent.
                     REMARK-M.bovis-M.tuberculosis: In Mycobacterium
                     tuberculosis strain H37Rv, PPE7 and PPE8 exist as 2 genes.
                     In Mycobacterium bovis, a 2 bp insertion (*-ta) resulting
                     in the absence of a stop codon between the 2 genes, leads
                     to a single product. Mb0362c found to be expressed during
                     exponential growth in Sauton's minimal media by
                     RNA-sequencing."
BEGIN
        1 MSFAVLPPEI NSARLYVGAG LAPMLDAAAA WDGLADELGS AAASFSAVTA GLAGSSWLGA
       61 ASTAMTGAAA PYLGWLSAAA AQAQQAATQT RLAAAAFEAA LAATVHPAII SANRALFVSL
      121 VVSNLLGQNA PAIAATEAAY EQMWAQDVAA MFGYHAGASA AVSALTPFGQ ALPTVAGGGA
      181 LVSAAAAQVT TRVFRNLGLA NVGEGNVGNG NVGNFNLGSA NIGNGNIGSG NIGSSNIGFG
      241 NVGPGLTAAL NNIGFGNTGS NNIGFGNTGS NNIGFGNTGD GNRGIGLTGS GLLGFGGLNS
      301 GTGNIGLFNS GTGNVGIGNS GTGNWGIGNS GNSYNTGFGN SGDANTGFFN SGIANTGVGN
      361 AGNYNTGSYN PGNSNTGGFN MGQYNTGYLN SGNYNTGLAN SGNVNTGAFI TGNFNNGFLW
      421 RGDHQGLIFG SPGFFNSTSA PSSGFFNSGA GSASGFLNSG ANNSGFFNSS SGAIGNSGLA
      481 NAGVLVSGVI NSGNTVSGLF NMSLVAITTP ALISGFFNTG SNMSGFFGGP PVFNLGLANR
      541 GVVNILGNAN IGNYNILGSG NVGDFNILGS GNLGSQNILG SGNVGSFNIG SGNIGVFNVG
      601 SGSLGNYNIG SGNLGIYNIG FGNVGDYNVG FGNAGDFNQG FANTGNNNIG FANTGNNNIG
      661 IGLSGDNQQG FNIASGWNSG TGNSGLFNSG TNNVGIFNAG TGNVGIANSG TGNWGIGNPG
      721 TDNTGILNAG SYNTGILNAG DFNTGFYNTG SYNTGGFNVG NTNTGNFNVG DTNTGSYNPG
      781 DTNTGFFNPG NVNTGAFDTG DFNNGFLVAG DNQGQIAIDL SVTTPFIPIN EQMVIDVHNV
      841 MTFGGNMITV TEASTVFPQT FYLSGLFFFG PVNLSASTLT VPTITLTIGG PTVTVPISIV
      901 GALESRTITF LKIDPAPGIG NSTTNPSSGF FNSGTGGTSG FQNVGGGSSG VWNSGLSSAI
      961 GNSGFQNLGS LQSGWANLGN SVSGFFNTST VNLSTPANVS GLNNIGTNLS GVFRGPTGTI
     1021 FNAGLANLGQ LNIGSANLGD FNLGSGNVGS FNVFSGNQGS YNIGPANLGN YNIGFANLGN
     1081 YNIGFGNAGD FNQGFANTGN NNIGFANTGN NNIGIGLSGD NQQGFNFAGG WNSGTANIGL
     1141 FNSGTNNVGI GNSGTGNWGI GNSGSGNTGI GNTGSTNTGF FNTGIVNTGV ANAGSYNTGW
     1201 YNTGDTNTGI ANLGDFNTGF YNTGNFSTGF ANQGDIATGA FITGDMGNGA FWRGDQQGLF
     1261 SAGYRVHVPE IPAHVTVEVP VNIPITASFT NTVYSGITLE QINFGFTIDI AGIPLLAGAI
     1321 SKAVLPPITG TGPAITVNIG DPGGSTAIRI PATASVGPFD VTFVNIAATT GFFNATTDPS
     1381 SGFFNGGPGT VSGIANIGAN ISGFQNVANS ATSGFNNYGS LQSGLANLGD TVSGVFNTGI
     1441 GAPANVSGMF NIGSNLAGFF HDQATGMSMF NLGLGNIGQF NVGFSNVGDS NAGLANIGSF
     1501 NLGSGNLGSF NVFGGNQGSY NIGPANLGNY NIGLGNLGSY NFGFGNAGDF NLGFANTGNN
     1561 NIGFANTGNN NIGIGLSGDN QQGFNFAGGW NSGSGNSGLF NSGTNNIGLF NSGTGNIGIG
     1621 NSGTGNWGIA NTGDTNTGIF NTGDVNTGLL NAGNVNTGIF NTGHYNTGSF NAGSFNTAGF
     1681 NPGSYNTGYL NTGSYNTGLA NSGDVNTGGF ITGNYSNGFW WRGDYQGLAG ISQTITVPDT
     1741 AVPVKLHVPI FLDIPVTGTL GTFTVHGFRF PEITGDIFLI GIPFNAATLD AFSFPNISIV
     1801 LPNIGINLGS GPDPLIDIAG TGGLLPIKIP LIDIPAAPGF GNSTTTPSSG FFNAGTGTVS
     1861 GVGNVGSNSS GFFDLTSGSS GISGVQNFGE LISGGFNFGN TVSGLVNAST LGLSMPANLS
     1921 GGGNVGATVA GFVNNTQILN LGFGNVGSGN VGHGNIGDSN VGLGNLGNAN VGHGNIGSFN
     1981 VFSGNRGSYN IGPANLGNYN IGLGNLGSYN FGFGNAGDFN LGFANSGSNN IGFANTGNNN
     2041 IGIGLSGHNQ QGFGSWNSGT ANTGLFNSGT NNIGLFNSGT GNIGIGNSGI GNTGIGNPGV
     2101 GNTGLGNSGT GNWGLWNPGT GNMGVANVGT YNTGGYNVGS TNTGIANVGI ANTGSYNTGS
     2161 TNTGSFNDGD FNTGFYNTGD YNTGFYNTGD VNTGAFIGGN FSNGAFWQSD HQGQWGAHYA
     2221 ITVPQIPLLN FSLNIPVNIP IHLDFGTLAV NGFQIPAITL RALGVTHFSV GPIIVPRIAG
     2281 TLPVIDINIG DPGGSSSIPI TITSGAGPVV IPLLDIPPAP GFGNSTTGPS SGFFNSGTGS
     2341 SSGFGNVGAN NSGFWNTAFA GIGNSGLQNF GSLQSGWANL GNTVSGFYNT SAADFATPAN
     2401 LSGLSNVGAD LTGVLRGPNG STFNAGLANL GQFNVGSANL GSANLGSANL GNSNVGFGNI
     2461 GNANIGGANI GDFNVGIANT GPGLTAAVNN IGIGNTGNYN IGVGNTGNYN IGFGNTGNNN
     2521 IGIGLSGDNQ IGFGPLNAGI ANMGLFNLGD NNFGMANAGN FNQGIANTGN NNIGLFNTGN
     2581 NNVGIGLTGD GLSGFSSLNS GAGNTGFFNS GTANTGLFNS GTGNTGLFNS GTGNVGIGNM
     2641 GTGGFGVGLS GDSQVGIGGT NSGSFNIGLF NSGTGNVGIG NSGTGNVGIG NTGTGNTGIG
     2701 NSGNYNTGLL NAGLVNTGIA NPGNHNTGLF NIGTFNTGIA NPGHYNTGSY NTGSYNTGMA
     2761 NAGDYGTGAF ITGSMNNGLL WRADRQGLLA ANYTITIERP AAFLNVDIPV NIPITGDITN
     2821 VSIPAITFPR IDASGSVDIG ILSGTVLAPV GPITLHGGDA SAPLDTPIEI DFGPSPAINL
     2881 NIGKPDGSTV INIVGGAGAG PISIPIIDLR PAPGFFNATT GPSSGFLNWG AGSASGLLNF
     2941 GNNSGLYNFA TSSMGNSGFQ NYGSLQSGWA NLGNSISGIY NTGLGAPANV SGLLNIGTNL
     3001 AGWLQNGPTE TTFSVGLANL GFWNLGSANI GNYNLGSANI GVYNLGSANI GDFNLGSANI
     3061 GDFNLGSANI GSSNIGFGNV GPGLTAAIGN IGFGNTGNGN IGIGNTGTGN IGFGNTGNGN
     3121 IGIGLTGDTM TGFGGWNSGT GNIGLFNSGT GNIGFGNSGT GNWGIGNSGD YNTGIGNTGS
     3181 TNSGFFNTGL VNTGIGNSGD YNTGLFNAGN TNTGSFNPGD YNTGGFNPGN YNTGYFNPGN
     3241 SNTGFANSGD VNTGAFNSGN YSNGFFWRGD YQGLGGFAYQ SAVSEIPWSY DIGSNIEIPI
     3301 EGDINAITQD AFTIDEFEIP IKLRVSVCVI YIPFKGCVKH VSVTIPITTE HLGPYEIDAS
     3361 TINPDQPIDT AFTQTLDFAG SGTVGAFPFG FGWQQSPGFF NSTTTPSSGF FNSGAGGASG
     3421 FLNDAAAAVS GLGNVFTETS GFFNAGGVGN SGFQNFGNLL SGWANLGNTV SGFYNTSMLD
     3481 LATQALISGF GNHGARLSGI LNNGSGP
//