LOCUS       SIU02451.1              1094 aa    PRT              BCT 25-MAY-2020
DEFINITION  Mycobacterium tuberculosis variant bovis AF2122/97 INTEGRAL
            MEMBRANE INDOLYLACETYLINOSITOL ARABINOSYLTRANSFERASE EMBC
            (ARABINOSYLINDOLYLACETYLINOSITOL SYNTHASE) protein.
ACCESSION   LT708304-3865
PROTEIN_ID  SIU02451.1
SOURCE      Mycobacterium tuberculosis variant bovis AF2122/97
  ORGANISM  Mycobacterium tuberculosis variant bovis AF2122/97
            Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae;
            Mycobacterium; Mycobacterium tuberculosis complex.
REFERENCE   1
  AUTHORS   Malone K.M.
  JOURNAL   Submitted (06-DEC-2016) to the INSDC. School of Veterinary
            Medicine, Tuberculosis Molecular Microbiology Research Group,
            University College Dublin, Tuberculosis Molecular Microbiology
            Research Group, School of Veterinary Medicine, University College
            Dublin, D4, Ireland
REFERENCE   2
  AUTHORS   Malone M K., Farrell D., Malone K.
  JOURNAL   Submitted (15-APR-2020) to the INSDC. School of Veterinary
            Medicine, Tuberculosis Molecular Microbiology Research Group,
            University College Dublin, Tuberculosis Molecular Microbiology
            Research Group, School of Veterinary Medicine,, University College
            Dublin, D4, Ireland
FEATURES             Qualifiers
     source          /organism="Mycobacterium tuberculosis variant bovis
                     AF2122/97"
                     /chromosome="Mycobacterium_bovis_AF212297"
                     /isolate="AF2122/97"
                     /mol_type="genomic DNA"
                     /isolation_source="Mycobacterium bovis subsp. bovis strain
                     AF2122/97. This strain is a fully virulent strain that was
                     isolated in 1997 in the UK from a cow suffering necrotic
                     lesions in lung and bronchomediastinal lymph nodes. The
                     strain was also reported to infect and persist in badgers
                     that are considered to be a significant source of bovine
                     infection."
                     /db_xref="taxon:233413"
     protein         /transl_table=11
                     /gene="embC"
                     /locus_tag="BQ2027_MB3822"
                     /note="Mb3822, embC, len: 1094 aa. Equivalent to Rv3793,
                     len: 1094 aa, from Mycobacterium tuberculosis strain
                     H37Rv, (99.9% identity in 1094 aa overlap). embC, integral
                     membrane protein, indolylacetylinositol
                     arabinosyltransferase (EC 2.4.2.34) (see citations below),
                     equivalent to Q9CDA7|EMBC|ML0106 PUTATIVE ARABINOSYL
                     TRANSFERASE from Mycobacterium leprae (1070 aa) FASTA
                     scores: opt: 6078,E(): 0, (82.95% identity in 1072 aa
                     overlap); Q50393|EMBC PUTATIVE ARABINOSYL TRANSFERASE from
                     Mycobacterium smegmatis (1074 aa), FASTA scores: opt:
                     5523, E(): 0, (75.35% identity in 1072 aa overlap). Also
                     similar to Q9CDA9|EMBB| ML0104 PUTATIVE ARABINOSYL
                     TRANSFERASE from Mycobacterium leprae (1083 aa), FASTA
                     scores: opt: 2789, E(): 1.9e-156, (44.0% identity in 1095
                     aa overlap); O30406|EMBB PUTATIVE ARABINOSYL TRANSFERASE
                     from Mycobacterium smegmatis (1082 aa), FASTA scores: opt:
                     2746, E(): 6.4e-154, (44.6% identity in 1096 aa overlap);
                     etc. Also similar to to P72030|EMBB|Rv3795|MTCY13D12.29
                     INDOLYLACETYLINOSITOL ARABINOSYLTRANSFERASE from
                     Mycobacterium tuberculosis (1098 aa), FASTA scores: opt:
                     2276, E(): 3.1e-126, (44.45% identity in 1118 aa overlap);
                     and P72060|EMBA|Rv3794|MTCY13D12.28 INDOLYLACETYLINOSITOL
                     ARABINOSYLTRANSFERASE from Mycobacterium tuberculosis
                     (1094 aa), FASTA scores: opt: 1974, E(): 1.9e-108, (41.0%
                     identity in 1110 aa overlap). Contains PS00044 Bacterial
                     regulatory proteins, lysR family signature; and PS00017
                     ATP/GTP-binding site motif A (P-loop). Protein product
                     from Mb3822 detected using shotgun mass spectrometry and
                     SWATH mass spectrometry. Mb3822 found to be expressed
                     during exponential growth in Sauton's minimal media by
                     RNA-sequencing."
                     /db_xref="GOA:Q7TVN4"
                     /db_xref="InterPro:IPR007680"
                     /db_xref="InterPro:IPR027451"
                     /db_xref="InterPro:IPR032731"
                     /db_xref="InterPro:IPR040920"
                     /db_xref="InterPro:IPR042486"
                     /db_xref="UniProtKB/Swiss-Prot:Q7TVN4"
                     /experiment="experimental evidence, no additional details
                     recorded"
BEGIN
        1 MATEAAPPRI AVRLPSTSVR DAGANYRIAR YVAVVAGLLG AVLAIATPLL PVNQTTAQLN
       61 WPQNGTFASV EAPLIGYVAT DLNITVPCQA AAGLAGSQNT GKTVLLSTVP KQAPKAVDRG
      121 LLLQRANDDL VLVVRNVPLV TAPLSQVLGP TCQRLTFTAH ADRVAAEFVG LVQGPNAEHP
      181 GAPLRGERSG YDFRPQIVGV FTDLAGPAPP GLSFSASVDT RYSSSPTPLK MAAMILGVAL
      241 TGAALVALHI LDTADGMRHR RFLPARWWSI GGLDTLVIAV LVWWHFVGAN TSDDGYILTM
      301 ARVSEHAGYM ANYYRWFGTP EAPFGWYYDL LALWAHVSTA SIWMRLPTLA MALTCWWVIS
      361 REVIPRLGHA VKTSRAAAWT AAGMFLAVWL PLDNGLRPEP IIALGILLTW CSVERAVATS
      421 RLLPVAIACI IGALTLFSGP TGIASIGALL VAIGPLRTIL HRRSRRFGVL PLVAPILAAA
      481 TVTAIPIFRD QTFAGEIQAN LLKRAVGPSL KWFDEHIRYE RLFMASPDGS IARRFAVLAL
      541 VLALAVSVAM SLRKGRIPGT AAGPSRRIIG ITIISFLAMM FTPTKWTHHF GVFAGLAGSL
      601 GALAAVAVTG AAMRSRRNRT VFAAVVVFVL ALSFASVNGW WYVSNFGVPW SNSFPKWRWS
      661 LTTALLELTV LVLLLAAWFH FVANGDGRRT ARPTRFRARL AGIVQSPLAI ATWLLVLFEV
      721 VSLTQAMISQ YPAWSVGRSN LQALAGKTCG LAEDVLVELD PNAGMLAPVT APLADALGAG
      781 LSEAFTPNGI PADVTADPVM ERPGDRSFLN DDGLITGSEP GTEGGTTAAP GINGSRARLP
      841 YNLDPARTPV LGSWRAGVQV PAMLRSGWYR LPTNEQRDRA PLLVVTAAGR FDSREVRLQW
      901 ATDEQAAAGH HGGSMEFADV GAAPAWRNLR APLSAIPSTA TQVRLVADDQ DLAPQHWIAL
      961 TPPRIPRVRT LQNVVGAADP VFLDWLVGLA FPCQRPFGHQ YGVDETPKWR ILPDRFGAEA
     1021 NSPVMDHNGG GPLGITELLM RATTVASYLK DDWFRDWGAL QRLTPYYPDA QPADLNLGTV
     1081 TRSGLWSPAP LRRG
//