LOCUS       CCP44427.1              1602 aa    PRT              BCT 27-FEB-2015
DEFINITION  Mycobacterium tuberculosis H37Rv Probable polyketide synthase
            Pks8 protein.
ACCESSION   AL123456-1705
PROTEIN_ID  CCP44427.1
SOURCE      Mycobacterium tuberculosis H37Rv
  ORGANISM  Mycobacterium tuberculosis H37Rv
            Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae;
            Mycobacterium; Mycobacterium tuberculosis complex.
REFERENCE   1
  AUTHORS   Cole S.T., Brosch R., Parkhill J., Garnier T., Churcher C.,
            Harris D., Gordon S.V., Eiglmeier K., Gas S., Barry C.E.III.,
            Tekaia F., Badcock K., Basham D., Brown D., Chillingworth T.,
            Connor R., Davies R., Devlin K., Feltwell T., Gentles S.,
            Hamlin N., Holroyd S., Hornsby T., Jagels K., Krogh A., McLean J.,
            Moule S., Murphy L., Oliver K., Osborne J., Quail M.A.,
            Rajandream M.A., Rogers J., Rutter S., Seeger K., Skelton J.,
            Squares R., Squares S., Sulston J.E., Taylor K., Whitehead S.,
            Barrell B.G.
  TITLE     Deciphering the biology of Mycobacterium tuberculosis from the
            complete genome sequence
  JOURNAL   Nature 393(6685), 537-544(1998).
   PUBMED   9634230
  REMARK    Erratum:[Nature 1998 Nov 12;396(6707):190]
REFERENCE   2
  AUTHORS   Camus J.C., Pryor M.J., Medigue C., Cole S.T.
  TITLE     Re-annotation of the genome sequence of Mycobacterium tuberculosis
            H37Rv
  JOURNAL   Microbiology (Reading, Engl.) 148(Pt 10), 2967-2973(2002).
   PUBMED   12368430
REFERENCE   3
  AUTHORS   Lew J.M., Kapopoulou A., Jones L.M., Cole S.T.
  TITLE     TubercuList--10 years after
  JOURNAL   Tuberculosis (Edinb) 91(1), 1-7(2011).
   PUBMED   20980199
REFERENCE   4  (bases 1 to 4411529)
  AUTHORS   Parkhill J.
  JOURNAL   Submitted (11-JUN-1998) to the INSDC. Submitted on behalf of the
            Mycobacterium tuberculosis sequencing and mapping teams, Sanger
            Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA
            Unite de Genetique Moleculaire Bacterienne, Institut Pasteur, 28
            rue du Docteur Roux, 75724 Paris Cedex 15, France E-mail:
            parkhill@sanger.ac.uk
REFERENCE   5  (bases 1 to 4411532)
  AUTHORS   Lew J.M.
  JOURNAL   Submitted (18-DEC-2012) to the INSDC. Lew J., Ecole Polytechnique
            Federale de Lausanne, CH-1015, Lausanne, Switzerland, and the Swiss
            Institute of Bioinformatics, CMU - Rue Michel-Servet 1, 1211 Geneva
            4, SWITZERLAND
COMMENT     On or before Feb 1, 2013 this sequence version replaced
            gi:41352722, gi:38490165, gi:38490207, gi:41353619, gi:38490250,
            gi:38684030, gi:38490288, gi:41353667, gi:41353422, gi:41352756,
            gi:38490319, gi:41352785, gi:38490370, gi:41353971.
            Note:
            This annotation is from the TubercuList website, Release 26, Dec
            2012 (URL: http://tuberculist.epfl.ch) (email:
            tuberculist@epfl.ch).
FEATURES             Qualifiers
     source          /organism="Mycobacterium tuberculosis H37Rv"
                     /strain="H37Rv"
                     /mol_type="genomic DNA"
                     /db_xref="taxon:83332"
     protein         /transl_table=11
                     /gene="pks8"
                     /locus_tag="Rv1662"
                     /note="Rv1662, (MTCY275.01-MTCY06H11.27), len: 1602 aa.
                     Probable pks8, polyketide synthase, similar to many
                     polyketide synthases e.g. ERY2_SACER|Q03132 erythronolide
                     synthase, modules 3 and 4 from Saccharopolyspora erythraea
                     (Streptomyces erythraeus) (3567 aa), FASTA scores: opt:
                     3319, E(): 0, (45.8% identity in 1619 aa overlap). Also
                     similar to other Mycobacterium tuberculosis probable
                     polyketide synthases e.g. pks7 and pks12. Contains PS00606
                     Beta-ketoacyl synthases active site and PS01162 Quinone
                     oxidoreductase/zeta-crystallin signature. Note that the
                     similarity extends into the downstream ORF Rv1663
                     (MTCY275.02), and this could be accounted for by a
                     frameshift, although the sequence has been checked and no
                     discrepancy was found."
                     /db_xref="EnsemblGenomes-Gn:Rv1662"
                     /db_xref="EnsemblGenomes-Tr:CCP44427"
                     /db_xref="GOA:O65933"
                     /db_xref="InterPro:IPR001227"
                     /db_xref="InterPro:IPR002364"
                     /db_xref="InterPro:IPR011032"
                     /db_xref="InterPro:IPR013149"
                     /db_xref="InterPro:IPR013154"
                     /db_xref="InterPro:IPR014030"
                     /db_xref="InterPro:IPR014031"
                     /db_xref="InterPro:IPR014043"
                     /db_xref="InterPro:IPR015083"
                     /db_xref="InterPro:IPR016035"
                     /db_xref="InterPro:IPR016036"
                     /db_xref="InterPro:IPR016039"
                     /db_xref="InterPro:IPR018201"
                     /db_xref="InterPro:IPR020801"
                     /db_xref="InterPro:IPR020807"
                     /db_xref="InterPro:IPR020841"
                     /db_xref="InterPro:IPR020843"
                     /db_xref="InterPro:IPR032821"
                     /db_xref="InterPro:IPR036291"
                     /db_xref="InterPro:IPR042104"
                     /db_xref="UniProtKB/TrEMBL:O65933"
                     /inference="protein motif:PROSITE:PS00606"
                     /inference="protein motif:PROSITE:PS01162"
                     /experiment="EXISTENCE: identified in proteomics study"
BEGIN
        1 MSGTTTHVDY LKRLTADLRR TRRRLSDLEA KLSEPVAVVG MGCRYPGGVD SPETLWELVA
       61 QGRDAVSDFP ADRGWDVDGL FDPDPDACGK MYTRRGTFLE HAGDFDAGFF GIGPSEALAM
      121 DPQQRLLLEV SWEALERTGI DPTKLRGSAT GVFAGVIHAG YGGQLSGELE GYGLTGSTLS
      181 VASGRVAYVL GLEGPAVSVD TACSSSLVAL HLAVQSLRSG ECDLALAGGV TVMATPAAFV
      241 EFSRQRALAR DGRCKVYAGA ADGTAWSEGA GVLVVERLVD ARRLGHPVLA LVRGSAVNQD
      301 GASNGLTAPN GPSQQRVIRA ALASARLRAV EVDVVEGHGT GTMLGDPIEA QALLATYGQD
      361 RVEPLWLGSI KSNIGHTSAA AGVAGVIKMV QAMRHGVMPK TLHVDVPTPH VDWSVGAVSL
      421 LTQPRAWSVH GRPRRAGVSS FGISGTNAHV ILEQAPVVES VVPEVASPTA ASAVPWVLSA
      481 RSEQALAGQA QRLLAFVAAN PDLDPIDVGW SLVKTRAMFE HRAVVVGADR GALLAGLAAL
      541 AAGESGAGVA VGRARSVGKT VFVFPGQGAQ WVGMGAQLYA ELPLFALAFD AVAEELDRHL
      601 RLPLRNVLWE GDEALLTSTE FAQPALFAIE VALATLLQHW GISPDFLIGH SVGEIAAAHL
      661 AGVLSLTDAA GLVAARGRLM AELPAGGVMV VVAASEEEVL PVLVDGANLA AVNAPHSVVV
      721 SGCEAAVSDI ADHFARRGRR VHRLAVSHAF HSLLMEPMLA EFTRIAAGIS VSKPRIPLVS
      781 NVTGQMAGAG YGDGQYWVEH ARRPVRFAEG VQLLNAVGAT RFVEVGPGGG LTALVEQSLP
      841 LGEALSVAMM RREHPEVSSV LGAVATLFTA GAQMDWPAVF GSPGRRIELP TYAFQRQRYW
      901 LPPTSAGSAD ISGVGLLAAR HGLLGAVVEQ PDSDVVVLTG RLSVGEQRWL ADHVIAGVVL
      961 LAGAAFVELA LRAADQVDCG VVEELTVVTP LVLPTVGGVQ LQVVVGVGEM GQRPVSIYSR
     1021 NAESDSGWVL HARGVLGAKA VAPAADLSVW PPLGAAPVDV DGAYQRFAEL GYEYGRAFQG
     1081 LTAMWRRESE LFADVAVPDD VDVTLSGFGI HPLVLDAALH AMGMVGEQAA TMLPFSWQGV
     1141 SLHAAGASRV RARIAPAGDG TVSVELADQA GLPVLSVQAL VMRSVSSQLL SAAVAAADAA
     1201 GRGLLEVAWL PVELAHNDIS ADLVVWELES FQDGVGPVYS ATHRVLVALQ SWLAQERAGR
     1261 LVVLTQGSVG QDATNLAGAA VWGLVRSAQA EHPGRVMLVD SDGSMDVGDV IGCGEEQLMI
     1321 RNGTAYAARL AQLRPQPILQ LPDTNSGWRL VAGGAGALED LTLASCPAKE LAPGQVRIEV
     1381 RALGVNFRDV LVALGIYPGA AELGAEGAGV VTEVGPGVTG LAVGDPVMGL LGVAGSEAVV
     1441 DARLVVKLPN RWPLTDAAGV PVVFLTAYYA LRVLAQVQPG ESVLVHAAAG GVGMAAVQLA
     1501 RLWGLEVFAT ASRGKWDTLH TMGCDNTHVA DSRTLAFEET FWLTTEGRGV DVVLNSLAGE
     1561 FTDASLRLLP RGGRFIEMGK TEFGTPRSLP RTILGWPTGL ST
//