LOCUS CCP44427.1 1602 aa PRT BCT 27-FEB-2015 DEFINITION Mycobacterium tuberculosis H37Rv Probable polyketide synthase Pks8 protein. ACCESSION AL123456-1705 PROTEIN_ID CCP44427.1 SOURCE Mycobacterium tuberculosis H37Rv ORGANISM Mycobacterium tuberculosis H37Rv Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae; Mycobacterium; Mycobacterium tuberculosis complex. REFERENCE 1 AUTHORS Cole S.T., Brosch R., Parkhill J., Garnier T., Churcher C., Harris D., Gordon S.V., Eiglmeier K., Gas S., Barry C.E.III., Tekaia F., Badcock K., Basham D., Brown D., Chillingworth T., Connor R., Davies R., Devlin K., Feltwell T., Gentles S., Hamlin N., Holroyd S., Hornsby T., Jagels K., Krogh A., McLean J., Moule S., Murphy L., Oliver K., Osborne J., Quail M.A., Rajandream M.A., Rogers J., Rutter S., Seeger K., Skelton J., Squares R., Squares S., Sulston J.E., Taylor K., Whitehead S., Barrell B.G. TITLE Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence JOURNAL Nature 393(6685), 537-544(1998). PUBMED 9634230 REMARK Erratum:[Nature 1998 Nov 12;396(6707):190] REFERENCE 2 AUTHORS Camus J.C., Pryor M.J., Medigue C., Cole S.T. TITLE Re-annotation of the genome sequence of Mycobacterium tuberculosis H37Rv JOURNAL Microbiology (Reading, Engl.) 148(Pt 10), 2967-2973(2002). PUBMED 12368430 REFERENCE 3 AUTHORS Lew J.M., Kapopoulou A., Jones L.M., Cole S.T. TITLE TubercuList--10 years after JOURNAL Tuberculosis (Edinb) 91(1), 1-7(2011). PUBMED 20980199 REFERENCE 4 (bases 1 to 4411529) AUTHORS Parkhill J. JOURNAL Submitted (11-JUN-1998) to the INSDC. Submitted on behalf of the Mycobacterium tuberculosis sequencing and mapping teams, Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA Unite de Genetique Moleculaire Bacterienne, Institut Pasteur, 28 rue du Docteur Roux, 75724 Paris Cedex 15, France E-mail: parkhill@sanger.ac.uk REFERENCE 5 (bases 1 to 4411532) AUTHORS Lew J.M. JOURNAL Submitted (18-DEC-2012) to the INSDC. Lew J., Ecole Polytechnique Federale de Lausanne, CH-1015, Lausanne, Switzerland, and the Swiss Institute of Bioinformatics, CMU - Rue Michel-Servet 1, 1211 Geneva 4, SWITZERLAND COMMENT On or before Feb 1, 2013 this sequence version replaced gi:41352722, gi:38490165, gi:38490207, gi:41353619, gi:38490250, gi:38684030, gi:38490288, gi:41353667, gi:41353422, gi:41352756, gi:38490319, gi:41352785, gi:38490370, gi:41353971. Note: This annotation is from the TubercuList website, Release 26, Dec 2012 (URL: http://tuberculist.epfl.ch) (email: tuberculist@epfl.ch). FEATURES Qualifiers source /organism="Mycobacterium tuberculosis H37Rv" /strain="H37Rv" /mol_type="genomic DNA" /db_xref="taxon:83332" protein /transl_table=11 /gene="pks8" /locus_tag="Rv1662" /note="Rv1662, (MTCY275.01-MTCY06H11.27), len: 1602 aa. Probable pks8, polyketide synthase, similar to many polyketide synthases e.g. ERY2_SACER|Q03132 erythronolide synthase, modules 3 and 4 from Saccharopolyspora erythraea (Streptomyces erythraeus) (3567 aa), FASTA scores: opt: 3319, E(): 0, (45.8% identity in 1619 aa overlap). Also similar to other Mycobacterium tuberculosis probable polyketide synthases e.g. pks7 and pks12. Contains PS00606 Beta-ketoacyl synthases active site and PS01162 Quinone oxidoreductase/zeta-crystallin signature. Note that the similarity extends into the downstream ORF Rv1663 (MTCY275.02), and this could be accounted for by a frameshift, although the sequence has been checked and no discrepancy was found." /db_xref="EnsemblGenomes-Gn:Rv1662" /db_xref="EnsemblGenomes-Tr:CCP44427" /db_xref="GOA:O65933" /db_xref="InterPro:IPR001227" /db_xref="InterPro:IPR002364" /db_xref="InterPro:IPR011032" /db_xref="InterPro:IPR013149" /db_xref="InterPro:IPR013154" /db_xref="InterPro:IPR014030" /db_xref="InterPro:IPR014031" /db_xref="InterPro:IPR014043" /db_xref="InterPro:IPR015083" /db_xref="InterPro:IPR016035" /db_xref="InterPro:IPR016036" /db_xref="InterPro:IPR016039" /db_xref="InterPro:IPR018201" /db_xref="InterPro:IPR020801" /db_xref="InterPro:IPR020807" /db_xref="InterPro:IPR020841" /db_xref="InterPro:IPR020843" /db_xref="InterPro:IPR032821" /db_xref="InterPro:IPR036291" /db_xref="InterPro:IPR042104" /db_xref="UniProtKB/TrEMBL:O65933" /inference="protein motif:PROSITE:PS00606" /inference="protein motif:PROSITE:PS01162" /experiment="EXISTENCE: identified in proteomics study" BEGIN 1 MSGTTTHVDY LKRLTADLRR TRRRLSDLEA KLSEPVAVVG MGCRYPGGVD SPETLWELVA 61 QGRDAVSDFP ADRGWDVDGL FDPDPDACGK MYTRRGTFLE HAGDFDAGFF GIGPSEALAM 121 DPQQRLLLEV SWEALERTGI DPTKLRGSAT GVFAGVIHAG YGGQLSGELE GYGLTGSTLS 181 VASGRVAYVL GLEGPAVSVD TACSSSLVAL HLAVQSLRSG ECDLALAGGV TVMATPAAFV 241 EFSRQRALAR DGRCKVYAGA ADGTAWSEGA GVLVVERLVD ARRLGHPVLA LVRGSAVNQD 301 GASNGLTAPN GPSQQRVIRA ALASARLRAV EVDVVEGHGT GTMLGDPIEA QALLATYGQD 361 RVEPLWLGSI KSNIGHTSAA AGVAGVIKMV QAMRHGVMPK TLHVDVPTPH VDWSVGAVSL 421 LTQPRAWSVH GRPRRAGVSS FGISGTNAHV ILEQAPVVES VVPEVASPTA ASAVPWVLSA 481 RSEQALAGQA QRLLAFVAAN PDLDPIDVGW SLVKTRAMFE HRAVVVGADR GALLAGLAAL 541 AAGESGAGVA VGRARSVGKT VFVFPGQGAQ WVGMGAQLYA ELPLFALAFD AVAEELDRHL 601 RLPLRNVLWE GDEALLTSTE FAQPALFAIE VALATLLQHW GISPDFLIGH SVGEIAAAHL 661 AGVLSLTDAA GLVAARGRLM AELPAGGVMV VVAASEEEVL PVLVDGANLA AVNAPHSVVV 721 SGCEAAVSDI ADHFARRGRR VHRLAVSHAF HSLLMEPMLA EFTRIAAGIS VSKPRIPLVS 781 NVTGQMAGAG YGDGQYWVEH ARRPVRFAEG VQLLNAVGAT RFVEVGPGGG LTALVEQSLP 841 LGEALSVAMM RREHPEVSSV LGAVATLFTA GAQMDWPAVF GSPGRRIELP TYAFQRQRYW 901 LPPTSAGSAD ISGVGLLAAR HGLLGAVVEQ PDSDVVVLTG RLSVGEQRWL ADHVIAGVVL 961 LAGAAFVELA LRAADQVDCG VVEELTVVTP LVLPTVGGVQ LQVVVGVGEM GQRPVSIYSR 1021 NAESDSGWVL HARGVLGAKA VAPAADLSVW PPLGAAPVDV DGAYQRFAEL GYEYGRAFQG 1081 LTAMWRRESE LFADVAVPDD VDVTLSGFGI HPLVLDAALH AMGMVGEQAA TMLPFSWQGV 1141 SLHAAGASRV RARIAPAGDG TVSVELADQA GLPVLSVQAL VMRSVSSQLL SAAVAAADAA 1201 GRGLLEVAWL PVELAHNDIS ADLVVWELES FQDGVGPVYS ATHRVLVALQ SWLAQERAGR 1261 LVVLTQGSVG QDATNLAGAA VWGLVRSAQA EHPGRVMLVD SDGSMDVGDV IGCGEEQLMI 1321 RNGTAYAARL AQLRPQPILQ LPDTNSGWRL VAGGAGALED LTLASCPAKE LAPGQVRIEV 1381 RALGVNFRDV LVALGIYPGA AELGAEGAGV VTEVGPGVTG LAVGDPVMGL LGVAGSEAVV 1441 DARLVVKLPN RWPLTDAAGV PVVFLTAYYA LRVLAQVQPG ESVLVHAAAG GVGMAAVQLA 1501 RLWGLEVFAT ASRGKWDTLH TMGCDNTHVA DSRTLAFEET FWLTTEGRGV DVVLNSLAGE 1561 FTDASLRLLP RGGRFIEMGK TEFGTPRSLP RTILGWPTGL ST //