LOCUS CCP45742.1 422 aa PRT BCT 27-FEB-2015 DEFINITION Mycobacterium tuberculosis H37Rv Possible conserved polyketide synthase associated protein PapA5 protein. ACCESSION AL123456-3020 PROTEIN_ID CCP45742.1 SOURCE Mycobacterium tuberculosis H37Rv ORGANISM Mycobacterium tuberculosis H37Rv Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae; Mycobacterium; Mycobacterium tuberculosis complex. REFERENCE 1 AUTHORS Cole S.T., Brosch R., Parkhill J., Garnier T., Churcher C., Harris D., Gordon S.V., Eiglmeier K., Gas S., Barry C.E.III., Tekaia F., Badcock K., Basham D., Brown D., Chillingworth T., Connor R., Davies R., Devlin K., Feltwell T., Gentles S., Hamlin N., Holroyd S., Hornsby T., Jagels K., Krogh A., McLean J., Moule S., Murphy L., Oliver K., Osborne J., Quail M.A., Rajandream M.A., Rogers J., Rutter S., Seeger K., Skelton J., Squares R., Squares S., Sulston J.E., Taylor K., Whitehead S., Barrell B.G. TITLE Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence JOURNAL Nature 393(6685), 537-544(1998). PUBMED 9634230 REMARK Erratum:[Nature 1998 Nov 12;396(6707):190] REFERENCE 2 AUTHORS Camus J.C., Pryor M.J., Medigue C., Cole S.T. TITLE Re-annotation of the genome sequence of Mycobacterium tuberculosis H37Rv JOURNAL Microbiology (Reading, Engl.) 148(Pt 10), 2967-2973(2002). PUBMED 12368430 REFERENCE 3 AUTHORS Lew J.M., Kapopoulou A., Jones L.M., Cole S.T. TITLE TubercuList--10 years after JOURNAL Tuberculosis (Edinb) 91(1), 1-7(2011). PUBMED 20980199 REFERENCE 4 (bases 1 to 4411529) AUTHORS Parkhill J. JOURNAL Submitted (11-JUN-1998) to the INSDC. Submitted on behalf of the Mycobacterium tuberculosis sequencing and mapping teams, Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA Unite de Genetique Moleculaire Bacterienne, Institut Pasteur, 28 rue du Docteur Roux, 75724 Paris Cedex 15, France E-mail: parkhill@sanger.ac.uk REFERENCE 5 (bases 1 to 4411532) AUTHORS Lew J.M. JOURNAL Submitted (18-DEC-2012) to the INSDC. Lew J., Ecole Polytechnique Federale de Lausanne, CH-1015, Lausanne, Switzerland, and the Swiss Institute of Bioinformatics, CMU - Rue Michel-Servet 1, 1211 Geneva 4, SWITZERLAND COMMENT On or before Feb 1, 2013 this sequence version replaced gi:41352722, gi:38490165, gi:38490207, gi:41353619, gi:38490250, gi:38684030, gi:38490288, gi:41353667, gi:41353422, gi:41352756, gi:38490319, gi:41352785, gi:38490370, gi:41353971. Note: This annotation is from the TubercuList website, Release 26, Dec 2012 (URL: http://tuberculist.epfl.ch) (email: tuberculist@epfl.ch). FEATURES Qualifiers source /organism="Mycobacterium tuberculosis H37Rv" /strain="H37Rv" /mol_type="genomic DNA" /db_xref="taxon:83332" protein /transl_table=11 /gene="papA5" /locus_tag="Rv2939" /note="Rv2939, (MTCY19H9.07), len: 422 aa. Possible papA5,conserved polyketide synthase (PKS) associated protein (see Camacho et al., 2001), equivalent to Q49939 hypothetical 45.6 KDA protein from Mycobacterium leprae (423 aa), FASTA scores: opt: 2398, E(): 4.5e-144, (84.05% identity in 426 aa overlap); and Q02279|YMA3_MYCBO hypothetical 38.1 KDA protein from Mycobacterium bovis (354 aa), FASTA scores: opt: 2193, E(): 3.6e-131, (97.4% identity in 343 aa overlap). And C-terminus highly similar to to Q9S381 hypothetical 5.0 KDA protein (fragment) from Mycobacterium leprae (44 aa), FASTA scores: opt: 275, E(): 1.4e-10,(88.65% identity in 44 aa overlap). Also similar in part to various synthetases e.g. Q9AE01|RIF20 RIF20 protein from Amycolatopsis mediterranei (Nocardia mediterranei) (403 aa), FASTA scores: opt: 282, E(): 2.7e-10, (30.3% identity in 393 aa overlap); middle part of Q00869|ESYN1 enniatin sythetase (fragment) (N-methyl peptide synthetase) from Fusarium equiseti (3131 aa), FASTA scores: opt: 180, E(): 0.0036, (26.85% identity in 242 aa overlap); N-terminus of Q9FB18 peptide synthetase NRPS2-1 from Streptomyces verticillus (2626 aa), FASTA scores: opt: 159, E(): 0.068,(23.65% identity in 351 aa overlap); etc. Note that Rv2939|papA5 belongs to the transcriptional unit Rv2930|fadD26-Rv2939|papA5 (proven experimentally)." /db_xref="EnsemblGenomes-Gn:Rv2939" /db_xref="EnsemblGenomes-Tr:CCP45742" /db_xref="GOA:P9WIN5" /db_xref="InterPro:IPR023213" /db_xref="InterPro:IPR031641" /db_xref="PDB:1Q9J" /db_xref="UniProtKB/Swiss-Prot:P9WIN5" /experiment="EXISTENCE: identified in proteomics study" BEGIN 1 MFPGSVIRKL SHSEEVFAQY EVFTSMTIQL RGVIDVDALS DAFDALLETH PVLASHLEQS 61 SDGGWNLVAD DLLHSGICVI DGTAATNGSP SGNAELRLDQ SVSLLHLQLI LREGGAELTL 121 YLHHCMADGH HGAVLVDELF SRYTDAVTTG DPGPITPQPT PLSMEAVLAQ RGIRKQGLSG 181 AERFMSVMYA YEIPATETPA VLAHPGLPQA VPVTRLWLSK QQTSDLMAFG REHRLSLNAV 241 VAAAILLTEW QLRNTPHVPI PYVYPVDLRF VLAPPVAPTE ATNLLGAASY LAEIGPNTDI 301 VDLASDIVAT LRADLANGVI QQSGLHFGTA FEGTPPGLPP LVFCTDATSF PTMRTPPGLE 361 IEDIKGQFYC SISVPLDLYS CAVYAGQLII EHHGHIAEPG KSLEAIRSLL CTVPSEYGWI 421 ME //