LOCUS       CCP45742.1               422 aa    PRT              BCT 27-FEB-2015
DEFINITION  Mycobacterium tuberculosis H37Rv Possible conserved polyketide
            synthase associated protein PapA5 protein.
ACCESSION   AL123456-3020
PROTEIN_ID  CCP45742.1
SOURCE      Mycobacterium tuberculosis H37Rv
  ORGANISM  Mycobacterium tuberculosis H37Rv
            Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae;
            Mycobacterium; Mycobacterium tuberculosis complex.
REFERENCE   1
  AUTHORS   Cole S.T., Brosch R., Parkhill J., Garnier T., Churcher C.,
            Harris D., Gordon S.V., Eiglmeier K., Gas S., Barry C.E.III.,
            Tekaia F., Badcock K., Basham D., Brown D., Chillingworth T.,
            Connor R., Davies R., Devlin K., Feltwell T., Gentles S.,
            Hamlin N., Holroyd S., Hornsby T., Jagels K., Krogh A., McLean J.,
            Moule S., Murphy L., Oliver K., Osborne J., Quail M.A.,
            Rajandream M.A., Rogers J., Rutter S., Seeger K., Skelton J.,
            Squares R., Squares S., Sulston J.E., Taylor K., Whitehead S.,
            Barrell B.G.
  TITLE     Deciphering the biology of Mycobacterium tuberculosis from the
            complete genome sequence
  JOURNAL   Nature 393(6685), 537-544(1998).
   PUBMED   9634230
  REMARK    Erratum:[Nature 1998 Nov 12;396(6707):190]
REFERENCE   2
  AUTHORS   Camus J.C., Pryor M.J., Medigue C., Cole S.T.
  TITLE     Re-annotation of the genome sequence of Mycobacterium tuberculosis
            H37Rv
  JOURNAL   Microbiology (Reading, Engl.) 148(Pt 10), 2967-2973(2002).
   PUBMED   12368430
REFERENCE   3
  AUTHORS   Lew J.M., Kapopoulou A., Jones L.M., Cole S.T.
  TITLE     TubercuList--10 years after
  JOURNAL   Tuberculosis (Edinb) 91(1), 1-7(2011).
   PUBMED   20980199
REFERENCE   4  (bases 1 to 4411529)
  AUTHORS   Parkhill J.
  JOURNAL   Submitted (11-JUN-1998) to the INSDC. Submitted on behalf of the
            Mycobacterium tuberculosis sequencing and mapping teams, Sanger
            Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA
            Unite de Genetique Moleculaire Bacterienne, Institut Pasteur, 28
            rue du Docteur Roux, 75724 Paris Cedex 15, France E-mail:
            parkhill@sanger.ac.uk
REFERENCE   5  (bases 1 to 4411532)
  AUTHORS   Lew J.M.
  JOURNAL   Submitted (18-DEC-2012) to the INSDC. Lew J., Ecole Polytechnique
            Federale de Lausanne, CH-1015, Lausanne, Switzerland, and the Swiss
            Institute of Bioinformatics, CMU - Rue Michel-Servet 1, 1211 Geneva
            4, SWITZERLAND
COMMENT     On or before Feb 1, 2013 this sequence version replaced
            gi:41352722, gi:38490165, gi:38490207, gi:41353619, gi:38490250,
            gi:38684030, gi:38490288, gi:41353667, gi:41353422, gi:41352756,
            gi:38490319, gi:41352785, gi:38490370, gi:41353971.
            Note:
            This annotation is from the TubercuList website, Release 26, Dec
            2012 (URL: http://tuberculist.epfl.ch) (email:
            tuberculist@epfl.ch).
FEATURES             Qualifiers
     source          /organism="Mycobacterium tuberculosis H37Rv"
                     /strain="H37Rv"
                     /mol_type="genomic DNA"
                     /db_xref="taxon:83332"
     protein         /transl_table=11
                     /gene="papA5"
                     /locus_tag="Rv2939"
                     /note="Rv2939, (MTCY19H9.07), len: 422 aa. Possible
                     papA5,conserved polyketide synthase (PKS) associated
                     protein (see Camacho et al., 2001), equivalent to Q49939
                     hypothetical 45.6 KDA protein from Mycobacterium leprae
                     (423 aa), FASTA scores: opt: 2398, E(): 4.5e-144, (84.05%
                     identity in 426 aa overlap); and Q02279|YMA3_MYCBO
                     hypothetical 38.1 KDA protein from Mycobacterium bovis
                     (354 aa), FASTA scores: opt: 2193, E(): 3.6e-131, (97.4%
                     identity in 343 aa overlap). And C-terminus highly similar
                     to to Q9S381 hypothetical 5.0 KDA protein (fragment) from
                     Mycobacterium leprae (44 aa), FASTA scores: opt: 275, E():
                     1.4e-10,(88.65% identity in 44 aa overlap). Also similar
                     in part to various synthetases e.g. Q9AE01|RIF20 RIF20
                     protein from Amycolatopsis mediterranei (Nocardia
                     mediterranei) (403 aa), FASTA scores: opt: 282, E():
                     2.7e-10, (30.3% identity in 393 aa overlap); middle part
                     of Q00869|ESYN1 enniatin sythetase (fragment) (N-methyl
                     peptide synthetase) from Fusarium equiseti (3131 aa),
                     FASTA scores: opt: 180, E(): 0.0036, (26.85% identity in
                     242 aa overlap); N-terminus of Q9FB18 peptide synthetase
                     NRPS2-1 from Streptomyces verticillus (2626 aa), FASTA
                     scores: opt: 159, E(): 0.068,(23.65% identity in 351 aa
                     overlap); etc. Note that Rv2939|papA5 belongs to the
                     transcriptional unit Rv2930|fadD26-Rv2939|papA5 (proven
                     experimentally)."
                     /db_xref="EnsemblGenomes-Gn:Rv2939"
                     /db_xref="EnsemblGenomes-Tr:CCP45742"
                     /db_xref="GOA:P9WIN5"
                     /db_xref="InterPro:IPR023213"
                     /db_xref="InterPro:IPR031641"
                     /db_xref="PDB:1Q9J"
                     /db_xref="UniProtKB/Swiss-Prot:P9WIN5"
                     /experiment="EXISTENCE: identified in proteomics study"
BEGIN
        1 MFPGSVIRKL SHSEEVFAQY EVFTSMTIQL RGVIDVDALS DAFDALLETH PVLASHLEQS
       61 SDGGWNLVAD DLLHSGICVI DGTAATNGSP SGNAELRLDQ SVSLLHLQLI LREGGAELTL
      121 YLHHCMADGH HGAVLVDELF SRYTDAVTTG DPGPITPQPT PLSMEAVLAQ RGIRKQGLSG
      181 AERFMSVMYA YEIPATETPA VLAHPGLPQA VPVTRLWLSK QQTSDLMAFG REHRLSLNAV
      241 VAAAILLTEW QLRNTPHVPI PYVYPVDLRF VLAPPVAPTE ATNLLGAASY LAEIGPNTDI
      301 VDLASDIVAT LRADLANGVI QQSGLHFGTA FEGTPPGLPP LVFCTDATSF PTMRTPPGLE
      361 IEDIKGQFYC SISVPLDLYS CAVYAGQLII EHHGHIAEPG KSLEAIRSLL CTVPSEYGWI
      421 ME
//