LOCUS       CCP46705.1               666 aa    PRT              BCT 27-FEB-2015
DEFINITION  Mycobacterium tuberculosis H37Rv ESX-1 secretion-associated
            protein EspI. Conserved proline and alanine rich protein.
            protein.
ACCESSION   AL123456-3983
PROTEIN_ID  CCP46705.1
SOURCE      Mycobacterium tuberculosis H37Rv
  ORGANISM  Mycobacterium tuberculosis H37Rv
            Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae;
            Mycobacterium; Mycobacterium tuberculosis complex.
REFERENCE   1
  AUTHORS   Cole S.T., Brosch R., Parkhill J., Garnier T., Churcher C.,
            Harris D., Gordon S.V., Eiglmeier K., Gas S., Barry C.E.III.,
            Tekaia F., Badcock K., Basham D., Brown D., Chillingworth T.,
            Connor R., Davies R., Devlin K., Feltwell T., Gentles S.,
            Hamlin N., Holroyd S., Hornsby T., Jagels K., Krogh A., McLean J.,
            Moule S., Murphy L., Oliver K., Osborne J., Quail M.A.,
            Rajandream M.A., Rogers J., Rutter S., Seeger K., Skelton J.,
            Squares R., Squares S., Sulston J.E., Taylor K., Whitehead S.,
            Barrell B.G.
  TITLE     Deciphering the biology of Mycobacterium tuberculosis from the
            complete genome sequence
  JOURNAL   Nature 393(6685), 537-544(1998).
   PUBMED   9634230
  REMARK    Erratum:[Nature 1998 Nov 12;396(6707):190]
REFERENCE   2
  AUTHORS   Camus J.C., Pryor M.J., Medigue C., Cole S.T.
  TITLE     Re-annotation of the genome sequence of Mycobacterium tuberculosis
            H37Rv
  JOURNAL   Microbiology (Reading, Engl.) 148(Pt 10), 2967-2973(2002).
   PUBMED   12368430
REFERENCE   3
  AUTHORS   Lew J.M., Kapopoulou A., Jones L.M., Cole S.T.
  TITLE     TubercuList--10 years after
  JOURNAL   Tuberculosis (Edinb) 91(1), 1-7(2011).
   PUBMED   20980199
REFERENCE   4  (bases 1 to 4411529)
  AUTHORS   Parkhill J.
  JOURNAL   Submitted (11-JUN-1998) to the INSDC. Submitted on behalf of the
            Mycobacterium tuberculosis sequencing and mapping teams, Sanger
            Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA
            Unite de Genetique Moleculaire Bacterienne, Institut Pasteur, 28
            rue du Docteur Roux, 75724 Paris Cedex 15, France E-mail:
            parkhill@sanger.ac.uk
REFERENCE   5  (bases 1 to 4411532)
  AUTHORS   Lew J.M.
  JOURNAL   Submitted (18-DEC-2012) to the INSDC. Lew J., Ecole Polytechnique
            Federale de Lausanne, CH-1015, Lausanne, Switzerland, and the Swiss
            Institute of Bioinformatics, CMU - Rue Michel-Servet 1, 1211 Geneva
            4, SWITZERLAND
COMMENT     On or before Feb 1, 2013 this sequence version replaced
            gi:41352722, gi:38490165, gi:38490207, gi:41353619, gi:38490250,
            gi:38684030, gi:38490288, gi:41353667, gi:41353422, gi:41352756,
            gi:38490319, gi:41352785, gi:38490370, gi:41353971.
            Note:
            This annotation is from the TubercuList website, Release 26, Dec
            2012 (URL: http://tuberculist.epfl.ch) (email:
            tuberculist@epfl.ch).
FEATURES             Qualifiers
     source          /organism="Mycobacterium tuberculosis H37Rv"
                     /strain="H37Rv"
                     /mol_type="genomic DNA"
                     /db_xref="taxon:83332"
     protein         /transl_table=11
                     /gene="espI"
                     /gene_synonym="snm3"
                     /locus_tag="Rv3876"
                     /note="Rv3876, (MTV027.11), len: 666 aa. EspI, ESX-1
                     secretion-associated protein, conserved pro-, ala-rich
                     protein, similar to several proteins from Mycobacterium
                     leprae e.g. Q9CDD8|ML0048 hypothetical protein (586
                     aa),FASTA scores: opt: 1682, E(): 2.1e-45, (50.75%
                     identity in 672 aa overlap); O33082|MLCB628.11c
                     hypothetical 52.0 KDA protein (478 aa), FASTA scores: opt:
                     1588, E(): 1.5e-42,(53.5% identity in 542 aa overlap)
                     (also has a proline rich N-terminus); etc. Also similar to
                     other proteins from Mycobacterium tuberculosis, especially
                     in C-terminus, e.g. O06396|Rv0530|MTCY25D10.09 (405 aa),
                     FASTA scores: opt: 670, E(): 2.5e-14, (34.85% identity in
                     396 aa overlap) (also has Pro-rich N-terminus); etc. Note
                     that N-terminus is repetitive and highly Proline rich."
                     /db_xref="EnsemblGenomes-Gn:Rv3876"
                     /db_xref="EnsemblGenomes-Tr:CCP46705"
                     /db_xref="GOA:P9WJC5"
                     /db_xref="InterPro:IPR002586"
                     /db_xref="InterPro:IPR027417"
                     /db_xref="UniProtKB/Swiss-Prot:P9WJC5"
                     /experiment="EXISTENCE: identified in proteomics study"
BEGIN
        1 MAADYDKLFR PHEGMEAPDD MAAQPFFDPS ASFPPAPASA NLPKPNGQTP PPTSDDLSER
       61 FVSAPPPPPP PPPPPPPTPM PIAAGEPPSP EPAASKPPTP PMPIAGPEPA PPKPPTPPMP
      121 IAGPEPAPPK PPTPPMPIAG PAPTPTESQL APPRPPTPQT PTGAPQQPES PAPHVPSHGP
      181 HQPRRTAPAP PWAKMPIGEP PPAPSRPSAS PAEPPTRPAP QHSRRARRGH RYRTDTERNV
      241 GKVATGPSIQ ARLRAEEASG AQLAPGTEPS PAPLGQPRSY LAPPTRPAPT EPPPSPSPQR
      301 NSGRRAERRV HPDLAAQHAA AQPDSITAAT TGGRRRKRAA PDLDATQKSL RPAAKGPKVK
      361 KVKPQKPKAT KPPKVVSQRG WRHWVHALTR INLGLSPDEK YELDLHARVR RNPRGSYQIA
      421 VVGLKGGAGK TTLTAALGST LAQVRADRIL ALDADPGAGN LADRVGRQSG ATIADVLAEK
      481 ELSHYNDIRA HTSVNAVNLE VLPAPEYSSA QRALSDADWH FIADPASRFY NLVLADCGAG
      541 FFDPLTRGVL STVSGVVVVA SVSIDGAQQA SVALDWLRNN GYQDLASRAC VVINHIMPGE
      601 PNVAVKDLVR HFEQQVQPGR VVVMPWDRHI AAGTEISLDL LDPIYKRKVL ELAAALSDDF
      661 ERAGRR
//