LOCUS CCP46705.1 666 aa PRT BCT 27-FEB-2015 DEFINITION Mycobacterium tuberculosis H37Rv ESX-1 secretion-associated protein EspI. Conserved proline and alanine rich protein. protein. ACCESSION AL123456-3983 PROTEIN_ID CCP46705.1 SOURCE Mycobacterium tuberculosis H37Rv ORGANISM Mycobacterium tuberculosis H37Rv Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae; Mycobacterium; Mycobacterium tuberculosis complex. REFERENCE 1 AUTHORS Cole S.T., Brosch R., Parkhill J., Garnier T., Churcher C., Harris D., Gordon S.V., Eiglmeier K., Gas S., Barry C.E.III., Tekaia F., Badcock K., Basham D., Brown D., Chillingworth T., Connor R., Davies R., Devlin K., Feltwell T., Gentles S., Hamlin N., Holroyd S., Hornsby T., Jagels K., Krogh A., McLean J., Moule S., Murphy L., Oliver K., Osborne J., Quail M.A., Rajandream M.A., Rogers J., Rutter S., Seeger K., Skelton J., Squares R., Squares S., Sulston J.E., Taylor K., Whitehead S., Barrell B.G. TITLE Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence JOURNAL Nature 393(6685), 537-544(1998). PUBMED 9634230 REMARK Erratum:[Nature 1998 Nov 12;396(6707):190] REFERENCE 2 AUTHORS Camus J.C., Pryor M.J., Medigue C., Cole S.T. TITLE Re-annotation of the genome sequence of Mycobacterium tuberculosis H37Rv JOURNAL Microbiology (Reading, Engl.) 148(Pt 10), 2967-2973(2002). PUBMED 12368430 REFERENCE 3 AUTHORS Lew J.M., Kapopoulou A., Jones L.M., Cole S.T. TITLE TubercuList--10 years after JOURNAL Tuberculosis (Edinb) 91(1), 1-7(2011). PUBMED 20980199 REFERENCE 4 (bases 1 to 4411529) AUTHORS Parkhill J. JOURNAL Submitted (11-JUN-1998) to the INSDC. Submitted on behalf of the Mycobacterium tuberculosis sequencing and mapping teams, Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA Unite de Genetique Moleculaire Bacterienne, Institut Pasteur, 28 rue du Docteur Roux, 75724 Paris Cedex 15, France E-mail: parkhill@sanger.ac.uk REFERENCE 5 (bases 1 to 4411532) AUTHORS Lew J.M. JOURNAL Submitted (18-DEC-2012) to the INSDC. Lew J., Ecole Polytechnique Federale de Lausanne, CH-1015, Lausanne, Switzerland, and the Swiss Institute of Bioinformatics, CMU - Rue Michel-Servet 1, 1211 Geneva 4, SWITZERLAND COMMENT On or before Feb 1, 2013 this sequence version replaced gi:41352722, gi:38490165, gi:38490207, gi:41353619, gi:38490250, gi:38684030, gi:38490288, gi:41353667, gi:41353422, gi:41352756, gi:38490319, gi:41352785, gi:38490370, gi:41353971. Note: This annotation is from the TubercuList website, Release 26, Dec 2012 (URL: http://tuberculist.epfl.ch) (email: tuberculist@epfl.ch). FEATURES Qualifiers source /organism="Mycobacterium tuberculosis H37Rv" /strain="H37Rv" /mol_type="genomic DNA" /db_xref="taxon:83332" protein /transl_table=11 /gene="espI" /gene_synonym="snm3" /locus_tag="Rv3876" /note="Rv3876, (MTV027.11), len: 666 aa. EspI, ESX-1 secretion-associated protein, conserved pro-, ala-rich protein, similar to several proteins from Mycobacterium leprae e.g. Q9CDD8|ML0048 hypothetical protein (586 aa),FASTA scores: opt: 1682, E(): 2.1e-45, (50.75% identity in 672 aa overlap); O33082|MLCB628.11c hypothetical 52.0 KDA protein (478 aa), FASTA scores: opt: 1588, E(): 1.5e-42,(53.5% identity in 542 aa overlap) (also has a proline rich N-terminus); etc. Also similar to other proteins from Mycobacterium tuberculosis, especially in C-terminus, e.g. O06396|Rv0530|MTCY25D10.09 (405 aa), FASTA scores: opt: 670, E(): 2.5e-14, (34.85% identity in 396 aa overlap) (also has Pro-rich N-terminus); etc. Note that N-terminus is repetitive and highly Proline rich." /db_xref="EnsemblGenomes-Gn:Rv3876" /db_xref="EnsemblGenomes-Tr:CCP46705" /db_xref="GOA:P9WJC5" /db_xref="InterPro:IPR002586" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/Swiss-Prot:P9WJC5" /experiment="EXISTENCE: identified in proteomics study" BEGIN 1 MAADYDKLFR PHEGMEAPDD MAAQPFFDPS ASFPPAPASA NLPKPNGQTP PPTSDDLSER 61 FVSAPPPPPP PPPPPPPTPM PIAAGEPPSP EPAASKPPTP PMPIAGPEPA PPKPPTPPMP 121 IAGPEPAPPK PPTPPMPIAG PAPTPTESQL APPRPPTPQT PTGAPQQPES PAPHVPSHGP 181 HQPRRTAPAP PWAKMPIGEP PPAPSRPSAS PAEPPTRPAP QHSRRARRGH RYRTDTERNV 241 GKVATGPSIQ ARLRAEEASG AQLAPGTEPS PAPLGQPRSY LAPPTRPAPT EPPPSPSPQR 301 NSGRRAERRV HPDLAAQHAA AQPDSITAAT TGGRRRKRAA PDLDATQKSL RPAAKGPKVK 361 KVKPQKPKAT KPPKVVSQRG WRHWVHALTR INLGLSPDEK YELDLHARVR RNPRGSYQIA 421 VVGLKGGAGK TTLTAALGST LAQVRADRIL ALDADPGAGN LADRVGRQSG ATIADVLAEK 481 ELSHYNDIRA HTSVNAVNLE VLPAPEYSSA QRALSDADWH FIADPASRFY NLVLADCGAG 541 FFDPLTRGVL STVSGVVVVA SVSIDGAQQA SVALDWLRNN GYQDLASRAC VVINHIMPGE 601 PNVAVKDLVR HFEQQVQPGR VVVMPWDRHI AAGTEISLDL LDPIYKRKVL ELAAALSDDF 661 ERAGRR //