LOCUS CCP46700.1 591 aa PRT BCT 27-FEB-2015 DEFINITION Mycobacterium tuberculosis H37Rv ESX conserved component EccCb1. ESX-1 type VII secretion system protein. protein. ACCESSION AL123456-3978 PROTEIN_ID CCP46700.1 SOURCE Mycobacterium tuberculosis H37Rv ORGANISM Mycobacterium tuberculosis H37Rv Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae; Mycobacterium; Mycobacterium tuberculosis complex. REFERENCE 1 AUTHORS Cole S.T., Brosch R., Parkhill J., Garnier T., Churcher C., Harris D., Gordon S.V., Eiglmeier K., Gas S., Barry C.E.III., Tekaia F., Badcock K., Basham D., Brown D., Chillingworth T., Connor R., Davies R., Devlin K., Feltwell T., Gentles S., Hamlin N., Holroyd S., Hornsby T., Jagels K., Krogh A., McLean J., Moule S., Murphy L., Oliver K., Osborne J., Quail M.A., Rajandream M.A., Rogers J., Rutter S., Seeger K., Skelton J., Squares R., Squares S., Sulston J.E., Taylor K., Whitehead S., Barrell B.G. TITLE Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence JOURNAL Nature 393(6685), 537-544(1998). PUBMED 9634230 REMARK Erratum:[Nature 1998 Nov 12;396(6707):190] REFERENCE 2 AUTHORS Camus J.C., Pryor M.J., Medigue C., Cole S.T. TITLE Re-annotation of the genome sequence of Mycobacterium tuberculosis H37Rv JOURNAL Microbiology (Reading, Engl.) 148(Pt 10), 2967-2973(2002). PUBMED 12368430 REFERENCE 3 AUTHORS Lew J.M., Kapopoulou A., Jones L.M., Cole S.T. TITLE TubercuList--10 years after JOURNAL Tuberculosis (Edinb) 91(1), 1-7(2011). PUBMED 20980199 REFERENCE 4 (bases 1 to 4411529) AUTHORS Parkhill J. JOURNAL Submitted (11-JUN-1998) to the INSDC. Submitted on behalf of the Mycobacterium tuberculosis sequencing and mapping teams, Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA Unite de Genetique Moleculaire Bacterienne, Institut Pasteur, 28 rue du Docteur Roux, 75724 Paris Cedex 15, France E-mail: parkhill@sanger.ac.uk REFERENCE 5 (bases 1 to 4411532) AUTHORS Lew J.M. JOURNAL Submitted (18-DEC-2012) to the INSDC. Lew J., Ecole Polytechnique Federale de Lausanne, CH-1015, Lausanne, Switzerland, and the Swiss Institute of Bioinformatics, CMU - Rue Michel-Servet 1, 1211 Geneva 4, SWITZERLAND COMMENT On or before Feb 1, 2013 this sequence version replaced gi:41352722, gi:38490165, gi:38490207, gi:41353619, gi:38490250, gi:38684030, gi:38490288, gi:41353667, gi:41353422, gi:41352756, gi:38490319, gi:41352785, gi:38490370, gi:41353971. Note: This annotation is from the TubercuList website, Release 26, Dec 2012 (URL: http://tuberculist.epfl.ch) (email: tuberculist@epfl.ch). FEATURES Qualifiers source /organism="Mycobacterium tuberculosis H37Rv" /strain="H37Rv" /mol_type="genomic DNA" /db_xref="taxon:83332" protein /transl_table=11 /gene="eccCb1" /gene_synonym="snm2" /locus_tag="Rv3871" /note="Rv3871, (MTV027.06), len: 591 aa. EccCb1, esx conserved component, ESX-1 type VII secretion system protein, equivalent to Q9CDD7|ML0052 hypothetical protein from Mycobacterium leprae (597 aa) FASTA scores: opt: 3341,E(): 9.8e-192, (80.85% identity in 596 aa overlap); and O33086|MLCB628.15c hypothetical protein from Mycobacterium leprae (597 aa), FASTA scores: opt: 3329, E(): 5.1e-191,(80.55% identity in 596 aa overlap). And similar to C-terminal end of others e.g. Q9Z5I2|ML1543|MLCB596.28 possible SPOIIIE-family membrane protein from Mycobacterium leprae (1345 aa), FASTA scores: opt: 601, E(): 5.6e-28,(32.3% identity in 613 aa overlap); O86653|SC3C3.20c ATP/GTP binding protein from Streptomyces coelicolor (1321 aa), FASTA scores: opt: 977, E(): 2.1e-50, (35.15% identity in 583 aa overlap); Q9L0T6|SCD35.15c putative cell division-related protein from Streptomyces coelicolor (1525 aa), FASTA scores: opt: 414, E(): 9e-17, (27.6% identity in 424 aa overlap);P71068|YUKA YUKA protein from Bacillus subtilis (1207 aa), FASTA scores: opt: 343, E(): 1.3e-12,(25.8% identity in 395 aa overlap); etc. And similar to to C-terminal end of hypothetical proteins from Mycobacterium tuberculosis e.g. O06264|Rv3447c|MTCY77.19c (1236 aa) FASTA scores: opt: 845, E(): 1.5e-42, (35.3% identity in 586 aa overlap); O53689|Rv0284|MTV035.12 (1330 aa) FASTA scores: opt: 646, E(): 1.2e-30, (33.35% identity in 606 aa overlap); O53935|Rv1784|MTV049.06 (932 aa) FASTA scores: opt: 589, E(): 2.1e-27, (33.1% identity in 619 aa overlap); etc. Contains 2 X PS00017 ATP/GTP-binding site motif A (P-loop). Note some similarity (with hypothetical proteins from Mycobacterium tuberculosis and P71068|YUKA) continues in upstream ORF MTV027.05." /db_xref="EnsemblGenomes-Gn:Rv3871" /db_xref="EnsemblGenomes-Tr:CCP46700" /db_xref="GOA:P9WNB1" /db_xref="InterPro:IPR002543" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR023837" /db_xref="InterPro:IPR027417" /db_xref="UniProtKB/Swiss-Prot:P9WNB1" /inference="protein motif:PROSITE:PS00017" /experiment="EXISTENCE: identified in proteomics study" BEGIN 1 MTAEPEVRTL REVVLDQLGT AESRAYKMWL PPLTNPVPLN ELIARDRRQP LRFALGIMDE 61 PRRHLQDVWG VDVSGAGGNI GIGGAPQTGK STLLQTMVMS AAATHSPRNV QFYCIDLGGG 121 GLIYLENLPH VGGVANRSEP DKVNRVVAEM QAVMRQRETT FKEHRVGSIG MYRQLRDDPS 181 QPVASDPYGD VFLIIDGWPG FVGEFPDLEG QVQDLAAQGL AFGVHVIIST PRWTELKSRV 241 RDYLGTKIEF RLGDVNETQI DRITREIPAN RPGRAVSMEK HHLMIGVPRF DGVHSADNLV 301 EAITAGVTQI ASQHTEQAPP VRVLPERIHL HELDPNPPGP ESDYRTRWEI PIGLRETDLT 361 PAHCHMHTNP HLLIFGAAKS GKTTIAHAIA RAICARNSPQ QVRFMLADYR SGLLDAVPDT 421 HLLGAGAINR NSASLDEAVQ ALAVNLKKRL PPTDLTTAQL RSRSWWSGFD VVLLVDDWHM 481 IVGAAGGMPP MAPLAPLLPA AADIGLHIIV TCQMSQAYKA TMDKFVGAAF GSGAPTMFLS 541 GEKQEFPSSE FKVKRRPPGQ AFLVSPDGKE VIQAPYIEPP EEVFAAPPSA G //