LOCUS CCP46271.1 455 aa PRT BCT 27-FEB-2015 DEFINITION Mycobacterium tuberculosis H37Rv Probable membrane-anchored mycosin MycP4 (serine protease) (subtilisin-like protease) (subtilase-like) (mycosin-4) protein. ACCESSION AL123456-3549 PROTEIN_ID CCP46271.1 SOURCE Mycobacterium tuberculosis H37Rv ORGANISM Mycobacterium tuberculosis H37Rv Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae; Mycobacterium; Mycobacterium tuberculosis complex. REFERENCE 1 AUTHORS Cole S.T., Brosch R., Parkhill J., Garnier T., Churcher C., Harris D., Gordon S.V., Eiglmeier K., Gas S., Barry C.E.III., Tekaia F., Badcock K., Basham D., Brown D., Chillingworth T., Connor R., Davies R., Devlin K., Feltwell T., Gentles S., Hamlin N., Holroyd S., Hornsby T., Jagels K., Krogh A., McLean J., Moule S., Murphy L., Oliver K., Osborne J., Quail M.A., Rajandream M.A., Rogers J., Rutter S., Seeger K., Skelton J., Squares R., Squares S., Sulston J.E., Taylor K., Whitehead S., Barrell B.G. TITLE Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence JOURNAL Nature 393(6685), 537-544(1998). PUBMED 9634230 REMARK Erratum:[Nature 1998 Nov 12;396(6707):190] REFERENCE 2 AUTHORS Camus J.C., Pryor M.J., Medigue C., Cole S.T. TITLE Re-annotation of the genome sequence of Mycobacterium tuberculosis H37Rv JOURNAL Microbiology (Reading, Engl.) 148(Pt 10), 2967-2973(2002). PUBMED 12368430 REFERENCE 3 AUTHORS Lew J.M., Kapopoulou A., Jones L.M., Cole S.T. TITLE TubercuList--10 years after JOURNAL Tuberculosis (Edinb) 91(1), 1-7(2011). PUBMED 20980199 REFERENCE 4 (bases 1 to 4411529) AUTHORS Parkhill J. JOURNAL Submitted (11-JUN-1998) to the INSDC. Submitted on behalf of the Mycobacterium tuberculosis sequencing and mapping teams, Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA Unite de Genetique Moleculaire Bacterienne, Institut Pasteur, 28 rue du Docteur Roux, 75724 Paris Cedex 15, France E-mail: parkhill@sanger.ac.uk REFERENCE 5 (bases 1 to 4411532) AUTHORS Lew J.M. JOURNAL Submitted (18-DEC-2012) to the INSDC. Lew J., Ecole Polytechnique Federale de Lausanne, CH-1015, Lausanne, Switzerland, and the Swiss Institute of Bioinformatics, CMU - Rue Michel-Servet 1, 1211 Geneva 4, SWITZERLAND COMMENT On or before Feb 1, 2013 this sequence version replaced gi:41352722, gi:38490165, gi:38490207, gi:41353619, gi:38490250, gi:38684030, gi:38490288, gi:41353667, gi:41353422, gi:41352756, gi:38490319, gi:41352785, gi:38490370, gi:41353971. Note: This annotation is from the TubercuList website, Release 26, Dec 2012 (URL: http://tuberculist.epfl.ch) (email: tuberculist@epfl.ch). FEATURES Qualifiers source /organism="Mycobacterium tuberculosis H37Rv" /strain="H37Rv" /mol_type="genomic DNA" /db_xref="taxon:83332" protein /transl_table=11 /gene="mycP4" /locus_tag="Rv3449" /note="Rv3449, (MTCY13E12.02), len: 455 aa. Probable mycP4,membrane-anchored serine protease (mycosin) (see citation below), similar to hypothetical unknowns or proteases from Mycobacterium tuberculosis strains H37Rv and CDC1551 e.g. AAK48366|MT3998 subtilase family protein from Mycobacterium tuberculosis strain CDC1551 (411 aa), FASTA scores: opt: 747, E(): 3.5e-33, (45.65% identity in 416 aa overlap); O05461|Rv3883c|MTCY15F10.29 membrane-anchored mycosin MYCP1 (446 aa), FASTA scores: opt: 747, E(): 3.8e-33, (45.45% identity in 451 aa overlap); O53695|Rv0291|MTV035.19 probable membrane-anchored mycosin MYCP2 (461 aa), FASTA scores: opt: 660, E(): 1.9e-28, (44.0% identity in 457 aa overlap); etc. And similar to hypothetical proteases from Mycobacterium leprae e.g. O33076|MLCB628.04|ML0041 hypothetical 45.7 KDA protein (probable secreted protease) (446 aa), FASTA scores: opt: 683, E(): 1.1e-29, (43.8% identity in 450 aa overlap); Q9CD36|ML2528 putative protease (475 aa), FASTA scores: opt: 608, E(): 1.3e-25,(43.0% identity in 451 aa overlap); Q9CBV3|ML1538 possible protease (567 aa), FASTA scores: opt: 389, E(): 9.7e-14,(33.8% identity in 562 aa overlap); etc. Also some similarity to other proteases from several organisms e.g. O31788|APRX alkaline serine protease from Bacillus subtilis (442 aa), FASTA scores: opt: 296, E(): 8.3e-09, (29.4% identity in 313 aa overlap); O86650|SC3C3.17c putative secreted serine protease from Streptomyces coelicolor (450 aa), FASTA scores: opt: 279, E(): 7e-08, (33.55% identity in 343 aa overlap); Q9KBJ7|APRX|BH193 intracellular alkaline serine protease from Bacillus halodurans (444 aa),FASTA scores: opt: 257, E(): 1.1e-06, (28.65% identity in 335 aa overlap); O86642|SC3C3.08 serine protease from Streptomyces coelicolor (413 aa), FASTA scores: opt: 243,E(): 5.7e-06, (38.25% identity in 387 aa overlap); etc. Has putative signal peptide at N-terminus and hydrophobic stretch at C-terminus. Contains three signatures typical of subtilase family: aspartic acid active site (PS00136),histidine active site (PS00137), and serine active site (PS00138). Belongs to peptidase family S8 (also known as the subtilase family), pyrolysin subfamily." /db_xref="EnsemblGenomes-Gn:Rv3449" /db_xref="EnsemblGenomes-Tr:CCP46271" /db_xref="GOA:I6YC58" /db_xref="InterPro:IPR000209" /db_xref="InterPro:IPR015500" /db_xref="InterPro:IPR022398" /db_xref="InterPro:IPR023827" /db_xref="InterPro:IPR023828" /db_xref="InterPro:IPR023834" /db_xref="InterPro:IPR036852" /db_xref="UniProtKB/Swiss-Prot:I6YC58" /inference="protein motif:PROSITE:PS00136" /inference="protein motif:PROSITE:PS00137" /inference="protein motif:PROSITE:PS00138" /experiment="EXISTENCE: identified in proteomics study" BEGIN 1 MTTSRTLRLL VVSALATLSG LGTPVAHAVS PPPIDERWLP ESALPAPPRP TVQREVCTEV 61 TAESGRAFGR AERSAQLADL DQVWRLTRGA GQRVAVIDTG VARHRRLPKV VAGGDYVFTG 121 DGTADCDAHG TLVAGIIAAA PDAQSDNFSG VAPDVTLISI RQSSSKFAPV GDPSSTGVGD 181 VDTMAKAVRT AADLGASVIN ISSIACVPAA AAPDDRALGA ALAYAVDVKN AVIVAAAGNT 241 GGAAQCPPQA PGVTRDSVTV AVSPAWYDDY VLTVGSVNAQ GEPSAFTLAG PWVDVAATGE 301 AVTSLSPFGD GTVNRLGGQH GSIPISGTSY AAPVVSGLAA LIRARFPTLT ARQVMQRIES 361 TAHHPPAGWD PLVGNGTVDA LAAVSSDSIP QAGTATSDPA PVAVPVPRRS TPGPSDRRAL 421 HTAFAGAAIC LLALMATLAT ASRRLRPGRN GIAGD //