LOCUS CCP43021.1 461 aa PRT BCT 27-FEB-2015 DEFINITION Mycobacterium tuberculosis H37Rv Probable membrane-anchored mycosin MycP3 (serine protease) (subtilisin-like protease) (subtilase-like) (mycosin-3) protein. ACCESSION AL123456-299 PROTEIN_ID CCP43021.1 SOURCE Mycobacterium tuberculosis H37Rv ORGANISM Mycobacterium tuberculosis H37Rv Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae; Mycobacterium; Mycobacterium tuberculosis complex. REFERENCE 1 AUTHORS Cole S.T., Brosch R., Parkhill J., Garnier T., Churcher C., Harris D., Gordon S.V., Eiglmeier K., Gas S., Barry C.E.III., Tekaia F., Badcock K., Basham D., Brown D., Chillingworth T., Connor R., Davies R., Devlin K., Feltwell T., Gentles S., Hamlin N., Holroyd S., Hornsby T., Jagels K., Krogh A., McLean J., Moule S., Murphy L., Oliver K., Osborne J., Quail M.A., Rajandream M.A., Rogers J., Rutter S., Seeger K., Skelton J., Squares R., Squares S., Sulston J.E., Taylor K., Whitehead S., Barrell B.G. TITLE Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence JOURNAL Nature 393(6685), 537-544(1998). PUBMED 9634230 REMARK Erratum:[Nature 1998 Nov 12;396(6707):190] REFERENCE 2 AUTHORS Camus J.C., Pryor M.J., Medigue C., Cole S.T. TITLE Re-annotation of the genome sequence of Mycobacterium tuberculosis H37Rv JOURNAL Microbiology (Reading, Engl.) 148(Pt 10), 2967-2973(2002). PUBMED 12368430 REFERENCE 3 AUTHORS Lew J.M., Kapopoulou A., Jones L.M., Cole S.T. TITLE TubercuList--10 years after JOURNAL Tuberculosis (Edinb) 91(1), 1-7(2011). PUBMED 20980199 REFERENCE 4 (bases 1 to 4411529) AUTHORS Parkhill J. JOURNAL Submitted (11-JUN-1998) to the INSDC. Submitted on behalf of the Mycobacterium tuberculosis sequencing and mapping teams, Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA Unite de Genetique Moleculaire Bacterienne, Institut Pasteur, 28 rue du Docteur Roux, 75724 Paris Cedex 15, France E-mail: parkhill@sanger.ac.uk REFERENCE 5 (bases 1 to 4411532) AUTHORS Lew J.M. JOURNAL Submitted (18-DEC-2012) to the INSDC. Lew J., Ecole Polytechnique Federale de Lausanne, CH-1015, Lausanne, Switzerland, and the Swiss Institute of Bioinformatics, CMU - Rue Michel-Servet 1, 1211 Geneva 4, SWITZERLAND COMMENT On or before Feb 1, 2013 this sequence version replaced gi:41352722, gi:38490165, gi:38490207, gi:41353619, gi:38490250, gi:38684030, gi:38490288, gi:41353667, gi:41353422, gi:41352756, gi:38490319, gi:41352785, gi:38490370, gi:41353971. Note: This annotation is from the TubercuList website, Release 26, Dec 2012 (URL: http://tuberculist.epfl.ch) (email: tuberculist@epfl.ch). FEATURES Qualifiers source /organism="Mycobacterium tuberculosis H37Rv" /strain="H37Rv" /mol_type="genomic DNA" /db_xref="taxon:83332" protein /transl_table=11 /gene="mycP3" /locus_tag="Rv0291" /note="Rv0291, (MTV035.19), len: 461 aa. Probable mycP3,membrane-anchored serine protease (mycosin) (see Brown et al., 2000), similar to several others in mycobacteria e.g. Z94121|MTY15F10_28|Rv1796 from Mycobacterium tuberculosis (446 aa), FASTA scores: opt: 1168, E(): 0, (44.6% identity in 453 aa overlap); Rv3886c; Rv3883c; Rv3449; and Y14967|MLCB628_4|MLCB628.04 from Mycobacterium leprae (446 aa), FASTA scores: opt: 1159, E(): 0, (43.5 identity in 446 aa overlap). Has signal sequence and hydrophobic stretch at C-terminus, followed by short positively charged segment,that seems to act as a membrane anchor. Contains PS00137 Serine proteases, subtilase family, histidine active site signature. Belongs to peptidase family S8 (also known as the subtilase family), pyrolysin subfamily. Conserved in M. tuberculosis, M. leprae, M. bovis and M. avium paratuberculosis; predicted to be essential for in vivo survival and pathogenicity (See Ribeiro-Guimaraes and Pessolani, 2007)." /db_xref="EnsemblGenomes-Gn:Rv0291" /db_xref="EnsemblGenomes-Tr:CCP43021" /db_xref="GOA:O53695" /db_xref="InterPro:IPR000209" /db_xref="InterPro:IPR015500" /db_xref="InterPro:IPR022398" /db_xref="InterPro:IPR023834" /db_xref="InterPro:IPR036852" /db_xref="UniProtKB/Swiss-Prot:O53695" /inference="protein motif:PROSITE:PS00137" /experiment="EXISTENCE: identified in proteomics study" BEGIN 1 MIRAAFACLA ATVVVAGWWT PPAWAIGPPV VDAAAQPPSG DPGPVAPMEQ RGACSVSGVI 61 PGTDPGVPTP SQTMLNLPAA WQFSRGEGQL VAIIDTGVQP GPRLPNVDAG GDFVESTDGL 121 TDCDGHGTLV AGIVAGQPGN DGFSGVAPAA RLLSIRAMST KFSPRTSGGD PQLAQATLDV 181 AVLAGAIVHA ADLGAKVINV STITCLPADR MVDQAALGAA IRYAAVDKDA VIVAAAGNTG 241 ASGSVSASCD SNPLTDLSRP DDPRNWAGVT SVSIPSWWQP YVLSVASLTS AGQPSKFSMP 301 GPWVGIAAPG ENIASVSNSG DGALANGLPD AHQKLVALSG TSYAAGYVSG VAALVRSRYP 361 GLNATEVVRR LTATAHRGAR ESSNIVGAGN LDAVAALTWQ LPAEPGGGAA PAKPVADPPV 421 PAPKDTTPRN VAFAGAAALS VLVGLTAATV AIARRRREPT E //