LOCUS CCP45581.1 438 aa PRT BCT 27-FEB-2015 DEFINITION Mycobacterium tuberculosis H37Rv Probable zinc protease PepR protein. ACCESSION AL123456-2859 PROTEIN_ID CCP45581.1 SOURCE Mycobacterium tuberculosis H37Rv ORGANISM Mycobacterium tuberculosis H37Rv Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae; Mycobacterium; Mycobacterium tuberculosis complex. REFERENCE 1 AUTHORS Cole S.T., Brosch R., Parkhill J., Garnier T., Churcher C., Harris D., Gordon S.V., Eiglmeier K., Gas S., Barry C.E.III., Tekaia F., Badcock K., Basham D., Brown D., Chillingworth T., Connor R., Davies R., Devlin K., Feltwell T., Gentles S., Hamlin N., Holroyd S., Hornsby T., Jagels K., Krogh A., McLean J., Moule S., Murphy L., Oliver K., Osborne J., Quail M.A., Rajandream M.A., Rogers J., Rutter S., Seeger K., Skelton J., Squares R., Squares S., Sulston J.E., Taylor K., Whitehead S., Barrell B.G. TITLE Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence JOURNAL Nature 393(6685), 537-544(1998). PUBMED 9634230 REMARK Erratum:[Nature 1998 Nov 12;396(6707):190] REFERENCE 2 AUTHORS Camus J.C., Pryor M.J., Medigue C., Cole S.T. TITLE Re-annotation of the genome sequence of Mycobacterium tuberculosis H37Rv JOURNAL Microbiology (Reading, Engl.) 148(Pt 10), 2967-2973(2002). PUBMED 12368430 REFERENCE 3 AUTHORS Lew J.M., Kapopoulou A., Jones L.M., Cole S.T. TITLE TubercuList--10 years after JOURNAL Tuberculosis (Edinb) 91(1), 1-7(2011). PUBMED 20980199 REFERENCE 4 (bases 1 to 4411529) AUTHORS Parkhill J. JOURNAL Submitted (11-JUN-1998) to the INSDC. Submitted on behalf of the Mycobacterium tuberculosis sequencing and mapping teams, Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA Unite de Genetique Moleculaire Bacterienne, Institut Pasteur, 28 rue du Docteur Roux, 75724 Paris Cedex 15, France E-mail: parkhill@sanger.ac.uk REFERENCE 5 (bases 1 to 4411532) AUTHORS Lew J.M. JOURNAL Submitted (18-DEC-2012) to the INSDC. Lew J., Ecole Polytechnique Federale de Lausanne, CH-1015, Lausanne, Switzerland, and the Swiss Institute of Bioinformatics, CMU - Rue Michel-Servet 1, 1211 Geneva 4, SWITZERLAND COMMENT On or before Feb 1, 2013 this sequence version replaced gi:41352722, gi:38490165, gi:38490207, gi:41353619, gi:38490250, gi:38684030, gi:38490288, gi:41353667, gi:41353422, gi:41352756, gi:38490319, gi:41352785, gi:38490370, gi:41353971. Note: This annotation is from the TubercuList website, Release 26, Dec 2012 (URL: http://tuberculist.epfl.ch) (email: tuberculist@epfl.ch). FEATURES Qualifiers source /organism="Mycobacterium tuberculosis H37Rv" /strain="H37Rv" /mol_type="genomic DNA" /db_xref="taxon:83332" protein /transl_table=11 /gene="pepR" /locus_tag="Rv2782c" /note="Rv2782c, (MTV002.47c), len: 438 aa. Probable pepR,protease/peptidase, equivalent to O32965|YR82_MYCLE|ML0855|MLCB22.26c hypothetical zinc protease from Mycobacterium leprae (445 aa), FASTA scores: opt: 2346, E(): 4.3e-146, (84.3% identity in 421 aa overlap). Also highly similar to others e.g. O86835|YA12_STRCO|SC9A10.02 from Streptomyces coelicolor (459 aa), FASTA scores: opt: 1394, E(): 1.1e-83, (51.9% identity in 416 aa overlap); Q04805|YMXG_BACSU|YMXG from Bacillus subtilis (409 aa), FASTA scores: opt: 1014, E(): 7.9e-59, (37.55% identity in 410 aa overlap); Q9KA85|BH2405 from Bacillus halodurans (413 aa), FASTA scores: opt: 967,E(): 9.6e-56, (38.6% identity in 417 aa overlap); etc. Contains PS00143 Insulinase family, zinc-binding region signature. Belongs to peptidase family M16, also known as the insulinase family. Cofactor: requires divalent cations for activity. Binds zinc. Conserved in M. tuberculosis, M. leprae, M. bovis and M. avium paratuberculosis; predicted to be essential for in vivo survival and pathogenicity (See Ribeiro-Guimaraes and Pessolani, 2007)." /db_xref="EnsemblGenomes-Gn:Rv2782c" /db_xref="EnsemblGenomes-Tr:CCP45581" /db_xref="GOA:P9WHT5" /db_xref="InterPro:IPR001431" /db_xref="InterPro:IPR007863" /db_xref="InterPro:IPR011249" /db_xref="InterPro:IPR011765" /db_xref="UniProtKB/Swiss-Prot:P9WHT5" /inference="protein motif:PROSITE:PS00143" /experiment="EXISTENCE: identified in proteomics study" BEGIN 1 MPRRSPADPA AALAPRRTTL PGGLRVVTEF LPAVHSASVG VWVGVGSRDE GATVAGAAHF 61 LEHLLFKSTP TRSAVDIAQA MDAVGGELNA FTAKEHTCYY AHVLGSDLPL AVDLVADVVL 121 NGRCAADDVE VERDVVLEEI AMRDDDPEDA LADMFLAALF GDHPVGRPVI GSAQSVSVMT 181 RAQLQSFHLR RYTPERMVVA AAGNVDHDGL VALVREHFGS RLVRGRRPVA PRKGTGRVNG 241 SPRLTLVSRD AEQTHVSLGI RTPGRGWEHR WALSVLHTAL GGGLSSRLFQ EVRETRGLAY 301 SVYSALDLFA DSGALSVYAA CLPERFADVM RVTADVLESV ARDGITEAEC GIAKGSLRGG 361 LVLGLEDSSS RMSRLGRSEL NYGKHRSIEH TLRQIEQVTV EEVNAVARHL LSRRYGAAVL 421 GPHGSKRSLP QQLRAMVG //