LOCUS CCP43733.1 464 aa PRT BCT 27-FEB-2015 DEFINITION Mycobacterium tuberculosis H37Rv Probable serine protease PepD (serine proteinase) (MTB32B) protein. ACCESSION AL123456-1011 PROTEIN_ID CCP43733.1 SOURCE Mycobacterium tuberculosis H37Rv ORGANISM Mycobacterium tuberculosis H37Rv Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae; Mycobacterium; Mycobacterium tuberculosis complex. REFERENCE 1 AUTHORS Cole S.T., Brosch R., Parkhill J., Garnier T., Churcher C., Harris D., Gordon S.V., Eiglmeier K., Gas S., Barry C.E.III., Tekaia F., Badcock K., Basham D., Brown D., Chillingworth T., Connor R., Davies R., Devlin K., Feltwell T., Gentles S., Hamlin N., Holroyd S., Hornsby T., Jagels K., Krogh A., McLean J., Moule S., Murphy L., Oliver K., Osborne J., Quail M.A., Rajandream M.A., Rogers J., Rutter S., Seeger K., Skelton J., Squares R., Squares S., Sulston J.E., Taylor K., Whitehead S., Barrell B.G. TITLE Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence JOURNAL Nature 393(6685), 537-544(1998). PUBMED 9634230 REMARK Erratum:[Nature 1998 Nov 12;396(6707):190] REFERENCE 2 AUTHORS Camus J.C., Pryor M.J., Medigue C., Cole S.T. TITLE Re-annotation of the genome sequence of Mycobacterium tuberculosis H37Rv JOURNAL Microbiology (Reading, Engl.) 148(Pt 10), 2967-2973(2002). PUBMED 12368430 REFERENCE 3 AUTHORS Lew J.M., Kapopoulou A., Jones L.M., Cole S.T. TITLE TubercuList--10 years after JOURNAL Tuberculosis (Edinb) 91(1), 1-7(2011). PUBMED 20980199 REFERENCE 4 (bases 1 to 4411529) AUTHORS Parkhill J. JOURNAL Submitted (11-JUN-1998) to the INSDC. Submitted on behalf of the Mycobacterium tuberculosis sequencing and mapping teams, Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA Unite de Genetique Moleculaire Bacterienne, Institut Pasteur, 28 rue du Docteur Roux, 75724 Paris Cedex 15, France E-mail: parkhill@sanger.ac.uk REFERENCE 5 (bases 1 to 4411532) AUTHORS Lew J.M. JOURNAL Submitted (18-DEC-2012) to the INSDC. Lew J., Ecole Polytechnique Federale de Lausanne, CH-1015, Lausanne, Switzerland, and the Swiss Institute of Bioinformatics, CMU - Rue Michel-Servet 1, 1211 Geneva 4, SWITZERLAND COMMENT On or before Feb 1, 2013 this sequence version replaced gi:41352722, gi:38490165, gi:38490207, gi:41353619, gi:38490250, gi:38684030, gi:38490288, gi:41353667, gi:41353422, gi:41352756, gi:38490319, gi:41352785, gi:38490370, gi:41353971. Note: This annotation is from the TubercuList website, Release 26, Dec 2012 (URL: http://tuberculist.epfl.ch) (email: tuberculist@epfl.ch). FEATURES Qualifiers source /organism="Mycobacterium tuberculosis H37Rv" /strain="H37Rv" /mol_type="genomic DNA" /db_xref="taxon:83332" protein /transl_table=11 /gene="pepD" /gene_synonym="mtb32b" /locus_tag="Rv0983" /note="Rv0983, (MTV044.11), len: 464 aa. Probable pepD (alternate gene name: mtb32b), secreted or membrane serine protease (see citation below), equivalent (but longer 18 aa in N-terminus) to AL035500|MLCL373_17|T45448 probable serine proteinase from Mycobacterium leprae (452 aa), FASTA score: (74.2% identity in 466 aa overlap); and highly similar to others from Mycobacterium leprae. Also highly similar (except in N-terminus) to other proteases e.g. CAC01350.1|AL390975 putative protease from Streptomyces coelicolor (542 aa); NP_440705.1|NC_000911|HtrA serine protease from Synechocystis sp. (452 aa); NP_346646.1|NC_003028 serine protease from Streptococcus pneumoniae (393 aa); etc. Also similar in part to members of the htrA-antigen family e.g. U87242|MTU87242_3|HtrA serine protease from M. tuberculosis (542 aa), FASTA scores: opt: 846, E(): 2e-28, (40.6% identity in 392 aa overlap); and similar to other hypothetical serine proteases e.g. Rv0983, Rv0125, etc. Belongs to the serine protease family. Conserved in M. tuberculosis, M. leprae,M. bovis and M. avium paratuberculosis; predicted to be essential for in vivo survival and pathogenicity (See Ribeiro-Guimaraes and Pessolani, 2007)." /db_xref="EnsemblGenomes-Gn:Rv0983" /db_xref="EnsemblGenomes-Tr:CCP43733" /db_xref="GOA:O53896" /db_xref="InterPro:IPR001478" /db_xref="InterPro:IPR001940" /db_xref="InterPro:IPR009003" /db_xref="InterPro:IPR036034" /db_xref="PDB:1Y8T" /db_xref="PDB:2Z9I" /db_xref="UniProtKB/TrEMBL:O53896" /experiment="EXISTENCE: identified in proteomics study" BEGIN 1 MAKLARVVGL VQEEQPSDMT NHPRYSPPPQ QPGTPGYAQG QQQTYSQQFD WRYPPSPPPQ 61 PTQYRQPYEA LGGTRPGLIP GVIPTMTPPP GMVRQRPRAG MLAIGAVTIA VVSAGIGGAA 121 ASLVGFNRAP AGPSGGPVAA SAAPSIPAAN MPPGSVEQVA AKVVPSVVML ETDLGRQSEE 181 GSGIILSAEG LILTNNHVIA AAAKPPLGSP PPKTTVTFSD GRTAPFTVVG ADPTSDIAVV 241 RVQGVSGLTP ISLGSSSDLR VGQPVLAIGS PLGLEGTVTT GIVSALNRPV STTGEAGNQN 301 TVLDAIQTDA AINPGNSGGA LVNMNAQLVG VNSAIATLGA DSADAQSGSI GLGFAIPVDQ 361 AKRIADELIS TGKASHASLG VQVTNDKDTL GAKIVEVVAG GAAANAGVPK GVVVTKVDDR 421 PINSADALVA AVRSKAPGAT VALTFQDPSG GSRTVQVTLG KAEQ //