LOCUS CCP45717.1 370 aa PRT BCT 27-FEB-2015 DEFINITION Mycobacterium tuberculosis H37Rv Conserved protein protein. ACCESSION AL123456-2995 PROTEIN_ID CCP45717.1 SOURCE Mycobacterium tuberculosis H37Rv ORGANISM Mycobacterium tuberculosis H37Rv Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae; Mycobacterium; Mycobacterium tuberculosis complex. REFERENCE 1 AUTHORS Cole S.T., Brosch R., Parkhill J., Garnier T., Churcher C., Harris D., Gordon S.V., Eiglmeier K., Gas S., Barry C.E.III., Tekaia F., Badcock K., Basham D., Brown D., Chillingworth T., Connor R., Davies R., Devlin K., Feltwell T., Gentles S., Hamlin N., Holroyd S., Hornsby T., Jagels K., Krogh A., McLean J., Moule S., Murphy L., Oliver K., Osborne J., Quail M.A., Rajandream M.A., Rogers J., Rutter S., Seeger K., Skelton J., Squares R., Squares S., Sulston J.E., Taylor K., Whitehead S., Barrell B.G. TITLE Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence JOURNAL Nature 393(6685), 537-544(1998). PUBMED 9634230 REMARK Erratum:[Nature 1998 Nov 12;396(6707):190] REFERENCE 2 AUTHORS Camus J.C., Pryor M.J., Medigue C., Cole S.T. TITLE Re-annotation of the genome sequence of Mycobacterium tuberculosis H37Rv JOURNAL Microbiology (Reading, Engl.) 148(Pt 10), 2967-2973(2002). PUBMED 12368430 REFERENCE 3 AUTHORS Lew J.M., Kapopoulou A., Jones L.M., Cole S.T. TITLE TubercuList--10 years after JOURNAL Tuberculosis (Edinb) 91(1), 1-7(2011). PUBMED 20980199 REFERENCE 4 (bases 1 to 4411529) AUTHORS Parkhill J. JOURNAL Submitted (11-JUN-1998) to the INSDC. Submitted on behalf of the Mycobacterium tuberculosis sequencing and mapping teams, Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA Unite de Genetique Moleculaire Bacterienne, Institut Pasteur, 28 rue du Docteur Roux, 75724 Paris Cedex 15, France E-mail: parkhill@sanger.ac.uk REFERENCE 5 (bases 1 to 4411532) AUTHORS Lew J.M. JOURNAL Submitted (18-DEC-2012) to the INSDC. Lew J., Ecole Polytechnique Federale de Lausanne, CH-1015, Lausanne, Switzerland, and the Swiss Institute of Bioinformatics, CMU - Rue Michel-Servet 1, 1211 Geneva 4, SWITZERLAND COMMENT On or before Feb 1, 2013 this sequence version replaced gi:41352722, gi:38490165, gi:38490207, gi:41353619, gi:38490250, gi:38684030, gi:38490288, gi:41353667, gi:41353422, gi:41352756, gi:38490319, gi:41352785, gi:38490370, gi:41353971. Note: This annotation is from the TubercuList website, Release 26, Dec 2012 (URL: http://tuberculist.epfl.ch) (email: tuberculist@epfl.ch). FEATURES Qualifiers source /organism="Mycobacterium tuberculosis H37Rv" /strain="H37Rv" /mol_type="genomic DNA" /db_xref="taxon:83332" protein /transl_table=11 /locus_tag="Rv2915c" /note="Rv2915c, (MTCY338.03c), len: 370 aa. Conserved protein, posssibly XAA-pro dipeptidase (prolidase), highly similar to CAC38796|SCI39.08c conserved hypothetical protein from Streptomyces coelicolor (363 aa), FASTA scores: opt: 1341, E(): 5.5e-76, (56.65% identity in 362 aa overlap); and similar to prolidases (XAA-pro dipeptidase) e.g. Q9ABC9|CC0300 putative XAA-pro dipeptidase from Caulobacter crescentus (428 aa), FASTA scores: opt: 327,E(): 7.4e-13, (30.2% identity in 374 aa overlap); Q97XD4 prolidase from Sulfolobus solfataricus (396 aa), FASTA scores: opt: 271, E(): 2.1e-09, (30.5% identity in 354 aa overlap); Q9WX55 prolidase from Microbacterium esteraromaticum (393 aa), FASTA scores: opt: 256, E(): 1.8e-08, (27.95% identity in 365 aa overlap); etc. Also similar to O53619|Rv0074|MTV030.18 conserved hypothetical protein from Mycobacterium tuberculosis (411 aa), FASTA scores: opt: 243, E(): 1.2e-07, (27.5% identity in 389 aa overlap)." /db_xref="EnsemblGenomes-Gn:Rv2915c" /db_xref="EnsemblGenomes-Tr:CCP45717" /db_xref="GOA:P9WL23" /db_xref="InterPro:IPR006680" /db_xref="InterPro:IPR032466" /db_xref="UniProtKB/Swiss-Prot:P9WL23" /experiment="EXISTENCE: identified in proteomics study" BEGIN 1 MKRVDTIRPR SRAVRLHVRG LGLPDETAIQ LWIVDGRIST EPVAGADTVF DGGWILPGLV 61 DAHCHVGLGK HGNVELDEAI AQAETERDVG ALLLRDCGSP TDTRGLDDHE DLPRIIRAGR 121 HLARPKRYIA GFAVELEDES QLPAAVAEQA RRGDGWVKLV GDWIDRQIGD LAPLWSDDVL 181 KAAIDTAHAQ GARVTAHVFS EDALPGLINA GIDCIEHGTG LTDDTIALML EHGTALVPTL 241 INLENFPGIA DAAGRYPTYA AHMRDLYARG YGRVAAAREA GVPVYAGTDA GSTIEHGRIA 301 DEVAALQRIG MTAHEALGAA CWDARRWLGR PGLDDRASAD LLCYAQDPRQ GPGVLQHPDL 361 VILRGRTFGP //