LOCUS CCP43548.1 433 aa PRT BCT 27-FEB-2015 DEFINITION Mycobacterium tuberculosis H37Rv Probable aminopeptidase PepC protein. ACCESSION AL123456-826 PROTEIN_ID CCP43548.1 SOURCE Mycobacterium tuberculosis H37Rv ORGANISM Mycobacterium tuberculosis H37Rv Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae; Mycobacterium; Mycobacterium tuberculosis complex. REFERENCE 1 AUTHORS Cole S.T., Brosch R., Parkhill J., Garnier T., Churcher C., Harris D., Gordon S.V., Eiglmeier K., Gas S., Barry C.E.III., Tekaia F., Badcock K., Basham D., Brown D., Chillingworth T., Connor R., Davies R., Devlin K., Feltwell T., Gentles S., Hamlin N., Holroyd S., Hornsby T., Jagels K., Krogh A., McLean J., Moule S., Murphy L., Oliver K., Osborne J., Quail M.A., Rajandream M.A., Rogers J., Rutter S., Seeger K., Skelton J., Squares R., Squares S., Sulston J.E., Taylor K., Whitehead S., Barrell B.G. TITLE Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence JOURNAL Nature 393(6685), 537-544(1998). PUBMED 9634230 REMARK Erratum:[Nature 1998 Nov 12;396(6707):190] REFERENCE 2 AUTHORS Camus J.C., Pryor M.J., Medigue C., Cole S.T. TITLE Re-annotation of the genome sequence of Mycobacterium tuberculosis H37Rv JOURNAL Microbiology (Reading, Engl.) 148(Pt 10), 2967-2973(2002). PUBMED 12368430 REFERENCE 3 AUTHORS Lew J.M., Kapopoulou A., Jones L.M., Cole S.T. TITLE TubercuList--10 years after JOURNAL Tuberculosis (Edinb) 91(1), 1-7(2011). PUBMED 20980199 REFERENCE 4 (bases 1 to 4411529) AUTHORS Parkhill J. JOURNAL Submitted (11-JUN-1998) to the INSDC. Submitted on behalf of the Mycobacterium tuberculosis sequencing and mapping teams, Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA Unite de Genetique Moleculaire Bacterienne, Institut Pasteur, 28 rue du Docteur Roux, 75724 Paris Cedex 15, France E-mail: parkhill@sanger.ac.uk REFERENCE 5 (bases 1 to 4411532) AUTHORS Lew J.M. JOURNAL Submitted (18-DEC-2012) to the INSDC. Lew J., Ecole Polytechnique Federale de Lausanne, CH-1015, Lausanne, Switzerland, and the Swiss Institute of Bioinformatics, CMU - Rue Michel-Servet 1, 1211 Geneva 4, SWITZERLAND COMMENT On or before Feb 1, 2013 this sequence version replaced gi:41352722, gi:38490165, gi:38490207, gi:41353619, gi:38490250, gi:38684030, gi:38490288, gi:41353667, gi:41353422, gi:41352756, gi:38490319, gi:41352785, gi:38490370, gi:41353971. Note: This annotation is from the TubercuList website, Release 26, Dec 2012 (URL: http://tuberculist.epfl.ch) (email: tuberculist@epfl.ch). FEATURES Qualifiers source /organism="Mycobacterium tuberculosis H37Rv" /strain="H37Rv" /mol_type="genomic DNA" /db_xref="taxon:83332" protein /transl_table=11 /gene="pepC" /locus_tag="Rv0800" /note="Rv0800, (MTCY07H7A.09c), len: 433 aa. Probable pepC,aminopeptidase I, highly similar (but shorter 17 aa) to Q50022|PEPX aminopeptidase from Mycobacterium leprae (443 aa), FASTA scores: opt: 2237, E(): 0, (78.3% identity in 433 aa overlap). Also highly similar to others from Eukaryotes and bacteria, e.g. T36482 probable aminopeptidase from Streptomyces coelicolor (432 aa),P14904|AMPL_YEAST vacuolar aminopeptidase I precursor from Saccharomyces cerevisiae (514 aa), FASTA scores: opt: 425,E(): 4.8e-21, (31.0% identity in 445 aa overlap); etc. Also similar to hypothetical proteins e.g. P38821|YHR3_YEAST hypothetical 54.2 kDa protein from Saccharomyces cerevisiae (490 aa), FASTA scores: opt: 429, E(): 2.5e-21, (34.8% identity in 443 aa overlap); etc. Conserved in M. tuberculosis, M. leprae, M. bovis and M. avium paratuberculosis; predicted to be essential for in vivo survival and pathogenicity (See Ribeiro-Guimaraes and Pessolani, 2007)." /db_xref="EnsemblGenomes-Gn:Rv0800" /db_xref="EnsemblGenomes-Tr:CCP43548" /db_xref="GOA:P9WHT1" /db_xref="InterPro:IPR001948" /db_xref="InterPro:IPR022984" /db_xref="InterPro:IPR023358" /db_xref="UniProtKB/Swiss-Prot:P9WHT1" /experiment="EXISTENCE: identified in proteomics study" BEGIN 1 MAATAHGLCE FIDASPSPFH VCATVAGRLL GAGYRELREA DRWPDKPGRY FTVRAGSLVA 61 WNAEQSGHTQ VPFRIVGAHT DSPNLRVKQH PDRLVAGWHV VALQPYGGVW LHSWLDRDLG 121 ISGRLSVRDG TGVSHRLVLI DDPILRVPQL AIHLAEDRKS LTLDPQRHIN AVWGVGERVE 181 SFVGYVAQRA GVAAADVLAA DLMTHDLTPS ALIGASVNGT ASLLSAPRLD NQASCYAGME 241 ALLAVDVDSA SSGFVPVLAI FDHEEVGSAS GHGAQSDLLS SVLERIVLAA GGTREDFLRR 301 LTTSMLASAD MAHATHPNYP DRHEPSHPIE VNAGPVLKVH PNLRYATDGR TAAAFALACQ 361 RAGVPMQRYE HRADLPCGST IGPLAAARTG IPTVDVGAAQ LAMHSARELM GAHDVAAYSA 421 ALQAFLSAEL SEA //