LOCUS CCP46453.1 431 aa PRT BCT 27-FEB-2015 DEFINITION Mycobacterium tuberculosis H37Rv Probable conserved integral membrane protein protein. ACCESSION AL123456-3731 PROTEIN_ID CCP46453.1 SOURCE Mycobacterium tuberculosis H37Rv ORGANISM Mycobacterium tuberculosis H37Rv Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae; Mycobacterium; Mycobacterium tuberculosis complex. REFERENCE 1 AUTHORS Cole S.T., Brosch R., Parkhill J., Garnier T., Churcher C., Harris D., Gordon S.V., Eiglmeier K., Gas S., Barry C.E.III., Tekaia F., Badcock K., Basham D., Brown D., Chillingworth T., Connor R., Davies R., Devlin K., Feltwell T., Gentles S., Hamlin N., Holroyd S., Hornsby T., Jagels K., Krogh A., McLean J., Moule S., Murphy L., Oliver K., Osborne J., Quail M.A., Rajandream M.A., Rogers J., Rutter S., Seeger K., Skelton J., Squares R., Squares S., Sulston J.E., Taylor K., Whitehead S., Barrell B.G. TITLE Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence JOURNAL Nature 393(6685), 537-544(1998). PUBMED 9634230 REMARK Erratum:[Nature 1998 Nov 12;396(6707):190] REFERENCE 2 AUTHORS Camus J.C., Pryor M.J., Medigue C., Cole S.T. TITLE Re-annotation of the genome sequence of Mycobacterium tuberculosis H37Rv JOURNAL Microbiology (Reading, Engl.) 148(Pt 10), 2967-2973(2002). PUBMED 12368430 REFERENCE 3 AUTHORS Lew J.M., Kapopoulou A., Jones L.M., Cole S.T. TITLE TubercuList--10 years after JOURNAL Tuberculosis (Edinb) 91(1), 1-7(2011). PUBMED 20980199 REFERENCE 4 (bases 1 to 4411529) AUTHORS Parkhill J. JOURNAL Submitted (11-JUN-1998) to the INSDC. Submitted on behalf of the Mycobacterium tuberculosis sequencing and mapping teams, Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA Unite de Genetique Moleculaire Bacterienne, Institut Pasteur, 28 rue du Docteur Roux, 75724 Paris Cedex 15, France E-mail: parkhill@sanger.ac.uk REFERENCE 5 (bases 1 to 4411532) AUTHORS Lew J.M. JOURNAL Submitted (18-DEC-2012) to the INSDC. Lew J., Ecole Polytechnique Federale de Lausanne, CH-1015, Lausanne, Switzerland, and the Swiss Institute of Bioinformatics, CMU - Rue Michel-Servet 1, 1211 Geneva 4, SWITZERLAND COMMENT On or before Feb 1, 2013 this sequence version replaced gi:41352722, gi:38490165, gi:38490207, gi:41353619, gi:38490250, gi:38684030, gi:38490288, gi:41353667, gi:41353422, gi:41352756, gi:38490319, gi:41352785, gi:38490370, gi:41353971. Note: This annotation is from the TubercuList website, Release 26, Dec 2012 (URL: http://tuberculist.epfl.ch) (email: tuberculist@epfl.ch). FEATURES Qualifiers source /organism="Mycobacterium tuberculosis H37Rv" /strain="H37Rv" /mol_type="genomic DNA" /db_xref="taxon:83332" protein /transl_table=11 /locus_tag="Rv3630" /note="Rv3630, (MTCY15C10.22c), len: 431 aa. Probable conserved integral membrane, highly similar to P71789|YF10_MYCTU|Rv1510|MTCY277.32 hypothetical 44.3 KDA protein from Mycobacterium tuberculosis (432 aa) FASTA scores: opt: 1940, E(): 2.3e-103, (70.75% identity in 424 aa overlap). Note that N-terminal end is highly similar to AAK45825|MT1558 hypothetical 18.1 KDA protein from Mycobacterium tuberculosis strain CDC1551 (172 aa) FASTA scores: opt: 649, E(): 4.2e-30, (61.65% identity in 167 aa overlap); and C-terminal end is highly similar to AAK45826|MT1560 hypothetical 25.8 KDA protein from Mycobacterium tuberculosis strain CDC1551 (256 aa), FASTA scores: opt: 1269, E(): 2.6e-65, (76.7% identity in 253 aa overlap). Contains PS00639 Eukaryotic thiol (cysteine) proteases histidine active site, so could be a protease." /db_xref="EnsemblGenomes-Gn:Rv3630" /db_xref="EnsemblGenomes-Tr:CCP46453" /db_xref="GOA:P9WKX9" /db_xref="UniProtKB/Swiss-Prot:P9WKX9" /inference="protein motif:PROSITE:PS00639" /experiment="EXISTENCE: identified in proteomics study" BEGIN 1 MAVGAAAVTE VGDTASPVGS SGASGGAIAS GSVARVGTAA AVTALCGYAV IYLAARNLAP 61 NGFSVFGVFW GAFGLVTGAA NGLLQETTRE VRSLGYLDVS ADGRRTHPLR VSGMVGLGSL 121 VVIAGSSPLW SGRVFAEARW LSVALLSIGL AGFCLHATLL GMLAGTNRWT QYGALMVADA 181 VIRVVVAAAT FVIGWQLVGF IWATVAGSVA WLIMLMTSPP TRAAARLMTP GATATFLRGA 241 AHSIIAAGAS AILVMGFPVL LKLTSNELGA QGGVVILAVT LTRAPLLVPL TAMQGNLIAH 301 FVDERTERIR ALIAPAALIG GVGAVGMLAA GVVGPWIMRV AFGSEYQSSS ALLAWLTAAA 361 VAIAMLTLTG AAAVAAALHR AYSLGWVGAT VGSGLLLLLP LSLETRTVVA LLCGPLVGIG 421 VHLVALARTD E //