LOCUS CCP46321.1 400 aa PRT BCT 27-FEB-2015 DEFINITION Mycobacterium tuberculosis H37Rv Mce-family protein Mce4A protein. ACCESSION AL123456-3599 PROTEIN_ID CCP46321.1 SOURCE Mycobacterium tuberculosis H37Rv ORGANISM Mycobacterium tuberculosis H37Rv Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae; Mycobacterium; Mycobacterium tuberculosis complex. REFERENCE 1 AUTHORS Cole S.T., Brosch R., Parkhill J., Garnier T., Churcher C., Harris D., Gordon S.V., Eiglmeier K., Gas S., Barry C.E.III., Tekaia F., Badcock K., Basham D., Brown D., Chillingworth T., Connor R., Davies R., Devlin K., Feltwell T., Gentles S., Hamlin N., Holroyd S., Hornsby T., Jagels K., Krogh A., McLean J., Moule S., Murphy L., Oliver K., Osborne J., Quail M.A., Rajandream M.A., Rogers J., Rutter S., Seeger K., Skelton J., Squares R., Squares S., Sulston J.E., Taylor K., Whitehead S., Barrell B.G. TITLE Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence JOURNAL Nature 393(6685), 537-544(1998). PUBMED 9634230 REMARK Erratum:[Nature 1998 Nov 12;396(6707):190] REFERENCE 2 AUTHORS Camus J.C., Pryor M.J., Medigue C., Cole S.T. TITLE Re-annotation of the genome sequence of Mycobacterium tuberculosis H37Rv JOURNAL Microbiology (Reading, Engl.) 148(Pt 10), 2967-2973(2002). PUBMED 12368430 REFERENCE 3 AUTHORS Lew J.M., Kapopoulou A., Jones L.M., Cole S.T. TITLE TubercuList--10 years after JOURNAL Tuberculosis (Edinb) 91(1), 1-7(2011). PUBMED 20980199 REFERENCE 4 (bases 1 to 4411529) AUTHORS Parkhill J. JOURNAL Submitted (11-JUN-1998) to the INSDC. Submitted on behalf of the Mycobacterium tuberculosis sequencing and mapping teams, Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA Unite de Genetique Moleculaire Bacterienne, Institut Pasteur, 28 rue du Docteur Roux, 75724 Paris Cedex 15, France E-mail: parkhill@sanger.ac.uk REFERENCE 5 (bases 1 to 4411532) AUTHORS Lew J.M. JOURNAL Submitted (18-DEC-2012) to the INSDC. Lew J., Ecole Polytechnique Federale de Lausanne, CH-1015, Lausanne, Switzerland, and the Swiss Institute of Bioinformatics, CMU - Rue Michel-Servet 1, 1211 Geneva 4, SWITZERLAND COMMENT On or before Feb 1, 2013 this sequence version replaced gi:41352722, gi:38490165, gi:38490207, gi:41353619, gi:38490250, gi:38684030, gi:38490288, gi:41353667, gi:41353422, gi:41352756, gi:38490319, gi:41352785, gi:38490370, gi:41353971. Note: This annotation is from the TubercuList website, Release 26, Dec 2012 (URL: http://tuberculist.epfl.ch) (email: tuberculist@epfl.ch). FEATURES Qualifiers source /organism="Mycobacterium tuberculosis H37Rv" /strain="H37Rv" /mol_type="genomic DNA" /db_xref="taxon:83332" protein /transl_table=11 /gene="mce4A" /gene_synonym="mce4" /locus_tag="Rv3499c" /note="Rv3499c, (MTV023.06c), len: 400 aa. Mce4A; belongs to 24-membered Mycobacterium tuberculosis Mce protein family (see citations below), highly similar to Mycobacterium tuberculosis proteins P72013|MCE1|Rv0169|MTCI28.09|mce1A (454 aa); O07789|MCE2|Rv0589|MTCY19H5.33c|mce2A (404 aa); and O53967|MCE3|Rv1966|MTV051.04|mce3A (425 aa). Also similar to others e.g. Q9F356|SC8A2.07c putative secreted protein from Streptomyces coelicolor (418 aa), FASTA scores: opt: 619, E(): 7.8e-30, (32.4% identity in 352 aa overlap); Q9S4U5|MCE1 mycobacterial cell entry protein from Mycobacterium bovis BCG (454 aa), FASTA scores: opt: 529,E(): 2.1e-24, (30.35% identity in 448 aa overlap); Q9CD14|MCE1A|ML2589 from Mycobacterium leprae (441 aa),FASTA scores: opt: 515, E(): 1.4e-23, (28.35% identity in 430 aa overlap); etc. Contains a possible N-terminal signal sequence. Note that previously known as mce4. Predicted to be an outer membrane protein (See Song et al., 2008)." /db_xref="EnsemblGenomes-Gn:Rv3499c" /db_xref="EnsemblGenomes-Tr:CCP46321" /db_xref="GOA:I6YC99" /db_xref="InterPro:IPR003399" /db_xref="InterPro:IPR005693" /db_xref="InterPro:IPR024516" /db_xref="UniProtKB/TrEMBL:I6YC99" /experiment="EXISTENCE: identified in proteomics study" BEGIN 1 MSGGGSRRTS VRVAAALLAG LMVGSAVLTY LSYTAAFTST DTVTVSSPRA GLVMEKGAKV 61 KYRGIQVGKV TDISYSGNQA RLKLAIDSGE MGFIPSNATV RIAGNTIFGA KSVEFIPPKT 121 PSPKPLSPNA HVAASQVQLE VNTLFQSLID LLHKIDPLET NATLSALSEG LRGHGDDLGA 181 LLSGLNTLTR QANPKLPALQ EDFRKAAVVA NVYADAAGDL NTVFDNLPTI NKTIVDQKDN 241 LNDTLLATIG LSNNAYETLA PAEQNFIDAI NRLRAPLKVT SDYSPVFGCL FKGIARGVKE 301 FAPLIGVRKA GLFTSSSFVL GAPSYTYPES LPIVNASGGP NCRGLPDIPT KQTGGSFYRA 361 PFLVTDNALI PYQPFTELQV DAPSTLQFLF NGAFAERDDF //