LOCUS CCP46713.1 619 aa PRT BCT 27-FEB-2015 DEFINITION Mycobacterium tuberculosis H37Rv ESX conserved component EccA2. ESX-2 type VII secretion system protein. Probable CbxX/CfqX family protein. protein. ACCESSION AL123456-3991 PROTEIN_ID CCP46713.1 SOURCE Mycobacterium tuberculosis H37Rv ORGANISM Mycobacterium tuberculosis H37Rv Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae; Mycobacterium; Mycobacterium tuberculosis complex. REFERENCE 1 AUTHORS Cole S.T., Brosch R., Parkhill J., Garnier T., Churcher C., Harris D., Gordon S.V., Eiglmeier K., Gas S., Barry C.E.III., Tekaia F., Badcock K., Basham D., Brown D., Chillingworth T., Connor R., Davies R., Devlin K., Feltwell T., Gentles S., Hamlin N., Holroyd S., Hornsby T., Jagels K., Krogh A., McLean J., Moule S., Murphy L., Oliver K., Osborne J., Quail M.A., Rajandream M.A., Rogers J., Rutter S., Seeger K., Skelton J., Squares R., Squares S., Sulston J.E., Taylor K., Whitehead S., Barrell B.G. TITLE Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence JOURNAL Nature 393(6685), 537-544(1998). PUBMED 9634230 REMARK Erratum:[Nature 1998 Nov 12;396(6707):190] REFERENCE 2 AUTHORS Camus J.C., Pryor M.J., Medigue C., Cole S.T. TITLE Re-annotation of the genome sequence of Mycobacterium tuberculosis H37Rv JOURNAL Microbiology (Reading, Engl.) 148(Pt 10), 2967-2973(2002). PUBMED 12368430 REFERENCE 3 AUTHORS Lew J.M., Kapopoulou A., Jones L.M., Cole S.T. TITLE TubercuList--10 years after JOURNAL Tuberculosis (Edinb) 91(1), 1-7(2011). PUBMED 20980199 REFERENCE 4 (bases 1 to 4411529) AUTHORS Parkhill J. JOURNAL Submitted (11-JUN-1998) to the INSDC. Submitted on behalf of the Mycobacterium tuberculosis sequencing and mapping teams, Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA Unite de Genetique Moleculaire Bacterienne, Institut Pasteur, 28 rue du Docteur Roux, 75724 Paris Cedex 15, France E-mail: parkhill@sanger.ac.uk REFERENCE 5 (bases 1 to 4411532) AUTHORS Lew J.M. JOURNAL Submitted (18-DEC-2012) to the INSDC. Lew J., Ecole Polytechnique Federale de Lausanne, CH-1015, Lausanne, Switzerland, and the Swiss Institute of Bioinformatics, CMU - Rue Michel-Servet 1, 1211 Geneva 4, SWITZERLAND COMMENT On or before Feb 1, 2013 this sequence version replaced gi:41352722, gi:38490165, gi:38490207, gi:41353619, gi:38490250, gi:38684030, gi:38490288, gi:41353667, gi:41353422, gi:41352756, gi:38490319, gi:41352785, gi:38490370, gi:41353971. Note: This annotation is from the TubercuList website, Release 26, Dec 2012 (URL: http://tuberculist.epfl.ch) (email: tuberculist@epfl.ch). FEATURES Qualifiers source /organism="Mycobacterium tuberculosis H37Rv" /strain="H37Rv" /mol_type="genomic DNA" /db_xref="taxon:83332" protein /transl_table=11 /gene="eccA2" /locus_tag="Rv3884c" /note="Rv3884c, (MTCY15F10.28), len: 619 aa. eccA2, esx conserved component, ESX-2 type VII secretion system protein. Probable CbxX/CfqX protein family, similar to hypothetical proteins from Mycobacterium leprae e.g. Q9CD28|Y282_MYCLE|ML2537 (640 aa), FASTA scores: opt: 725,E(): 2.9e-34, (28.95% identity in 587 aa overlap); O33089|Y2G8_MYCLE|ML0055|MLCB628.18c (belongs to the CbxX/CfqX family) (573 aa); Q9CBV5|ML1536 (610 aa) FASTA scores: opt: 648, E(): 7.4e-30, (31.5% identity in 549 aa overlap). Also similar to proteins belonging to the CbxX/CfqX family e.g. Q9RKZ2|SC6D7.05c putative CbxX/CfqX family protein from Streptomyces coelicolor (618 aa) FASTA scores: opt: 557, E(): 1.3e-24, (28.6% identity in 601 aa overlap); P27643|SP5K_BACSU|SPOVK|SPOVJ stage V sporulation protein K from Bacillus subtilis (322 aa) FASTA scores: opt: 485, E(): 1.1e-20, (35.0% identity in 280 aa overlap) (similarity only at C-terminus); Q9KAC6|BH2363 stage V sporulation protein K from Bacillus halodurans (315 aa),FASTA scores: opt: 462, E(): 2.2e-19, (36.05% identity in 244 aa overlap) (similarity only at C-terminus); etc. And similar to hypothetical proteins from Mycobacterium tuberculosis belonging to the CbxX/CfqX family e.g. O53687|Y282_MYCTU|Rv0282|MT0295|MTV035.10 hypothetical 68.1 KDA protein (631 aa), FASTA scores: opt: 743, E(): 2.6e-35,(29.9% identity in 612 aa overlap); O69733|Y2G8_MYCTU|Rv3868|MT3981|MTV027.03 hypothetical 62.4 KDA protein (573 aa), FASTA scores: opt: 678, E(): 1.3e-31,(31.25% identity in 589 aa overlap); O53947|YH98_MYCTU|Rv1798|MT1847|MTV049.20 (610 aa) FASTA scores: opt: 669, E(): 4.6e-31, (30.95% identity in 549 aa overlap); etc. Contains PS00017 ATP/GTP-binding site motif A (P-loop). Seems to belong to the CbxX/CfqX family." /db_xref="EnsemblGenomes-Gn:Rv3884c" /db_xref="EnsemblGenomes-Tr:CCP46713" /db_xref="GOA:P9WPH7" /db_xref="InterPro:IPR000641" /db_xref="InterPro:IPR003593" /db_xref="InterPro:IPR003959" /db_xref="InterPro:IPR011990" /db_xref="InterPro:IPR023835" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR041627" /db_xref="UniProtKB/Swiss-Prot:P9WPH7" /inference="protein motif:PROSITE:PS00017" BEGIN 1 MSRMVDTMGD LLTARRHFDR AMTIKNGQGC VAALPEFVAA TEADPSMADA WLGRIACGDR 61 DLASLKQLNA HSEWLHRETT RIGRTLAAEV QLGPSIGITV TDASQVGLAL SSALTIAGEY 121 AKADALLANR ELLDSWRNYQ WHQLARAFLM YVTQRWPDVL STAAEDLPPQ AIVMPAVTAS 181 ICALAAHAAA HLGQGRVALD WLDRVDVIGH SRSSERFGAD VLTAAIGPAD IPLLVADLAY 241 VRGMVYRQLH EEDKAQIWLS KATINGVLTD AAKEALADPN LRLIVTDERT IASRSDRWDA 301 STAKSRDQLD DDNAAQRRGE LLAEGRELLA KQVGLAAVKQ AVSALEDQLE VRMMRLEHGL 361 PVEGQTNHML LVGPPGTGKT TTAEALGKIY AGMGIVRHPE IREVRRSDFC GHYIGESGPK 421 TNELIEKSLG RIIFMDEFYS LIERHQDGTP DMIGMEAVNQ LLVQLETHRF DFCFIGAGYE 481 DQVDEFLTVN PGLAGRFNRK LRFESYSPVE IVEIGHRYAT PRASQLDDAA REVFLDAVTT 541 IRNYTTPSGQ HGIDAMQNGR FARNVIERAE GFRDTRVVAQ KRAGQPVSVQ DLQIITATDI 601 DAAIRSVCSD NRDMAAIVW //