LOCUS CCP43455.1 787 aa PRT BCT 27-FEB-2015 DEFINITION Mycobacterium tuberculosis H37Rv Possible arylsulfatase AtsA (aryl-sulfate sulphohydrolase) (arylsulphatase) protein. ACCESSION AL123456-733 PROTEIN_ID CCP43455.1 SOURCE Mycobacterium tuberculosis H37Rv ORGANISM Mycobacterium tuberculosis H37Rv Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae; Mycobacterium; Mycobacterium tuberculosis complex. REFERENCE 1 AUTHORS Cole S.T., Brosch R., Parkhill J., Garnier T., Churcher C., Harris D., Gordon S.V., Eiglmeier K., Gas S., Barry C.E.III., Tekaia F., Badcock K., Basham D., Brown D., Chillingworth T., Connor R., Davies R., Devlin K., Feltwell T., Gentles S., Hamlin N., Holroyd S., Hornsby T., Jagels K., Krogh A., McLean J., Moule S., Murphy L., Oliver K., Osborne J., Quail M.A., Rajandream M.A., Rogers J., Rutter S., Seeger K., Skelton J., Squares R., Squares S., Sulston J.E., Taylor K., Whitehead S., Barrell B.G. TITLE Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence JOURNAL Nature 393(6685), 537-544(1998). PUBMED 9634230 REMARK Erratum:[Nature 1998 Nov 12;396(6707):190] REFERENCE 2 AUTHORS Camus J.C., Pryor M.J., Medigue C., Cole S.T. TITLE Re-annotation of the genome sequence of Mycobacterium tuberculosis H37Rv JOURNAL Microbiology (Reading, Engl.) 148(Pt 10), 2967-2973(2002). PUBMED 12368430 REFERENCE 3 AUTHORS Lew J.M., Kapopoulou A., Jones L.M., Cole S.T. TITLE TubercuList--10 years after JOURNAL Tuberculosis (Edinb) 91(1), 1-7(2011). PUBMED 20980199 REFERENCE 4 (bases 1 to 4411529) AUTHORS Parkhill J. JOURNAL Submitted (11-JUN-1998) to the INSDC. Submitted on behalf of the Mycobacterium tuberculosis sequencing and mapping teams, Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA Unite de Genetique Moleculaire Bacterienne, Institut Pasteur, 28 rue du Docteur Roux, 75724 Paris Cedex 15, France E-mail: parkhill@sanger.ac.uk REFERENCE 5 (bases 1 to 4411532) AUTHORS Lew J.M. JOURNAL Submitted (18-DEC-2012) to the INSDC. Lew J., Ecole Polytechnique Federale de Lausanne, CH-1015, Lausanne, Switzerland, and the Swiss Institute of Bioinformatics, CMU - Rue Michel-Servet 1, 1211 Geneva 4, SWITZERLAND COMMENT On or before Feb 1, 2013 this sequence version replaced gi:41352722, gi:38490165, gi:38490207, gi:41353619, gi:38490250, gi:38684030, gi:38490288, gi:41353667, gi:41353422, gi:41352756, gi:38490319, gi:41352785, gi:38490370, gi:41353971. Note: This annotation is from the TubercuList website, Release 26, Dec 2012 (URL: http://tuberculist.epfl.ch) (email: tuberculist@epfl.ch). FEATURES Qualifiers source /organism="Mycobacterium tuberculosis H37Rv" /strain="H37Rv" /mol_type="genomic DNA" /db_xref="taxon:83332" protein /transl_table=11 /gene="atsA" /locus_tag="Rv0711" /note="Rv0711, (MTCY210.30), len: 787 aa. Possible atsA,arylsulfatase, similar to others e.g. P51691|ARS_PSEAE arylsulfatase from Pseudomonas aeruginosa (532 aa), FASTA scores: opt: 439, E(): 2.9e-21, (30.8% identity in 552 aa overlap); etc. Also similar to other hypothetical arylsulfatases from Mycobacterium tuberculosis e.g. Rv3299c, Rv0663, etc. Contains PS00523 Sulfatases signature 1, and PS00149 Sulfatases signature 2. Belongs to the sulfatase family." /db_xref="EnsemblGenomes-Gn:Rv0711" /db_xref="EnsemblGenomes-Tr:CCP43455" /db_xref="GOA:P95059" /db_xref="InterPro:IPR000917" /db_xref="InterPro:IPR017850" /db_xref="InterPro:IPR024607" /db_xref="UniProtKB/TrEMBL:P95059" /inference="protein motif:PROSITE:PS00678" /inference="protein motif:PROSITE:PS00523" /inference="protein motif:PROSITE:PS00149" /experiment="EXISTENCE: identified in proteomics study" BEGIN 1 MAPEATEAFN GTIELDIRDS EPDWGPYAAP VAPEHSPNIL YLVWDDVGIA TWDCFGGLVE 61 MPAMTRVAER GVRLSQFHTT ALCSPTRASL LTGRNATTVG MATIEEFTDG FPNCNGRIPA 121 DTALLPEVLA EHGYNTYCVG KWHLTPLEES NMASTKRHWP TSRGFERFYG FLGGETDQWY 181 PDLVYDNHPV SPPGTPEGGY HLSKDIADKT IEFIRDAKVI APDKPWFSYV CPGAGHAPHH 241 VFKEWADRYA GRFDMGYERY REIVLERQKA LGIVPPDTEL SPINPYLDVP GPNGETWPLQ 301 DTVRPWDSLS DEEKKLFCRM AEVFAGFLSY TDAQIGRILD YLEESGQLDN TIIVVISDNG 361 ASGEGGPNGS VNEGKFFNGY IDTVAESMKL FDHLGGPQTY NHYPIGWAMA FNTPYKLFKR 421 YASHEGGIAD PAIISWPNGI AAHGEIRDNY VNVSDITPTV YDLLGMTPPG TVKGIPQKPM 481 DGVSFIAALA DPAADTGKTT QFYTMLGTRG IWHEGWFANT IHAATPAGWS NFNADRWELF 541 HIAADRSQCH DLAAEHPDKL EELKALWFSE AAKYNGLPLA DLNLLETMTR SRPYLVSERA 601 SYVYYPDCAD VGIGAAVEIR GRSFAVLADV TIDTTGAEGV LFKHGGAHGG HVLFVRDGRL 661 HYVYNFLGER QQLVSSSGPV PSGRHLLGVR YLRTGTVPNS HTPVGDLELF FDENLVGALT 721 NVLTHPGTFG LAGAAISVGR NGGSAVSSHY EAPFAFTGGT ITQVTVDVSG RPFEDVESDL 781 ALAFSRD //