LOCUS CCP46388.1 283 aa PRT BCT 27-FEB-2015 DEFINITION Mycobacterium tuberculosis H37Rv Arylamine N-acetyltransferase Nat (arylamine acetylase) protein. ACCESSION AL123456-3666 PROTEIN_ID CCP46388.1 SOURCE Mycobacterium tuberculosis H37Rv ORGANISM Mycobacterium tuberculosis H37Rv Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae; Mycobacterium; Mycobacterium tuberculosis complex. REFERENCE 1 AUTHORS Cole S.T., Brosch R., Parkhill J., Garnier T., Churcher C., Harris D., Gordon S.V., Eiglmeier K., Gas S., Barry C.E.III., Tekaia F., Badcock K., Basham D., Brown D., Chillingworth T., Connor R., Davies R., Devlin K., Feltwell T., Gentles S., Hamlin N., Holroyd S., Hornsby T., Jagels K., Krogh A., McLean J., Moule S., Murphy L., Oliver K., Osborne J., Quail M.A., Rajandream M.A., Rogers J., Rutter S., Seeger K., Skelton J., Squares R., Squares S., Sulston J.E., Taylor K., Whitehead S., Barrell B.G. TITLE Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence JOURNAL Nature 393(6685), 537-544(1998). PUBMED 9634230 REMARK Erratum:[Nature 1998 Nov 12;396(6707):190] REFERENCE 2 AUTHORS Camus J.C., Pryor M.J., Medigue C., Cole S.T. TITLE Re-annotation of the genome sequence of Mycobacterium tuberculosis H37Rv JOURNAL Microbiology (Reading, Engl.) 148(Pt 10), 2967-2973(2002). PUBMED 12368430 REFERENCE 3 AUTHORS Lew J.M., Kapopoulou A., Jones L.M., Cole S.T. TITLE TubercuList--10 years after JOURNAL Tuberculosis (Edinb) 91(1), 1-7(2011). PUBMED 20980199 REFERENCE 4 (bases 1 to 4411529) AUTHORS Parkhill J. JOURNAL Submitted (11-JUN-1998) to the INSDC. Submitted on behalf of the Mycobacterium tuberculosis sequencing and mapping teams, Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA Unite de Genetique Moleculaire Bacterienne, Institut Pasteur, 28 rue du Docteur Roux, 75724 Paris Cedex 15, France E-mail: parkhill@sanger.ac.uk REFERENCE 5 (bases 1 to 4411532) AUTHORS Lew J.M. JOURNAL Submitted (18-DEC-2012) to the INSDC. Lew J., Ecole Polytechnique Federale de Lausanne, CH-1015, Lausanne, Switzerland, and the Swiss Institute of Bioinformatics, CMU - Rue Michel-Servet 1, 1211 Geneva 4, SWITZERLAND COMMENT On or before Feb 1, 2013 this sequence version replaced gi:41352722, gi:38490165, gi:38490207, gi:41353619, gi:38490250, gi:38684030, gi:38490288, gi:41353667, gi:41353422, gi:41352756, gi:38490319, gi:41352785, gi:38490370, gi:41353971. Note: This annotation is from the TubercuList website, Release 26, Dec 2012 (URL: http://tuberculist.epfl.ch) (email: tuberculist@epfl.ch). FEATURES Qualifiers source /organism="Mycobacterium tuberculosis H37Rv" /strain="H37Rv" /mol_type="genomic DNA" /db_xref="taxon:83332" protein /transl_table=11 /gene="nat" /gene_synonym="nhoA" /locus_tag="Rv3566c" /note="Rv3566c, (MT3671, MTCY06G11.13c), len: 283 aa. Nat (alternate gene name: nhoA), arylamine N-acetyltransferase (see citations below), highly similar to O86309|NAT_MYCSM arylamine N-acetyltransferase from Mycobacterium smegmatis (see citation below) (275 aa), FASTA scores: opt: 1114,E(): 3e-66, (60.95% identity in 274 aa overlap). Also highly similar to others e.g. Q98D42|BAB51429|MLR4870 from Rhizobium loti (Mesorhizobium loti) (278 aa), FASTA scores: opt: 697, E(): 1.1e-38, (44.1% identity in 272 aa overlap); P77567|NHOA_ECOLI|B1463 from Escherichia coli strain K12 (281 aa), FASTA scores: opt: 537, E(): 4.4e-28, (38.85% identity in 273 aa overlap); Q00267|NHOA_SALTY from Salmonella typhimurium (281 aa), FASTA scores: opt: 507,E(): 4.3e-26, (34.8% identity in 273 aa overlap); etc. Belongs to the arylamine N-acetyltransferase family. Note that previously known as nhoA (332 aa) and that nucleotide 4007874 has been changed since first submission (G deleted)." /db_xref="EnsemblGenomes-Gn:Rv3566c" /db_xref="EnsemblGenomes-Tr:CCP46388" /db_xref="GOA:P9WJI5" /db_xref="InterPro:IPR001447" /db_xref="InterPro:IPR038765" /db_xref="UniProtKB/Swiss-Prot:P9WJI5" /experiment="EXISTENCE: identified in proteomics study" BEGIN 1 MALDLTAYFD RINYRGATDP TLDVLQDLVT VHSRTIPFEN LDPLLGVPVD DLSPQALADK 61 LVLRRRGGYC FEHNGLMGYV LAELGYRVRR FAARVVWKLA PDAPLPPQTH TLLGVTFPGS 121 GGCYLVDVGF GGQTPTSPLR LETGAVQPTT HEPYRLEDRV DGFVLQAMVR DTWQTLYEFT 181 TQTRPQIDLK VASWYASTHP ASKFVTGLTA AVITDDARWN LSGRDLAVHR AGGTEKIRLA 241 DAAAVVDTLS ERFGINVADI GERGALETRI DELLARQPGA DAP //