LOCUS CCP46621.1 643 aa PRT BCT 27-FEB-2015 DEFINITION Mycobacterium tuberculosis H37Rv Arabinofuranosyltransferase AftA protein. ACCESSION AL123456-3899 PROTEIN_ID CCP46621.1 SOURCE Mycobacterium tuberculosis H37Rv ORGANISM Mycobacterium tuberculosis H37Rv Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae; Mycobacterium; Mycobacterium tuberculosis complex. REFERENCE 1 AUTHORS Cole S.T., Brosch R., Parkhill J., Garnier T., Churcher C., Harris D., Gordon S.V., Eiglmeier K., Gas S., Barry C.E.III., Tekaia F., Badcock K., Basham D., Brown D., Chillingworth T., Connor R., Davies R., Devlin K., Feltwell T., Gentles S., Hamlin N., Holroyd S., Hornsby T., Jagels K., Krogh A., McLean J., Moule S., Murphy L., Oliver K., Osborne J., Quail M.A., Rajandream M.A., Rogers J., Rutter S., Seeger K., Skelton J., Squares R., Squares S., Sulston J.E., Taylor K., Whitehead S., Barrell B.G. TITLE Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence JOURNAL Nature 393(6685), 537-544(1998). PUBMED 9634230 REMARK Erratum:[Nature 1998 Nov 12;396(6707):190] REFERENCE 2 AUTHORS Camus J.C., Pryor M.J., Medigue C., Cole S.T. TITLE Re-annotation of the genome sequence of Mycobacterium tuberculosis H37Rv JOURNAL Microbiology (Reading, Engl.) 148(Pt 10), 2967-2973(2002). PUBMED 12368430 REFERENCE 3 AUTHORS Lew J.M., Kapopoulou A., Jones L.M., Cole S.T. TITLE TubercuList--10 years after JOURNAL Tuberculosis (Edinb) 91(1), 1-7(2011). PUBMED 20980199 REFERENCE 4 (bases 1 to 4411529) AUTHORS Parkhill J. JOURNAL Submitted (11-JUN-1998) to the INSDC. Submitted on behalf of the Mycobacterium tuberculosis sequencing and mapping teams, Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA Unite de Genetique Moleculaire Bacterienne, Institut Pasteur, 28 rue du Docteur Roux, 75724 Paris Cedex 15, France E-mail: parkhill@sanger.ac.uk REFERENCE 5 (bases 1 to 4411532) AUTHORS Lew J.M. JOURNAL Submitted (18-DEC-2012) to the INSDC. Lew J., Ecole Polytechnique Federale de Lausanne, CH-1015, Lausanne, Switzerland, and the Swiss Institute of Bioinformatics, CMU - Rue Michel-Servet 1, 1211 Geneva 4, SWITZERLAND COMMENT On or before Feb 1, 2013 this sequence version replaced gi:41352722, gi:38490165, gi:38490207, gi:41353619, gi:38490250, gi:38684030, gi:38490288, gi:41353667, gi:41353422, gi:41352756, gi:38490319, gi:41352785, gi:38490370, gi:41353971. Note: This annotation is from the TubercuList website, Release 26, Dec 2012 (URL: http://tuberculist.epfl.ch) (email: tuberculist@epfl.ch). FEATURES Qualifiers source /organism="Mycobacterium tuberculosis H37Rv" /strain="H37Rv" /mol_type="genomic DNA" /db_xref="taxon:83332" protein /transl_table=11 /gene="aftA" /locus_tag="Rv3792" /note="Rv3792, (MTCY13D12.26), len: 643 aa. aftA,arabinofuranosyltransferase (See Alderwick et al., 2006). Predicted to be in the GT-C superfamily of glycosyltransferases (See Liu and Mushegian, 2003). Probable conserved transmembrane protein, equivalent, but longer 21 aa, to Q9CDA6|ML0107 putative membrane protein from Mycobacterium leprae (632 aa), FASTA scores: opt: 1981, E(): 2.1e-110, (77.5% identity in 631 aa overlap). C-terminal end highly similar to C-terminus of O05765 putative product ORF 3 from Mycobacterium smegmatis (603 aa), FASTA scores: opt: 1261, E(): 1.4e-67, (70.7% identity in 266 aa overlap). A core mycobacterial gene; conserved in mycobacterial strains (See Marmiesse et al., 2004)." /db_xref="EnsemblGenomes-Gn:Rv3792" /db_xref="EnsemblGenomes-Tr:CCP46621" /db_xref="GOA:P9WN03" /db_xref="InterPro:IPR020959" /db_xref="InterPro:IPR020963" /db_xref="UniProtKB/Swiss-Prot:P9WN03" /experiment="EXISTENCE: identified in proteomics study" BEGIN 1 MPSRRKSPQF GHEMGAFTSA RAREVLVALG QLAAAVVVAV GVAVVSLLAI ARVEWPAFPS 61 SNQLHALTTV GQVGCLAGLV GIGWLWRHGR FRRLARLGGL VLVSAFTVVT LGMPLGATKL 121 YLFGISVDQQ FRTEYLTRLT DTAALRDMTY IGLPPFYPPG WFWIGGRAAA LTGTPAWEMF 181 KPWAITSMAI AVAVALVLWW RMIRFEYALL VTVATAAVML AYSSPEPYAA MITVLLPPML 241 VLTWSGLGAR DRQGWAAVVG AGVFLGFAAT WYTLLVAYGA FTVVLMALLL AGSRLQSGIK 301 AAVDPLCRLA VVGAIAAAIG STTWLPYLLR AARDPVSDTG SAQHYLPADG AALTFPMLQF 361 SLLGAICLLG TLWLVMRARS SAPAGALAIG VLAVYLWSLL SMLATLARTT LLSFRLQPTL 421 SVLLVAAGAF GFVEAVQALG KRGRGVIPMA AAIGLAGAIA FSQDIPDVLR PDLTIAYTDT 481 DGYGQRGDRR PPGSEKYYPA IDAAIRRVTG KRRDRTVVLT ADYSFLSYYP YWGFQGLTPH 541 YANPLAQFDK RATQIDSWSG LSTADEFIAA LDKLPWQPPT VFLMRHGAHN SYTLRLAQDV 601 YPNQPNVRRY TVDLRTALFA DPRFVVEDIG PFVLAIRKPQ ESA //