LOCUS CCP45840.1 414 aa PRT BCT 27-FEB-2015 DEFINITION Mycobacterium tuberculosis H37Rv Alpha (1->4) glucosyltransferase protein. ACCESSION AL123456-3118 PROTEIN_ID CCP45840.1 SOURCE Mycobacterium tuberculosis H37Rv ORGANISM Mycobacterium tuberculosis H37Rv Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae; Mycobacterium; Mycobacterium tuberculosis complex. REFERENCE 1 AUTHORS Cole S.T., Brosch R., Parkhill J., Garnier T., Churcher C., Harris D., Gordon S.V., Eiglmeier K., Gas S., Barry C.E.III., Tekaia F., Badcock K., Basham D., Brown D., Chillingworth T., Connor R., Davies R., Devlin K., Feltwell T., Gentles S., Hamlin N., Holroyd S., Hornsby T., Jagels K., Krogh A., McLean J., Moule S., Murphy L., Oliver K., Osborne J., Quail M.A., Rajandream M.A., Rogers J., Rutter S., Seeger K., Skelton J., Squares R., Squares S., Sulston J.E., Taylor K., Whitehead S., Barrell B.G. TITLE Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence JOURNAL Nature 393(6685), 537-544(1998). PUBMED 9634230 REMARK Erratum:[Nature 1998 Nov 12;396(6707):190] REFERENCE 2 AUTHORS Camus J.C., Pryor M.J., Medigue C., Cole S.T. TITLE Re-annotation of the genome sequence of Mycobacterium tuberculosis H37Rv JOURNAL Microbiology (Reading, Engl.) 148(Pt 10), 2967-2973(2002). PUBMED 12368430 REFERENCE 3 AUTHORS Lew J.M., Kapopoulou A., Jones L.M., Cole S.T. TITLE TubercuList--10 years after JOURNAL Tuberculosis (Edinb) 91(1), 1-7(2011). PUBMED 20980199 REFERENCE 4 (bases 1 to 4411529) AUTHORS Parkhill J. JOURNAL Submitted (11-JUN-1998) to the INSDC. Submitted on behalf of the Mycobacterium tuberculosis sequencing and mapping teams, Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA Unite de Genetique Moleculaire Bacterienne, Institut Pasteur, 28 rue du Docteur Roux, 75724 Paris Cedex 15, France E-mail: parkhill@sanger.ac.uk REFERENCE 5 (bases 1 to 4411532) AUTHORS Lew J.M. JOURNAL Submitted (18-DEC-2012) to the INSDC. Lew J., Ecole Polytechnique Federale de Lausanne, CH-1015, Lausanne, Switzerland, and the Swiss Institute of Bioinformatics, CMU - Rue Michel-Servet 1, 1211 Geneva 4, SWITZERLAND COMMENT On or before Feb 1, 2013 this sequence version replaced gi:41352722, gi:38490165, gi:38490207, gi:41353619, gi:38490250, gi:38684030, gi:38490288, gi:41353667, gi:41353422, gi:41352756, gi:38490319, gi:41352785, gi:38490370, gi:41353971. Note: This annotation is from the TubercuList website, Release 26, Dec 2012 (URL: http://tuberculist.epfl.ch) (email: tuberculist@epfl.ch). FEATURES Qualifiers source /organism="Mycobacterium tuberculosis H37Rv" /strain="H37Rv" /mol_type="genomic DNA" /db_xref="taxon:83332" protein /transl_table=11 /locus_tag="Rv3032" /note="Rv3032, (MTV012.47), len: 414 aa. Alpha (1->4) glucosyltransferase (See Stadthagen et al., 2007). Equivalent to Q9CBR3|ML1715 putative transferase from Mycobacterium leprae (438 aa), FASTA scores: opt: 2456,E(): 7.3e-145, (87.9% identity in 414 aa overlap). Also similar to hypothetical proteins and various transferases e.g. P73369|SLL1971 hypothetical 46.2 KDA protein from Synechocystis sp. strain PCC 6803 (404 aa), FASTA scores: opt: 584, E(): 7.3e-29, (34.5% identity in 400 aa overlap); Q9Z5B7|SC2G5.06 putative transferase from Streptomyces coelicolor (406 aa), FASTA scores: opt: 509, E(): 3.3e-24,(35.9% identity in 413 aa overlap); Q9UZA1|PAB0827 galactosyltransferase (LPS biosynthesis RFBU related protein) from Pyrococcus abyssi (371 aa), FASTA scores: opt: 381, E(): 2.6e-16, (26.75% identity in 404 aa overlap); etc." /db_xref="EnsemblGenomes-Gn:Rv3032" /db_xref="EnsemblGenomes-Tr:CCP45840" /db_xref="GOA:P9WMY9" /db_xref="InterPro:IPR001296" /db_xref="InterPro:IPR028098" /db_xref="UniProtKB/Swiss-Prot:P9WMY9" BEGIN 1 MRILMVSWEY PPVVIGGLGR HVHHLSTALA AAGHDVVVLS RCPSGTDPST HPSSDEVTEG 61 VRVIAAAQDP HEFTFGNDMM AWTLAMGHAM IRAGLRLKKL GTDRSWRPDV VHAHDWLVAH 121 PAIALAQFYD VPMVSTIHAT EAGRHSGWVS GALSRQVHAV ESWLVRESDS LITCSASMND 181 EITELFGPGL AEITVIRNGI DAARWPFAAR RPRTGPAELL YVGRLEYEKG VHDAIAALPR 241 LRRTHPGTTL TIAGEGTQQD WLIDQARKHR VLRATRFVGH LDHTELLALL HRADAAVLPS 301 HYEPFGLVAL EAAAAGTPLV TSNIGGLGEA VINGQTGVSC APRDVAGLAA AVRSVLDDPA 361 AAQRRARAAR QRLTSDFDWQ TVATATAQVY LAAKRGERQP QPRLPIVEHA LPDR //