LOCUS CCP45265.1 546 aa PRT BCT 27-FEB-2015 DEFINITION Mycobacterium tuberculosis H37Rv Probable alpha-glucosidase AglA (maltase) (glucoinvertase) (glucosidosucrase) (maltase- glucoamylase) (lysosomal alpha-glucosidase) (acid maltase) protein. ACCESSION AL123456-2543 PROTEIN_ID CCP45265.1 SOURCE Mycobacterium tuberculosis H37Rv ORGANISM Mycobacterium tuberculosis H37Rv Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae; Mycobacterium; Mycobacterium tuberculosis complex. REFERENCE 1 AUTHORS Cole S.T., Brosch R., Parkhill J., Garnier T., Churcher C., Harris D., Gordon S.V., Eiglmeier K., Gas S., Barry C.E.III., Tekaia F., Badcock K., Basham D., Brown D., Chillingworth T., Connor R., Davies R., Devlin K., Feltwell T., Gentles S., Hamlin N., Holroyd S., Hornsby T., Jagels K., Krogh A., McLean J., Moule S., Murphy L., Oliver K., Osborne J., Quail M.A., Rajandream M.A., Rogers J., Rutter S., Seeger K., Skelton J., Squares R., Squares S., Sulston J.E., Taylor K., Whitehead S., Barrell B.G. TITLE Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence JOURNAL Nature 393(6685), 537-544(1998). PUBMED 9634230 REMARK Erratum:[Nature 1998 Nov 12;396(6707):190] REFERENCE 2 AUTHORS Camus J.C., Pryor M.J., Medigue C., Cole S.T. TITLE Re-annotation of the genome sequence of Mycobacterium tuberculosis H37Rv JOURNAL Microbiology (Reading, Engl.) 148(Pt 10), 2967-2973(2002). PUBMED 12368430 REFERENCE 3 AUTHORS Lew J.M., Kapopoulou A., Jones L.M., Cole S.T. TITLE TubercuList--10 years after JOURNAL Tuberculosis (Edinb) 91(1), 1-7(2011). PUBMED 20980199 REFERENCE 4 (bases 1 to 4411529) AUTHORS Parkhill J. JOURNAL Submitted (11-JUN-1998) to the INSDC. Submitted on behalf of the Mycobacterium tuberculosis sequencing and mapping teams, Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA Unite de Genetique Moleculaire Bacterienne, Institut Pasteur, 28 rue du Docteur Roux, 75724 Paris Cedex 15, France E-mail: parkhill@sanger.ac.uk REFERENCE 5 (bases 1 to 4411532) AUTHORS Lew J.M. JOURNAL Submitted (18-DEC-2012) to the INSDC. Lew J., Ecole Polytechnique Federale de Lausanne, CH-1015, Lausanne, Switzerland, and the Swiss Institute of Bioinformatics, CMU - Rue Michel-Servet 1, 1211 Geneva 4, SWITZERLAND COMMENT On or before Feb 1, 2013 this sequence version replaced gi:41352722, gi:38490165, gi:38490207, gi:41353619, gi:38490250, gi:38684030, gi:38490288, gi:41353667, gi:41353422, gi:41352756, gi:38490319, gi:41352785, gi:38490370, gi:41353971. Note: This annotation is from the TubercuList website, Release 26, Dec 2012 (URL: http://tuberculist.epfl.ch) (email: tuberculist@epfl.ch). FEATURES Qualifiers source /organism="Mycobacterium tuberculosis H37Rv" /strain="H37Rv" /mol_type="genomic DNA" /db_xref="taxon:83332" protein /transl_table=11 /gene="aglA" /locus_tag="Rv2471" /note="Rv2471, (MTV008.27), len: 546 aa. Probable aglA,maltase (alpha-glucosidase), highly similar or similar to several e.g. Q60027|AGLA from Thermomonospora curvata (544 aa), FASTA scores: opt: 2071, E(): 4e-116, (57.7% identity in 525 aa overlap); Q9KZE3|AGLAE from Streptomyces coelicolor (534 aa), FASTA scores: opt: 1475, E(): 1.5e-80,(50.1% identity in 537 aa overlap); O86874|AGLA from Streptomyces lividans (534 aa), FASTA scores: opt: 1473,E(): 2e-80, (50.1% identity in 537 aa overlap); etc. Seems to belong to family 13 of glycosyl hydrolases, also known as the alpha-amylase family." /db_xref="EnsemblGenomes-Gn:Rv2471" /db_xref="EnsemblGenomes-Tr:CCP45265" /db_xref="GOA:O53198" /db_xref="InterPro:IPR006047" /db_xref="InterPro:IPR017853" /db_xref="UniProtKB/TrEMBL:O53198" /experiment="EXISTENCE: identified in proteomics study" BEGIN 1 MDQHQRPDPM GPGSPRASAR RPEPDPMGEP WWSRAVFYQV YPRSFADSNG DGVGDLDGLA 61 SRLDHLQQLG VDAIWINPVT VSPMADHGYD VADPRDIDPL FGGMPAFERL VAAAHRQGIK 121 VTMDVVPNHT SSAHPWFQAA LADLPGSPAR DRYFFRDGRG PDGSLPPNNW ESVFGGPAWT 181 RVREPDGNPG QWYLHLFDTE QPDLNWDNPE ILDDFEKTLR FWLDRGVDGF RIDVAHGMAK 241 PPGLPDSPDL GIEVLHHRDD DPRFNHPNVH AIHRDIRTVI DEYPGAVTVG EVWVHDNARW 301 AEYLRPDELH LGFNFRLART EFDAAEIRDA VANSLAAAAL QNATPTWTLA NHDVGREVSR 361 YGGGEIGLRR AKAMAVVMLA LPGVVFLYNG QELGLPDVDL PDEVLQDPTW ERSGRTERGR 421 DGCRVPIPWS GNIPPFGFST CPDTWLPMPP EWAALTAEKQ RADAGSTLSF FRLALRLRRE 481 RNEFDGDVDW LAAPDDALIF RRHGGGLVCA LNAAERPLAL PAGEPILASA PLTDATLPPN 541 AAAWLV //