LOCUS CCP43008.1 957 aa PRT BCT 27-FEB-2015 DEFINITION Mycobacterium tuberculosis H37Rv PE-PGRS family protein PE_PGRS3 protein. ACCESSION AL123456-286 PROTEIN_ID CCP43008.1 SOURCE Mycobacterium tuberculosis H37Rv ORGANISM Mycobacterium tuberculosis H37Rv Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae; Mycobacterium; Mycobacterium tuberculosis complex. REFERENCE 1 AUTHORS Cole S.T., Brosch R., Parkhill J., Garnier T., Churcher C., Harris D., Gordon S.V., Eiglmeier K., Gas S., Barry C.E.III., Tekaia F., Badcock K., Basham D., Brown D., Chillingworth T., Connor R., Davies R., Devlin K., Feltwell T., Gentles S., Hamlin N., Holroyd S., Hornsby T., Jagels K., Krogh A., McLean J., Moule S., Murphy L., Oliver K., Osborne J., Quail M.A., Rajandream M.A., Rogers J., Rutter S., Seeger K., Skelton J., Squares R., Squares S., Sulston J.E., Taylor K., Whitehead S., Barrell B.G. TITLE Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence JOURNAL Nature 393(6685), 537-544(1998). PUBMED 9634230 REMARK Erratum:[Nature 1998 Nov 12;396(6707):190] REFERENCE 2 AUTHORS Camus J.C., Pryor M.J., Medigue C., Cole S.T. TITLE Re-annotation of the genome sequence of Mycobacterium tuberculosis H37Rv JOURNAL Microbiology (Reading, Engl.) 148(Pt 10), 2967-2973(2002). PUBMED 12368430 REFERENCE 3 AUTHORS Lew J.M., Kapopoulou A., Jones L.M., Cole S.T. TITLE TubercuList--10 years after JOURNAL Tuberculosis (Edinb) 91(1), 1-7(2011). PUBMED 20980199 REFERENCE 4 (bases 1 to 4411529) AUTHORS Parkhill J. JOURNAL Submitted (11-JUN-1998) to the INSDC. Submitted on behalf of the Mycobacterium tuberculosis sequencing and mapping teams, Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA Unite de Genetique Moleculaire Bacterienne, Institut Pasteur, 28 rue du Docteur Roux, 75724 Paris Cedex 15, France E-mail: parkhill@sanger.ac.uk REFERENCE 5 (bases 1 to 4411532) AUTHORS Lew J.M. JOURNAL Submitted (18-DEC-2012) to the INSDC. Lew J., Ecole Polytechnique Federale de Lausanne, CH-1015, Lausanne, Switzerland, and the Swiss Institute of Bioinformatics, CMU - Rue Michel-Servet 1, 1211 Geneva 4, SWITZERLAND COMMENT On or before Feb 1, 2013 this sequence version replaced gi:41352722, gi:38490165, gi:38490207, gi:41353619, gi:38490250, gi:38684030, gi:38490288, gi:41353667, gi:41353422, gi:41352756, gi:38490319, gi:41352785, gi:38490370, gi:41353971. Note: This annotation is from the TubercuList website, Release 26, Dec 2012 (URL: http://tuberculist.epfl.ch) (email: tuberculist@epfl.ch). FEATURES Qualifiers source /organism="Mycobacterium tuberculosis H37Rv" /strain="H37Rv" /mol_type="genomic DNA" /db_xref="taxon:83332" protein /transl_table=11 /gene="PE_PGRS3" /locus_tag="Rv0278c" /note="Rv0278c, (MTV035.06c), len: 957 aa. PE_PGRS3, Member of the Mycobacterium tuberculosis PE family (see citation below), PGRS subfamily of gly-rich proteins, similar to many e.g. Z95890|MTCY28_25|Rv1759c from Mycobacterium tuberculosis (914 aa), FASTA scores: opt: 3849, E(): 0,(67.8% identity in 903 aa overlap). Contains PS00583 pfkB family of carbohydrate kinases signature 1." /db_xref="EnsemblGenomes-Gn:Rv0278c" /db_xref="EnsemblGenomes-Tr:CCP43008" /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/Swiss-Prot:P9WIG3" /inference="protein motif:PROSITE:PS00583" /experiment="EXISTENCE: identified in proteomics study" BEGIN 1 MSFVIAAPEV IAAAATDLAS LGSSISAANA AAAANTTALM AAGADEVSTA IAALFGAHGQ 61 AYQALSAQAQ AFHAQFVQAL TSGGGAYAAA EAAAVSPLLD PINEFFLANT GRPLIGNGAN 121 GAPGTGANGG DGGWLIGNGG AGGSGAAGVN GGAGGNGGAG GNGGAGGLIG NGGAGGAGGV 181 ASSGIGGSGG AGGNAMLFGA GGAGGAGGGV VALTGGAGGA GGAGGNAGLL FGAAGVGGAG 241 GFTNGSALGG AGGAGGAGGL FATGGVGGSG GAGSSGGAGG AGGAGGLFGA GGTGGHGGFA 301 DSSFGGVGGA GGAGGLFGAG GEGGSGGHSL VAGGDGGAGG NAGMLALGAA GGAGGIGGDG 361 GTLTAGGIGG AGGAGGNAGL LFGSGGSGGA GGFGFADGGQ GGPGGNAGTV FGSGGAGGNG 421 GVGQGFAGGI GGAGGTPGLI GNGGNGGNGG ASAVTGGNGG IGGTGVLIGN GGNGGSGGIG 481 AGKAGVGGVS GLLLGLDGFN APASTSPLHT LQQNVLNVVN EPFQTLTGRP LIGNGANGTP 541 GTGADGGAGG WLFGNGANGT PGTGAAGGAG GWLFGNGGNG GHGATNTAAT ATGGAGGAGG 601 ILFGTGGNGG TGGIATGAGG IGGAGGAGGV SLLIGSGGTG GNGGNSIGVA GIGGAGGRGG 661 DAGLLFGAAG TGGHGAAGGV PAGVGGAGGN GGLFANGGAG GAGGFNAAGG NGGNGGLFGT 721 GGTGGAGTNF GAGGNGGNGG LFGAGGTGGA AGSGGSGITT GGGGHGGNAG LLSLGASGGA 781 GGSGGASSLA GGAGGTGGNG ALLFGFRGAG GAGGHGGAAL TSIQQGGAGG AGGNGGLLFG 841 SAGAGGAGGS GANALGAGTG GTGGDGGHAG VFGNGGDGGC RRVWRRYRRQ RWCRRQRRAD 901 RQRRQRRQRR QSRGHARCRR HRRAAARRER TQRLAIAGRP ATTRGVEGIS CSPQMMP //