LOCUS CCP45432.1 778 aa PRT BCT 27-FEB-2015 DEFINITION Mycobacterium tuberculosis H37Rv PE-PGRS family protein PE_PGRS46 protein. ACCESSION AL123456-2710 PROTEIN_ID CCP45432.1 SOURCE Mycobacterium tuberculosis H37Rv ORGANISM Mycobacterium tuberculosis H37Rv Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae; Mycobacterium; Mycobacterium tuberculosis complex. REFERENCE 1 AUTHORS Cole S.T., Brosch R., Parkhill J., Garnier T., Churcher C., Harris D., Gordon S.V., Eiglmeier K., Gas S., Barry C.E.III., Tekaia F., Badcock K., Basham D., Brown D., Chillingworth T., Connor R., Davies R., Devlin K., Feltwell T., Gentles S., Hamlin N., Holroyd S., Hornsby T., Jagels K., Krogh A., McLean J., Moule S., Murphy L., Oliver K., Osborne J., Quail M.A., Rajandream M.A., Rogers J., Rutter S., Seeger K., Skelton J., Squares R., Squares S., Sulston J.E., Taylor K., Whitehead S., Barrell B.G. TITLE Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence JOURNAL Nature 393(6685), 537-544(1998). PUBMED 9634230 REMARK Erratum:[Nature 1998 Nov 12;396(6707):190] REFERENCE 2 AUTHORS Camus J.C., Pryor M.J., Medigue C., Cole S.T. TITLE Re-annotation of the genome sequence of Mycobacterium tuberculosis H37Rv JOURNAL Microbiology (Reading, Engl.) 148(Pt 10), 2967-2973(2002). PUBMED 12368430 REFERENCE 3 AUTHORS Lew J.M., Kapopoulou A., Jones L.M., Cole S.T. TITLE TubercuList--10 years after JOURNAL Tuberculosis (Edinb) 91(1), 1-7(2011). PUBMED 20980199 REFERENCE 4 (bases 1 to 4411529) AUTHORS Parkhill J. JOURNAL Submitted (11-JUN-1998) to the INSDC. Submitted on behalf of the Mycobacterium tuberculosis sequencing and mapping teams, Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA Unite de Genetique Moleculaire Bacterienne, Institut Pasteur, 28 rue du Docteur Roux, 75724 Paris Cedex 15, France E-mail: parkhill@sanger.ac.uk REFERENCE 5 (bases 1 to 4411532) AUTHORS Lew J.M. JOURNAL Submitted (18-DEC-2012) to the INSDC. Lew J., Ecole Polytechnique Federale de Lausanne, CH-1015, Lausanne, Switzerland, and the Swiss Institute of Bioinformatics, CMU - Rue Michel-Servet 1, 1211 Geneva 4, SWITZERLAND COMMENT On or before Feb 1, 2013 this sequence version replaced gi:41352722, gi:38490165, gi:38490207, gi:41353619, gi:38490250, gi:38684030, gi:38490288, gi:41353667, gi:41353422, gi:41352756, gi:38490319, gi:41352785, gi:38490370, gi:41353971. Note: This annotation is from the TubercuList website, Release 26, Dec 2012 (URL: http://tuberculist.epfl.ch) (email: tuberculist@epfl.ch). FEATURES Qualifiers source /organism="Mycobacterium tuberculosis H37Rv" /strain="H37Rv" /mol_type="genomic DNA" /db_xref="taxon:83332" protein /transl_table=11 /gene="PE_PGRS46" /locus_tag="Rv2634c" /note="Rv2634c, (MTCY441.04c), len: 778 aa. PE_PGRS46,Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see citation below), highly similar to many e.g. O53553|YZ08_MYCTU|Rv3508|MTV023.15 from Mycobacterium tuberculosis (1901 aa), FASTA scores: opt: 2553, E(): 2.2e-93, (53.8% identity in 866 aa overlap). Equivalent to AAK47026 from Mycobacterium tuberculosis strain CDC1551 (788 aa) but shorter 10 aa." /db_xref="EnsemblGenomes-Gn:Rv2634c" /db_xref="EnsemblGenomes-Tr:CCP45432" /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/Swiss-Prot:P9WIE7" /experiment="EXISTENCE: identified in proteomics study" BEGIN 1 MSFVIAVPEA LTMAASDLAN IGSTINAANA AAALPTTGVV AAAADEVSAA VAALFGSYAQ 61 SYQAFGAQLS AFHAQFVQSL TNGARSYVVA EATSAAPLQD LLGVVNAPAQ ALLGRPLIGN 121 GANGADGTGA PGGPGGLLLG NGGNGGSGAP GQPGGAGGDA GLIGNGGTGG KGGDGLVGSG 181 AAGGVGGRGG WLLGNGGTGG AGGAAGATLV GGTGGVGGAT GLIGSGGFGG AGGAAAGVGT 241 TGGVGGSGGV GGVFGNGGFG GAGGLGAAGG VGGAASYFGT GGGGGVGGDG APGGDGGAGP 301 LLIGNGGVGG LGGAGAAGGN GGAGGMLLGD GGAGGQGGPA VAGVLGGMPG AGGNGGNANW 361 FGSGGAGGQG GTGLAGTNGV NPGSIANPNT GANGTDNSGN GNQTGGNGGP GPAGGVGEAG 421 GVGGQGGLGE SLDGNDGTGG KGGAGGTAGT DGGAGGAGGA GGIGETDGSA GGVATGGEGG 481 DGATGGVDGG VGGAGGKGGQ GHNTGVGDAF GGDGGIGGDG NGALGAAGGN GGTGGAGGNG 541 GRGGMLIGNG GAGGAGGTGG TGGGGAAGFA GGVGGAGGEG LTDGAGTAEG GTGGLGGLGG 601 VGGTGGMGGS GGVGGNGGAA GSLIGLGGGG GAGGVGGTGG IGGIGGAGGN GGAGGAGTTT 661 GGGATIGGGG GTGGVGGAGG TGGTGGAGGT TGGSGGAGGL IGWAGAAGGT GAGGTGGQGG 721 LGGQGGNGGN GGTGATGGQG GDFALGGNGG AGGAGGSPGG SSGIQGNMGP PGTQGADG //