LOCUS CCP43844.1 853 aa PRT BCT 27-FEB-2015 DEFINITION Mycobacterium tuberculosis H37Rv PE-PGRS family protein PE_PGRS22 protein. ACCESSION AL123456-1122 PROTEIN_ID CCP43844.1 SOURCE Mycobacterium tuberculosis H37Rv ORGANISM Mycobacterium tuberculosis H37Rv Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae; Mycobacterium; Mycobacterium tuberculosis complex. REFERENCE 1 AUTHORS Cole S.T., Brosch R., Parkhill J., Garnier T., Churcher C., Harris D., Gordon S.V., Eiglmeier K., Gas S., Barry C.E.III., Tekaia F., Badcock K., Basham D., Brown D., Chillingworth T., Connor R., Davies R., Devlin K., Feltwell T., Gentles S., Hamlin N., Holroyd S., Hornsby T., Jagels K., Krogh A., McLean J., Moule S., Murphy L., Oliver K., Osborne J., Quail M.A., Rajandream M.A., Rogers J., Rutter S., Seeger K., Skelton J., Squares R., Squares S., Sulston J.E., Taylor K., Whitehead S., Barrell B.G. TITLE Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence JOURNAL Nature 393(6685), 537-544(1998). PUBMED 9634230 REMARK Erratum:[Nature 1998 Nov 12;396(6707):190] REFERENCE 2 AUTHORS Camus J.C., Pryor M.J., Medigue C., Cole S.T. TITLE Re-annotation of the genome sequence of Mycobacterium tuberculosis H37Rv JOURNAL Microbiology (Reading, Engl.) 148(Pt 10), 2967-2973(2002). PUBMED 12368430 REFERENCE 3 AUTHORS Lew J.M., Kapopoulou A., Jones L.M., Cole S.T. TITLE TubercuList--10 years after JOURNAL Tuberculosis (Edinb) 91(1), 1-7(2011). PUBMED 20980199 REFERENCE 4 (bases 1 to 4411529) AUTHORS Parkhill J. JOURNAL Submitted (11-JUN-1998) to the INSDC. Submitted on behalf of the Mycobacterium tuberculosis sequencing and mapping teams, Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA Unite de Genetique Moleculaire Bacterienne, Institut Pasteur, 28 rue du Docteur Roux, 75724 Paris Cedex 15, France E-mail: parkhill@sanger.ac.uk REFERENCE 5 (bases 1 to 4411532) AUTHORS Lew J.M. JOURNAL Submitted (18-DEC-2012) to the INSDC. Lew J., Ecole Polytechnique Federale de Lausanne, CH-1015, Lausanne, Switzerland, and the Swiss Institute of Bioinformatics, CMU - Rue Michel-Servet 1, 1211 Geneva 4, SWITZERLAND COMMENT On or before Feb 1, 2013 this sequence version replaced gi:41352722, gi:38490165, gi:38490207, gi:41353619, gi:38490250, gi:38684030, gi:38490288, gi:41353667, gi:41353422, gi:41352756, gi:38490319, gi:41352785, gi:38490370, gi:41353971. Note: This annotation is from the TubercuList website, Release 26, Dec 2012 (URL: http://tuberculist.epfl.ch) (email: tuberculist@epfl.ch). FEATURES Qualifiers source /organism="Mycobacterium tuberculosis H37Rv" /strain="H37Rv" /mol_type="genomic DNA" /db_xref="taxon:83332" protein /transl_table=11 /gene="PE_PGRS22" /locus_tag="Rv1091" /note="Rv1091, (MTV017.44), len: 853 aa. PE_PGRS22, Member of the Mycobacterium tuberculosis PE family, PGRS subfamily of gly-rich proteins (see citation below). Similar to Rv1087|AL021897|MTV017_39 Mycobacterium tuberculosis H37Rv (767 aa), FASTA scores: opt: 2819, E(): 0, (60.0% identity in 860 aa overlap). Predicted to be an outer membrane protein (See Song et al., 2008). This region is a possible MT-complex-specific genomic island (See Becq et al.,2007)." /db_xref="EnsemblGenomes-Gn:Rv1091" /db_xref="EnsemblGenomes-Tr:CCP43844" /db_xref="InterPro:IPR000084" /db_xref="UniProtKB/TrEMBL:Q79FS5" /experiment="EXISTENCE: identified in proteomics study" BEGIN 1 MSFVIAAPEA LVAVASDLAG IGSALAEANA AALAPTTALL AAGADEVSAA IAALFGAHGQ 61 AYQTVSAQAS AFHAQFVQAL TGGGGAYAAA EAANVSAAQS TDQRLLDLIN GPTQALLGRP 121 LIGDGANGGP GQDGGPGGLL YGNGGNGGTS TTAGVAGGNG GAAGLIGNGG AGGGGGAGAA 181 GGNGGAGGWL YGNGGAGGAG GTSVIPGVAG GNGGAGGSAG LWGTGGAGGD GGNGRSGPVN 241 VAGSAGGNGG AGGAAGLFGD AGAGGNGGKG GAGGAAFSIN FTAGDGGAGG AGGSGGHALL 301 WGAGGAGGNG GSGGTGGAGG STAGAGGNGG AGGGGGTGGL LFGNGGAGGH GAAAGNGLAA 361 GNGVSSSGGG GAGGTGGAGG DGGAGGAGGN ARLWGVGGAG GAGGDGGAGG AGGKGGSGLS 421 GNANGGAGGD SGRGGTGGAG GEGGAAGLLV GTGGHGGDGG AGGAAVKGGD GGAAAGTGIA 481 GAGGRGGAGG SGGSGGDGGG GAAGPAGWLF GDGGAGGNGG AAAAGGAGGQ AGGGGGNGGN 541 GGNGGNGGNG GNGATGGWLY GNGGAGGQGA TAGAGGAGAN GVSSTNGGGT GGNGGIGGTG 601 GSGGAGGNAG LLGVGGAGGH GASGGAGDRG GAGGTGFISS DGGAGGDGGD GGNGGAGGTG 661 GLLFGAGGNG GPGGSGGAAD IGGNGGAGNG GGTDGNGGNG GSGGGAGSGG DGGGAGGNGA 721 WLFGNGGAGG GGGKGGNGAG GGLGGGSFGL PGLNGSGGDG GDGGNGAPGG VLYGNGGAGG 781 QGSSGGIGGP GATGGAGGKG GDGGDAQLIG DGGNGGNGGA GGTGGTPGPG GPGGSGGLGG 841 LLFGQTGTAG VSP //