LOCUS       CCP43844.1               853 aa    PRT              BCT 27-FEB-2015
DEFINITION  Mycobacterium tuberculosis H37Rv PE-PGRS family protein
            PE_PGRS22 protein.
ACCESSION   AL123456-1122
PROTEIN_ID  CCP43844.1
SOURCE      Mycobacterium tuberculosis H37Rv
  ORGANISM  Mycobacterium tuberculosis H37Rv
            Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae;
            Mycobacterium; Mycobacterium tuberculosis complex.
REFERENCE   1
  AUTHORS   Cole S.T., Brosch R., Parkhill J., Garnier T., Churcher C.,
            Harris D., Gordon S.V., Eiglmeier K., Gas S., Barry C.E.III.,
            Tekaia F., Badcock K., Basham D., Brown D., Chillingworth T.,
            Connor R., Davies R., Devlin K., Feltwell T., Gentles S.,
            Hamlin N., Holroyd S., Hornsby T., Jagels K., Krogh A., McLean J.,
            Moule S., Murphy L., Oliver K., Osborne J., Quail M.A.,
            Rajandream M.A., Rogers J., Rutter S., Seeger K., Skelton J.,
            Squares R., Squares S., Sulston J.E., Taylor K., Whitehead S.,
            Barrell B.G.
  TITLE     Deciphering the biology of Mycobacterium tuberculosis from the
            complete genome sequence
  JOURNAL   Nature 393(6685), 537-544(1998).
   PUBMED   9634230
  REMARK    Erratum:[Nature 1998 Nov 12;396(6707):190]
REFERENCE   2
  AUTHORS   Camus J.C., Pryor M.J., Medigue C., Cole S.T.
  TITLE     Re-annotation of the genome sequence of Mycobacterium tuberculosis
            H37Rv
  JOURNAL   Microbiology (Reading, Engl.) 148(Pt 10), 2967-2973(2002).
   PUBMED   12368430
REFERENCE   3
  AUTHORS   Lew J.M., Kapopoulou A., Jones L.M., Cole S.T.
  TITLE     TubercuList--10 years after
  JOURNAL   Tuberculosis (Edinb) 91(1), 1-7(2011).
   PUBMED   20980199
REFERENCE   4  (bases 1 to 4411529)
  AUTHORS   Parkhill J.
  JOURNAL   Submitted (11-JUN-1998) to the INSDC. Submitted on behalf of the
            Mycobacterium tuberculosis sequencing and mapping teams, Sanger
            Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA
            Unite de Genetique Moleculaire Bacterienne, Institut Pasteur, 28
            rue du Docteur Roux, 75724 Paris Cedex 15, France E-mail:
            parkhill@sanger.ac.uk
REFERENCE   5  (bases 1 to 4411532)
  AUTHORS   Lew J.M.
  JOURNAL   Submitted (18-DEC-2012) to the INSDC. Lew J., Ecole Polytechnique
            Federale de Lausanne, CH-1015, Lausanne, Switzerland, and the Swiss
            Institute of Bioinformatics, CMU - Rue Michel-Servet 1, 1211 Geneva
            4, SWITZERLAND
COMMENT     On or before Feb 1, 2013 this sequence version replaced
            gi:41352722, gi:38490165, gi:38490207, gi:41353619, gi:38490250,
            gi:38684030, gi:38490288, gi:41353667, gi:41353422, gi:41352756,
            gi:38490319, gi:41352785, gi:38490370, gi:41353971.
            Note:
            This annotation is from the TubercuList website, Release 26, Dec
            2012 (URL: http://tuberculist.epfl.ch) (email:
            tuberculist@epfl.ch).
FEATURES             Qualifiers
     source          /organism="Mycobacterium tuberculosis H37Rv"
                     /strain="H37Rv"
                     /mol_type="genomic DNA"
                     /db_xref="taxon:83332"
     protein         /transl_table=11
                     /gene="PE_PGRS22"
                     /locus_tag="Rv1091"
                     /note="Rv1091, (MTV017.44), len: 853 aa. PE_PGRS22, Member
                     of the Mycobacterium tuberculosis PE family, PGRS
                     subfamily of gly-rich proteins (see citation below).
                     Similar to Rv1087|AL021897|MTV017_39 Mycobacterium
                     tuberculosis H37Rv (767 aa), FASTA scores: opt: 2819, E():
                     0, (60.0% identity in 860 aa overlap). Predicted to be an
                     outer membrane protein (See Song et al., 2008). This
                     region is a possible MT-complex-specific genomic island
                     (See Becq et al.,2007)."
                     /db_xref="EnsemblGenomes-Gn:Rv1091"
                     /db_xref="EnsemblGenomes-Tr:CCP43844"
                     /db_xref="InterPro:IPR000084"
                     /db_xref="UniProtKB/TrEMBL:Q79FS5"
                     /experiment="EXISTENCE: identified in proteomics study"
BEGIN
        1 MSFVIAAPEA LVAVASDLAG IGSALAEANA AALAPTTALL AAGADEVSAA IAALFGAHGQ
       61 AYQTVSAQAS AFHAQFVQAL TGGGGAYAAA EAANVSAAQS TDQRLLDLIN GPTQALLGRP
      121 LIGDGANGGP GQDGGPGGLL YGNGGNGGTS TTAGVAGGNG GAAGLIGNGG AGGGGGAGAA
      181 GGNGGAGGWL YGNGGAGGAG GTSVIPGVAG GNGGAGGSAG LWGTGGAGGD GGNGRSGPVN
      241 VAGSAGGNGG AGGAAGLFGD AGAGGNGGKG GAGGAAFSIN FTAGDGGAGG AGGSGGHALL
      301 WGAGGAGGNG GSGGTGGAGG STAGAGGNGG AGGGGGTGGL LFGNGGAGGH GAAAGNGLAA
      361 GNGVSSSGGG GAGGTGGAGG DGGAGGAGGN ARLWGVGGAG GAGGDGGAGG AGGKGGSGLS
      421 GNANGGAGGD SGRGGTGGAG GEGGAAGLLV GTGGHGGDGG AGGAAVKGGD GGAAAGTGIA
      481 GAGGRGGAGG SGGSGGDGGG GAAGPAGWLF GDGGAGGNGG AAAAGGAGGQ AGGGGGNGGN
      541 GGNGGNGGNG GNGATGGWLY GNGGAGGQGA TAGAGGAGAN GVSSTNGGGT GGNGGIGGTG
      601 GSGGAGGNAG LLGVGGAGGH GASGGAGDRG GAGGTGFISS DGGAGGDGGD GGNGGAGGTG
      661 GLLFGAGGNG GPGGSGGAAD IGGNGGAGNG GGTDGNGGNG GSGGGAGSGG DGGGAGGNGA
      721 WLFGNGGAGG GGGKGGNGAG GGLGGGSFGL PGLNGSGGDG GDGGNGAPGG VLYGNGGAGG
      781 QGSSGGIGGP GATGGAGGKG GDGGDAQLIG DGGNGGNGGA GGTGGTPGPG GPGGSGGLGG
      841 LLFGQTGTAG VSP
//