LOCUS       AJF01470.1               487 aa    PRT              BCT 20-JUN-2017
DEFINITION  Mycobacterium tuberculosis H37RvSiena PE-PGRS family protein
            PE_PGRS2 protein.
ACCESSION   CP007027-126
PROTEIN_ID  AJF01470.1
SOURCE      Mycobacterium tuberculosis H37RvSiena
  ORGANISM  Mycobacterium tuberculosis H37RvSiena
            Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae;
            Mycobacterium; Mycobacterium tuberculosis complex.
REFERENCE   1  (bases 1 to 4410911)
  AUTHORS   Santoro,F., Guerrini,V., Lazzeri,E., Iannelli,F. and Pozzi,G.
  TITLE     Genomic polymorphisms in a Laboratory Isolate of Mycobacterium
            tuberculosis Reference Strain H37Rv (ATCC27294)
  JOURNAL   New Microbiol. 40 (1), 62-69 (2017)
   PUBMED   27819398
REFERENCE   2  (bases 1 to 4410911)
  AUTHORS   Guerrini,V., Santoro,F. and Pozzi,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (30-DEC-2013) Medical Biotechnology, University of Siena,
            viale Bracci, Policlinico Le Scotte, Siena 53100, Italy
REFERENCE   3  (bases 1 to 4410911)
  AUTHORS   Guerrini,V., Santoro,F. and Pozzi,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (20-FEB-2015) Medical Biotechnology, University of Siena,
            viale Bracci, Policlinico Le Scotte, Siena 53100, Italy
  REMARK    Protein update by submitter
COMMENT     Annotation was added by the NCBI Prokaryotic Genome Annotation
            Pipeline (released 2013). Information about the Pipeline can be
            found here: http://www.ncbi.nlm.nih.gov/genome/annotation_prok/
            
            Annotation modified by submitter.
            
            ##Assembly-Data-START##
            Assembly Method       :: Ray v. 2.3.1
            Coverage              :: 300
            Sequencing Technology :: Illumina
            ##Assembly-Data-END##
            
            ##Genome-Annotation-Data-START##
            Annotation Provider          :: NCBI
            Annotation Date              :: 12/30/2013 09:28:23
            Annotation Pipeline          :: NCBI Prokaryotic Genome Annotation
                                            Pipeline
            Annotation Method            :: Best-placed reference protein set;
                                            GeneMarkS+
            Annotation Software revision :: 2.3 (rev. 422554)
            Features Annotated           :: Gene; CDS; rRNA; tRNA; ncRNA;
                                            repeat_region
            Genes                        :: 4,146
            CDS                          :: 4,050
            Pseudo Genes                 :: 42
            CRISPR Arrays                :: 2
            rRNAs                        :: 3 (5S, 16S, 23S)
            tRNAs                        :: 45
            ncRNA                        :: 6
            Frameshifted Genes           :: 34
            ##Genome-Annotation-Data-END##
FEATURES             Qualifiers
     source          /organism="Mycobacterium tuberculosis H37RvSiena"
                     /mol_type="genomic DNA"
                     /strain="H37RvSiena"
                     /db_xref="taxon:1437856"
     protein         /gene="PE_PGRS2"
                     /locus_tag="Y980_0124"
                     /experiment="EXISTENCE: identified in proteomics study"
                     /note="Rv0124, (MTCI418B.06) PE_PGRS2, Member of the
                     Mycobacterium tuberculosis PE family, PGRS subfamily of
                     gly-rich proteins (see Brennan and Delogu, 2002), highly
                     similar to many e.g. Y0DP_MYCTU|Q50615 from Mycobacterium
                     tuberculosis (498 aa)."
                     /transl_table=11
                     /db_xref="InterPro:IPR000084"
                     /db_xref="UniProtKB/TrEMBL:L0T2J5"
BEGIN
        1 MSFVSVAPEI VVAAATDLAG IGSAISAANA AAAAPTTAVL AAGADEVSAA IAALFSGHAQ
       61 AYQALSAQAA AFHQQFVQTL AGGAGAYAAA EAQVEQQLLA AINAPTQALL GRPLIGNGAD
      121 GAPGTGQAGG AGGILYGNGG NGGSGAAGQA GGAGGPAGLI GHGGSGGAGG SGAAGGAGGH
      181 GGWLWGNGGV GGSGGAGVGA GVAGGHGGAG GAAGLWGAGG GGGNGGNGAD ANIVSGGDGG
      241 LGGAGGGGGW LYGDGGAGGH GGQGAIGLGG GAGGDGGQGG AGRGLWGTGG AGGHGGQGGG
      301 TGGPPLPGQA GMGAAGGAGG LIGNGGAGGD GGVGASGGVA GVGGAGGNAM LIGHGGAGGA
      361 GGDSSFANGA AGGAGGAGGH LFGNGGSGGH GGAVTAGNTG IGGAGGVGGD ARLIGHGGAG
      421 GAGGDRAGAL VGRDGGPGGN GGAGGQLYGN GGDGAPGTGG TLQAAVSGLV TALFGAPGQP
      481 GDTGQPG
//