LOCUS       AJF01455.1               496 aa    PRT              BCT 20-JUN-2017
DEFINITION  Mycobacterium tuberculosis H37RvSiena PE-PGRS family protein
            PE_PGRS1 protein.
ACCESSION   CP007027-111
PROTEIN_ID  AJF01455.1
SOURCE      Mycobacterium tuberculosis H37RvSiena
  ORGANISM  Mycobacterium tuberculosis H37RvSiena
            Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae;
            Mycobacterium; Mycobacterium tuberculosis complex.
REFERENCE   1  (bases 1 to 4410911)
  AUTHORS   Santoro,F., Guerrini,V., Lazzeri,E., Iannelli,F. and Pozzi,G.
  TITLE     Genomic polymorphisms in a Laboratory Isolate of Mycobacterium
            tuberculosis Reference Strain H37Rv (ATCC27294)
  JOURNAL   New Microbiol. 40 (1), 62-69 (2017)
   PUBMED   27819398
REFERENCE   2  (bases 1 to 4410911)
  AUTHORS   Guerrini,V., Santoro,F. and Pozzi,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (30-DEC-2013) Medical Biotechnology, University of Siena,
            viale Bracci, Policlinico Le Scotte, Siena 53100, Italy
REFERENCE   3  (bases 1 to 4410911)
  AUTHORS   Guerrini,V., Santoro,F. and Pozzi,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (20-FEB-2015) Medical Biotechnology, University of Siena,
            viale Bracci, Policlinico Le Scotte, Siena 53100, Italy
  REMARK    Protein update by submitter
COMMENT     Annotation was added by the NCBI Prokaryotic Genome Annotation
            Pipeline (released 2013). Information about the Pipeline can be
            found here: http://www.ncbi.nlm.nih.gov/genome/annotation_prok/
            
            Annotation modified by submitter.
            
            ##Assembly-Data-START##
            Assembly Method       :: Ray v. 2.3.1
            Coverage              :: 300
            Sequencing Technology :: Illumina
            ##Assembly-Data-END##
            
            ##Genome-Annotation-Data-START##
            Annotation Provider          :: NCBI
            Annotation Date              :: 12/30/2013 09:28:23
            Annotation Pipeline          :: NCBI Prokaryotic Genome Annotation
                                            Pipeline
            Annotation Method            :: Best-placed reference protein set;
                                            GeneMarkS+
            Annotation Software revision :: 2.3 (rev. 422554)
            Features Annotated           :: Gene; CDS; rRNA; tRNA; ncRNA;
                                            repeat_region
            Genes                        :: 4,146
            CDS                          :: 4,050
            Pseudo Genes                 :: 42
            CRISPR Arrays                :: 2
            rRNAs                        :: 3 (5S, 16S, 23S)
            tRNAs                        :: 45
            ncRNA                        :: 6
            Frameshifted Genes           :: 34
            ##Genome-Annotation-Data-END##
FEATURES             Qualifiers
     source          /organism="Mycobacterium tuberculosis H37RvSiena"
                     /mol_type="genomic DNA"
                     /strain="H37RvSiena"
                     /db_xref="taxon:1437856"
     protein         /gene="PE_PGRS1"
                     /locus_tag="Y980_0109"
                     /experiment="EXISTENCE: identified in proteomics study"
                     /note="Rv0109, (MTV031.03c) PE_PGRS1, Member of the M.
                     tuberculosis PE family, PGRS subfamily of gly-rich
                     proteins (see Brennan and Delogu, 2002), highly similar to
                     many e.g. Q50615|Y0DP_MYCTU hypothetical glycine-rich 40.8
                     kDa protein from Mycobacterium tuberculosis (498 aa)."
                     /transl_table=11
                     /db_xref="InterPro:IPR000084"
                     /db_xref="UniProtKB/TrEMBL:L0T2H7"
BEGIN
        1 MSLLITSPAT VAAAATHLAG IGSALSTANA AAAAPTTALS VAGADEVSVL IAALFEAYAQ
       61 EYQALSAQAL AFHDQFVQAL NMGAVCYAAA ETANATPLQA LQTVQQNVLT VVNAPTQALL
      121 GRPIIGNGAN GLPNTGQDGG PGGLLFGNGG NGGSGGVDQA GGNGGAAGLI GNGGSGGVGG
      181 PGIAGSAGGA GGAGGLLFGN GGPGGAGGIG TTGDGGPGGA GGNAIGLFGS GGTGGMGGVG
      241 GMGGVGNGGN AGNGGTAGLF GHGGAGGAGG IGSADGGLGG GGGNGRFMGN GGVGGAGGYG
      301 ASGDGGNAGN GGLGGVFGDG GAGGTGGLGD VNGGLAGIGG NAGFVGNGGA GGNGQLGSGA
      361 VSSAGGMGGN GGLVFGNGGP GGLGGPGTSA GNGGMGGNAV GLFGQGGAGG AGGSGFGAGI
      421 PGGRGGDGGS GGLIGDGGTG GGAGAGDAAA SAGGNGGNAR LIGNGGDGGP GMFGGPGGAG
      481 GSGGTIFGFA GTPGPS
//