LOCUS       AJF01507.1               502 aa    PRT              BCT 20-JUN-2017
DEFINITION  Mycobacterium tuberculosis H37RvSiena PE family protein
            PE4 protein.
ACCESSION   CP007027-163
PROTEIN_ID  AJF01507.1
SOURCE      Mycobacterium tuberculosis H37RvSiena
  ORGANISM  Mycobacterium tuberculosis H37RvSiena
            Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae;
            Mycobacterium; Mycobacterium tuberculosis complex.
REFERENCE   1  (bases 1 to 4410911)
  AUTHORS   Santoro,F., Guerrini,V., Lazzeri,E., Iannelli,F. and Pozzi,G.
  TITLE     Genomic polymorphisms in a Laboratory Isolate of Mycobacterium
            tuberculosis Reference Strain H37Rv (ATCC27294)
  JOURNAL   New Microbiol. 40 (1), 62-69 (2017)
   PUBMED   27819398
REFERENCE   2  (bases 1 to 4410911)
  AUTHORS   Guerrini,V., Santoro,F. and Pozzi,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (30-DEC-2013) Medical Biotechnology, University of Siena,
            viale Bracci, Policlinico Le Scotte, Siena 53100, Italy
REFERENCE   3  (bases 1 to 4410911)
  AUTHORS   Guerrini,V., Santoro,F. and Pozzi,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (20-FEB-2015) Medical Biotechnology, University of Siena,
            viale Bracci, Policlinico Le Scotte, Siena 53100, Italy
  REMARK    Protein update by submitter
COMMENT     Annotation was added by the NCBI Prokaryotic Genome Annotation
            Pipeline (released 2013). Information about the Pipeline can be
            found here: http://www.ncbi.nlm.nih.gov/genome/annotation_prok/
            
            Annotation modified by submitter.
            
            ##Assembly-Data-START##
            Assembly Method       :: Ray v. 2.3.1
            Coverage              :: 300
            Sequencing Technology :: Illumina
            ##Assembly-Data-END##
            
            ##Genome-Annotation-Data-START##
            Annotation Provider          :: NCBI
            Annotation Date              :: 12/30/2013 09:28:23
            Annotation Pipeline          :: NCBI Prokaryotic Genome Annotation
                                            Pipeline
            Annotation Method            :: Best-placed reference protein set;
                                            GeneMarkS+
            Annotation Software revision :: 2.3 (rev. 422554)
            Features Annotated           :: Gene; CDS; rRNA; tRNA; ncRNA;
                                            repeat_region
            Genes                        :: 4,146
            CDS                          :: 4,050
            Pseudo Genes                 :: 42
            CRISPR Arrays                :: 2
            rRNAs                        :: 3 (5S, 16S, 23S)
            tRNAs                        :: 45
            ncRNA                        :: 6
            Frameshifted Genes           :: 34
            ##Genome-Annotation-Data-END##
FEATURES             Qualifiers
     source          /organism="Mycobacterium tuberculosis H37RvSiena"
                     /mol_type="genomic DNA"
                     /strain="H37RvSiena"
                     /db_xref="taxon:1437856"
     protein         /gene="PE4"
                     /locus_tag="Y980_0160c"
                     /experiment="EXISTENCE: identified in proteomics study"
                     /note="Rv0160c, (MTV032.03c) PE4, Member of the
                     Mycobacterium tuberculosis PE family (see citation below),
                     similar to many other PE proteins e.g. Z92770|MTCI5_26c
                     from Mycobacterium tuberculosis (525 aa); C-terminal
                     region of O06801|RV1768|MTCY28.34 from Mycobacterium
                     tuberculosis (618 aa). Also highly similar to downstream
                     ORF MTV032_2."
                     /transl_table=11
                     /db_xref="InterPro:IPR000084"
                     /db_xref="InterPro:IPR013228"
                     /db_xref="UniProtKB/TrEMBL:Q7DAC9"
BEGIN
        1 MSHLVTAPDM LATAAAHVDE IASTLRAANA AAAGPTCNLL AAAGDEVSAA TAALFSAYGR
       61 EYQAVVKQAA AFHSEFTRTL EAAGNAYAHA EAANAARVSH ALDTINAPIR TLLGRAPLSP
      121 NGSSGAGGLP AIAQLAAESP ITALIMGGTN NPLPDPEYVT DINKAFIQTL FPGAVSQGLF
      181 TPEQFWPVTP DLGNLTFNQS VTEGVALLNT AVNNQLALDN KVVAFGYSQS ATIINNYINS
      241 LMAMGSPNPD DISFVMIGSG NNPVGGLLAR FPGFYIPFLD VPFNGATPAN SPYPTHIYTA
      301 QYDGIAHAPQ FPLRILSDIN AFMGYFYVHN TYPELMATQV DNAVPLPTSP GYTGNTQYYM
      361 FLTQDLPLLQ PIRDIPYAGP PIADLFQPQL RVLVDLGYAD YGPGGNYADI PTPAGLFSIP
      421 NPFAVTYYLI KGSLQAPYGA IVEIGVEAGL IGPEWFPDSY PWVPSINPGL NFYFGQPQVT
      481 LLSLMSGGLG NILHLIPPPV FT
//