LOCUS       AJF01506.1               468 aa    PRT              BCT 20-JUN-2017
DEFINITION  Mycobacterium tuberculosis H37RvSiena PE family protein
            PE3 protein.
ACCESSION   CP007027-162
PROTEIN_ID  AJF01506.1
SOURCE      Mycobacterium tuberculosis H37RvSiena
  ORGANISM  Mycobacterium tuberculosis H37RvSiena
            Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae;
            Mycobacterium; Mycobacterium tuberculosis complex.
REFERENCE   1  (bases 1 to 4410911)
  AUTHORS   Santoro,F., Guerrini,V., Lazzeri,E., Iannelli,F. and Pozzi,G.
  TITLE     Genomic polymorphisms in a Laboratory Isolate of Mycobacterium
            tuberculosis Reference Strain H37Rv (ATCC27294)
  JOURNAL   New Microbiol. 40 (1), 62-69 (2017)
   PUBMED   27819398
REFERENCE   2  (bases 1 to 4410911)
  AUTHORS   Guerrini,V., Santoro,F. and Pozzi,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (30-DEC-2013) Medical Biotechnology, University of Siena,
            viale Bracci, Policlinico Le Scotte, Siena 53100, Italy
REFERENCE   3  (bases 1 to 4410911)
  AUTHORS   Guerrini,V., Santoro,F. and Pozzi,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (20-FEB-2015) Medical Biotechnology, University of Siena,
            viale Bracci, Policlinico Le Scotte, Siena 53100, Italy
  REMARK    Protein update by submitter
COMMENT     Annotation was added by the NCBI Prokaryotic Genome Annotation
            Pipeline (released 2013). Information about the Pipeline can be
            found here: http://www.ncbi.nlm.nih.gov/genome/annotation_prok/
            
            Annotation modified by submitter.
            
            ##Assembly-Data-START##
            Assembly Method       :: Ray v. 2.3.1
            Coverage              :: 300
            Sequencing Technology :: Illumina
            ##Assembly-Data-END##
            
            ##Genome-Annotation-Data-START##
            Annotation Provider          :: NCBI
            Annotation Date              :: 12/30/2013 09:28:23
            Annotation Pipeline          :: NCBI Prokaryotic Genome Annotation
                                            Pipeline
            Annotation Method            :: Best-placed reference protein set;
                                            GeneMarkS+
            Annotation Software revision :: 2.3 (rev. 422554)
            Features Annotated           :: Gene; CDS; rRNA; tRNA; ncRNA;
                                            repeat_region
            Genes                        :: 4,146
            CDS                          :: 4,050
            Pseudo Genes                 :: 42
            CRISPR Arrays                :: 2
            rRNAs                        :: 3 (5S, 16S, 23S)
            tRNAs                        :: 45
            ncRNA                        :: 6
            Frameshifted Genes           :: 34
            ##Genome-Annotation-Data-END##
FEATURES             Qualifiers
     source          /organism="Mycobacterium tuberculosis H37RvSiena"
                     /mol_type="genomic DNA"
                     /strain="H37RvSiena"
                     /db_xref="taxon:1437856"
     protein         /gene="PE3"
                     /locus_tag="Y980_0159c"
                     /note="Rv0159c, (MTV032.02c) PE3, Member of the
                     Mycobacterium tuberculosis PE family (see citation below),
                     similar to many other PE proteins e.g. O06828 from
                     Mycobacterium tuberculosis (528 aa). Also highly similar
                     to upstream MTV032_3, and to MTCI5_25,MTCI5_26, MTV049_
                     21, MTCY1A10_26, etc."
                     /transl_table=11
                     /db_xref="InterPro:IPR000084"
                     /db_xref="InterPro:IPR013228"
                     /db_xref="UniProtKB/TrEMBL:L0T2V2"
BEGIN
        1 MSYVIAAPEM LATTAADVDG IGSAIRAASA SAAGPTTGLL AAAADEVSSA AAALFSEYAR
       61 ECQEVLKQAA AFHGEFTRAL AAAGAAYAQA EASNTAAMSG TAGSSGALGS VGMLSGNPLT
      121 ALMMGGTGEP ILSDRVLAII DSAYIRPIFG PNNPVAQYTP EQWWPFIGNL SLDQSIAQGV
      181 TLLNNGINAE LQNGHDVVVF GYSQSAAVAT NEIRALMALP PGQAPDPSRL AFTLIGNINN
      241 PNGGVLERYV GLYLPFLDMS FNGATPPDSP YQTYMYTGQY DGYAHNPQYP LNILSDLNAF
      301 MGIRWVHNAY PFTAAEVANA VPLPTSPGYT GNTHYYMFLT QDLPLLQPIR AIPFVGTPIA
      361 ELIQPDLRVL VDLGYGYGYA DVPTPASLFA PINPIAVASA LATGTVQGPQ AALVSIGLLP
      421 QSALPNTYPY LPSANPGLMF NFGQSSVTEL SVLSGALGSV ARLIPPIA
//