LOCUS       AJF01497.1               588 aa    PRT              BCT 20-JUN-2017
DEFINITION  Mycobacterium tuberculosis H37RvSiena PE family protein
            PE1 protein.
ACCESSION   CP007027-153
PROTEIN_ID  AJF01497.1
SOURCE      Mycobacterium tuberculosis H37RvSiena
  ORGANISM  Mycobacterium tuberculosis H37RvSiena
            Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae;
            Mycobacterium; Mycobacterium tuberculosis complex.
REFERENCE   1  (bases 1 to 4410911)
  AUTHORS   Santoro,F., Guerrini,V., Lazzeri,E., Iannelli,F. and Pozzi,G.
  TITLE     Genomic polymorphisms in a Laboratory Isolate of Mycobacterium
            tuberculosis Reference Strain H37Rv (ATCC27294)
  JOURNAL   New Microbiol. 40 (1), 62-69 (2017)
   PUBMED   27819398
REFERENCE   2  (bases 1 to 4410911)
  AUTHORS   Guerrini,V., Santoro,F. and Pozzi,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (30-DEC-2013) Medical Biotechnology, University of Siena,
            viale Bracci, Policlinico Le Scotte, Siena 53100, Italy
REFERENCE   3  (bases 1 to 4410911)
  AUTHORS   Guerrini,V., Santoro,F. and Pozzi,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (20-FEB-2015) Medical Biotechnology, University of Siena,
            viale Bracci, Policlinico Le Scotte, Siena 53100, Italy
  REMARK    Protein update by submitter
COMMENT     Annotation was added by the NCBI Prokaryotic Genome Annotation
            Pipeline (released 2013). Information about the Pipeline can be
            found here: http://www.ncbi.nlm.nih.gov/genome/annotation_prok/
            
            Annotation modified by submitter.
            
            ##Assembly-Data-START##
            Assembly Method       :: Ray v. 2.3.1
            Coverage              :: 300
            Sequencing Technology :: Illumina
            ##Assembly-Data-END##
            
            ##Genome-Annotation-Data-START##
            Annotation Provider          :: NCBI
            Annotation Date              :: 12/30/2013 09:28:23
            Annotation Pipeline          :: NCBI Prokaryotic Genome Annotation
                                            Pipeline
            Annotation Method            :: Best-placed reference protein set;
                                            GeneMarkS+
            Annotation Software revision :: 2.3 (rev. 422554)
            Features Annotated           :: Gene; CDS; rRNA; tRNA; ncRNA;
                                            repeat_region
            Genes                        :: 4,146
            CDS                          :: 4,050
            Pseudo Genes                 :: 42
            CRISPR Arrays                :: 2
            rRNAs                        :: 3 (5S, 16S, 23S)
            tRNAs                        :: 45
            ncRNA                        :: 6
            Frameshifted Genes           :: 34
            ##Genome-Annotation-Data-END##
FEATURES             Qualifiers
     source          /organism="Mycobacterium tuberculosis H37RvSiena"
                     /mol_type="genomic DNA"
                     /strain="H37RvSiena"
                     /db_xref="taxon:1437856"
     protein         /gene="PE1"
                     /locus_tag="Y980_0151c"
                     /experiment="EXISTENCE: identified in proteomics study"
                     /note="Rv0151c, (MTCI5.25c) PE1, Member of the
                     Mycobacterium tuberculosis PE family (see citation below),
                     with N-terminal region similar to others e.g. MTV032_2
                     PE_PGRS family from Mycobacterium tuberculosis (468 aa);
                     MTCY493_24 from Mycobacterium tuberculosis. Also similar
                     to upstream ORF MTCI5.26c. Also shows similarity to
                     C-terminal part of some PPE family proteins e.g. MTV049_21
                     from Mycobacterium tuberculosis."
                     /transl_table=11
                     /db_xref="InterPro:IPR000084"
                     /db_xref="InterPro:IPR013228"
                     /db_xref="UniProtKB/TrEMBL:L0T5V5"
BEGIN
        1 MAPFGFTPKA RHNRGVALRS TYRLDGWVMG PVDKEGWGLS YVFAQPSVLA AAATDLAGIG
       61 SAINQATAAV AAPTTGLAAA AADEVSTALA TLFGAYGQQF QAISAQVAAF HNEFTQRLAA
      121 AANAFVNAEA TNTSALVQEA TAGLFKPTSP PVLPPMFNQN TAIIMGGTGS PIPTPSYVNA
      181 ITTLFIDPVV SNPVVKALVT PEELYPITGV KSLPFQTSVQ LGLQILDGAI WEQINAGNHV
      241 TVFGYSQSAV IASLEMQHLI SLGPNAPSPS QLNFILIGNE MNPNGGILAR IPGLNVTTLG
      301 LPFYGATPDN PYPTTTYTLE YDGFADFPRY PLNVLSDINA VFGILTVHTT YADLTPAQIA
      361 SATQLPTQGT TSNTYYIIET EHLPLLAPLR AIPVIGPPLA ALVEPNLEVI VNLGYGDPRF
      421 GYSTSPANVP TPFGLFPDVP ASVVADALVA GTQQGVNDFM VELPAALNTL PQTPMPAFPP
      481 YVPTLLPPPP PPQPATLINI ADTFASVVST GYSILLPTAD LGLAFVTILP AYDLTLFVNQ
      541 LAAGNLRAAI ELPLAATIGL AALGGMIEFI AIVVTLADIT QQLQSFSI
//