CAB36639.1

LOCUS       CAB36639.1               112 aa    PRT              BCT 23-OCT-2008
DEFINITION  Mycobacterium leprae hypothetical protein protein.
ACCESSION   AL035480-13
PROTEIN_ID  CAB36639.1
SOURCE      Mycobacterium leprae
  ORGANISM  Mycobacterium leprae
            Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae;
            Mycobacterium.
REFERENCE   1  (bases 1 to 34316)
  AUTHORS   Murphy L., Harris D.
  JOURNAL   Unpublished.
REFERENCE   2  (bases 1 to 34316)
  AUTHORS   James K.D., Parkhill J., Barrell B.G., Rajandream M.A.
  JOURNAL   Submitted (15-FEB-1999) to the INSDC. Mycobacterium leprae
            sequencing project, Sanger Centre, Wellcome Trust Genome Campus,
            Hinxton, Cambridge CB10 1SA E-mail: barrell@sanger.ac.uk Cosmids
            supplied by Dr. Stewart T. Cole, [3] Unite de Genetique Moleculaire
            Bacterienne, Institut Pasteur, 28 rue du Docteur Roux, 75724 Paris
            Cedex 15, France Requests for cosmids should be sent to Karin
            Eiglmeier (kei@pasteur.fr).
REFERENCE   3  (bases 1 to 34316)
  AUTHORS   Eiglmeier K., Honore N., Woods S.A., Caudron B., Cole S.T.
  TITLE     Use of an ordered cosmid library to deduce the genomic organization
            of Mycobacterium leprae
  JOURNAL   Mol. Microbiol. 7(2), 197-206(1993).
   PUBMED   8446027
COMMENT     Notes:
            
            The Sanger Centre is funded to complete the sequence of M. leprae
            by the Heiser Program for Research in Leprosy and Tuberculosis of
            The New York Community Trust.
            
            Work in Paris is supported by the Heiser Trust, the Association
            Francaise Raoul Follereau and the Groupement de Recherches et des
            Etudes des Genomes (GIP-GREG).
            
            Details of M. leprae sequencing at the Sanger Centre
            are available on the World Wide Web.
            (URL, http://www.sanger.ac.uk/Projects/)
            
            CDS are numbered using the following system eg MLCB33.01c.
            ML (M. leprae), cB33 (cosmid name), .01 (first CDS),
            c (complementary strand).
            
            The more significant matches with motifs in the PROSITE
            database are also included but some of these may be fortuitous.
            
            The length in codons is given for each CDS.
            
            Usually the highest scoring match found by fasta -o is given for
            CDS which show significant similarity to other CDS in the database.
            The position of possible ribosome binding site sequences are
            given where these have been used to deduce the initiation codon.
            
            All CDS over 100 codons have been analysed.  Gene prediction
            is based on positional base preference in codons especially
            where there is an increase in the observed/expected third
            position G + C.  CAUTION:  We may not have predicted the
            correct initiation codon.  Where possible we choose an
            initiation codon (atg, gtg, or ttg) which is preceded by an
            upstream ribosome binding site sequence (optimally 5-13bp
            before the initiation codon).  If this cannot be identified
            we choose the most upstream initiation codon.
            
            IMPORTANT: This sequence MAY NOT be the entire insert of
            the sequenced clone.  It may be shorter because we only
            sequence overlapping sections once, or longer, because we
            arrange for a small overlap between neighbouring submissions.
FEATURES             Qualifiers
     source          /organism="Mycobacterium leprae"
                     /mol_type="genomic DNA"
                     /clone="cosmid B12"
                     /db_xref="taxon:1769"
     protein         /transl_table=11
                     /gene="MLCB12.13c"
                     /note="MLCB12.13c, hypothetical protein, len: 112 aa; CDS
                     within RLEP element, highly similar other CDS in RLEP
                     elements e.g. TR:O07704 (MLCL383.30C), fasta scores;
                     opt:786 z-score: 926.0 E(): 0, 99.1% identity in 112 aa
                     overlap"
                     /db_xref="InterPro:IPR009418"
                     /db_xref="UniProtKB/TrEMBL:Q9R448"
BEGIN
        1 MTTPTPQGHD MHTKTPLPRG ANNYPHTHAC IDIAFSTAQV PSPWHHQHVD QAAPTTDMLT
       61 CAALIVSTAA KHTKPHRKQA VSHPPTKTPQ HSKTRQQDSG LPPTTTEHIG QS
//