LOCUS CCP45928.1 100 aa PRT BCT 27-FEB-2015 DEFINITION Mycobacterium tuberculosis H37Rv Conserved hypothetical protein SseC1 protein. ACCESSION AL123456-3206 PROTEIN_ID CCP45928.1 SOURCE Mycobacterium tuberculosis H37Rv ORGANISM Mycobacterium tuberculosis H37Rv Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae; Mycobacterium; Mycobacterium tuberculosis complex. REFERENCE 1 AUTHORS Cole S.T., Brosch R., Parkhill J., Garnier T., Churcher C., Harris D., Gordon S.V., Eiglmeier K., Gas S., Barry C.E.III., Tekaia F., Badcock K., Basham D., Brown D., Chillingworth T., Connor R., Davies R., Devlin K., Feltwell T., Gentles S., Hamlin N., Holroyd S., Hornsby T., Jagels K., Krogh A., McLean J., Moule S., Murphy L., Oliver K., Osborne J., Quail M.A., Rajandream M.A., Rogers J., Rutter S., Seeger K., Skelton J., Squares R., Squares S., Sulston J.E., Taylor K., Whitehead S., Barrell B.G. TITLE Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence JOURNAL Nature 393(6685), 537-544(1998). PUBMED 9634230 REMARK Erratum:[Nature 1998 Nov 12;396(6707):190] REFERENCE 2 AUTHORS Camus J.C., Pryor M.J., Medigue C., Cole S.T. TITLE Re-annotation of the genome sequence of Mycobacterium tuberculosis H37Rv JOURNAL Microbiology (Reading, Engl.) 148(Pt 10), 2967-2973(2002). PUBMED 12368430 REFERENCE 3 AUTHORS Lew J.M., Kapopoulou A., Jones L.M., Cole S.T. TITLE TubercuList--10 years after JOURNAL Tuberculosis (Edinb) 91(1), 1-7(2011). PUBMED 20980199 REFERENCE 4 (bases 1 to 4411529) AUTHORS Parkhill J. JOURNAL Submitted (11-JUN-1998) to the INSDC. Submitted on behalf of the Mycobacterium tuberculosis sequencing and mapping teams, Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA Unite de Genetique Moleculaire Bacterienne, Institut Pasteur, 28 rue du Docteur Roux, 75724 Paris Cedex 15, France E-mail: parkhill@sanger.ac.uk REFERENCE 5 (bases 1 to 4411532) AUTHORS Lew J.M. JOURNAL Submitted (18-DEC-2012) to the INSDC. Lew J., Ecole Polytechnique Federale de Lausanne, CH-1015, Lausanne, Switzerland, and the Swiss Institute of Bioinformatics, CMU - Rue Michel-Servet 1, 1211 Geneva 4, SWITZERLAND COMMENT On or before Feb 1, 2013 this sequence version replaced gi:41352722, gi:38490165, gi:38490207, gi:41353619, gi:38490250, gi:38684030, gi:38490288, gi:41353667, gi:41353422, gi:41352756, gi:38490319, gi:41352785, gi:38490370, gi:41353971. Note: This annotation is from the TubercuList website, Release 26, Dec 2012 (URL: http://tuberculist.epfl.ch) (email: tuberculist@epfl.ch). FEATURES Qualifiers source /organism="Mycobacterium tuberculosis H37Rv" /strain="H37Rv" /mol_type="genomic DNA" /db_xref="taxon:83332" protein /transl_table=11 /gene="sseC1" /gene_synonym="sseC" /locus_tag="Rv3118" /note="Rv3118, (MTCY164.28, O05794), len: 100 aa. SseC1,conserved hypothetical protein, equivalent to Q9CBC7|ML2199 hypothetical protein from Mycobacterium leprae (100 aa),FASTA scores: opt: 545, E(): 3.1e-30, (84.0% identity in 10 aa overlap). Also similar to hypothetical proteins e.g. Q50035 from Saccharopolyspora erythraea (Streptomyces erythraeus) (101 aa), FASTA scores: opt: 345, E(): 9.7e-17,(57.15% identity in 98 aa overlap); and Q9K4H3|SCD66.02 from Streptomyces coelicolor (95 aa), FASTA scores: opt: 249, E(): 2.8e-10, (48.5% identity in 99 aa overlap). Some weak similarity with Q9ZB84|PCAG protocatechuate 3,4-dioxygenase alpha-subunit from Pseudomonas marginata (196 aa), FASTA scores: opt: 109, E(): 1.4, (31.3% identity in 83 aa overlap); and other bacterial proteins. Identical second copy present as Rv0814c|AL022004|MTV043.06c|SSEC2 from Mycobacterium tuberculosis (100 aa) (100.0% identity in 100 aa overlap). Note that previously known as sseC. This region is a possible MT-complex-specific genomic island (See Becq et al., 2007)." /db_xref="EnsemblGenomes-Gn:Rv3118" /db_xref="EnsemblGenomes-Tr:CCP45928" /db_xref="InterPro:IPR008969" /db_xref="InterPro:IPR010814" /db_xref="UniProtKB/Swiss-Prot:P0CG96" BEGIN 1 MCSGPKQGLT LPASVDLEKE TVITGRVVDG DGQAVGGAFV RLLDSSDEFT AEVVASATGD 61 FRFFAAPGSW TLRALSAAGN GDAVVQPSGA GIHEVDVKIT //