LOCUS CCP45888.1 275 aa PRT BCT 27-FEB-2015 DEFINITION Mycobacterium tuberculosis H37Rv Conserved protein protein. ACCESSION AL123456-3166 PROTEIN_ID CCP45888.1 SOURCE Mycobacterium tuberculosis H37Rv ORGANISM Mycobacterium tuberculosis H37Rv Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae; Mycobacterium; Mycobacterium tuberculosis complex. REFERENCE 1 AUTHORS Cole S.T., Brosch R., Parkhill J., Garnier T., Churcher C., Harris D., Gordon S.V., Eiglmeier K., Gas S., Barry C.E.III., Tekaia F., Badcock K., Basham D., Brown D., Chillingworth T., Connor R., Davies R., Devlin K., Feltwell T., Gentles S., Hamlin N., Holroyd S., Hornsby T., Jagels K., Krogh A., McLean J., Moule S., Murphy L., Oliver K., Osborne J., Quail M.A., Rajandream M.A., Rogers J., Rutter S., Seeger K., Skelton J., Squares R., Squares S., Sulston J.E., Taylor K., Whitehead S., Barrell B.G. TITLE Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence JOURNAL Nature 393(6685), 537-544(1998). PUBMED 9634230 REMARK Erratum:[Nature 1998 Nov 12;396(6707):190] REFERENCE 2 AUTHORS Camus J.C., Pryor M.J., Medigue C., Cole S.T. TITLE Re-annotation of the genome sequence of Mycobacterium tuberculosis H37Rv JOURNAL Microbiology (Reading, Engl.) 148(Pt 10), 2967-2973(2002). PUBMED 12368430 REFERENCE 3 AUTHORS Lew J.M., Kapopoulou A., Jones L.M., Cole S.T. TITLE TubercuList--10 years after JOURNAL Tuberculosis (Edinb) 91(1), 1-7(2011). PUBMED 20980199 REFERENCE 4 (bases 1 to 4411529) AUTHORS Parkhill J. JOURNAL Submitted (11-JUN-1998) to the INSDC. Submitted on behalf of the Mycobacterium tuberculosis sequencing and mapping teams, Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA Unite de Genetique Moleculaire Bacterienne, Institut Pasteur, 28 rue du Docteur Roux, 75724 Paris Cedex 15, France E-mail: parkhill@sanger.ac.uk REFERENCE 5 (bases 1 to 4411532) AUTHORS Lew J.M. JOURNAL Submitted (18-DEC-2012) to the INSDC. Lew J., Ecole Polytechnique Federale de Lausanne, CH-1015, Lausanne, Switzerland, and the Swiss Institute of Bioinformatics, CMU - Rue Michel-Servet 1, 1211 Geneva 4, SWITZERLAND COMMENT On or before Feb 1, 2013 this sequence version replaced gi:41352722, gi:38490165, gi:38490207, gi:41353619, gi:38490250, gi:38684030, gi:38490288, gi:41353667, gi:41353422, gi:41352756, gi:38490319, gi:41352785, gi:38490370, gi:41353971. Note: This annotation is from the TubercuList website, Release 26, Dec 2012 (URL: http://tuberculist.epfl.ch) (email: tuberculist@epfl.ch). FEATURES Qualifiers source /organism="Mycobacterium tuberculosis H37Rv" /strain="H37Rv" /mol_type="genomic DNA" /db_xref="taxon:83332" protein /transl_table=11 /locus_tag="Rv3079c" /note="Rv3079c, (MTCY22D7.02), len: 275 aa. Conserved protein, similar to other hypothetical mycobacterium proteins e.g. P71557|Y953_MYCTU|Rv0953c|MTCY10D7.21 possible oxidoreductase from Mycobacterium tuberculosis strain H37Rv (282 aa), FASTA scores: opt: 668, E(): 2.4e-34, (40.55% identity in 281 aa overlap); O06216|Rv2161c|MTCY270.07 from Mycobacterium tuberculosis strain H37Rv (288 aa), FASTA scores: opt: 595, E(): 8.5e-30, (40.9% identity in 274 aa overlap); O87779 from Mycobacterium paratuberculosis (166 aa), FASTA scores: opt: 464, E(): 7.2e-22, (41.55% identity in 166 aa overlap); etc. Also some similarity to other proteins e.g. Q9AH10 putative F420-dependent dehydrogenase from Rhodococcus erythropolis (295 aa), FASTA scores: opt: 401, E(): 9.6e-18, (30.2% identity in 288 aa overlap); Q9AE04|RIF17 RIF17 protein from Amycolatopsis mediterranei (356 aa),FASTA scores: opt: 298, E(): 2.8e-11, (35.0% identity in 203 aa overlap); AAK48081|MT3720 luciferase-related protein from Mycobacterium tuberculosis strain CDC1551 (395 aa),FASTA scores: opt: 223, E(): 1.4e-06, (29.4% identity in 211 aa overlap)." /db_xref="EnsemblGenomes-Gn:Rv3079c" /db_xref="EnsemblGenomes-Tr:CCP45888" /db_xref="GOA:I6XG43" /db_xref="InterPro:IPR011251" /db_xref="InterPro:IPR019921" /db_xref="InterPro:IPR036661" /db_xref="UniProtKB/TrEMBL:I6XG43" /experiment="EXISTENCE: identified in proteomics study" BEGIN 1 MQFGVLTFVT DEGIGPAELG AALEHRGFES LFLAEHTHIP VNTQSPYPGG GPIPEKYYRT 61 LDPFVALAAA AATTQSLVLG TGIALIPERD PIVTAKEVAS LDLVSQGRFR FGVGVGWLRE 121 EVANHGVDPA VRGRVIDERL RAIIEIWTQE QAEFHGTYVD FDPIYCWPKP VTKPYPPLYV 181 GGGPANFPRI ARLNAGWIAI SPSPQRLSGP LQRLRAMAGG DVPVTVCQWG EAAAKDLEGY 241 RHLGVERVLL ELPTEPRDPT LRYLDKLQAE LARLA //