LOCUS CCP45337.1 403 aa PRT BCT 27-FEB-2015 DEFINITION Mycobacterium tuberculosis H37Rv Conserved hypothetical protein protein. ACCESSION AL123456-2615 PROTEIN_ID CCP45337.1 SOURCE Mycobacterium tuberculosis H37Rv ORGANISM Mycobacterium tuberculosis H37Rv Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae; Mycobacterium; Mycobacterium tuberculosis complex. REFERENCE 1 AUTHORS Cole S.T., Brosch R., Parkhill J., Garnier T., Churcher C., Harris D., Gordon S.V., Eiglmeier K., Gas S., Barry C.E.III., Tekaia F., Badcock K., Basham D., Brown D., Chillingworth T., Connor R., Davies R., Devlin K., Feltwell T., Gentles S., Hamlin N., Holroyd S., Hornsby T., Jagels K., Krogh A., McLean J., Moule S., Murphy L., Oliver K., Osborne J., Quail M.A., Rajandream M.A., Rogers J., Rutter S., Seeger K., Skelton J., Squares R., Squares S., Sulston J.E., Taylor K., Whitehead S., Barrell B.G. TITLE Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence JOURNAL Nature 393(6685), 537-544(1998). PUBMED 9634230 REMARK Erratum:[Nature 1998 Nov 12;396(6707):190] REFERENCE 2 AUTHORS Camus J.C., Pryor M.J., Medigue C., Cole S.T. TITLE Re-annotation of the genome sequence of Mycobacterium tuberculosis H37Rv JOURNAL Microbiology (Reading, Engl.) 148(Pt 10), 2967-2973(2002). PUBMED 12368430 REFERENCE 3 AUTHORS Lew J.M., Kapopoulou A., Jones L.M., Cole S.T. TITLE TubercuList--10 years after JOURNAL Tuberculosis (Edinb) 91(1), 1-7(2011). PUBMED 20980199 REFERENCE 4 (bases 1 to 4411529) AUTHORS Parkhill J. JOURNAL Submitted (11-JUN-1998) to the INSDC. Submitted on behalf of the Mycobacterium tuberculosis sequencing and mapping teams, Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA Unite de Genetique Moleculaire Bacterienne, Institut Pasteur, 28 rue du Docteur Roux, 75724 Paris Cedex 15, France E-mail: parkhill@sanger.ac.uk REFERENCE 5 (bases 1 to 4411532) AUTHORS Lew J.M. JOURNAL Submitted (18-DEC-2012) to the INSDC. Lew J., Ecole Polytechnique Federale de Lausanne, CH-1015, Lausanne, Switzerland, and the Swiss Institute of Bioinformatics, CMU - Rue Michel-Servet 1, 1211 Geneva 4, SWITZERLAND COMMENT On or before Feb 1, 2013 this sequence version replaced gi:41352722, gi:38490165, gi:38490207, gi:41353619, gi:38490250, gi:38684030, gi:38490288, gi:41353667, gi:41353422, gi:41352756, gi:38490319, gi:41352785, gi:38490370, gi:41353971. Note: This annotation is from the TubercuList website, Release 26, Dec 2012 (URL: http://tuberculist.epfl.ch) (email: tuberculist@epfl.ch). FEATURES Qualifiers source /organism="Mycobacterium tuberculosis H37Rv" /strain="H37Rv" /mol_type="genomic DNA" /db_xref="taxon:83332" protein /transl_table=11 /locus_tag="Rv2542" /note="Rv2542, (MTCY159.14c), len: 403 aa. Conserved hypothetical protein, highly similar to AAK46927|MT2616 hypothetical 28.0 KDA protein from Mycobacterium tuberculosis strain CDC1551 (265 aa), FASTA scores: opt: 1776, E(): 2.3e-94, (99.25% identity in 265 aa overlap). And similar to several hypothetical proteins from Mycobacterium tuberculosis (strain H37Rv and CDC1551) e.g. P71654|Rv2797c|MTCY16B7.46 (562 aa), FASTA scores: opt: 537, E(): 2.6e-23, (40.75% identity in 292 aa overlap); P71547|Y963_MYCTU|Rv0963c|MT0992|MTCY10D7.11 (266 aa),FASTA scores: opt: 357, E(): 2.6e-13, (34.6% identity in 234 aa overlap); Q10685|YK77_MYCTU|Rv2077c|MT2137|MTCY49.16c (323 aa), FASTA scores: opt: 261, E(): 9.5e-08, (32.7% identity in 211 aa overlap); etc. Also similar to Q9RDQ9|SC4A7.03 putative secreted protein from Streptomyces coelicolor (406 aa),FASTA scores: opt: 247, E(): 7.3e-07, (30.35% identity in 303 aa overlap)." /db_xref="EnsemblGenomes-Gn:Rv2542" /db_xref="EnsemblGenomes-Tr:CCP45337" /db_xref="GOA:P95011" /db_xref="InterPro:IPR010427" /db_xref="InterPro:IPR029058" /db_xref="UniProtKB/TrEMBL:P95011" /experiment="EXISTENCE: identified in proteomics study" BEGIN 1 MLDAVSDARR DGFAVGEDYT VTDRSTGGSR QQRAARLGQA QGHADFIRHR VGALLATDRD 61 IATRVSAATQ GLDELAFEDV PGVDTPAEDG VQAVDFRQAP PPGAPGGMSS GDIDAIDAAN 121 RALLQDMLAE YSRLPDGQVK TDRLADIAAI QEALRVPDSH LIYVARPDDP ADMIPAVTAV 181 GDPFTADHVS VTVPGVSGTT RQTIATMTQE TRGLREEARV IAHSVGESEN VATIAWVGYQ 241 PPPVLASWNT VDDDLAQAGA PKLEAFLRDL QAGSHNPGHT TALFGHSYGS LLSGIALKDG 301 ASSLVDNAVL YGSPGFDATS PAKLGMNDHN FFVMTTPDDP IRYPARLAPL HGWGSDGADT 361 IGTVGRQGTP ARVGIRPQRD HRRIPGPLPL HPSADRRGIH SAG //