LOCUS CCP46615.1 407 aa PRT BCT 27-FEB-2015 DEFINITION Mycobacterium tuberculosis H37Rv Unknown protein protein. ACCESSION AL123456-3893 PROTEIN_ID CCP46615.1 SOURCE Mycobacterium tuberculosis H37Rv ORGANISM Mycobacterium tuberculosis H37Rv Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae; Mycobacterium; Mycobacterium tuberculosis complex. REFERENCE 1 AUTHORS Cole S.T., Brosch R., Parkhill J., Garnier T., Churcher C., Harris D., Gordon S.V., Eiglmeier K., Gas S., Barry C.E.III., Tekaia F., Badcock K., Basham D., Brown D., Chillingworth T., Connor R., Davies R., Devlin K., Feltwell T., Gentles S., Hamlin N., Holroyd S., Hornsby T., Jagels K., Krogh A., McLean J., Moule S., Murphy L., Oliver K., Osborne J., Quail M.A., Rajandream M.A., Rogers J., Rutter S., Seeger K., Skelton J., Squares R., Squares S., Sulston J.E., Taylor K., Whitehead S., Barrell B.G. TITLE Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence JOURNAL Nature 393(6685), 537-544(1998). PUBMED 9634230 REMARK Erratum:[Nature 1998 Nov 12;396(6707):190] REFERENCE 2 AUTHORS Camus J.C., Pryor M.J., Medigue C., Cole S.T. TITLE Re-annotation of the genome sequence of Mycobacterium tuberculosis H37Rv JOURNAL Microbiology (Reading, Engl.) 148(Pt 10), 2967-2973(2002). PUBMED 12368430 REFERENCE 3 AUTHORS Lew J.M., Kapopoulou A., Jones L.M., Cole S.T. TITLE TubercuList--10 years after JOURNAL Tuberculosis (Edinb) 91(1), 1-7(2011). PUBMED 20980199 REFERENCE 4 (bases 1 to 4411529) AUTHORS Parkhill J. JOURNAL Submitted (11-JUN-1998) to the INSDC. Submitted on behalf of the Mycobacterium tuberculosis sequencing and mapping teams, Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA Unite de Genetique Moleculaire Bacterienne, Institut Pasteur, 28 rue du Docteur Roux, 75724 Paris Cedex 15, France E-mail: parkhill@sanger.ac.uk REFERENCE 5 (bases 1 to 4411532) AUTHORS Lew J.M. JOURNAL Submitted (18-DEC-2012) to the INSDC. Lew J., Ecole Polytechnique Federale de Lausanne, CH-1015, Lausanne, Switzerland, and the Swiss Institute of Bioinformatics, CMU - Rue Michel-Servet 1, 1211 Geneva 4, SWITZERLAND COMMENT On or before Feb 1, 2013 this sequence version replaced gi:41352722, gi:38490165, gi:38490207, gi:41353619, gi:38490250, gi:38684030, gi:38490288, gi:41353667, gi:41353422, gi:41352756, gi:38490319, gi:41352785, gi:38490370, gi:41353971. Note: This annotation is from the TubercuList website, Release 26, Dec 2012 (URL: http://tuberculist.epfl.ch) (email: tuberculist@epfl.ch). FEATURES Qualifiers source /organism="Mycobacterium tuberculosis H37Rv" /strain="H37Rv" /mol_type="genomic DNA" /db_xref="taxon:83332" protein /transl_table=11 /locus_tag="Rv3786c" /note="Rv3786c, (MTCY13D12.20), len: 407 aa. Unknown protein. Segment between aa 265-300 (approximately) is highly similar to part of O03937|RORF1608 minor capsid protein from Bacteriophage phig1e (1608 aa), FASTA scores: opt: 242, E(): 8.4e-07, (26.85% identity in 272 aa overlap); Q9ETT9|ORF36 putative peptidase from Corynebacterium equii (Rhodococcus equi) plasmid pREAT701 (p33701) and Plasmid virulence (546 aa), FASTA scores: opt: 231, E(): 1.6e-06, (34.15% identity in 167 aa overlap); O69910|SC2E1.40c hypothetical 22.8 KDA protein. from Streptomyces coelicolor (226 aa) FASTA scores: opt: 218,E(): 4.6e-06, (34.15% identity in 164 aa overlap); and others." /db_xref="EnsemblGenomes-Gn:Rv3786c" /db_xref="EnsemblGenomes-Tr:CCP46615" /db_xref="GOA:P9WKW9" /db_xref="InterPro:IPR011055" /db_xref="InterPro:IPR016047" /db_xref="InterPro:IPR029044" /db_xref="UniProtKB/Swiss-Prot:P9WKW9" /experiment="EXISTENCE: identified in proteomics study" BEGIN 1 MRILAMTRAH NAGRTLAATL DSLAVFSDDI YVIDDRSTDD TAEILANHPA VTNVVRARPD 61 LPPTPWLIPE SAGLELLYRM ADFCRPDWVM MVDADWLVET DIDLRAVLAR TPDDIVALMC 121 PMVSRWDDPE YPDLIPVMGT AEALRGPLWR WYPGLRAGGK LMHNPHWPAN ITDHGRIGQL 181 PGVRLVHSGW STLAERILRV EHYLRLDPDY RFNFGVAYDR SLLFGYALDE VDLLKADYRR 241 RIRGDFDPLE PGGRLPIDRE PRAIGRGYGP HAGGFHPGVD FATDPGTPVY AVASGAVSAI 301 DEVDGLVSLT IARCELDVVY VFRPGDEGRL VLGDRIAAGA QLGTIGAQGE SADGYLHFEV 361 RTQDGHVNPV RYLANMGLRP WPPPGRLRAV SGSYPPATPC TITAEDR //