LOCUS CCP46255.1 473 aa PRT BCT 27-FEB-2015 DEFINITION Mycobacterium tuberculosis H37Rv Conserved protein protein. ACCESSION AL123456-3533 PROTEIN_ID CCP46255.1 SOURCE Mycobacterium tuberculosis H37Rv ORGANISM Mycobacterium tuberculosis H37Rv Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae; Mycobacterium; Mycobacterium tuberculosis complex. REFERENCE 1 AUTHORS Cole S.T., Brosch R., Parkhill J., Garnier T., Churcher C., Harris D., Gordon S.V., Eiglmeier K., Gas S., Barry C.E.III., Tekaia F., Badcock K., Basham D., Brown D., Chillingworth T., Connor R., Davies R., Devlin K., Feltwell T., Gentles S., Hamlin N., Holroyd S., Hornsby T., Jagels K., Krogh A., McLean J., Moule S., Murphy L., Oliver K., Osborne J., Quail M.A., Rajandream M.A., Rogers J., Rutter S., Seeger K., Skelton J., Squares R., Squares S., Sulston J.E., Taylor K., Whitehead S., Barrell B.G. TITLE Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence JOURNAL Nature 393(6685), 537-544(1998). PUBMED 9634230 REMARK Erratum:[Nature 1998 Nov 12;396(6707):190] REFERENCE 2 AUTHORS Camus J.C., Pryor M.J., Medigue C., Cole S.T. TITLE Re-annotation of the genome sequence of Mycobacterium tuberculosis H37Rv JOURNAL Microbiology (Reading, Engl.) 148(Pt 10), 2967-2973(2002). PUBMED 12368430 REFERENCE 3 AUTHORS Lew J.M., Kapopoulou A., Jones L.M., Cole S.T. TITLE TubercuList--10 years after JOURNAL Tuberculosis (Edinb) 91(1), 1-7(2011). PUBMED 20980199 REFERENCE 4 (bases 1 to 4411529) AUTHORS Parkhill J. JOURNAL Submitted (11-JUN-1998) to the INSDC. Submitted on behalf of the Mycobacterium tuberculosis sequencing and mapping teams, Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA Unite de Genetique Moleculaire Bacterienne, Institut Pasteur, 28 rue du Docteur Roux, 75724 Paris Cedex 15, France E-mail: parkhill@sanger.ac.uk REFERENCE 5 (bases 1 to 4411532) AUTHORS Lew J.M. JOURNAL Submitted (18-DEC-2012) to the INSDC. Lew J., Ecole Polytechnique Federale de Lausanne, CH-1015, Lausanne, Switzerland, and the Swiss Institute of Bioinformatics, CMU - Rue Michel-Servet 1, 1211 Geneva 4, SWITZERLAND COMMENT On or before Feb 1, 2013 this sequence version replaced gi:41352722, gi:38490165, gi:38490207, gi:41353619, gi:38490250, gi:38684030, gi:38490288, gi:41353667, gi:41353422, gi:41352756, gi:38490319, gi:41352785, gi:38490370, gi:41353971. Note: This annotation is from the TubercuList website, Release 26, Dec 2012 (URL: http://tuberculist.epfl.ch) (email: tuberculist@epfl.ch). FEATURES Qualifiers source /organism="Mycobacterium tuberculosis H37Rv" /strain="H37Rv" /mol_type="genomic DNA" /db_xref="taxon:83332" protein /transl_table=11 /locus_tag="Rv3433c" /note="Rv3433c, (MTCY77.05), len: 473 aa. Conserved protein, member of YKL151c/yjeF family, equivalent to P37391|YY33_MYCLE|ML0373|U229G|B229_C2_201 hypothetical 47.2 KDA protein from Mycobacterium leprae (473 aa), FASTA scores: opt: 2650, E(): 5e-136, (84.55% identity in 473 aa overlap). Also similar to other hypothetical bacterial proteins e.g. Q9X3W3 from Zymomonas mobilis (484 aa), FASTA scores: opt: 700, E(): 1.2e-30, (33.7% identity in 484 aa overlap); O86783|SC6G4.20c from Streptomyces coelicolor (485 aa), FASTA scores: opt: 563, E(): 3.2e-23, (48.45% identity in 489 aa overlap); Q9LC81 from Arthrobacter sp. Q36 (313 aa), FASTA scores: opt: 553, E(): 7.9e-23, (44.2% identity in 303 aa overlap); etc. Contains Pfam match to entry PF01256 hypothetical UPFOO31 family signature and PF03853 YjeF-related protein N-terminus. Belongs to the UPF0031 family." /db_xref="EnsemblGenomes-Gn:Rv3433c" /db_xref="EnsemblGenomes-Tr:CCP46255" /db_xref="GOA:P9WF11" /db_xref="InterPro:IPR000631" /db_xref="InterPro:IPR004443" /db_xref="InterPro:IPR017953" /db_xref="InterPro:IPR029056" /db_xref="InterPro:IPR030677" /db_xref="InterPro:IPR036652" /db_xref="UniProtKB/Swiss-Prot:P9WF11" /inference="protein motif:PROSITE:PS01050" /inference="protein motif:PROSITE:PS01049" /experiment="EXISTENCE: identified in proteomics study" BEGIN 1 MRHYYSVDTI RAAEAPLLAS LPDGALMRRA AFGLATEIGR ELTARTGGVV GRRVCAVVGS 61 GDNGGDALWA ATFLRRRGAA ADAVLLNPDR THRKALAAFT KSGGRLVESV SAATDLVIDG 121 VVGISGSGPL RPAAAQVFAA VQAAAIPVVA VDIPSGIDVA TGAITGPAVH AALTVTFGGL 181 KPVHALADCG RVVLVDIGLD LAHTDVLGFE ATDVAARWPV PGPRDDKYTQ GVTGVLAGSS 241 TYPGAAVLCT GAAVAATSGM VRYAGTAHAE VLAHWPEVIA SPTPAAAGRV QAWVVGPGLG 301 TDEAGAAALW FALDTDLPVL VDADGLTMLA DHPDLVAGRN APTVLTPHAG EFARLAGAPP 361 GDDRVGACRQ LADALGATVL LKGNVTVIAD PGGPVYLNPA GQSWAATAGS GDVLSGMIGA 421 LLASGLPSGE AAAAAAFVHA RASAAAAADP GPGDAPTSAS RISGHIRAAL AAL //