LOCUS CCP45429.1 432 aa PRT BCT 27-FEB-2015 DEFINITION Mycobacterium tuberculosis H37Rv Conserved hypothetical protein protein. ACCESSION AL123456-2707 PROTEIN_ID CCP45429.1 SOURCE Mycobacterium tuberculosis H37Rv ORGANISM Mycobacterium tuberculosis H37Rv Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae; Mycobacterium; Mycobacterium tuberculosis complex. REFERENCE 1 AUTHORS Cole S.T., Brosch R., Parkhill J., Garnier T., Churcher C., Harris D., Gordon S.V., Eiglmeier K., Gas S., Barry C.E.III., Tekaia F., Badcock K., Basham D., Brown D., Chillingworth T., Connor R., Davies R., Devlin K., Feltwell T., Gentles S., Hamlin N., Holroyd S., Hornsby T., Jagels K., Krogh A., McLean J., Moule S., Murphy L., Oliver K., Osborne J., Quail M.A., Rajandream M.A., Rogers J., Rutter S., Seeger K., Skelton J., Squares R., Squares S., Sulston J.E., Taylor K., Whitehead S., Barrell B.G. TITLE Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence JOURNAL Nature 393(6685), 537-544(1998). PUBMED 9634230 REMARK Erratum:[Nature 1998 Nov 12;396(6707):190] REFERENCE 2 AUTHORS Camus J.C., Pryor M.J., Medigue C., Cole S.T. TITLE Re-annotation of the genome sequence of Mycobacterium tuberculosis H37Rv JOURNAL Microbiology (Reading, Engl.) 148(Pt 10), 2967-2973(2002). PUBMED 12368430 REFERENCE 3 AUTHORS Lew J.M., Kapopoulou A., Jones L.M., Cole S.T. TITLE TubercuList--10 years after JOURNAL Tuberculosis (Edinb) 91(1), 1-7(2011). PUBMED 20980199 REFERENCE 4 (bases 1 to 4411529) AUTHORS Parkhill J. JOURNAL Submitted (11-JUN-1998) to the INSDC. Submitted on behalf of the Mycobacterium tuberculosis sequencing and mapping teams, Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA Unite de Genetique Moleculaire Bacterienne, Institut Pasteur, 28 rue du Docteur Roux, 75724 Paris Cedex 15, France E-mail: parkhill@sanger.ac.uk REFERENCE 5 (bases 1 to 4411532) AUTHORS Lew J.M. JOURNAL Submitted (18-DEC-2012) to the INSDC. Lew J., Ecole Polytechnique Federale de Lausanne, CH-1015, Lausanne, Switzerland, and the Swiss Institute of Bioinformatics, CMU - Rue Michel-Servet 1, 1211 Geneva 4, SWITZERLAND COMMENT On or before Feb 1, 2013 this sequence version replaced gi:41352722, gi:38490165, gi:38490207, gi:41353619, gi:38490250, gi:38684030, gi:38490288, gi:41353667, gi:41353422, gi:41352756, gi:38490319, gi:41352785, gi:38490370, gi:41353971. Note: This annotation is from the TubercuList website, Release 26, Dec 2012 (URL: http://tuberculist.epfl.ch) (email: tuberculist@epfl.ch). FEATURES Qualifiers source /organism="Mycobacterium tuberculosis H37Rv" /strain="H37Rv" /mol_type="genomic DNA" /db_xref="taxon:83332" protein /transl_table=11 /locus_tag="Rv2631" /note="Rv2631, (MTCY441.01, MTCY01A10.01c), len: 432 aa. Conserved hypothetical protein, highly similar to several conserved hypothetical proteins from various species e.g. O29399|AF0862 conserved hypothetical protein from Archaeoglobus fulgidus (482 aa), FASTA scores: opt: 1496,E(): 2.1e-80, (52.3% identity in 432 aa overlap) (has its N-terminus longer 30 aa); O27634|MTH1597 conserved protein from Methanothermobacter thermautotrophicus (488 aa), FASTA scores: opt: 1428, E(): 2.1e-76, (50.9% identity in 432 aa overlap); Q9YB37|APE1758 hypothetical 53.7 KDA protein APE1758 from Aeropyrum pernix (483 aa), FASTA scores: opt: 1422, E(): 4.6e-76, (49.3% identity in 432 aa overlap) (has its N-terminus longer 30 aa); etc. Equivalent to AAK47022 from Mycobacterium tuberculosis strain CDC1551 (432 aa). 3' part extended since first submission (+175 aa)." /db_xref="EnsemblGenomes-Gn:Rv2631" /db_xref="EnsemblGenomes-Tr:CCP45429" /db_xref="GOA:P9WGW5" /db_xref="InterPro:IPR001233" /db_xref="InterPro:IPR036025" /db_xref="UniProtKB/Swiss-Prot:P9WGW5" /experiment="EXISTENCE: identified in proteomics study" BEGIN 1 MQVVNVATLP GIVRASYAMP DVHWGYGFPI GGVAATDVDN DGVVSPGGVG FDISCGVRLL 61 VGEGLDREEL QPRLPAVMDR LDRAIPRGVG TAGVWRLPDR NTLQEVLTGG ARFAVEQGHG 121 VALDLERCED GGVMTGADAA KISDRALQRG LGQIGSLGSG NHFLEVQAVD RVYDPVAAAP 181 MGLAEGTVCV MIHTGSRGLG HQICTDHVRQ MEQAMGRYGI AVPDRQLACV PVHSPDGQAY 241 LAAMAAAANY GRANRQLLTE ATRRVFADAT GTPLDLLYDV SHNLAKIETH PIDGQLRSVC 301 VHRKGATRSL PPHHHELPAE LAAVGQPVLI PGTMGTASYV LAGVTGNPAF FSTAHGAGRV 361 LSRHQAARHT SGEAIRASLA KRGIIVRGTS RRGIAEEKPE AYKDVDEVIE ASHQSGLARK 421 VARLVPLGCV KG //