LOCUS CCP46343.1 303 aa PRT BCT 27-FEB-2015 DEFINITION Mycobacterium tuberculosis H37Rv Conserved hypothetical protein protein. ACCESSION AL123456-3621 PROTEIN_ID CCP46343.1 SOURCE Mycobacterium tuberculosis H37Rv ORGANISM Mycobacterium tuberculosis H37Rv Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae; Mycobacterium; Mycobacterium tuberculosis complex. REFERENCE 1 AUTHORS Cole S.T., Brosch R., Parkhill J., Garnier T., Churcher C., Harris D., Gordon S.V., Eiglmeier K., Gas S., Barry C.E.III., Tekaia F., Badcock K., Basham D., Brown D., Chillingworth T., Connor R., Davies R., Devlin K., Feltwell T., Gentles S., Hamlin N., Holroyd S., Hornsby T., Jagels K., Krogh A., McLean J., Moule S., Murphy L., Oliver K., Osborne J., Quail M.A., Rajandream M.A., Rogers J., Rutter S., Seeger K., Skelton J., Squares R., Squares S., Sulston J.E., Taylor K., Whitehead S., Barrell B.G. TITLE Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence JOURNAL Nature 393(6685), 537-544(1998). PUBMED 9634230 REMARK Erratum:[Nature 1998 Nov 12;396(6707):190] REFERENCE 2 AUTHORS Camus J.C., Pryor M.J., Medigue C., Cole S.T. TITLE Re-annotation of the genome sequence of Mycobacterium tuberculosis H37Rv JOURNAL Microbiology (Reading, Engl.) 148(Pt 10), 2967-2973(2002). PUBMED 12368430 REFERENCE 3 AUTHORS Lew J.M., Kapopoulou A., Jones L.M., Cole S.T. TITLE TubercuList--10 years after JOURNAL Tuberculosis (Edinb) 91(1), 1-7(2011). PUBMED 20980199 REFERENCE 4 (bases 1 to 4411529) AUTHORS Parkhill J. JOURNAL Submitted (11-JUN-1998) to the INSDC. Submitted on behalf of the Mycobacterium tuberculosis sequencing and mapping teams, Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA Unite de Genetique Moleculaire Bacterienne, Institut Pasteur, 28 rue du Docteur Roux, 75724 Paris Cedex 15, France E-mail: parkhill@sanger.ac.uk REFERENCE 5 (bases 1 to 4411532) AUTHORS Lew J.M. JOURNAL Submitted (18-DEC-2012) to the INSDC. Lew J., Ecole Polytechnique Federale de Lausanne, CH-1015, Lausanne, Switzerland, and the Swiss Institute of Bioinformatics, CMU - Rue Michel-Servet 1, 1211 Geneva 4, SWITZERLAND COMMENT On or before Feb 1, 2013 this sequence version replaced gi:41352722, gi:38490165, gi:38490207, gi:41353619, gi:38490250, gi:38684030, gi:38490288, gi:41353667, gi:41353422, gi:41352756, gi:38490319, gi:41352785, gi:38490370, gi:41353971. Note: This annotation is from the TubercuList website, Release 26, Dec 2012 (URL: http://tuberculist.epfl.ch) (email: tuberculist@epfl.ch). FEATURES Qualifiers source /organism="Mycobacterium tuberculosis H37Rv" /strain="H37Rv" /mol_type="genomic DNA" /db_xref="taxon:83332" protein /transl_table=11 /locus_tag="Rv3521" /note="Rv3521, (MTV023.28), len: 303 aa. Conserved hypothetical protein, similar to (although longer than) other conserved hypothetical proteins e.g. O29296|AF0966 from Archaeoglobus fulgidus (176 aa), FASTA scores: opt: 286, E(): 5.4e-11, (31.15% identity in 170 aa overlap); O30036|AF0203 from Archaeoglobus fulgidus (149 aa) FASTA scores: opt: 259, E(): 2.3e-09, (33.8% identity in 142 aa overlap); O29297|AF0965 from Archaeoglobus fulgidus (154 aa), FASTA scores: opt: 241, E(): 3.2e-08, (31.4% identity in 137 aa overlap); Q9Y995|APE2390 from Aeropyrum pernix (157 aa), FASTA scores: opt: 204, E(): 6.8e-06, (27.45% identity in 153 aa overlap); BAB60424|TVG1322512 from Thermoplasma volcanium (164 aa), FASTA scores: opt: 183,E(): 0.00015, (29.75% identity in 148 aa overlap); etc. Equivalent to AAK47982 from Mycobacterium tuberculosis strain CDC1551 (334 aa) but shorter 31 aa." /db_xref="EnsemblGenomes-Gn:Rv3521" /db_xref="EnsemblGenomes-Tr:CCP46343" /db_xref="InterPro:IPR002878" /db_xref="InterPro:IPR012340" /db_xref="UniProtKB/TrEMBL:O53566" /experiment="EXISTENCE: identified in proteomics study" BEGIN 1 MGPTLSRFFT ALRARRIVGV RGSDGRVHVP PVEYDPVTYE PLSEMVPVSS VGTVASWTWQ 61 PEPLAGQPLD RPFAWALIKL DGADTLLMHA VDVGTAGPSA IHTGARVHAH WADQPVGAIT 121 DIACFALGET AEPVAAHKTE DARDPVTMIV TPIQLEIQHT ASHEESAYLR AIAQGKLVGA 181 RTGKTGKVYF PPHGADPATG KPTSEFVELP DKGTVTTFAI VNIPFLGQRI KPPYVAAYVL 241 LDGADIPFLH LVSDVDAHQV RMGMRVEAVW KPRERWGLGI DNIEYFRPTG EPDANYDTYK 301 HHL //