LOCUS CCP45190.1 356 aa PRT BCT 27-FEB-2015 DEFINITION Mycobacterium tuberculosis H37Rv Probable sulfate-binding lipoprotein SubI protein. ACCESSION AL123456-2468 PROTEIN_ID CCP45190.1 SOURCE Mycobacterium tuberculosis H37Rv ORGANISM Mycobacterium tuberculosis H37Rv Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae; Mycobacterium; Mycobacterium tuberculosis complex. REFERENCE 1 AUTHORS Cole S.T., Brosch R., Parkhill J., Garnier T., Churcher C., Harris D., Gordon S.V., Eiglmeier K., Gas S., Barry C.E.III., Tekaia F., Badcock K., Basham D., Brown D., Chillingworth T., Connor R., Davies R., Devlin K., Feltwell T., Gentles S., Hamlin N., Holroyd S., Hornsby T., Jagels K., Krogh A., McLean J., Moule S., Murphy L., Oliver K., Osborne J., Quail M.A., Rajandream M.A., Rogers J., Rutter S., Seeger K., Skelton J., Squares R., Squares S., Sulston J.E., Taylor K., Whitehead S., Barrell B.G. TITLE Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence JOURNAL Nature 393(6685), 537-544(1998). PUBMED 9634230 REMARK Erratum:[Nature 1998 Nov 12;396(6707):190] REFERENCE 2 AUTHORS Camus J.C., Pryor M.J., Medigue C., Cole S.T. TITLE Re-annotation of the genome sequence of Mycobacterium tuberculosis H37Rv JOURNAL Microbiology (Reading, Engl.) 148(Pt 10), 2967-2973(2002). PUBMED 12368430 REFERENCE 3 AUTHORS Lew J.M., Kapopoulou A., Jones L.M., Cole S.T. TITLE TubercuList--10 years after JOURNAL Tuberculosis (Edinb) 91(1), 1-7(2011). PUBMED 20980199 REFERENCE 4 (bases 1 to 4411529) AUTHORS Parkhill J. JOURNAL Submitted (11-JUN-1998) to the INSDC. Submitted on behalf of the Mycobacterium tuberculosis sequencing and mapping teams, Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA Unite de Genetique Moleculaire Bacterienne, Institut Pasteur, 28 rue du Docteur Roux, 75724 Paris Cedex 15, France E-mail: parkhill@sanger.ac.uk REFERENCE 5 (bases 1 to 4411532) AUTHORS Lew J.M. JOURNAL Submitted (18-DEC-2012) to the INSDC. Lew J., Ecole Polytechnique Federale de Lausanne, CH-1015, Lausanne, Switzerland, and the Swiss Institute of Bioinformatics, CMU - Rue Michel-Servet 1, 1211 Geneva 4, SWITZERLAND COMMENT On or before Feb 1, 2013 this sequence version replaced gi:41352722, gi:38490165, gi:38490207, gi:41353619, gi:38490250, gi:38684030, gi:38490288, gi:41353667, gi:41353422, gi:41352756, gi:38490319, gi:41352785, gi:38490370, gi:41353971. Note: This annotation is from the TubercuList website, Release 26, Dec 2012 (URL: http://tuberculist.epfl.ch) (email: tuberculist@epfl.ch). FEATURES Qualifiers source /organism="Mycobacterium tuberculosis H37Rv" /strain="H37Rv" /mol_type="genomic DNA" /db_xref="taxon:83332" protein /transl_table=11 /gene="subI" /locus_tag="Rv2400c" /note="Rv2400c, (MTCY253.21), len: 356 aa. Probable subI,sulfate-binding lipoprotein component of sulfate transport system (see citations below), equivalent to Q9CCN3|SUBI|ML0615 (alias Q49748|B1937_F1_11, 358 aa) putative sulphate-binding protein from Mycobacterium leprae (348 aa), FASTA scores: opt: 1775, E(): 2.3e-102, (76.45% identity in 340 aa overlap). Also similar to others and other substrate-binding proteins e.g. P27366|SUBI_SYNP7|SBPA sulfate-binding protein precursor from Synechococcus sp. strain PCC 7942 (Anacystis nidulans R2) (350 aa), FASTA scores: opt: 703, E(): 4.6e-36, (35.6% identity in 351 aa overlap); Q9I6K7|SBP|PA0283 sulfate-binding protein precursor from Pseudomonas aeruginosa (332 aa), FASTA scores: opt: 591, E(): 3.7e-29,(36.9% identity in 317 aa overlap); CAC49112|SMB21133 putative sulfate uptake ABC transporter periplasmic solute-binding protein precursor from Rhizobium meliloti (Sinorhizobium meliloti) (341 aa), FASTA scores: opt: 569,E(): 8.8e-28, (36.15% identity in 321 aa overlap); etc. Belongs to the prokaryotic sulfate binding protein family." /db_xref="EnsemblGenomes-Gn:Rv2400c" /db_xref="EnsemblGenomes-Tr:CCP45190" /db_xref="GOA:P71744" /db_xref="InterPro:IPR005669" /db_xref="PDB:6DDN" /db_xref="UniProtKB/TrEMBL:P71744" /experiment="EXISTENCE: identified in proteomics study" BEGIN 1 MLSLTLSEAS CIASASRWRH IIPAGVVCAL IAGIGVGCHG GPSDVVGRAG PDRAHTSITL 61 VAYAVPEPGW SAVIPAFNAS EQGRGVQVIT SYGASADQSR GVADGKPADL VNFSVEPDIA 121 RLVKAGKVDK DWDADATKGI PFGSVVTFVV RAGNPKNIRD WDDLLRPGIE VITPSPLSSG 181 SAKWNLLAPY AAKSDGGRNN QAGIDFVNTL VNEHVKLRPG SGREATDVFV QGSGDVLISY 241 ENEAIATERA GKPVQHVTPP QTFKIENPLA VVATSTHLGA ATAFRNFQYT VQAQKLWAQA 301 GFRPVDPAVA ADFADLFPVP AKLWTIADLG GWGSVDPQLF DKATGSITKI YLRATG //