LOCUS CCP45962.1 806 aa PRT BCT 27-FEB-2015 DEFINITION Mycobacterium tuberculosis H37Rv Probable NADH dehydrogenase I (chain G) NuoG (NADH-ubiquinone oxidoreductase chain G) protein. ACCESSION AL123456-3240 PROTEIN_ID CCP45962.1 SOURCE Mycobacterium tuberculosis H37Rv ORGANISM Mycobacterium tuberculosis H37Rv Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae; Mycobacterium; Mycobacterium tuberculosis complex. REFERENCE 1 AUTHORS Cole S.T., Brosch R., Parkhill J., Garnier T., Churcher C., Harris D., Gordon S.V., Eiglmeier K., Gas S., Barry C.E.III., Tekaia F., Badcock K., Basham D., Brown D., Chillingworth T., Connor R., Davies R., Devlin K., Feltwell T., Gentles S., Hamlin N., Holroyd S., Hornsby T., Jagels K., Krogh A., McLean J., Moule S., Murphy L., Oliver K., Osborne J., Quail M.A., Rajandream M.A., Rogers J., Rutter S., Seeger K., Skelton J., Squares R., Squares S., Sulston J.E., Taylor K., Whitehead S., Barrell B.G. TITLE Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence JOURNAL Nature 393(6685), 537-544(1998). PUBMED 9634230 REMARK Erratum:[Nature 1998 Nov 12;396(6707):190] REFERENCE 2 AUTHORS Camus J.C., Pryor M.J., Medigue C., Cole S.T. TITLE Re-annotation of the genome sequence of Mycobacterium tuberculosis H37Rv JOURNAL Microbiology (Reading, Engl.) 148(Pt 10), 2967-2973(2002). PUBMED 12368430 REFERENCE 3 AUTHORS Lew J.M., Kapopoulou A., Jones L.M., Cole S.T. TITLE TubercuList--10 years after JOURNAL Tuberculosis (Edinb) 91(1), 1-7(2011). PUBMED 20980199 REFERENCE 4 (bases 1 to 4411529) AUTHORS Parkhill J. JOURNAL Submitted (11-JUN-1998) to the INSDC. Submitted on behalf of the Mycobacterium tuberculosis sequencing and mapping teams, Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA Unite de Genetique Moleculaire Bacterienne, Institut Pasteur, 28 rue du Docteur Roux, 75724 Paris Cedex 15, France E-mail: parkhill@sanger.ac.uk REFERENCE 5 (bases 1 to 4411532) AUTHORS Lew J.M. JOURNAL Submitted (18-DEC-2012) to the INSDC. Lew J., Ecole Polytechnique Federale de Lausanne, CH-1015, Lausanne, Switzerland, and the Swiss Institute of Bioinformatics, CMU - Rue Michel-Servet 1, 1211 Geneva 4, SWITZERLAND COMMENT On or before Feb 1, 2013 this sequence version replaced gi:41352722, gi:38490165, gi:38490207, gi:41353619, gi:38490250, gi:38684030, gi:38490288, gi:41353667, gi:41353422, gi:41352756, gi:38490319, gi:41352785, gi:38490370, gi:41353971. Note: This annotation is from the TubercuList website, Release 26, Dec 2012 (URL: http://tuberculist.epfl.ch) (email: tuberculist@epfl.ch). FEATURES Qualifiers source /organism="Mycobacterium tuberculosis H37Rv" /strain="H37Rv" /mol_type="genomic DNA" /db_xref="taxon:83332" protein /transl_table=11 /gene="nuoG" /locus_tag="Rv3151" /note="Rv3151, (MTCY03A2.07c), len: 806 aa. Probable nuoG,NADH dehydrogenase I, chain G, similar to others e.g. Q9XAR0|NUOG_STRCO from Streptomyces coelicolor (843 aa),FASTA scores: opt: 1968 ,E(): 5.2e-107, (62.45% identity in 818 aa overlap); P56914|NUG2_RHIME from Rhizobium meliloti (853 aa), FASTA scores: opt: 964, E(): 1.6e-48, (30.6% identity in 840 aa overlap); etc. But also similarity with other proteins e.g. P77908|FDHA formate dehydrogenase,alpha subunit (formate dehydrogenase [NADP+]) from Moorella thermoacetica (Clostridium thermoaceticum) (893 aa), FASTA scores: opt: 928, E(): 2e-46, (28.65% identity in 865 aa overlap); and Q9UUU3|NUAM NUAM protein precursor from Yarrowia lipolytica (Candida lipolytica) (728 aa), FASTA scores: opt: 894, E(): 1.7e-44, (31.95% identity in 676 aa overlap). Equivalent to AAK47578 from Mycobacterium tuberculosis strain CDC1551 but longer 15 aa. Contains respiratory-chain NADH dehydrogenase 75 kDa subunit signature 2 (PS00642). Belongs to the complex I 75 KDA subunit family. Cofactor: may bind two 4FE-4S cluster and one 2FE-2S cluster." /db_xref="EnsemblGenomes-Gn:Rv3151" /db_xref="EnsemblGenomes-Tr:CCP45962" /db_xref="GOA:P9WIV9" /db_xref="InterPro:IPR000283" /db_xref="InterPro:IPR001041" /db_xref="InterPro:IPR006656" /db_xref="InterPro:IPR006657" /db_xref="InterPro:IPR006963" /db_xref="InterPro:IPR009010" /db_xref="InterPro:IPR010228" /db_xref="InterPro:IPR019574" /db_xref="InterPro:IPR036010" /db_xref="UniProtKB/Swiss-Prot:P9WIV9" /inference="protein motif:PROSITE:PS00642" /experiment="EXISTENCE: identified in proteomics study" BEGIN 1 MTQAADTDIR VGQPEMVTLT IDGVEISVPK GTLVIRAAEL MGIQIPRFCD HPLLEPVGAC 61 RQCLVEVEGQ RKPLASCTTV ATDDMVVRTQ LTSEIADKAQ HGVMELLLIN HPLDCPMCDK 121 GGECPLQNQA MSNGRTDSRF TEAKRTFAKP INISAQVLLD RERCILCARC TRFSDQIAGD 181 PFIDMQERGA LQQVGIYADE PFESYFSGNT VQICPVGALT GTAYRFRARP FDLVSSPSVC 241 EHCASGCAQR TDHRRGKVLR RLAGDDPEVN EEWNCDKGRW AFTYATQPDV ITTPLIRDGG 301 DPKGALVPTS WSHAMAVAAQ GLAAARGRTG VLVGGRVTWE DAYAYAKFAR ITLGTNDIDF 361 RARPHSAEEA DFLAARIAGR HMAVSYADLE SAPVVLLVGF EPEDESPIVF LRLRKAARRH 421 RVPVYTIAPF ATGGLHKMSG RLIKTVPGGE PAALDDLATG AVGDLLATPG AVIIVGERLA 481 TVPGGLSAAA RLADTTGARL AWVPRRAGER GALEAGALPT LLPGGRPLAD EVARAQVCAA 541 WHIAELPAAA GRDADGILAA AADETLAALL VGGIEPADFA DPDAVLAALD ATGFVVSLEL 601 RHSTVTERAD VVFPVAPTTQ KAGAFVNWEG RYRTFEPALR GSTLQAGQSD HRVLDALADD 661 MGVHLGVPTV EAAREELAAL GIWDGKHAAG PHIAATGPTQ PEAGEAILTG WRMLLDEGRL 721 QDGEPYLAGT ARTPVVRLSP DTAAEIGAAD GEAVTVSTSR GSITLPCSVT DMPDRVVWLP 781 LNSAGSTVHR QLRVTIGSIV KIGAGS //