LOCUS       CCP45962.1               806 aa    PRT              BCT 27-FEB-2015
DEFINITION  Mycobacterium tuberculosis H37Rv Probable NADH dehydrogenase
            I (chain G) NuoG (NADH-ubiquinone oxidoreductase chain
            G) protein.
ACCESSION   AL123456-3240
PROTEIN_ID  CCP45962.1
SOURCE      Mycobacterium tuberculosis H37Rv
  ORGANISM  Mycobacterium tuberculosis H37Rv
            Bacteria; Actinobacteria; Corynebacteriales; Mycobacteriaceae;
            Mycobacterium; Mycobacterium tuberculosis complex.
REFERENCE   1
  AUTHORS   Cole S.T., Brosch R., Parkhill J., Garnier T., Churcher C.,
            Harris D., Gordon S.V., Eiglmeier K., Gas S., Barry C.E.III.,
            Tekaia F., Badcock K., Basham D., Brown D., Chillingworth T.,
            Connor R., Davies R., Devlin K., Feltwell T., Gentles S.,
            Hamlin N., Holroyd S., Hornsby T., Jagels K., Krogh A., McLean J.,
            Moule S., Murphy L., Oliver K., Osborne J., Quail M.A.,
            Rajandream M.A., Rogers J., Rutter S., Seeger K., Skelton J.,
            Squares R., Squares S., Sulston J.E., Taylor K., Whitehead S.,
            Barrell B.G.
  TITLE     Deciphering the biology of Mycobacterium tuberculosis from the
            complete genome sequence
  JOURNAL   Nature 393(6685), 537-544(1998).
   PUBMED   9634230
  REMARK    Erratum:[Nature 1998 Nov 12;396(6707):190]
REFERENCE   2
  AUTHORS   Camus J.C., Pryor M.J., Medigue C., Cole S.T.
  TITLE     Re-annotation of the genome sequence of Mycobacterium tuberculosis
            H37Rv
  JOURNAL   Microbiology (Reading, Engl.) 148(Pt 10), 2967-2973(2002).
   PUBMED   12368430
REFERENCE   3
  AUTHORS   Lew J.M., Kapopoulou A., Jones L.M., Cole S.T.
  TITLE     TubercuList--10 years after
  JOURNAL   Tuberculosis (Edinb) 91(1), 1-7(2011).
   PUBMED   20980199
REFERENCE   4  (bases 1 to 4411529)
  AUTHORS   Parkhill J.
  JOURNAL   Submitted (11-JUN-1998) to the INSDC. Submitted on behalf of the
            Mycobacterium tuberculosis sequencing and mapping teams, Sanger
            Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA
            Unite de Genetique Moleculaire Bacterienne, Institut Pasteur, 28
            rue du Docteur Roux, 75724 Paris Cedex 15, France E-mail:
            parkhill@sanger.ac.uk
REFERENCE   5  (bases 1 to 4411532)
  AUTHORS   Lew J.M.
  JOURNAL   Submitted (18-DEC-2012) to the INSDC. Lew J., Ecole Polytechnique
            Federale de Lausanne, CH-1015, Lausanne, Switzerland, and the Swiss
            Institute of Bioinformatics, CMU - Rue Michel-Servet 1, 1211 Geneva
            4, SWITZERLAND
COMMENT     On or before Feb 1, 2013 this sequence version replaced
            gi:41352722, gi:38490165, gi:38490207, gi:41353619, gi:38490250,
            gi:38684030, gi:38490288, gi:41353667, gi:41353422, gi:41352756,
            gi:38490319, gi:41352785, gi:38490370, gi:41353971.
            Note:
            This annotation is from the TubercuList website, Release 26, Dec
            2012 (URL: http://tuberculist.epfl.ch) (email:
            tuberculist@epfl.ch).
FEATURES             Qualifiers
     source          /organism="Mycobacterium tuberculosis H37Rv"
                     /strain="H37Rv"
                     /mol_type="genomic DNA"
                     /db_xref="taxon:83332"
     protein         /transl_table=11
                     /gene="nuoG"
                     /locus_tag="Rv3151"
                     /note="Rv3151, (MTCY03A2.07c), len: 806 aa. Probable
                     nuoG,NADH dehydrogenase I, chain G, similar to others e.g.
                     Q9XAR0|NUOG_STRCO from Streptomyces coelicolor (843
                     aa),FASTA scores: opt: 1968 ,E(): 5.2e-107, (62.45%
                     identity in 818 aa overlap); P56914|NUG2_RHIME from
                     Rhizobium meliloti (853 aa), FASTA scores: opt: 964, E():
                     1.6e-48, (30.6% identity in 840 aa overlap); etc. But also
                     similarity with other proteins e.g. P77908|FDHA formate
                     dehydrogenase,alpha subunit (formate dehydrogenase
                     [NADP+]) from Moorella thermoacetica (Clostridium
                     thermoaceticum) (893 aa), FASTA scores: opt: 928, E():
                     2e-46, (28.65% identity in 865 aa overlap); and
                     Q9UUU3|NUAM NUAM protein precursor from Yarrowia
                     lipolytica (Candida lipolytica) (728 aa), FASTA scores:
                     opt: 894, E(): 1.7e-44, (31.95% identity in 676 aa
                     overlap). Equivalent to AAK47578 from Mycobacterium
                     tuberculosis strain CDC1551 but longer 15 aa. Contains
                     respiratory-chain NADH dehydrogenase 75 kDa subunit
                     signature 2 (PS00642). Belongs to the complex I 75 KDA
                     subunit family. Cofactor: may bind two 4FE-4S cluster and
                     one 2FE-2S cluster."
                     /db_xref="EnsemblGenomes-Gn:Rv3151"
                     /db_xref="EnsemblGenomes-Tr:CCP45962"
                     /db_xref="GOA:P9WIV9"
                     /db_xref="InterPro:IPR000283"
                     /db_xref="InterPro:IPR001041"
                     /db_xref="InterPro:IPR006656"
                     /db_xref="InterPro:IPR006657"
                     /db_xref="InterPro:IPR006963"
                     /db_xref="InterPro:IPR009010"
                     /db_xref="InterPro:IPR010228"
                     /db_xref="InterPro:IPR019574"
                     /db_xref="InterPro:IPR036010"
                     /db_xref="UniProtKB/Swiss-Prot:P9WIV9"
                     /inference="protein motif:PROSITE:PS00642"
                     /experiment="EXISTENCE: identified in proteomics study"
BEGIN
        1 MTQAADTDIR VGQPEMVTLT IDGVEISVPK GTLVIRAAEL MGIQIPRFCD HPLLEPVGAC
       61 RQCLVEVEGQ RKPLASCTTV ATDDMVVRTQ LTSEIADKAQ HGVMELLLIN HPLDCPMCDK
      121 GGECPLQNQA MSNGRTDSRF TEAKRTFAKP INISAQVLLD RERCILCARC TRFSDQIAGD
      181 PFIDMQERGA LQQVGIYADE PFESYFSGNT VQICPVGALT GTAYRFRARP FDLVSSPSVC
      241 EHCASGCAQR TDHRRGKVLR RLAGDDPEVN EEWNCDKGRW AFTYATQPDV ITTPLIRDGG
      301 DPKGALVPTS WSHAMAVAAQ GLAAARGRTG VLVGGRVTWE DAYAYAKFAR ITLGTNDIDF
      361 RARPHSAEEA DFLAARIAGR HMAVSYADLE SAPVVLLVGF EPEDESPIVF LRLRKAARRH
      421 RVPVYTIAPF ATGGLHKMSG RLIKTVPGGE PAALDDLATG AVGDLLATPG AVIIVGERLA
      481 TVPGGLSAAA RLADTTGARL AWVPRRAGER GALEAGALPT LLPGGRPLAD EVARAQVCAA
      541 WHIAELPAAA GRDADGILAA AADETLAALL VGGIEPADFA DPDAVLAALD ATGFVVSLEL
      601 RHSTVTERAD VVFPVAPTTQ KAGAFVNWEG RYRTFEPALR GSTLQAGQSD HRVLDALADD
      661 MGVHLGVPTV EAAREELAAL GIWDGKHAAG PHIAATGPTQ PEAGEAILTG WRMLLDEGRL
      721 QDGEPYLAGT ARTPVVRLSP DTAAEIGAAD GEAVTVSTSR GSITLPCSVT DMPDRVVWLP
      781 LNSAGSTVHR QLRVTIGSIV KIGAGS
//