LOCUS       WQZ29727.1              1037 aa    PRT              BCT 29-DEC-2023
DEFINITION  Helicobacter pylori efflux RND transporter permease subunit
            protein.
ACCESSION   CP079244-1220
PROTEIN_ID  WQZ29727.1
SOURCE      Helicobacter pylori
  ORGANISM  Helicobacter pylori
            Bacteria; Campylobacterota; Epsilonproteobacteria;
            Campylobacterales; Helicobacteraceae; Helicobacter.
REFERENCE   1  (bases 1 to 1570870)
  AUTHORS   Thorell,K., Munoz-Ramirez,Z.Y., Wang,D., Sandoval-Motta,S., Boscolo
            Agostini,R., Ghirotto,S., Torres,R.C., Falush,D., Camargo,M.C. and
            Rabkin,C.S.
  CONSRTM   HpGP Research Network
  TITLE     The Helicobacter pylori Genome Project: insights into H. pylori
            population structure from analysis of a worldwide collection of
            complete genomes
  JOURNAL   Nat Commun 14 (1), 8184 (2023)
   PUBMED   38081806
  REMARK    Publication Status: Online-Only
REFERENCE   2  (bases 1 to 1570870)
  AUTHORS   Camargo,M.C. and Rabkin,C.S.
  TITLE     Direct Submission
  JOURNAL   Submitted (15-JUL-2021) IIB, National Cancer Institute, 9609
            Medical Center Dr., Rm. 6E110, Bethesda, MD 20892, USA
COMMENT     The annotation was added by the NCBI Prokaryotic Genome Annotation
            Pipeline (PGAP). Information about PGAP can be found here:
            https://www.ncbi.nlm.nih.gov/genome/annotation_prok/
            
            ##Genome-Assembly-Data-START##
            Assembly Method        :: HGAP v. 4
            Assembly Name          :: HpGP-TWN-021
            Genome Representation  :: Full
            Expected Final Version :: Yes
            Genome Coverage        :: 2661x
            Sequencing Technology  :: PacBio Sequel II
            ##Genome-Assembly-Data-END##
            
            ##Genome-Annotation-Data-START##
            Annotation Provider               :: NCBI
            Annotation Date                   :: 07/16/2021 08:12:09
            Annotation Pipeline               :: NCBI Prokaryotic Genome
                                                 Annotation Pipeline (PGAP)
            Annotation Method                 :: Best-placed reference protein
                                                 set; GeneMarkS-2+
            Annotation Software revision      :: 5.2
            Features Annotated                :: Gene; CDS; rRNA; tRNA; ncRNA;
                                                 repeat_region
            Genes (total)                     :: 1,497
            CDSs (total)                      :: 1,452
            Genes (coding)                    :: 1,391
            CDSs (with protein)               :: 1,391
            Genes (RNA)                       :: 45
            rRNAs                             :: 2, 2, 2 (5S, 16S, 23S)
            complete rRNAs                    :: 2, 2, 2 (5S, 16S, 23S)
            tRNAs                             :: 36
            ncRNAs                            :: 3
            Pseudo Genes (total)              :: 61
            CDSs (without protein)            :: 61
            Pseudo Genes (ambiguous residues) :: 0 of 61
            Pseudo Genes (frameshifted)       :: 44 of 61
            Pseudo Genes (incomplete)         :: 11 of 61
            Pseudo Genes (internal stop)      :: 18 of 61
            Pseudo Genes (multiple problems)  :: 12 of 61
            ##Genome-Annotation-Data-END##
FEATURES             Qualifiers
     source          /organism="Helicobacter pylori"
                     /mol_type="genomic DNA"
                     /strain="HpGP-TWN-021"
                     /isolation_source="Biopsy"
                     /host="Homo sapiens"
                     /db_xref="taxon:210"
                     /geo_loc_name="Taiwan"
                     /lat_lon="23.30 N 121.00 E"
                     /collected_by="Maria Camargo and Charles Rabkins"
     protein         /locus_tag="E5P95_06305"
                     /inference="COORDINATES: similar to AA
                     sequence:RefSeq:WP_000570388.1"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /transl_table=11
BEGIN
        1 MIEKIIDLSV KNKLLTTLVT LLIFLASLWA IKSVRLDALP DLSPAQVVVQ ITYPNQSPKI
       61 VQEQVTYPLV STFMSIANID TVRGISSYES GLIYIIFKDG VNLYWARDRV LEQLNRVSNL
      121 PKDAKVEIGS DSTSIGWAYQ YALSSGSKNL SDLKVLQDFY YRYALLGVDG VSEVASVGGF
      181 VKDYEVTLQN DSLIRYNLSL EQVANAIKNS NNDTGGGVIL ENGFEKIIRS HGYIQSLKDL
      241 EEIVVKKEGA IPLKIKDIAS VRLVPKPRRG AANLNGDKEV VGGIVMVRYH ADTYKVLKAI
      301 KEKIATLQAS NPDVKITSVY DRSELIEKGI DNLIHTLIEE SVIVLVIIAI FLLHFRSALV
      361 VIITLPLSVC ISFLLMRYFN IEASIMSLGG IAIAIGAMVD AAIVMVENAH KHLQHIDTKD
      421 NAQRVNAIMQ GVKHVGGAIF FALMIIVVSF LPIFALTGQE EKLFAPLAYT KTFAMLVGAL
      481 LSITIVPVLM VWLIKGRILE ESKNPINAFF MKIYGVSLKV VLKFRYAFLI ASVLGLGGLY
      541 LAYKKLNWEF IPQINEGVVM YMPVTINGVG IDTALEYLKK SNSAIKRLDF VKQVFGKVGR
      601 ANTSTDAAGL SMIETYIELK PQNEWKEKLS YKEVRDKLEK TLQLKGLTNS WTYPIRGRTD
      661 MLLTGIRTPL GIKLYGNDTD KLQELAILME QQLKTLKESL SVFAERSNNG YYITLDLNDE
      721 NLARYGINKN AVLDTIKFAL GGATLTTMIK GVENYPISLR LEDTERNTIE KLQNLYVKTA
      781 YNYMPLRELA HVYYDNSPAV LKSEKGLNVN FIYIVPQNGI SSDTYRQLAK KALEKIQLPS
      841 GYYYEFSDES QYLEEAFKTL QYIVPVSVFI IFILIVFALK NFTNSLLCFF TLPFAFLGGL
      901 IFMNIMGFNM SVAALVGFLA LLGVASETAI VMIIYLEDAF QKFIKIPLKE QNSAALKEAI
      961 MHGAVLRVRP KLMTFFSILA SLIPIMYSHG TGSEIMKSIA APMLGGMISS VVLTLFIIPT
     1021 AYFVIKNARV KSNQTSF
//