LOCUS       QLY32249.1               679 aa    PRT              BCT 27-JUL-2020
DEFINITION  Nocardia huaxiensis CocE/NonD family hydrolase protein.
ACCESSION   CP059399-1630
PROTEIN_ID  QLY32249.1
SOURCE      Nocardia huaxiensis
  ORGANISM  Nocardia huaxiensis
            Bacteria; Actinobacteria; Corynebacteriales; Nocardiaceae;
            Nocardia.
REFERENCE   1  (bases 1 to 8339910)
  AUTHORS   Zhuang,K. and Ran,Y.
  TITLE     Direct Submission
  JOURNAL   Submitted (21-JUL-2020) Dermatovenereology, West China Hospital,
            Sichuan University, Guoxue Xiang, Wuhou District, Chengdu, Sichuan
            610041, China
COMMENT     The annotation was added by the NCBI Prokaryotic Genome Annotation
            Pipeline (PGAP). Information about PGAP can be found here:
            https://www.ncbi.nlm.nih.gov/genome/annotation_prok/
            
            ##Genome-Assembly-Data-START##
            Assembly Method        :: FALCON v. AUGUST-2019
            Genome Representation  :: Full
            Expected Final Version :: No
            Genome Coverage        :: 250.0x
            Sequencing Technology  :: PacBio Sequel
            ##Genome-Assembly-Data-END##
            
            ##Genome-Annotation-Data-START##
            Annotation Provider               :: NCBI
            Annotation Date                   :: 07/22/2020 23:23:33
            Annotation Pipeline               :: NCBI Prokaryotic Genome
                                                 Annotation Pipeline (PGAP)
            Annotation Method                 :: Best-placed reference protein
                                                 set; GeneMarkS-2+
            Annotation Software revision      :: 4.12
            Features Annotated                :: Gene; CDS; rRNA; tRNA; ncRNA;
                                                 repeat_region
            Genes (total)                     :: 7,641
            CDSs (total)                      :: 7,551
            Genes (coding)                    :: 7,450
            CDSs (with protein)               :: 7,450
            Genes (RNA)                       :: 90
            rRNAs                             :: 4, 4, 4 (5S, 16S, 23S)
            complete rRNAs                    :: 4, 4, 4 (5S, 16S, 23S)
            tRNAs                             :: 75
            ncRNAs                            :: 3
            Pseudo Genes (total)              :: 101
            CDSs (without protein)            :: 101
            Pseudo Genes (ambiguous residues) :: 0 of 101
            Pseudo Genes (frameshifted)       :: 23 of 101
            Pseudo Genes (incomplete)         :: 76 of 101
            Pseudo Genes (internal stop)      :: 7 of 101
            Pseudo Genes (multiple problems)  :: 5 of 101
            CRISPR Arrays                     :: 2
            ##Genome-Annotation-Data-END##
FEATURES             Qualifiers
     source          /organism="Nocardia huaxiensis"
                     /mol_type="genomic DNA"
                     /strain="WCH-YHL-001"
                     /isolation_source="skin sample from patient"
                     /host="Homo sapiens"
                     /type_material="type strain of Nocardia huaxiensis"
                     /db_xref="taxon:2755382"
                     /country="China: Chengdu"
                     /collection_date="2013"
                     /collected_by="Kaiwen Zhuang"
     protein         /locus_tag="H0264_08265"
                     /inference="COORDINATES: similar to AA
                     sequence:RefSeq:WP_014988583.1"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /transl_table=11
BEGIN
        1 MRLLLRSLVA VTAAIALTQF VAPASTAAES TGPDGGAAGA QWTAAEDGPQ TYPNVYIEWD
       61 VPITMSDGTV LKANVYHPAD AAGRPIADPT PTIVNFTPYT KLGSMIADSM LSIPGLSDMV
      121 VQLFRNIDLT GTPISGLTDL TKALGGGLLR TFTVDRQLIK SGYTQVVVDV RGTGFSQGIW
      181 DVLGPREQLD TVETIDWASR QSWSTGDIGL TGISYSGLNQ LQAAAKNPPA LQAIFPVVPS
      241 RNPFRDLVAP GGAVGVTFMP MWLAAVNGGK LVPDMTALLQ GRFDMAWLAS RLADPFTFVD
      301 ALLEIFLQPD IDQLSPKVRQ LLDSGGDLRR SWETDPAQIT VPTFITGGWH DVFVASQADV
      361 YRRLAVPPGQ KQILIGDGYH ISNGNESGRP GQPPRMDVLQ RAWFDKWLKG IDNGIDTYGP
      421 VTMKEQGGAW ITASGFPDED AIEHQRMYLS PVSSGTGGHS VHDGSLFTAP SEWPETLTVA
      481 PGLTGLCSRD AAQETIGIVS VIDACGKDSR IWESNGLTFT SAPVGSATTI SGPIAVHLNT
      541 VHDTADGYWV ATVNDVAPDG TSVTLSSGQL VASLRAIDES RSTRSPNGDY TDPVPDLSLD
      601 TRQRTEPGVP VTLDISIAGV DAVLQPGHRL RVDVYAGNFP KGLPPLAILL DSGLRPQHLL
      661 LDPAEPSYVN IPLRGNPGW
//