LOCUS       QLY31492.1               389 aa    PRT              BCT 27-JUL-2020
DEFINITION  Nocardia huaxiensis flavin-dependent monooxygenase protein.
ACCESSION   CP059399-760
PROTEIN_ID  QLY31492.1
SOURCE      Nocardia huaxiensis
  ORGANISM  Nocardia huaxiensis
            Bacteria; Actinobacteria; Corynebacteriales; Nocardiaceae;
            Nocardia.
REFERENCE   1  (bases 1 to 8339910)
  AUTHORS   Zhuang,K. and Ran,Y.
  TITLE     Direct Submission
  JOURNAL   Submitted (21-JUL-2020) Dermatovenereology, West China Hospital,
            Sichuan University, Guoxue Xiang, Wuhou District, Chengdu, Sichuan
            610041, China
COMMENT     The annotation was added by the NCBI Prokaryotic Genome Annotation
            Pipeline (PGAP). Information about PGAP can be found here:
            https://www.ncbi.nlm.nih.gov/genome/annotation_prok/
            
            ##Genome-Assembly-Data-START##
            Assembly Method        :: FALCON v. AUGUST-2019
            Genome Representation  :: Full
            Expected Final Version :: No
            Genome Coverage        :: 250.0x
            Sequencing Technology  :: PacBio Sequel
            ##Genome-Assembly-Data-END##
            
            ##Genome-Annotation-Data-START##
            Annotation Provider               :: NCBI
            Annotation Date                   :: 07/22/2020 23:23:33
            Annotation Pipeline               :: NCBI Prokaryotic Genome
                                                 Annotation Pipeline (PGAP)
            Annotation Method                 :: Best-placed reference protein
                                                 set; GeneMarkS-2+
            Annotation Software revision      :: 4.12
            Features Annotated                :: Gene; CDS; rRNA; tRNA; ncRNA;
                                                 repeat_region
            Genes (total)                     :: 7,641
            CDSs (total)                      :: 7,551
            Genes (coding)                    :: 7,450
            CDSs (with protein)               :: 7,450
            Genes (RNA)                       :: 90
            rRNAs                             :: 4, 4, 4 (5S, 16S, 23S)
            complete rRNAs                    :: 4, 4, 4 (5S, 16S, 23S)
            tRNAs                             :: 75
            ncRNAs                            :: 3
            Pseudo Genes (total)              :: 101
            CDSs (without protein)            :: 101
            Pseudo Genes (ambiguous residues) :: 0 of 101
            Pseudo Genes (frameshifted)       :: 23 of 101
            Pseudo Genes (incomplete)         :: 76 of 101
            Pseudo Genes (internal stop)      :: 7 of 101
            Pseudo Genes (multiple problems)  :: 5 of 101
            CRISPR Arrays                     :: 2
            ##Genome-Annotation-Data-END##
FEATURES             Qualifiers
     source          /organism="Nocardia huaxiensis"
                     /mol_type="genomic DNA"
                     /strain="WCH-YHL-001"
                     /isolation_source="skin sample from patient"
                     /host="Homo sapiens"
                     /type_material="type strain of Nocardia huaxiensis"
                     /db_xref="taxon:2755382"
                     /country="China: Chengdu"
                     /collection_date="2013"
                     /collected_by="Kaiwen Zhuang"
     protein         /locus_tag="H0264_03860"
                     /inference="COORDINATES: similar to AA
                     sequence:RefSeq:WP_011207003.1"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /transl_table=11
BEGIN
        1 MTQEVVERVE ALLPTLRERA QEAEDLRKLP EESVKDLQET GFFKLLQPRQ WGGYAGDPVV
       61 FYDTVRKIAS ACGSTGWVSG IVGVHNWHLA LFDQRAQEEV WGEDTDVRIS SSYAPMGAGV
      121 LTEDKSGYIV NGAWAWSSGS DHCDWVVVGG PVIKDGKPVD FGSFLIPRTE YAIDDVWHVV
      181 GLRGTGSNTI KVKDVFVPRH RFLSFRAMSE LSAPGLEQNT DPVYKMPWGT IHPTTISTPI
      241 VGMAYGAYEA HVEHQGKRLR AAYAGEKAKD DPFGKVRIAE AASDIDAAWR QLSGNVADEY
      301 ALLVAGKEVP FDLRASARRD QVRATGRAVA SIDRLFEASG ATALANGTPL QRFWRDAHAG
      361 RVHAANDAER AYVMYGTHAF GLPVTDTMV
//