LOCUS       QLY31949.1              1329 aa    PRT              BCT 27-JUL-2020
DEFINITION  Nocardia huaxiensis WD40 repeat domain-containing protein protein.
ACCESSION   CP059399-1284
PROTEIN_ID  QLY31949.1
SOURCE      Nocardia huaxiensis
  ORGANISM  Nocardia huaxiensis
            Bacteria; Actinobacteria; Corynebacteriales; Nocardiaceae;
            Nocardia.
REFERENCE   1  (bases 1 to 8339910)
  AUTHORS   Zhuang,K. and Ran,Y.
  TITLE     Direct Submission
  JOURNAL   Submitted (21-JUL-2020) Dermatovenereology, West China Hospital,
            Sichuan University, Guoxue Xiang, Wuhou District, Chengdu, Sichuan
            610041, China
COMMENT     The annotation was added by the NCBI Prokaryotic Genome Annotation
            Pipeline (PGAP). Information about PGAP can be found here:
            https://www.ncbi.nlm.nih.gov/genome/annotation_prok/
            
            ##Genome-Assembly-Data-START##
            Assembly Method        :: FALCON v. AUGUST-2019
            Genome Representation  :: Full
            Expected Final Version :: No
            Genome Coverage        :: 250.0x
            Sequencing Technology  :: PacBio Sequel
            ##Genome-Assembly-Data-END##
            
            ##Genome-Annotation-Data-START##
            Annotation Provider               :: NCBI
            Annotation Date                   :: 07/22/2020 23:23:33
            Annotation Pipeline               :: NCBI Prokaryotic Genome
                                                 Annotation Pipeline (PGAP)
            Annotation Method                 :: Best-placed reference protein
                                                 set; GeneMarkS-2+
            Annotation Software revision      :: 4.12
            Features Annotated                :: Gene; CDS; rRNA; tRNA; ncRNA;
                                                 repeat_region
            Genes (total)                     :: 7,641
            CDSs (total)                      :: 7,551
            Genes (coding)                    :: 7,450
            CDSs (with protein)               :: 7,450
            Genes (RNA)                       :: 90
            rRNAs                             :: 4, 4, 4 (5S, 16S, 23S)
            complete rRNAs                    :: 4, 4, 4 (5S, 16S, 23S)
            tRNAs                             :: 75
            ncRNAs                            :: 3
            Pseudo Genes (total)              :: 101
            CDSs (without protein)            :: 101
            Pseudo Genes (ambiguous residues) :: 0 of 101
            Pseudo Genes (frameshifted)       :: 23 of 101
            Pseudo Genes (incomplete)         :: 76 of 101
            Pseudo Genes (internal stop)      :: 7 of 101
            Pseudo Genes (multiple problems)  :: 5 of 101
            CRISPR Arrays                     :: 2
            ##Genome-Annotation-Data-END##
FEATURES             Qualifiers
     source          /organism="Nocardia huaxiensis"
                     /mol_type="genomic DNA"
                     /strain="WCH-YHL-001"
                     /isolation_source="skin sample from patient"
                     /host="Homo sapiens"
                     /type_material="type strain of Nocardia huaxiensis"
                     /db_xref="taxon:2755382"
                     /country="China: Chengdu"
                     /collection_date="2013"
                     /collected_by="Kaiwen Zhuang"
     protein         /locus_tag="H0264_06510"
                     /inference="COORDINATES: ab initio
                     prediction:GeneMarkS-2+"
                     /note="Derived by automated computational analysis using
                     gene prediction method: GeneMarkS-2+."
                     /transl_table=11
BEGIN
        1 MLVGDQPNIE PSSVRETADR QSRSPRGLFA QRFTELYAAA GNPTLRRVAT AAETRMRAAS
       61 GNRPTGASAQ RISDWKAGRN VPARFESLLP VVLTLIDLAR KQDTPLPRHL AEPKEWQRLW
      121 HAATTWNPEE EDEAACPYLG LNSYRRENRD LFFGRTRAAA DLTALVREAT GIIAVVGASG
      181 AGKSSLLAAG LVPALPDWEV VSLTPGPHPL PALLHAATPP DSATPAPAEP ETEPGRPRRM
      241 LVIDQFEELF TLCADEAERE QVLDLLHVCA TRPVDPLAVV IAIRADFYAH CLAHPALQEA
      301 LEHRSFLLGP MRIDELAQAI TGPARSAGLE LEPGLEELVV TELCGVGDRP SRTYDPGALP
      361 LLSHVMAATW QHREGRRLTI AGYRKAGGVV GSVAETAEQA WNELAPAQQH AAKAILLGLV
      421 AVGQDSRDTR RTAPRRELLH RAPDPEDAAA ALELLSRTRL ITLDADTVGL THEIVLSAWP
      481 RLHGWIDEDR VGYLVRQRLD ADASEWAAQD RDSSLLYHGT RLQNALDNVD PPPASPLARE
      541 FLTASGLARN RIRRRSSRTR AVLAALGVVL LVLGFATYTQ TQLAQQRRDD KNFTTVLAEA
      601 DRLAATDQSR AAQLYLIAER LRPNDSEVRA RILQTQNVPL MQPILGQPAD ISGVAYRADG
      661 VLAAIGFDDA LRLWDVQDAR HPRQLGTQID GVRMAGFSAD GSRMVTAGYS EDIRLWDVRD
      721 PSAPRETARL PGLSAKTMNE PVFVGNDLAV LSTTQFTLWD LSNPSVPVRG PSHRLFDDTR
      781 PDESGVVVSR FEASPDGSLL AIISSPGTDV TVKTIQLWDI RNRAAPIKLT ERLVADQTAI
      841 GDMVFNPSGT LLAISNEVQI SRQFIGSRAT VELWDVADRA HVSPLGTPLK AEDGDVPALT
      901 FSPDGTTLVV SSGSRTALWN VTEPADPVLV TDQLVFDSGS CRYPDNSTYL CSSSASDLGF
      961 SPDGRTLYAR DSSGKLVVWS LPPSVLTGHS GYLTTPQFDA TGDRMVTGSA DGRIIVWDIS
     1021 TRQRPTRVGE YRMPADYYSM SLAPDGRTLL VSTARTVKTS VLDLSDPTRI RSRGDWSLPP
     1081 QDSSPPYPIG DWSSMARIDE SGALQIWSLA DPMRPVLLTT VPVDPRYSWV SIDIAGQTLI
     1141 AQQRDLASNG ELIVTRWDIS NPANPTALGE AFRHHEGVAH FSPDQRVMVI TAAEKLQSWD
     1201 ISDPARPQPL ADAFAVHTST TWTVDFTPDG RTMLTASVDG APQLWDYTDP ADPRRMGGPL
     1261 MDVGKEPWDA RFHPDGRFVV GNGSNGALRF WDLDEQHMID RICTATGNPW TPELWRRYLP
     1321 DLDYDPPCD
//