LOCUS       QLY29883.1              1202 aa    PRT              BCT 27-JUL-2020
DEFINITION  Nocardia huaxiensis chromosome segregation protein SMC protein.
ACCESSION   CP059399-6426
PROTEIN_ID  QLY29883.1
SOURCE      Nocardia huaxiensis
  ORGANISM  Nocardia huaxiensis
            Bacteria; Actinobacteria; Corynebacteriales; Nocardiaceae;
            Nocardia.
REFERENCE   1  (bases 1 to 8339910)
  AUTHORS   Zhuang,K. and Ran,Y.
  TITLE     Direct Submission
  JOURNAL   Submitted (21-JUL-2020) Dermatovenereology, West China Hospital,
            Sichuan University, Guoxue Xiang, Wuhou District, Chengdu, Sichuan
            610041, China
COMMENT     The annotation was added by the NCBI Prokaryotic Genome Annotation
            Pipeline (PGAP). Information about PGAP can be found here:
            https://www.ncbi.nlm.nih.gov/genome/annotation_prok/
            
            ##Genome-Assembly-Data-START##
            Assembly Method        :: FALCON v. AUGUST-2019
            Genome Representation  :: Full
            Expected Final Version :: No
            Genome Coverage        :: 250.0x
            Sequencing Technology  :: PacBio Sequel
            ##Genome-Assembly-Data-END##
            
            ##Genome-Annotation-Data-START##
            Annotation Provider               :: NCBI
            Annotation Date                   :: 07/22/2020 23:23:33
            Annotation Pipeline               :: NCBI Prokaryotic Genome
                                                 Annotation Pipeline (PGAP)
            Annotation Method                 :: Best-placed reference protein
                                                 set; GeneMarkS-2+
            Annotation Software revision      :: 4.12
            Features Annotated                :: Gene; CDS; rRNA; tRNA; ncRNA;
                                                 repeat_region
            Genes (total)                     :: 7,641
            CDSs (total)                      :: 7,551
            Genes (coding)                    :: 7,450
            CDSs (with protein)               :: 7,450
            Genes (RNA)                       :: 90
            rRNAs                             :: 4, 4, 4 (5S, 16S, 23S)
            complete rRNAs                    :: 4, 4, 4 (5S, 16S, 23S)
            tRNAs                             :: 75
            ncRNAs                            :: 3
            Pseudo Genes (total)              :: 101
            CDSs (without protein)            :: 101
            Pseudo Genes (ambiguous residues) :: 0 of 101
            Pseudo Genes (frameshifted)       :: 23 of 101
            Pseudo Genes (incomplete)         :: 76 of 101
            Pseudo Genes (internal stop)      :: 7 of 101
            Pseudo Genes (multiple problems)  :: 5 of 101
            CRISPR Arrays                     :: 2
            ##Genome-Annotation-Data-END##
FEATURES             Qualifiers
     source          /organism="Nocardia huaxiensis"
                     /mol_type="genomic DNA"
                     /strain="WCH-YHL-001"
                     /isolation_source="skin sample from patient"
                     /host="Homo sapiens"
                     /type_material="type strain of Nocardia huaxiensis"
                     /db_xref="taxon:2755382"
                     /country="China: Chengdu"
                     /collection_date="2013"
                     /collected_by="Kaiwen Zhuang"
     protein         /gene="smc"
                     /locus_tag="H0264_32500"
                     /inference="COORDINATES: similar to AA
                     sequence:RefSeq:WP_007296878.1"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /transl_table=11
BEGIN
        1 MHLKSLTLKG FKSFASATTL RFEPGITCVV GPNGSGKSNV VDALTWVMGE QGAKALRGGK
       61 MQDVIFAGTT GRAPLGRAEV TLTIDNSDGV LPIEYNEVSI TRRMFRDGAG EYEINGSTCR
      121 LMDVQELLSD SGIGREMHVI VGQGQLSAIL ESRPEDRRSF IEEAAGVLKH RRRKEKAVRK
      181 LDAMQANLAR LTDLTTELRR QLKPLGRQAE VARRAATVQS ELRDARLRLA ADDLVSRRTE
      241 LESQQSKEAY AREQQVNMQA ELDAANAALA QQEFELSRLT PSAEAAAQTW FQLSALTERV
      301 NATIRIARDR ARNLTIEQPS GAGRDPDQLE REAERVEAEE AELLAAVEIA TETLEAARDA
      361 LHEREQAAKA AEQAHLAAVR AIADRREGLA RLSGQVDTLR TRAQSVDGEI SRLSTALAEA
      421 RRRGEDADAE FESVQGELSE LDAGEEGLDT QYEHAAQALE LADQRVTELR EKDREASKRV
      481 ASLSARIEAL TMGLARKDGA AWLLEHRTDG LLGPLSGLIR VHGGYEAAVA AALGPLADAV
      541 AADTGPSAHA AVRALKESDG GRAALVYGHA TQPDSRPNGQ LPGSARWLAD IVDCPDHLRG
      601 AIGALTAGIA VADDLAVAAQ VIAARPELRV VTREGDITGS GWLLGGSDRA PSQLEVQAEI
      661 DAAKAELVSW QRQAEELEAA LAGALAEQTD RKEAVDHALL ALHESDQALV AIYDRLGRLG
      721 QTARRAQQEC ERILAQRADT EAGREETLAK LAELEDRLRH AEDEQSEMGS DAAGTETAGY
      781 AREEAAAALA EARTMEVEAR LAVRTAEERA ESVRGKADSL RRMARAERET RARAERAQAA
      841 RRQASAVAAV VADSAERVAA QLETVVAEAS ARRDELVRRR SEIAAQVEQT KERSRALTTQ
      901 LSQIVDAVHR DEVARAQAAL RIEQLETTIA ETFGVALEDL IAEYGPDTPM PPTALEMMEY
      961 EQAKERGENV SEPQPMPYDR ATQERRAKRA EKDLTTLGKV NPLALEEFAA LEERYTFLNT
     1021 QLEDVKKARQ DLLDVVAEVD ARILQVFTEA YDDVEREFTH VFSKLFPGGE GRILLTDPSD
     1081 MLTTGVEVEA RPPGKKVKRL SLLSGGEKSL AAVAFLVAIF RARPSPFYVM DEVEAALDDT
     1141 NLRRLIGLFE QLREKSQLIV ITHQKPTMEI ADALYGVSMR GDGITQVISQ RLRGENLVGA
     1201 AS
//