LOCUS       QLY31851.1              8579 aa    PRT              BCT 27-JUL-2020
DEFINITION  Nocardia huaxiensis hypothetical protein protein.
ACCESSION   CP059399-1173
PROTEIN_ID  QLY31851.1
SOURCE      Nocardia huaxiensis
  ORGANISM  Nocardia huaxiensis
            Bacteria; Actinobacteria; Corynebacteriales; Nocardiaceae;
            Nocardia.
REFERENCE   1  (bases 1 to 8339910)
  AUTHORS   Zhuang,K. and Ran,Y.
  TITLE     Direct Submission
  JOURNAL   Submitted (21-JUL-2020) Dermatovenereology, West China Hospital,
            Sichuan University, Guoxue Xiang, Wuhou District, Chengdu, Sichuan
            610041, China
COMMENT     The annotation was added by the NCBI Prokaryotic Genome Annotation
            Pipeline (PGAP). Information about PGAP can be found here:
            https://www.ncbi.nlm.nih.gov/genome/annotation_prok/
            
            ##Genome-Assembly-Data-START##
            Assembly Method        :: FALCON v. AUGUST-2019
            Genome Representation  :: Full
            Expected Final Version :: No
            Genome Coverage        :: 250.0x
            Sequencing Technology  :: PacBio Sequel
            ##Genome-Assembly-Data-END##
            
            ##Genome-Annotation-Data-START##
            Annotation Provider               :: NCBI
            Annotation Date                   :: 07/22/2020 23:23:33
            Annotation Pipeline               :: NCBI Prokaryotic Genome
                                                 Annotation Pipeline (PGAP)
            Annotation Method                 :: Best-placed reference protein
                                                 set; GeneMarkS-2+
            Annotation Software revision      :: 4.12
            Features Annotated                :: Gene; CDS; rRNA; tRNA; ncRNA;
                                                 repeat_region
            Genes (total)                     :: 7,641
            CDSs (total)                      :: 7,551
            Genes (coding)                    :: 7,450
            CDSs (with protein)               :: 7,450
            Genes (RNA)                       :: 90
            rRNAs                             :: 4, 4, 4 (5S, 16S, 23S)
            complete rRNAs                    :: 4, 4, 4 (5S, 16S, 23S)
            tRNAs                             :: 75
            ncRNAs                            :: 3
            Pseudo Genes (total)              :: 101
            CDSs (without protein)            :: 101
            Pseudo Genes (ambiguous residues) :: 0 of 101
            Pseudo Genes (frameshifted)       :: 23 of 101
            Pseudo Genes (incomplete)         :: 76 of 101
            Pseudo Genes (internal stop)      :: 7 of 101
            Pseudo Genes (multiple problems)  :: 5 of 101
            CRISPR Arrays                     :: 2
            ##Genome-Annotation-Data-END##
FEATURES             Qualifiers
     source          /organism="Nocardia huaxiensis"
                     /mol_type="genomic DNA"
                     /strain="WCH-YHL-001"
                     /isolation_source="skin sample from patient"
                     /host="Homo sapiens"
                     /type_material="type strain of Nocardia huaxiensis"
                     /db_xref="taxon:2755382"
                     /country="China: Chengdu"
                     /collection_date="2013"
                     /collected_by="Kaiwen Zhuang"
     protein         /locus_tag="H0264_05955"
                     /inference="COORDINATES: ab initio
                     prediction:GeneMarkS-2+"
                     /note="Derived by automated computational analysis using
                     gene prediction method: GeneMarkS-2+."
                     /transl_table=11
BEGIN
        1 MAELPPYLLP LEWVAGSDWP AGEPEGMWGI GGDWRTAASG LREILEDIDA AKSASLKAYP
       61 SGEGVAEMTR AFDELRSGDN SLEKLAESFD SIAGAADDVG TEIEYAQLMM ITSLALLLAE
      121 IIAAWVFPPT APAVEAVAIG VTRVGIRMLA QRLMNRVVQA VAKLVGERAA GFLVREISID
      181 TILGVVQEIA VRKYQGRDVN WETVLVTAVS SAVGAGAASP FGDWLGKKFD DDIIKPWVKG
      241 AIIGTGAGLV GAGAGFLASV GTQYLIDGLK TGDWDKAAQN ARNALTNIDP RMFFAGASNG
      301 AASGMNRARA DAFYQNRHPE WYRPPTPTPT PTPAPDDDGA RIGFRPPPDP AAPTSPGEPR
      361 AGAPGLGPGG TPDGGATQPT GDGSRSGAPQ ANTPVGDGSG ATNPGGTQSG ANQNPGGTQV
      421 SGAPHSSGGG QGAGVPQAGG GAQSAVGPHT GGGAQVSGAS QTGGGTQGSA SGGQSGGSQS
      481 TSQSGGANQS TGNTQQQSGA AANAARDAGR PQGSPTADTA GDGSTGRPET SSDAGRAGTP
      541 DAPRQPPLAD SPAAAPSDTT GTGNTATTDR SGLPAAGPAA TPGIGGPAQA GGLDAGVQAA
      601 PGQAALGAST PQLGEQLPLA VSPESLAATP ESPAAVPPGN PAAAAPGAPA SPGDAVRSAD
      661 APNSPDAPGT ANGTTPPSTA EIGRTTTDSD STQPNSPASD TSRSALDADS QDSRTSADDR
      721 PTDDTASDHP LAPAALADAA RAASGNPARP PADSTPAQIR AGRPGLRPAP AELPPAEPLR
      781 PAPRAEAMGD GSATRRPPSE NRAGLTESPQ VGGADLTAGL HTDTGEITSL RLGDNESHAR
      841 EGEQSPQTEV ADPQRRPPAA EPGIPESPRG VDNSGLPGRG ALDPAHPGAN HAPDPLQGEP
      901 SSAANRTSED EPVQMAGPRV VADPDRADGS RPDAEPGSGS RTRDDSDNAG RHFAADADSA
      961 AGRRTDDESD PGGRRVDATE PVDGRREGET SLRPEDSEAA ARRVSDEQNR GECGRLSLLE
     1021 IREGTGSRVV RVPEEPVGPA GMSRAELEEA AGARLTRVPR VAAGVPPHDP VGGALLALGE
     1081 GAHALVVDEY AGPTDEFGVG AHAYVVRVVR DPVSGELMLE VVDRAAGIRQ GFPPEVPREL
     1141 RSVSVVLYDA QGDPVPPRAG AEPGEFPFPR VGTSLDPYPA QVRALLEETE VGRDALALME
     1201 RYGVRTRYAD RAPGGQSADY RAHELLATVY TAGSPQLQQA ISVVHEAVHA EFANTGRTVL
     1261 AHELPAMSRE SYIDAMVREE AAAHARQYEF ARQLRSAGFD IAVERNETTY FRAYDGDLAG
     1321 RVDAGLSPDE ARIHAREAGI RAIVQAVGTA AWGPGADVAN YRDYYGRGYD RFLADSGPHP
     1381 DPTLRTITDD SALDTVRDLA NQRRMSWLRA ADLADSTGEL ARALRLPDYD PESLPSAVDA
     1441 ALARVERNRT ALAAQLGIDP ADLTRDPLTG PEADRRANLL RREQRLLELS DLLDAHRIEQ
     1501 ANATRLADDL TAAAAYDVLD NELFLAGGDG RIVTPRVAVI EGAPPRVYIV DVRGDYDTPL
     1561 QRAAELDSGV ITAMADPAVE VHYLNLTVDG AGAVRVTRTP ESVVEANALY GGQAEHLPDP
     1621 GVARHRADEA GMQRTEQINR ILDRTELGRR TLELLERNGV RFRFEAGAEP RYEPDARVVV
     1681 LDIGVSDVEQ ALAVVHATAH VEDSAQGDSA SNSRDRVKLS RADYIELMLR EEARAHAMEI
     1741 ELRKQLRQLD HDLPVDELER AYDEGFHRAR NAAYRADPTD LEVLWDAGFL GGTAQLHAFL
     1801 EGAPATDGQR AYAELYGDAW DAAAGLAGKV ETSDPATLPA DGAGPSPARA GDPAAAPPMG
     1861 PPSEPPNSPP AAASPDGPDD GDNSGRADSD PNAVWDAMAR NDDHLDALSA RLDALAESLR
     1921 SSAGAADRAA QTARFEELLQ QHRADFDEID TRLEAIAAQA RGEAAPEQPD HPAPGNRTAD
     1981 DSAPGNGGSG DVSPGDRTDA DAAARREAYD ELVKQREELA REREFQRAKR NDRAERFGIT
     2041 DPDTVLGGGN HEGAVLGLYD RAGYDRVETP GGLEQAATVE RRPVSPEELA ARREAIRKLE
     2101 LAAERVYELD QQIAEVDRRL TELERAGVEG RRVPSEEVGA QLDRLAGERA QELRRIKPVR
     2161 EMRDDLAHRL GIDPARLDVD PDRLEPELRR MYAELDARDA TPEQRAEIDR LADGLADAAR
     2221 DVSRAHNRIG RVQDEMARVA GAYQRLAEAV GARMVTSRVA VVDGPNPRVI VFGPRAEAGS
     2281 PRADHDAALA EALRESRAAA QAFVRPGADI RMIRVIADRD GGWRAERMDP PQLGRLAAAP
     2341 VNGRGLDVVL WRDGAGEWHP VDPTRPDWQP GRAGDAELKR WSPKDPPDGV SGWAMADVVN
     2401 DAALPTEDIP AGAVPETVLP VGVPGAKEQN VTGGLPPGTE LFGQHIGDDA YNIVRLILMA
     2461 AQAPQHPAVK AWIQRHPAIG EWVRAHPWLR NVPYFGTAFY GYEWTASPGR NVQPMFRLFD
     2521 PAEHSPQVDR SHIPEGLHAA FDADYAAWQQ VQDWADGKYE LFLLNDSEVG VIADNVAVHR
     2581 LLRQLEQAQA LIGRVRELPD GSNPDELAQA VDRLAEQVAK QFDDPSPDAV RQALRDIHDL
     2641 LRTDADPDRL AQTVAQFMDV GYPELTRDQI LQIKNHLMRD EHRVRDFTDP AGAFVRRPMD
     2701 RLADVAEAWI RLTAGNPVPQ DILLLRDALA ESDYLRTHPD ATWHDANAHA IGLGLHWDAD
     2761 RPPLPGRRAG IPYAVPPLDA RSEWNSPRRD PDDDDWIDIY WFGGGDPGGS PAMPPANPST
     2821 GGSPAALDEP STTGGQPNPD RTRSIDGSAQ PPSTAAADSE TPGAQQLRPE PEAERPEAPK
     2881 SRLRRLADKL NFWQPDARTG DGESPDPATG ADGNSPEDPA GPGNSASDGE SSRSDRAEVL
     2941 ARQREHLLAQ ARQLAGALEV DADGLNGSEL VDAAGAAVAR LAGEEGPGAG LRADWAGRVT
     3001 GEIGELVRLA EVAAAAESTL RSGDSIPEAD DGRQWQVVES RRVHEFTYPT AELESARQLL
     3061 AAGGDPSDSP VIARDEHPVA ADARARVEEA AVRALLDAQS GTPLGYGLRR VPGVGDRPDS
     3121 VVLVGVEPHP DRAMDPVVRS ALAPEGTDFT YLRVIVDGDG VVQVERWRHI APEPPAVADV
     3181 TVPPQVSRPE VADLVDFTAP ERALLETAQR LHDQLTESVH EGVRSFGVDP AGRSARELAD
     3241 AVRAEYERRL SDIEAAQHDS SGPPAGHRVR LDAEVDVLLH DLALLDAADI TEQRLRAHNF
     3301 DAPRFTAPSE WRLSGPDRDF IPADPAVARE LAALGRQVEV FDSLAAHARQ LGIGDVDGRS
     3361 VRDLADAIRD RTAPRPDQRA PERRRNARIR ADIDRLTRQY ERSGGDPDRI GLRAHAEAEA
     3421 VRAVLAEAGG MHRADGVVRL PDGSGLLAVG SAPHPDSVVP VALRSRVAAL GDEFAYRRVQ
     3481 FDSTGRVRIT DMAVRPTGSD HAEDPAPRRS LWQRINERFT TPTSDMPWPP STRITGEWGG
     3541 YSHEDLARVL SERHGVRVRR FNAAGVNPDA VRLFAMALDD LLTRYPSANL HVVTINRLFS
     3601 DRRGDEPPLA WVAGGGSPFV WRSGTTGFAS TMELDVRHAS DHESALPPRS SGGFHPPSAD
     3661 PDAYQTAVHE FGHVLDLTGD GRARRLAMSA LLEHFGATRP HVDRAEFDTW LRQLSGYSFH
     3721 EDGRFNPPEA LAEAFHDVES NGVDATEPAR VLYVVLLRAA GIDPAPEILA SLNPAQDSAP
     3781 LTNDPARQPI TDGPATPDAP AAAPRAGDDD SPGNFRRFLR GLLPGAPEAN IAPGNGLDGR
     3841 SDGSADDGSR ARASTPEAPA RRSMWQRIQD VLNGESSTPP WPPSTLLTDE WADRSHSEIK
     3901 RILEERHGIR VRYLGRAGAD PDTVRVFAAT LDEMLTRYPE SHVRTVLIGR LPEFANAAVL
     3961 PGGSPFATRP DREGFVTTIV LGLRYAGDID APDTPDSPEP LGFRPRRSSS LIRMSTIHEF
     4021 GHTLHATGSN RAFARAYASL VEHYSATRGH IDRAEFDTWL NQLSGYSFNR RGGLRTAEAL
     4081 ADAFFDVEEN GVDAAEPAKV LYALLLREAG IEPDPHIMAS LHPNGPPTPG IADLPPAQAD
     4141 PIAAPDDAAS RPGADGDGST DHVGESETPD TPRTPEPAPQ PQGEAPRTSD VRARLDAATS
     4201 PHAELRANTY AAARSWGVDT DGLDPRQVAR AIHEHRVRVF EELRAGLRDL AGQPESQLRA
     4261 ARVQAQERLR AVYEAADALI HDLNVVSAAE RAMQGLRAGQ SDTTVPGERS REDTDSPALD
     4321 ERSRRPDAGA QDLGMSVEPD SPVVAERVAA LTRQEEQFLA DARRLAVALA IHPIPESGRE
     4381 LGDAVRAVMD GLHARVDAAR TALANPDEPS VELGDAEHQV FTAGRVADEI AELLRAADRA
     4441 AQVRAAVESG DPVPARERAR WRRSPERALE SRWDAEALAE AGRRLVAFER MAALVRDAGV
     4501 DPANRSAEQL VAALRDADPA TRAEMDRHLA EFTDAGGDPA MVDVRAAAER LALRSMSGGS
     4561 GGRTLGFTIT RMGGTTAMAP RMLVVGTQAH PDLALDPAVR DLLAEPGTVV EYHRVRFDED
     4621 GIARVESWGW DATDYDDLSP PPDADGDAEQ PGSTRRPTPT SDEPWPPSTE LTGEWADYSH
     4681 ADIVRIFAQR HGIRVLRFNA EGVNPDAVRL FAQGLDGLLT RYPGIEVAQM TISALGSVAA
     4741 DLRPAAWVMH RTRPFSWTGG APSRSVSLVD LDVEFASRPD AHRPVRPDLA TAEPGFHPPS
     4801 RLPTALYTVI HEFAHMLDNA GEKRAAEQAL QTLLDHFGAT RGQVDRAEFE TWLAQLSGYS
     4861 FSVGPDGDYS TRGPFNPTEA LAEAFHDVEE NGVYATEPAK VLYALLLREA GFEPDPTIMA
     4921 SLAADGSGPD GTARPHVSEE PDDARPDRTD SGNARADDRD SSGGRFRRFL RELLSGGGRP
     4981 EIPAGPDPRG ADGVDDGSRA PRSDSPAQPE GSGPLTPAHQ ARLESAERLR RQLADTVHDA
     5041 VRVFGLDPAG LRTPELAEAL RTEYDRRTAE FGANARLEAE VGALIGDLAL LDAAETAENR
     5101 LRAGDVSESF LDNTHRWQQL GRTHDFTPSD PALARTLATF GDQVAAYDRV VAHARRLGID
     5161 DVDGRSVREL AAAVRDRTAP DPGRSGRERR RNIRARNEMH RLTSRFEAAG GDPNRTGLRA
     5221 DAEATAIRAV LAEAGGQRLA DNMVRFPADS QGRSRILVVD SAGHPDLVVH PEVRRRVVAM
     5281 QDEIGYQRVD FDDRGRIRLL DMTVVRSSDY TPAGDPDRSL WRRISERFTV PTSDLPWPPS
     5341 TTMTDEWGQR SHADIVRVLE QRHGIRVLRF NAEGVDPDAV RQAAKALDDL LTRFPRVNLQ
     5401 VVDINALQPQ RKGGRKPAAW LADRSSIFVR RPGTAGIGTT MELDVGYYSR RETGLPPSEH
     5461 ARADTFHPPT HYPDGYRTVV HEFGHALAST GNHRALSRAL DALLAHYAAT RPEVGRAEFD
     5521 VWLRQLSGYS FHEDGRFNPT EAVAEAFTDV EANGVNATEP ARVLYVLLLT EASITPDPAI
     5581 LASLETTTVH PESTPRTPPP AQPETRGPEV PVTTSDGSEP GPGAEHPPEG ETPDPPRSAP
     5641 EQPKSPPAND GSTARPQPEP ENVQPTQPET DETGSQTPPL ASGGRGAGAP PRPPADGGVD
     5701 DPTPSGDNPP SRIDSPQRRH ERLAQAVWHA RMMAEHAAGA RRMADLYAGT DTPHADWTRL
     5761 RAERAELLAR LAADHVIRLQ EQADLADSDH TTPDAGSSDE GDAGAALPVA PGGDDSGGGR
     5821 GGVTPPVAPG ADDPDGGDGQ ATQPLSGDGV PQTSGEVEPV GRPGQLESTD PETGVVARQQ
     5881 DSVAVPVRDV ADPVREVLRF FDEVAQAQRA AAEGGEFQPT VLTESQRAYL RALEAALGLD
     5941 GVLADHPDAV RALRELAEIA RVRGFSGAPE SGVEPGAMRY PEDFAALDID EVHGDEEYWR
     6001 FQDDPRRVAE AEAEVNRLDV PGPAAMRAGD DQARLTGPEQ VRDALARRLR VDAADLTPER
     6061 MGRTMAELRY RNLLRAGAVE ALADAVARHR AATDDVQRAQ IGATRDQWAR ALGIPEAVVE
     6121 PGRAPAALAA LRGGILARAR EIADLADAVQ AARQAEAAEP GGTRRLTVEV DGQRLPVRLV
     6181 ADADGTWRVE NTDRLRAPEP AESTERLPVP QEDDAPESKL RKFWKWFKDA WQHHDDVPKY
     6241 PSGSGIDGQG QTLLIEYYGD VDLGGTFNPA RILKEIATMW KKRGQIRDFL RGYRAGYNDD
     6301 FQPPRDRDGR DYEYWLLEAD PELVRQRGEI PEHLLLHWAE LDEARAKARE QAELEAGTTP
     6361 DPAQRPGLES DVEAPVPDSE LSDPQGPAAD AINDLADRLR DALSERNRLG DDLYWAANEL
     6421 GRPLADLTPG ELHRAHRELQ YRNLRRAGAI EGLADVAQRY LDENAYIPFL RELNHFEQNP
     6481 LARFLGDVAR SEGHSTAMLD WEGVNNGGEP GRDWGDLTYS DQPGQDDGTR EWFINALRRE
     6541 ELRTERGHWA HMLGVARADL SPHTALGEAL AELRADNTER AADIAEFDAA AIRYSELDAL
     6601 VDQLSRQLAE LAIPEHIAAQ GGVMLEPGVA LFDGDPMRLA IVAADLTHES RLADLLAQPN
     6661 GIAESLDDGS LTLEFRVVWA DAQGAIHAGR IAVPEVRHIT TEVDGRPVSL TLLRDGDGPW
     6721 RVAPNSATDP NDTDRQSDPN TAVQPSPRSP QETATARVDL AEQLEIPLAD LDPVRLHATI
     6781 DDLNHANQRR AARLEALADY ARALHAIDDY HRRSPEERAF DPRVPDDDPP IHDDTAPAFL
     6841 REIITDPAAF GERLRGPGPE PRPGDPDDRP EPDADWAALL GVNLRNPTPE QLADAYEKFR
     6901 DGKIDEYEAL TPEELTRVHA ELRDEIRARA AAIAELESVA RQYNRLAVDA AATHARAEVR
     6961 RLLVAQEDLL RARDEAQAAV REAVTGLPVA DAEVAPARID QALDRLRALV APDAETPTRL
     7021 AALAAAATRL NEADAAVRQG FSDLAAAMER AALLGQGARP IAENFGVIGD RPPVIVVVGW
     7081 GVFPDLAGLT AAEARHPILR DLRAHPGVTV RRVLLDLDEH GRMRMRVEGE DPPPDDLPTD
     7141 PDTPQPPATP APDSGPTRPA GPRPVAPVAE PRPDASESAQ PQPITSAHGP REPVQSGSAD
     7201 ADATDSVDGL PAAGRSETDA GRSGPGEVAA VDPARAALIR QHAAAIAQRE ALAGMVRELA
     7261 RRLRLRVDDA ALSPERVLGT LAEVPTTPLD SAHSALRGDL QDLAQQLVRA AREAGELAEE
     7321 LGLVAQYDEL LQQHRDTVRE REFVAAKAND RARRLGLVGA SDEWVRMDPD ARERAITAAF
     7381 EAAGSDIVHR PGGLDEASRD TRVPVSDWER AQREAVLRKL IEATERHGDL DGEVRELERR
     7441 MRELADAGVV RFRAPLDDVA GELDRLARQR GELLQEIKPR RRTRDELAIR LGVVDDGRPN
     7501 EHALGRHLDA TLARLRAQLD ADFWAGRADV AEVRTRTREI DELAEAAADV NRAHLRIGQL
     7561 QDEMARVAGV WRGQIRDEGG FMVTPRVGVL PGDPSRIVVF GSRSDADAPR TVYEDALADA
     7621 LRNSPIAAQL LVREGATIEF RRVLADREGN WRSEPLPPPA VRRLVSEPFN AVRLNVTMWQ
     7681 DADGEWHPVD ATRPSWHVNR PDAGGDRNYG KFKTKPLPDG VSSWGMEDVV DDGLVVFQDI
     7741 PPGQMPKSAL PVVRPPGSDE AYGLHHLPPG VELFEQHWGA DSYNIVRLIL MAVQAPSLPM
     7801 VVRWIQRHPE IGEWVRARPW LADVPYFGLA FMDYKWFARP ETNVQPMHRP WDPDEHGDRS
     7861 GDTEIPESLR AQWEAEREAW QRVQAWADTQ YERFLSDDTD IAQITRELAR YRRDQYADTA
     7921 RDLVDLVRER LITGDPGLDR TGDIAAHQEQ LQSRIDHLAK EIADKFPAGH RPEIAEALTQ
     7981 IRELLLAGAD PDLIARTLTG YLDLDIPDLT ESQIAQIKNH LMRDTHLVRD YTDPAGGFVH
     8041 RPMDRLADVA EAWIRLTTGT PLPEDLILLQ DALAESDYLR RHPDATWNEA NAHAIGLGHH
     8101 WDAHRPPLTD WRKGIRYAPT PYTPPTPAST ESNTTPESAP PQPDPTPPGD EADDSRGAPT
     8161 GSPTARSSTD AEDAGTPDEP DRGQIPDLSP AGEDPSKSTA PELDPDTDDP AENLSPGALS
     8221 PTVTDGTGAP HEPTHQGDTG GPARSSDGEG DDSDGEAGGS ARTPGDDDDD DDGGAAPLND
     8281 GRAGGGLPGR AYRSHIPHPP REFELEPMPE RMDVPVYDPG PWPVPDFEEE PAEPGPEQGR
     8341 ANPIGTQQQS SGTSSNIDSQ GRTAAASSST GGQASGNGTA APEPSMPAWL RDAAAQMGDP
     8401 YQAAQPSERP SQGNSFGAGH QDDGGGSGQP GGVNGSGQQG GTDGSGLEAG ATRLPVRPFN
     8461 GVGAAVAFDP ASGVGWGMAG SPEAYAGVYS DTGGGVLVFF RDGERLGVSV NGVSIDVNDP
     8521 AVRVEWVPGR DGFTRFSVFL GQVLGGWMVY PNVGAELDLG RLVRDVCADP QRKATVFTR
//