LOCUS QLY31851.1 8579 aa PRT BCT 27-JUL-2020 DEFINITION Nocardia huaxiensis hypothetical protein protein. ACCESSION CP059399-1173 PROTEIN_ID QLY31851.1 SOURCE Nocardia huaxiensis ORGANISM Nocardia huaxiensis Bacteria; Actinobacteria; Corynebacteriales; Nocardiaceae; Nocardia. REFERENCE 1 (bases 1 to 8339910) AUTHORS Zhuang,K. and Ran,Y. TITLE Direct Submission JOURNAL Submitted (21-JUL-2020) Dermatovenereology, West China Hospital, Sichuan University, Guoxue Xiang, Wuhou District, Chengdu, Sichuan 610041, China COMMENT The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: FALCON v. AUGUST-2019 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 250.0x Sequencing Technology :: PacBio Sequel ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 07/22/2020 23:23:33 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 4.12 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 7,641 CDSs (total) :: 7,551 Genes (coding) :: 7,450 CDSs (with protein) :: 7,450 Genes (RNA) :: 90 rRNAs :: 4, 4, 4 (5S, 16S, 23S) complete rRNAs :: 4, 4, 4 (5S, 16S, 23S) tRNAs :: 75 ncRNAs :: 3 Pseudo Genes (total) :: 101 CDSs (without protein) :: 101 Pseudo Genes (ambiguous residues) :: 0 of 101 Pseudo Genes (frameshifted) :: 23 of 101 Pseudo Genes (incomplete) :: 76 of 101 Pseudo Genes (internal stop) :: 7 of 101 Pseudo Genes (multiple problems) :: 5 of 101 CRISPR Arrays :: 2 ##Genome-Annotation-Data-END## FEATURES Qualifiers source /organism="Nocardia huaxiensis" /mol_type="genomic DNA" /strain="WCH-YHL-001" /isolation_source="skin sample from patient" /host="Homo sapiens" /type_material="type strain of Nocardia huaxiensis" /db_xref="taxon:2755382" /country="China: Chengdu" /collection_date="2013" /collected_by="Kaiwen Zhuang" protein /locus_tag="H0264_05955" /inference="COORDINATES: ab initio prediction:GeneMarkS-2+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS-2+." /transl_table=11 BEGIN 1 MAELPPYLLP LEWVAGSDWP AGEPEGMWGI GGDWRTAASG LREILEDIDA AKSASLKAYP 61 SGEGVAEMTR AFDELRSGDN SLEKLAESFD SIAGAADDVG TEIEYAQLMM ITSLALLLAE 121 IIAAWVFPPT APAVEAVAIG VTRVGIRMLA QRLMNRVVQA VAKLVGERAA GFLVREISID 181 TILGVVQEIA VRKYQGRDVN WETVLVTAVS SAVGAGAASP FGDWLGKKFD DDIIKPWVKG 241 AIIGTGAGLV GAGAGFLASV GTQYLIDGLK TGDWDKAAQN ARNALTNIDP RMFFAGASNG 301 AASGMNRARA DAFYQNRHPE WYRPPTPTPT PTPAPDDDGA RIGFRPPPDP AAPTSPGEPR 361 AGAPGLGPGG TPDGGATQPT GDGSRSGAPQ ANTPVGDGSG ATNPGGTQSG ANQNPGGTQV 421 SGAPHSSGGG QGAGVPQAGG GAQSAVGPHT GGGAQVSGAS QTGGGTQGSA SGGQSGGSQS 481 TSQSGGANQS TGNTQQQSGA AANAARDAGR PQGSPTADTA GDGSTGRPET SSDAGRAGTP 541 DAPRQPPLAD SPAAAPSDTT GTGNTATTDR SGLPAAGPAA TPGIGGPAQA GGLDAGVQAA 601 PGQAALGAST PQLGEQLPLA VSPESLAATP ESPAAVPPGN PAAAAPGAPA SPGDAVRSAD 661 APNSPDAPGT ANGTTPPSTA EIGRTTTDSD STQPNSPASD TSRSALDADS QDSRTSADDR 721 PTDDTASDHP LAPAALADAA RAASGNPARP PADSTPAQIR AGRPGLRPAP AELPPAEPLR 781 PAPRAEAMGD GSATRRPPSE NRAGLTESPQ VGGADLTAGL HTDTGEITSL RLGDNESHAR 841 EGEQSPQTEV ADPQRRPPAA EPGIPESPRG VDNSGLPGRG ALDPAHPGAN HAPDPLQGEP 901 SSAANRTSED EPVQMAGPRV VADPDRADGS RPDAEPGSGS RTRDDSDNAG RHFAADADSA 961 AGRRTDDESD PGGRRVDATE PVDGRREGET SLRPEDSEAA ARRVSDEQNR GECGRLSLLE 1021 IREGTGSRVV RVPEEPVGPA GMSRAELEEA AGARLTRVPR VAAGVPPHDP VGGALLALGE 1081 GAHALVVDEY AGPTDEFGVG AHAYVVRVVR DPVSGELMLE VVDRAAGIRQ GFPPEVPREL 1141 RSVSVVLYDA QGDPVPPRAG AEPGEFPFPR VGTSLDPYPA QVRALLEETE VGRDALALME 1201 RYGVRTRYAD RAPGGQSADY RAHELLATVY TAGSPQLQQA ISVVHEAVHA EFANTGRTVL 1261 AHELPAMSRE SYIDAMVREE AAAHARQYEF ARQLRSAGFD IAVERNETTY FRAYDGDLAG 1321 RVDAGLSPDE ARIHAREAGI RAIVQAVGTA AWGPGADVAN YRDYYGRGYD RFLADSGPHP 1381 DPTLRTITDD SALDTVRDLA NQRRMSWLRA ADLADSTGEL ARALRLPDYD PESLPSAVDA 1441 ALARVERNRT ALAAQLGIDP ADLTRDPLTG PEADRRANLL RREQRLLELS DLLDAHRIEQ 1501 ANATRLADDL TAAAAYDVLD NELFLAGGDG RIVTPRVAVI EGAPPRVYIV DVRGDYDTPL 1561 QRAAELDSGV ITAMADPAVE VHYLNLTVDG AGAVRVTRTP ESVVEANALY GGQAEHLPDP 1621 GVARHRADEA GMQRTEQINR ILDRTELGRR TLELLERNGV RFRFEAGAEP RYEPDARVVV 1681 LDIGVSDVEQ ALAVVHATAH VEDSAQGDSA SNSRDRVKLS RADYIELMLR EEARAHAMEI 1741 ELRKQLRQLD HDLPVDELER AYDEGFHRAR NAAYRADPTD LEVLWDAGFL GGTAQLHAFL 1801 EGAPATDGQR AYAELYGDAW DAAAGLAGKV ETSDPATLPA DGAGPSPARA GDPAAAPPMG 1861 PPSEPPNSPP AAASPDGPDD GDNSGRADSD PNAVWDAMAR NDDHLDALSA RLDALAESLR 1921 SSAGAADRAA QTARFEELLQ QHRADFDEID TRLEAIAAQA RGEAAPEQPD HPAPGNRTAD 1981 DSAPGNGGSG DVSPGDRTDA DAAARREAYD ELVKQREELA REREFQRAKR NDRAERFGIT 2041 DPDTVLGGGN HEGAVLGLYD RAGYDRVETP GGLEQAATVE RRPVSPEELA ARREAIRKLE 2101 LAAERVYELD QQIAEVDRRL TELERAGVEG RRVPSEEVGA QLDRLAGERA QELRRIKPVR 2161 EMRDDLAHRL GIDPARLDVD PDRLEPELRR MYAELDARDA TPEQRAEIDR LADGLADAAR 2221 DVSRAHNRIG RVQDEMARVA GAYQRLAEAV GARMVTSRVA VVDGPNPRVI VFGPRAEAGS 2281 PRADHDAALA EALRESRAAA QAFVRPGADI RMIRVIADRD GGWRAERMDP PQLGRLAAAP 2341 VNGRGLDVVL WRDGAGEWHP VDPTRPDWQP GRAGDAELKR WSPKDPPDGV SGWAMADVVN 2401 DAALPTEDIP AGAVPETVLP VGVPGAKEQN VTGGLPPGTE LFGQHIGDDA YNIVRLILMA 2461 AQAPQHPAVK AWIQRHPAIG EWVRAHPWLR NVPYFGTAFY GYEWTASPGR NVQPMFRLFD 2521 PAEHSPQVDR SHIPEGLHAA FDADYAAWQQ VQDWADGKYE LFLLNDSEVG VIADNVAVHR 2581 LLRQLEQAQA LIGRVRELPD GSNPDELAQA VDRLAEQVAK QFDDPSPDAV RQALRDIHDL 2641 LRTDADPDRL AQTVAQFMDV GYPELTRDQI LQIKNHLMRD EHRVRDFTDP AGAFVRRPMD 2701 RLADVAEAWI RLTAGNPVPQ DILLLRDALA ESDYLRTHPD ATWHDANAHA IGLGLHWDAD 2761 RPPLPGRRAG IPYAVPPLDA RSEWNSPRRD PDDDDWIDIY WFGGGDPGGS PAMPPANPST 2821 GGSPAALDEP STTGGQPNPD RTRSIDGSAQ PPSTAAADSE TPGAQQLRPE PEAERPEAPK 2881 SRLRRLADKL NFWQPDARTG DGESPDPATG ADGNSPEDPA GPGNSASDGE SSRSDRAEVL 2941 ARQREHLLAQ ARQLAGALEV DADGLNGSEL VDAAGAAVAR LAGEEGPGAG LRADWAGRVT 3001 GEIGELVRLA EVAAAAESTL RSGDSIPEAD DGRQWQVVES RRVHEFTYPT AELESARQLL 3061 AAGGDPSDSP VIARDEHPVA ADARARVEEA AVRALLDAQS GTPLGYGLRR VPGVGDRPDS 3121 VVLVGVEPHP DRAMDPVVRS ALAPEGTDFT YLRVIVDGDG VVQVERWRHI APEPPAVADV 3181 TVPPQVSRPE VADLVDFTAP ERALLETAQR LHDQLTESVH EGVRSFGVDP AGRSARELAD 3241 AVRAEYERRL SDIEAAQHDS SGPPAGHRVR LDAEVDVLLH DLALLDAADI TEQRLRAHNF 3301 DAPRFTAPSE WRLSGPDRDF IPADPAVARE LAALGRQVEV FDSLAAHARQ LGIGDVDGRS 3361 VRDLADAIRD RTAPRPDQRA PERRRNARIR ADIDRLTRQY ERSGGDPDRI GLRAHAEAEA 3421 VRAVLAEAGG MHRADGVVRL PDGSGLLAVG SAPHPDSVVP VALRSRVAAL GDEFAYRRVQ 3481 FDSTGRVRIT DMAVRPTGSD HAEDPAPRRS LWQRINERFT TPTSDMPWPP STRITGEWGG 3541 YSHEDLARVL SERHGVRVRR FNAAGVNPDA VRLFAMALDD LLTRYPSANL HVVTINRLFS 3601 DRRGDEPPLA WVAGGGSPFV WRSGTTGFAS TMELDVRHAS DHESALPPRS SGGFHPPSAD 3661 PDAYQTAVHE FGHVLDLTGD GRARRLAMSA LLEHFGATRP HVDRAEFDTW LRQLSGYSFH 3721 EDGRFNPPEA LAEAFHDVES NGVDATEPAR VLYVVLLRAA GIDPAPEILA SLNPAQDSAP 3781 LTNDPARQPI TDGPATPDAP AAAPRAGDDD SPGNFRRFLR GLLPGAPEAN IAPGNGLDGR 3841 SDGSADDGSR ARASTPEAPA RRSMWQRIQD VLNGESSTPP WPPSTLLTDE WADRSHSEIK 3901 RILEERHGIR VRYLGRAGAD PDTVRVFAAT LDEMLTRYPE SHVRTVLIGR LPEFANAAVL 3961 PGGSPFATRP DREGFVTTIV LGLRYAGDID APDTPDSPEP LGFRPRRSSS LIRMSTIHEF 4021 GHTLHATGSN RAFARAYASL VEHYSATRGH IDRAEFDTWL NQLSGYSFNR RGGLRTAEAL 4081 ADAFFDVEEN GVDAAEPAKV LYALLLREAG IEPDPHIMAS LHPNGPPTPG IADLPPAQAD 4141 PIAAPDDAAS RPGADGDGST DHVGESETPD TPRTPEPAPQ PQGEAPRTSD VRARLDAATS 4201 PHAELRANTY AAARSWGVDT DGLDPRQVAR AIHEHRVRVF EELRAGLRDL AGQPESQLRA 4261 ARVQAQERLR AVYEAADALI HDLNVVSAAE RAMQGLRAGQ SDTTVPGERS REDTDSPALD 4321 ERSRRPDAGA QDLGMSVEPD SPVVAERVAA LTRQEEQFLA DARRLAVALA IHPIPESGRE 4381 LGDAVRAVMD GLHARVDAAR TALANPDEPS VELGDAEHQV FTAGRVADEI AELLRAADRA 4441 AQVRAAVESG DPVPARERAR WRRSPERALE SRWDAEALAE AGRRLVAFER MAALVRDAGV 4501 DPANRSAEQL VAALRDADPA TRAEMDRHLA EFTDAGGDPA MVDVRAAAER LALRSMSGGS 4561 GGRTLGFTIT RMGGTTAMAP RMLVVGTQAH PDLALDPAVR DLLAEPGTVV EYHRVRFDED 4621 GIARVESWGW DATDYDDLSP PPDADGDAEQ PGSTRRPTPT SDEPWPPSTE LTGEWADYSH 4681 ADIVRIFAQR HGIRVLRFNA EGVNPDAVRL FAQGLDGLLT RYPGIEVAQM TISALGSVAA 4741 DLRPAAWVMH RTRPFSWTGG APSRSVSLVD LDVEFASRPD AHRPVRPDLA TAEPGFHPPS 4801 RLPTALYTVI HEFAHMLDNA GEKRAAEQAL QTLLDHFGAT RGQVDRAEFE TWLAQLSGYS 4861 FSVGPDGDYS TRGPFNPTEA LAEAFHDVEE NGVYATEPAK VLYALLLREA GFEPDPTIMA 4921 SLAADGSGPD GTARPHVSEE PDDARPDRTD SGNARADDRD SSGGRFRRFL RELLSGGGRP 4981 EIPAGPDPRG ADGVDDGSRA PRSDSPAQPE GSGPLTPAHQ ARLESAERLR RQLADTVHDA 5041 VRVFGLDPAG LRTPELAEAL RTEYDRRTAE FGANARLEAE VGALIGDLAL LDAAETAENR 5101 LRAGDVSESF LDNTHRWQQL GRTHDFTPSD PALARTLATF GDQVAAYDRV VAHARRLGID 5161 DVDGRSVREL AAAVRDRTAP DPGRSGRERR RNIRARNEMH RLTSRFEAAG GDPNRTGLRA 5221 DAEATAIRAV LAEAGGQRLA DNMVRFPADS QGRSRILVVD SAGHPDLVVH PEVRRRVVAM 5281 QDEIGYQRVD FDDRGRIRLL DMTVVRSSDY TPAGDPDRSL WRRISERFTV PTSDLPWPPS 5341 TTMTDEWGQR SHADIVRVLE QRHGIRVLRF NAEGVDPDAV RQAAKALDDL LTRFPRVNLQ 5401 VVDINALQPQ RKGGRKPAAW LADRSSIFVR RPGTAGIGTT MELDVGYYSR RETGLPPSEH 5461 ARADTFHPPT HYPDGYRTVV HEFGHALAST GNHRALSRAL DALLAHYAAT RPEVGRAEFD 5521 VWLRQLSGYS FHEDGRFNPT EAVAEAFTDV EANGVNATEP ARVLYVLLLT EASITPDPAI 5581 LASLETTTVH PESTPRTPPP AQPETRGPEV PVTTSDGSEP GPGAEHPPEG ETPDPPRSAP 5641 EQPKSPPAND GSTARPQPEP ENVQPTQPET DETGSQTPPL ASGGRGAGAP PRPPADGGVD 5701 DPTPSGDNPP SRIDSPQRRH ERLAQAVWHA RMMAEHAAGA RRMADLYAGT DTPHADWTRL 5761 RAERAELLAR LAADHVIRLQ EQADLADSDH TTPDAGSSDE GDAGAALPVA PGGDDSGGGR 5821 GGVTPPVAPG ADDPDGGDGQ ATQPLSGDGV PQTSGEVEPV GRPGQLESTD PETGVVARQQ 5881 DSVAVPVRDV ADPVREVLRF FDEVAQAQRA AAEGGEFQPT VLTESQRAYL RALEAALGLD 5941 GVLADHPDAV RALRELAEIA RVRGFSGAPE SGVEPGAMRY PEDFAALDID EVHGDEEYWR 6001 FQDDPRRVAE AEAEVNRLDV PGPAAMRAGD DQARLTGPEQ VRDALARRLR VDAADLTPER 6061 MGRTMAELRY RNLLRAGAVE ALADAVARHR AATDDVQRAQ IGATRDQWAR ALGIPEAVVE 6121 PGRAPAALAA LRGGILARAR EIADLADAVQ AARQAEAAEP GGTRRLTVEV DGQRLPVRLV 6181 ADADGTWRVE NTDRLRAPEP AESTERLPVP QEDDAPESKL RKFWKWFKDA WQHHDDVPKY 6241 PSGSGIDGQG QTLLIEYYGD VDLGGTFNPA RILKEIATMW KKRGQIRDFL RGYRAGYNDD 6301 FQPPRDRDGR DYEYWLLEAD PELVRQRGEI PEHLLLHWAE LDEARAKARE QAELEAGTTP 6361 DPAQRPGLES DVEAPVPDSE LSDPQGPAAD AINDLADRLR DALSERNRLG DDLYWAANEL 6421 GRPLADLTPG ELHRAHRELQ YRNLRRAGAI EGLADVAQRY LDENAYIPFL RELNHFEQNP 6481 LARFLGDVAR SEGHSTAMLD WEGVNNGGEP GRDWGDLTYS DQPGQDDGTR EWFINALRRE 6541 ELRTERGHWA HMLGVARADL SPHTALGEAL AELRADNTER AADIAEFDAA AIRYSELDAL 6601 VDQLSRQLAE LAIPEHIAAQ GGVMLEPGVA LFDGDPMRLA IVAADLTHES RLADLLAQPN 6661 GIAESLDDGS LTLEFRVVWA DAQGAIHAGR IAVPEVRHIT TEVDGRPVSL TLLRDGDGPW 6721 RVAPNSATDP NDTDRQSDPN TAVQPSPRSP QETATARVDL AEQLEIPLAD LDPVRLHATI 6781 DDLNHANQRR AARLEALADY ARALHAIDDY HRRSPEERAF DPRVPDDDPP IHDDTAPAFL 6841 REIITDPAAF GERLRGPGPE PRPGDPDDRP EPDADWAALL GVNLRNPTPE QLADAYEKFR 6901 DGKIDEYEAL TPEELTRVHA ELRDEIRARA AAIAELESVA RQYNRLAVDA AATHARAEVR 6961 RLLVAQEDLL RARDEAQAAV REAVTGLPVA DAEVAPARID QALDRLRALV APDAETPTRL 7021 AALAAAATRL NEADAAVRQG FSDLAAAMER AALLGQGARP IAENFGVIGD RPPVIVVVGW 7081 GVFPDLAGLT AAEARHPILR DLRAHPGVTV RRVLLDLDEH GRMRMRVEGE DPPPDDLPTD 7141 PDTPQPPATP APDSGPTRPA GPRPVAPVAE PRPDASESAQ PQPITSAHGP REPVQSGSAD 7201 ADATDSVDGL PAAGRSETDA GRSGPGEVAA VDPARAALIR QHAAAIAQRE ALAGMVRELA 7261 RRLRLRVDDA ALSPERVLGT LAEVPTTPLD SAHSALRGDL QDLAQQLVRA AREAGELAEE 7321 LGLVAQYDEL LQQHRDTVRE REFVAAKAND RARRLGLVGA SDEWVRMDPD ARERAITAAF 7381 EAAGSDIVHR PGGLDEASRD TRVPVSDWER AQREAVLRKL IEATERHGDL DGEVRELERR 7441 MRELADAGVV RFRAPLDDVA GELDRLARQR GELLQEIKPR RRTRDELAIR LGVVDDGRPN 7501 EHALGRHLDA TLARLRAQLD ADFWAGRADV AEVRTRTREI DELAEAAADV NRAHLRIGQL 7561 QDEMARVAGV WRGQIRDEGG FMVTPRVGVL PGDPSRIVVF GSRSDADAPR TVYEDALADA 7621 LRNSPIAAQL LVREGATIEF RRVLADREGN WRSEPLPPPA VRRLVSEPFN AVRLNVTMWQ 7681 DADGEWHPVD ATRPSWHVNR PDAGGDRNYG KFKTKPLPDG VSSWGMEDVV DDGLVVFQDI 7741 PPGQMPKSAL PVVRPPGSDE AYGLHHLPPG VELFEQHWGA DSYNIVRLIL MAVQAPSLPM 7801 VVRWIQRHPE IGEWVRARPW LADVPYFGLA FMDYKWFARP ETNVQPMHRP WDPDEHGDRS 7861 GDTEIPESLR AQWEAEREAW QRVQAWADTQ YERFLSDDTD IAQITRELAR YRRDQYADTA 7921 RDLVDLVRER LITGDPGLDR TGDIAAHQEQ LQSRIDHLAK EIADKFPAGH RPEIAEALTQ 7981 IRELLLAGAD PDLIARTLTG YLDLDIPDLT ESQIAQIKNH LMRDTHLVRD YTDPAGGFVH 8041 RPMDRLADVA EAWIRLTTGT PLPEDLILLQ DALAESDYLR RHPDATWNEA NAHAIGLGHH 8101 WDAHRPPLTD WRKGIRYAPT PYTPPTPAST ESNTTPESAP PQPDPTPPGD EADDSRGAPT 8161 GSPTARSSTD AEDAGTPDEP DRGQIPDLSP AGEDPSKSTA PELDPDTDDP AENLSPGALS 8221 PTVTDGTGAP HEPTHQGDTG GPARSSDGEG DDSDGEAGGS ARTPGDDDDD DDGGAAPLND 8281 GRAGGGLPGR AYRSHIPHPP REFELEPMPE RMDVPVYDPG PWPVPDFEEE PAEPGPEQGR 8341 ANPIGTQQQS SGTSSNIDSQ GRTAAASSST GGQASGNGTA APEPSMPAWL RDAAAQMGDP 8401 YQAAQPSERP SQGNSFGAGH QDDGGGSGQP GGVNGSGQQG GTDGSGLEAG ATRLPVRPFN 8461 GVGAAVAFDP ASGVGWGMAG SPEAYAGVYS DTGGGVLVFF RDGERLGVSV NGVSIDVNDP 8521 AVRVEWVPGR DGFTRFSVFL GQVLGGWMVY PNVGAELDLG RLVRDVCADP QRKATVFTR //