LOCUS BEK93123.1 1228 aa PRT BCT 26-JUL-2023 DEFINITION Nocardia seriolae hypothetical protein protein. ACCESSION AP028459-1029 PROTEIN_ID BEK93123.1 SOURCE Nocardia seriolae ORGANISM Nocardia seriolae Bacteria; Bacillati; Actinomycetota; Actinomycetes; Mycobacteriales; Nocardiaceae; Nocardia. REFERENCE 1 (bases 1 to 8113213) AUTHORS Umeda,K., Matsuura,Y., Shimahara,Y., Takano,T. and Matsuyama,T. TITLE Direct Submission JOURNAL Submitted (29-JUN-2023) to the DDBJ/EMBL/GenBank databases. Contact:Kousuke Umeda Fisheries Technology Institute, Japan Fisheries Research and Education Agency, Aquaculture Research Department; 422-1 Nakatsuhamaura, Minami-ise, Mie 516-0193, Japan REFERENCE 2 AUTHORS Umeda,K., Matsuura,Y., Shimahara,Y., Takano,T. and Matsuyama,T. TITLE Complete genome sequences of alpha-glucosidase-positive/negative strains of Nocardia seriolae from Seriola species in Japan. JOURNAL Unpublished (2023) COMMENT Annotated by DFAST https://dfast.ddbj.nig.ac.jp/ ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.5.0 Genome Coverage :: 220x Sequencing Technology :: BGI DNBseq; Nanopore MinION ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="2002-10-24" /db_xref="taxon:37332" /geo_loc_name="Japan:Oita, Tsukumi, Tsukumi Bay" /host="Seriola quinqueradiata" /isolation_source="muscle isolate from a diseased fish in a fish farm" /lat_lon="33.10 N 131.89 E" /mol_type="genomic DNA" /organism="Nocardia seriolae" /strain="024013" protein /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /locus_tag="NSER024013_10290" /note="frameshifted, insertion/deletion at around 1126622" /note="possible pseudo due to internal stop codon" /transl_table=11 BEGIN 1 MALYLPPELE WLGWLVGVEW PEGNEDHMWE LATHWSDAAK GLRGQVSKMD DAKYATLSAY 61 VDGEGRDAMA KVFDTMAGVG GKQDGNTSIL DLAEFYDQIA DSVHETGTEI ESTKLMFYSS 121 LAILAFEMIA AWAFPPTAPA AEAAAQAATR VAVRFIARRA IAAIERQVAK LVGSTLAKFM 181 VRHFLLDAGL GVLQEAGIEE YQKTVGHRKD LDWGKIGVTA VGAAAGGIAG GKVGKALGEY 241 FEHSGMNHYL TAAITGTTAG LAGGGAGFLA STGAQFGLDA YQHGLGDAWK NTVNAFSHFD 301 PRMLTGGMAN GGLAGINHTA AHNFWEPRVA RMKAGVSTDM PHVGSRPDMG APGNTGHTGA 361 TDVGGAGGSD SGHAGGSDTG RAGGGDTGHA GGADTGGPRT HPAGFSGESA DSTAGTRSTG 421 DGSTPHPQSH DGGGIGVDSR GNSGRDTPTG EQPAAPGEHG QSGTSQPAAH APADTGAPSD 481 THGTQPHAGT PDNAGTPASG VQAGKHENAA PVADSADHGS GPVVQNSGPA TQSPTASDSR 541 PATQDSRPTA SDSRPATSDS RPATSDSRST GSDSRSAGSD SRSTGSDSRS TDFDSRSAGV 601 ESGSAEPELH SASADESGST TSASADSGAR DQSATAAQSD SGTSRSGGAA DAGASTTSGP 661 HPAQGGPAAA PHESPAAAAA PSATPVDSRM ALSANANGPA LTANGGEASA AAAARPGVPA 721 GAPTTGTPSA AHAQHGGPTP ESRTATPDTR TSPTRAGDAT GTRSDATRPE SARPQSGTEA 781 RPQPGAEPRP APAAHPRPTG PDTTRPGSEA ARPSAEEPAA SHESAAPSEL EGCSAISEPI 841 VDQRESATPG TPAPQSIDSP ATRESASNPT ETGNRAGEAP ADHPPAHHEP PAPTETTPHE 901 AELPDSTHPS DESTTTSGES DSHPESTPAR AEPLPREEAP SSAEATPRNE AAPRDETALR 961 GEVTLHEDTT PREDVTPRDE TVPREEVVPR DEATLRDETA PQDESAPRDA TAPRDEGLPH 1021 EEAPPRDEAA PHDEFVPRDE PLPRDEASPR DDTGSHDETA PRDEAMPRDG VGLGDEAGSR 1081 LGEAPVDSEG RGRGDEAVVD GGNRRPVEES VREPGDRAEG EPVPVGDRSE PVDGRGAPVE 1141 ERDGGAGAPE DGLLIAPVPV GEHPVSHGGE GERGASGGER GREVGEGQRG GERPVPVDRR 1201 GAGDRAGERR AGAPERGDGA VAHPGSVR //