LOCUS UCS39493.1 2497 aa PRT PLN 13-OCT-2021 DEFINITION Nakaseomyces glabratus uncharacterized protein protein. ACCESSION CP060155-463 PROTEIN_ID UCS39493.1 SOURCE Nakaseomyces glabratus (Candida glabrata) ORGANISM Nakaseomyces glabratus Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; Saccharomycetes; Saccharomycetales; Saccharomycetaceae; Nakaseomyces. REFERENCE 1 (bases 1 to 1453064) AUTHORS Xu,Z., Green,B., Benoit,N., Sobel,J., Schatz,M., Wheelan,S. and Cormack,B. TITLE Genome dynamics and non-allelic homologous recombination in Candida glabrata JOURNAL Unpublished REFERENCE 2 (bases 1 to 1453064) AUTHORS Xu,Z., Sobel,J., Schatz,M., Wheelan,S. and Cormack,B. TITLE Direct Submission JOURNAL Submitted (12-AUG-2020) Molecular Biology and Genetics, Johns Hopkins Medical Institution, 725 N Wolfe St, Baltimore, MD 21205, USA COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Canu v. 1.5 Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: 40.0x Sequencing Technology :: PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /organism="Nakaseomyces glabratus" /mol_type="genomic DNA" /strain="BG3994" /isolation_source="vaginal" /host="Homo sapiens" /db_xref="taxon:5478" /chromosome="M" /geo_loc_name="USA: Detroit" /collection_date="2003-12-02" /collected_by="Jack Sobel" protein /locus_tag="HLK62_M10483" /note="CAGL0M10527g; Ortholog(s) have nucleolus localization" BEGIN 1 MKQKTTTKTT KRYRYSSFKS RIDDLRIEPA RNLEKRAHDY VESSHLLASF EHWKDINLSA 61 SFTALVPQLE PLVQTLPQIL FHSKQVCALL IEAIDKHDEL SLQPELDMLA QFCHDLGPDF 121 MPFYKQAMDS LIALISDAGS LESPQVLEWA FNCLAYLFKY LSRLLTADLA QTSELLFPLL 181 SHHREYLSRF SAEALSFLIR KTSAKNLPAL LNYFFAKLSE NEEEGHYYEG LLTLFTESLV 241 STQGSLHSKS NIILTAFITK VLTPQAVDPV CTTLFCDVWM NISKHTAAEN LAPVYQLIFQ 301 HITEKLGPEN TNTVVQIMAT LVFSESGKKI PHWETVVSIT KSILESCNAE NCSSSNLAFY 361 SAALFRNADV RSMTQLHKLL FSTYATKFTD DYFAFLRYCM DLCPDKISSY NGDKYANVFL 421 EENWASQGQN LSLFLLELEQ NTQLQNRLRI KIPSGLVNSL LDTLRTINDK EITEEQLADL 481 YWMSVILKKC DVQESDPVIN VILNLISDTG NPSDMKKDLL GNLILAIPSD ENNILINIME 541 KLVHSFGAYR DSVFFVKSIT YLLRKTGNTD SVKTIIDSLL TDNLENFQQN LVLPDSKIRY 601 ETLVLISTLY ELRSSEVPQL INECKIIEEI PLSLDNGRAL TARIRAMSAP FLKIQRGSPE 661 IKLVISHMFG LLTIRFSPIW DGVNDFLSAT TGKCPDLVWK LILQFINVLE HPIISSYPQT 721 FMDIDSPVSL WDSRVDRLTN TINNFREIWN RFFNKNESIF ELSKDLRGSF QYPGQIRNQT 781 LKVMLLVPHL AEQHFADIIT YFFNQVEYEE LFDNDFNGEK TAKNWTEADR NVLLKVLSKF 841 KNIKNVYKSD ELHNRLLTLL GSKNTEVQKL ALDAIFAYKE PAVNKYKDNL SNLLNDTLFK 901 DEITIFFANK EKNQLEEQHE IKLMPYILRI LYGRAQTPIT SGSKKSGKYA VVSVLPNFKR 961 KYIIDFLSLT YNGLAFEKFF DKKYAVDKDD LTQGTLKRMS GFVTLMSGVI GTLGSKFSIE 1021 LATVLKPLLF VISSSYYICG SDANLKDSDQ SHLLKIGSTL RQHSLKCLSE FFDTLGDDLN 1081 WDPYIKDIYE CAFKPRLANF SVENAQQISS LMRIMVQWSG NESLYRFLYY NSFSCTKALV 1141 ELLHNPHTKE PVLVSILTAC NDVITRPAQD AEYVELVTII STSTLKSLPT LYERLGNSDS 1201 ISIAIEVLVN LTENGYVQDD ETISYLLSSL TLILESNKNV NNPKIITKML NVLKTLIPVS 1261 TMPFSDLESL FRTLSGFYQT CADKETRLGV NDVLSAFSQR FDGLEKVASL MKSLNSYSNR 1321 RIQEYDFPVM LSAFKKFTEE DYCNYSEIQW FPVVHTCLFL INDKEELAVR TNATHTLTVF 1381 VKYINEKSSF EEAKPGITIL KSIILPQIKS GLRKYNDEIQ TEYIALLEYI VQNSKYYNDM 1441 ADMQVLSFGD DEEASFFKNI AHVQLHRRQR AIRRLKEVAS ELSDNSISHY LIPIAEQYVF 1501 SDEEKFRNIA NESLITIGEL ANFMSWNQYK ALMRRYIHLL KTKDTALKQS VLLITSVSVA 1561 LKNTLTAKRN SSDEEINSRT MRKFPNNFND AESFIKHELY PTLSKILGTR NDDTIVARMP 1621 LSEGIINILL GLDEDDKITL LPGVLTSICQ VLRSKSEELR DAVRQSLSKI VVILGAKYIV 1681 FIIKELVSAL QRGSQVHVLS YTVHHVLRTI VDDLNHGDLD DSAHLIVRVI MEDVFGAVGE 1741 EKDSDNYHTK MKEVKVNRSY DTGEVLAANI SLQEFSTILK PIKMLLMERV SFKSQNKLQE 1801 LLRHYALGIN HNTEAAEINS LRLCYEIFNQ DVEQKKYNRP KATVTEQEEF FLVNLNAKKE 1861 RVVTEYSLLS HTFQKFSLDL LRTVITRHKS LMQAKYLQGF VPLLQQSLTS DDENVLISTL 1921 KVLIILVKID FEMSTENLFK NCVKKALNIV KDSPSTGSEL CQMAIKYLSA AIKHKEMKLK 1981 NVALSYVLQR ILPDLNEPNK QGLAFNFLKS LISKQVMLPE LYDIMTTVRE IMITNHSKDI 2041 RNVARSVFYH FLMEYDQSKG RLEKQFKFMV DNLQYPAPDG KQSVMELINL IVTKANPELL 2101 SKLASSFFIG LANVSVNDDS PRCREMATII LTNMLPRLDA TALRTIVKYI SAWLKQLEND 2161 AFLNLGLRIY KVYLEGLGIG HSAELDDQAT KAIIKTLHNT DENSKTQWDL IYSALAVFTV 2221 FTNKNPEVFN SEYKNIWDSV IKCLLYPHIW VRQASGKLVC DLFNHLEKDN WMYENSDIQI 2281 ITSKIIHQLR APSIPEALAE TAIKTLLKVS SYWNKNNVSY IPFGETTESI NRYSTAIEYV 2341 VSSIGGIIRS EENRNDTFFS KKFAIQYFYL LTQMLNAEAL ESVLEIILFS LYIYLEKSDV 2401 SRLTDEESEL VTSSQECMKQ LEDSVSVSAF SKAYANVKQM VYRRRLERKN KRSVLAVTAP 2461 DVAAAKKLKK HARSREKRKH ERDENGYYQR KNKKKRI //