LOCUS UCS39493.1 2497 aa PRT PLN 13-OCT-2021
DEFINITION Nakaseomyces glabratus uncharacterized protein protein.
ACCESSION CP060155-463
PROTEIN_ID UCS39493.1
SOURCE Nakaseomyces glabratus (Candida glabrata)
ORGANISM Nakaseomyces glabratus
Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina;
Saccharomycetes; Saccharomycetales; Saccharomycetaceae;
Nakaseomyces.
REFERENCE 1 (bases 1 to 1453064)
AUTHORS Xu,Z., Green,B., Benoit,N., Sobel,J., Schatz,M., Wheelan,S. and
Cormack,B.
TITLE Genome dynamics and non-allelic homologous recombination in Candida
glabrata
JOURNAL Unpublished
REFERENCE 2 (bases 1 to 1453064)
AUTHORS Xu,Z., Sobel,J., Schatz,M., Wheelan,S. and Cormack,B.
TITLE Direct Submission
JOURNAL Submitted (12-AUG-2020) Molecular Biology and Genetics, Johns
Hopkins Medical Institution, 725 N Wolfe St, Baltimore, MD 21205,
USA
COMMENT ##Genome-Assembly-Data-START##
Assembly Method :: Canu v. 1.5
Genome Representation :: Full
Expected Final Version :: Yes
Genome Coverage :: 40.0x
Sequencing Technology :: PacBio RSII
##Genome-Assembly-Data-END##
FEATURES Qualifiers
source /organism="Nakaseomyces glabratus"
/mol_type="genomic DNA"
/strain="BG3994"
/isolation_source="vaginal"
/host="Homo sapiens"
/db_xref="taxon:5478"
/chromosome="M"
/geo_loc_name="USA: Detroit"
/collection_date="2003-12-02"
/collected_by="Jack Sobel"
protein /locus_tag="HLK62_M10483"
/note="CAGL0M10527g; Ortholog(s) have nucleolus
localization"
BEGIN
1 MKQKTTTKTT KRYRYSSFKS RIDDLRIEPA RNLEKRAHDY VESSHLLASF EHWKDINLSA
61 SFTALVPQLE PLVQTLPQIL FHSKQVCALL IEAIDKHDEL SLQPELDMLA QFCHDLGPDF
121 MPFYKQAMDS LIALISDAGS LESPQVLEWA FNCLAYLFKY LSRLLTADLA QTSELLFPLL
181 SHHREYLSRF SAEALSFLIR KTSAKNLPAL LNYFFAKLSE NEEEGHYYEG LLTLFTESLV
241 STQGSLHSKS NIILTAFITK VLTPQAVDPV CTTLFCDVWM NISKHTAAEN LAPVYQLIFQ
301 HITEKLGPEN TNTVVQIMAT LVFSESGKKI PHWETVVSIT KSILESCNAE NCSSSNLAFY
361 SAALFRNADV RSMTQLHKLL FSTYATKFTD DYFAFLRYCM DLCPDKISSY NGDKYANVFL
421 EENWASQGQN LSLFLLELEQ NTQLQNRLRI KIPSGLVNSL LDTLRTINDK EITEEQLADL
481 YWMSVILKKC DVQESDPVIN VILNLISDTG NPSDMKKDLL GNLILAIPSD ENNILINIME
541 KLVHSFGAYR DSVFFVKSIT YLLRKTGNTD SVKTIIDSLL TDNLENFQQN LVLPDSKIRY
601 ETLVLISTLY ELRSSEVPQL INECKIIEEI PLSLDNGRAL TARIRAMSAP FLKIQRGSPE
661 IKLVISHMFG LLTIRFSPIW DGVNDFLSAT TGKCPDLVWK LILQFINVLE HPIISSYPQT
721 FMDIDSPVSL WDSRVDRLTN TINNFREIWN RFFNKNESIF ELSKDLRGSF QYPGQIRNQT
781 LKVMLLVPHL AEQHFADIIT YFFNQVEYEE LFDNDFNGEK TAKNWTEADR NVLLKVLSKF
841 KNIKNVYKSD ELHNRLLTLL GSKNTEVQKL ALDAIFAYKE PAVNKYKDNL SNLLNDTLFK
901 DEITIFFANK EKNQLEEQHE IKLMPYILRI LYGRAQTPIT SGSKKSGKYA VVSVLPNFKR
961 KYIIDFLSLT YNGLAFEKFF DKKYAVDKDD LTQGTLKRMS GFVTLMSGVI GTLGSKFSIE
1021 LATVLKPLLF VISSSYYICG SDANLKDSDQ SHLLKIGSTL RQHSLKCLSE FFDTLGDDLN
1081 WDPYIKDIYE CAFKPRLANF SVENAQQISS LMRIMVQWSG NESLYRFLYY NSFSCTKALV
1141 ELLHNPHTKE PVLVSILTAC NDVITRPAQD AEYVELVTII STSTLKSLPT LYERLGNSDS
1201 ISIAIEVLVN LTENGYVQDD ETISYLLSSL TLILESNKNV NNPKIITKML NVLKTLIPVS
1261 TMPFSDLESL FRTLSGFYQT CADKETRLGV NDVLSAFSQR FDGLEKVASL MKSLNSYSNR
1321 RIQEYDFPVM LSAFKKFTEE DYCNYSEIQW FPVVHTCLFL INDKEELAVR TNATHTLTVF
1381 VKYINEKSSF EEAKPGITIL KSIILPQIKS GLRKYNDEIQ TEYIALLEYI VQNSKYYNDM
1441 ADMQVLSFGD DEEASFFKNI AHVQLHRRQR AIRRLKEVAS ELSDNSISHY LIPIAEQYVF
1501 SDEEKFRNIA NESLITIGEL ANFMSWNQYK ALMRRYIHLL KTKDTALKQS VLLITSVSVA
1561 LKNTLTAKRN SSDEEINSRT MRKFPNNFND AESFIKHELY PTLSKILGTR NDDTIVARMP
1621 LSEGIINILL GLDEDDKITL LPGVLTSICQ VLRSKSEELR DAVRQSLSKI VVILGAKYIV
1681 FIIKELVSAL QRGSQVHVLS YTVHHVLRTI VDDLNHGDLD DSAHLIVRVI MEDVFGAVGE
1741 EKDSDNYHTK MKEVKVNRSY DTGEVLAANI SLQEFSTILK PIKMLLMERV SFKSQNKLQE
1801 LLRHYALGIN HNTEAAEINS LRLCYEIFNQ DVEQKKYNRP KATVTEQEEF FLVNLNAKKE
1861 RVVTEYSLLS HTFQKFSLDL LRTVITRHKS LMQAKYLQGF VPLLQQSLTS DDENVLISTL
1921 KVLIILVKID FEMSTENLFK NCVKKALNIV KDSPSTGSEL CQMAIKYLSA AIKHKEMKLK
1981 NVALSYVLQR ILPDLNEPNK QGLAFNFLKS LISKQVMLPE LYDIMTTVRE IMITNHSKDI
2041 RNVARSVFYH FLMEYDQSKG RLEKQFKFMV DNLQYPAPDG KQSVMELINL IVTKANPELL
2101 SKLASSFFIG LANVSVNDDS PRCREMATII LTNMLPRLDA TALRTIVKYI SAWLKQLEND
2161 AFLNLGLRIY KVYLEGLGIG HSAELDDQAT KAIIKTLHNT DENSKTQWDL IYSALAVFTV
2221 FTNKNPEVFN SEYKNIWDSV IKCLLYPHIW VRQASGKLVC DLFNHLEKDN WMYENSDIQI
2281 ITSKIIHQLR APSIPEALAE TAIKTLLKVS SYWNKNNVSY IPFGETTESI NRYSTAIEYV
2341 VSSIGGIIRS EENRNDTFFS KKFAIQYFYL LTQMLNAEAL ESVLEIILFS LYIYLEKSDV
2401 SRLTDEESEL VTSSQECMKQ LEDSVSVSAF SKAYANVKQM VYRRRLERKN KRSVLAVTAP
2461 DVAAAKKLKK HARSREKRKH ERDENGYYQR KNKKKRI
//