LOCUS DS999421 291194 bp DNA linear CON 24-JUL-2016 DEFINITION Thalassiosira pseudonana CCMP1335 chromosome 19 THAPSchr_19c_29 genomic scaffold, whole genome shotgun sequence. ACCESSION DS999421 AAFD02000000 VERSION DS999421.1 DBLINK BioProject: PRJNA191 BioSample: SAMN02744045 KEYWORDS WGS. SOURCE Thalassiosira pseudonana CCMP1335 ORGANISM Thalassiosira pseudonana CCMP1335 Eukaryota; Sar; Stramenopiles; Ochrophyta; Bacillariophyta; Coscinodiscophyceae; Thalassiosirophycidae; Thalassiosirales; Thalassiosiraceae; Thalassiosira. REFERENCE 1 (bases 1 to 291194) AUTHORS Armbrust,E.V., Berges,J.A., Bowler,C., Green,B.R., Martinez,D., Putnam,N.H., Zhou,S., Allen,A.E., Apt,K.E., Bechner,M., Brzezinski,M.A., Chaal,B.K., Chiovitti,A., Davis,A.K., Demarest,M.S., Detter,J.C., Glavina,T., Goodstein,D., Hadi,M.Z., Hellsten,U., Hildebrand,M., Jenkins,B.D., Jurka,J., Kapitonov,V.V., Kroger,N., Lau,W.W., Lane,T.W., Larimer,F.W., Lippmeier,J.C., Lucas,S., Medina,M., Montsant,A., Obornik,M., Parker,M.S., Palenik,B., Pazour,G.J., Richardson,P.M., Rynearson,T.A., Saito,M.A., Schwartz,D.C., Thamatrakoln,K., Valentin,K., Vardi,A., Wilkerson,F.P. and Rokhsar,D.S. TITLE The genome of the diatom Thalassiosira pseudonana: ecology, evolution, and metabolism JOURNAL Science 306 (5693), 79-86 (2004) PUBMED 15459382 REFERENCE 2 (bases 1 to 291194) AUTHORS Bowler,C., Allen,A.E., Badger,J.H., Grimwood,J., Jabbari,K., Kuo,A., Maheswari,U., Martens,C., Maumus,F., Otillar,R.P., Rayko,E., Salamov,A., Vandepoele,K., Beszteri,B., Gruber,A., Heijde,M., Katinka,M., Mock,T., Valentin,K., Verret,F., Berges,J.A., Brownlee,C., Cadoret,J.P., Chiovitti,A., Choi,C.J., Coesel,S., De Martino,A., Detter,J.C., Durkin,C., Falciatore,A., Fournet,J., Haruta,M., Huysman,M.J., Jenkins,B.D., Jiroutova,K., Jorgensen,R.E., Joubert,Y., Kaplan,A., Kroger,N., Kroth,P.G., La Roche,J., Lindquist,E., Lommer,M., Martin-Jezequel,V., Lopez,P.J., Lucas,S., Mangogna,M., McGinnis,K., Medlin,L.K., Montsant,A., Oudot-Le Secq,M.P., Napoli,C., Obornik,M., Parker,M.S., Petit,J.L., Porcel,B.M., Poulsen,N., Robison,M., Rychlewski,L., Rynearson,T.A., Schmutz,J., Shapiro,H., Siaut,M., Stanley,M., Sussman,M.R., Taylor,A.R., Vardi,A., von Dassow,P., Vyverman,W., Willis,A., Wyrwicz,L.S., Rokhsar,D.S., Weissenbach,J., Armbrust,E.V., Green,B.R., Van de Peer,Y. and Grigoriev,I.V. TITLE The Phaeodactylum genome reveals the evolutionary history of diatom genomes JOURNAL Nature 456 (7219), 239-244 (2008) PUBMED 18923393 REFERENCE 3 (bases 1 to 291194) AUTHORS Grigoriev,I., Grimwood,J., Kuo,A., Otillar,R.P., Salamov,A., Detter,J.C., Schmutz,J., Lindquist,E., Shapiro,H., Lucas,S., Glavina del Rio,T., Bruce,D., Pitluck,S., Rokhsar,D. and Armbrust,V. CONSRTM Diatom Consortium TITLE Direct Submission JOURNAL Submitted (18-SEP-2008) US DOE Joint Genome Institute, 2800 Mitchell Drive B100, Walnut Creek, CA 94598-1698, USA COMMENT URL -- http://www.jgi.doe.gov/thalassiosira JGI Project ID: 2662235 Contacts: E. Virginia Armbrust (armbrust@u.washington.edu) Igor Grigoriev (ivgrigoriev@lbl.gov) The clone of P. pseudonana that was sequenced is CCMP1335 and is available from the Center for Culture of Marine Phytoplankton (http://ccmp.bigelow.org). This clone was collected in 1958 from Moriches Bay (Long Island, New York) and has been maintained continuously in culture. Annotation was done by the JGI Annotation team and the Diatom Consortium. Chromosomes 7 and 18 are complete and present in GenBank records CP001160 and CP001159, respectively. The JGI and collaborators endorse the principles for the distribution and use of large scale sequencing data adopted by the larger genome sequencing community and urge users of this data to follow them. It is our intention to publish the work of this project in a timely fashion and we welcome collaborative interaction on the project and analysis. (http://www.genome.gov/page.cfm?pageID=10506376) Annotated scaffolds were added in January 2009. FEATURES Location/Qualifiers source 1..291194 /organism="Thalassiosira pseudonana CCMP1335" /mol_type="genomic DNA" /submitter_seqid="THAPSchr_19c_29" /strain="CCMP1335" /db_xref="taxon:296543" /chromosome="19" gene <68..>2515 /locus_tag="THAPSDRAFT_11315" mRNA join(<68..292,381..1386,1473..1574,1667..1971,2053..2424, 2462..>2515) /locus_tag="THAPSDRAFT_11315" /product="predicted protein" CDS join(68..292,381..1386,1473..1574,1667..1971,2053..2424, 2462..2515) /locus_tag="THAPSDRAFT_11315" /codon_start=1 /product="predicted protein" /protein_id="EED86470.1" /translation="MPPRLHLHLFFISVAIAAITIFSVILGYPPILSSTLSNPATVLG KTGLRSSSFGDFPDQEDSAPSIDLRRVDVVEQPSESEPESIDVNGREAVSGNNEQNIK PYSLDDILPVAKAYRNQFAVFAYIPTEDQFICLESEANGLFFGTEHRRAKMIIYSLSV FLRALFPQHLNKDSTELVFALSTGDSPMIHYDRCRKNELVCDSSVPVLQFSSVLRDAS EFLPNRLAMPVPQGNHLGCFIHLIEHGDVCKPFLPRSPTNQVGLVFGETVSLTWEDLI PQVVWRGTDHAFLEALLSPRLRRPNFEKDVAREMSDISDNMTVALDGMRKVYDELVPR WKGVVWTVEAESEAMNKTGSLPWADMKFSSVMYKGAKNDTAYYEQFKNYGISAVGNKM SLEELAKYKYHIDIGGGGGTTWSGTLEKLALPGLLFHHETETRDYYYNRLEPYVHFVP VKQDLSDLKEKYDWAEQNPEKAKEISERATALIKSFGTTEGMESLFKEYIEEPLNKVI SAYQPVDGSFQEVLEQFSAAADFHISSTCTSDGCNTSMALEKHTANTTTDTANITTDI VIQSPSPSSVHPSCFGLGNAISNNITVLPASPKYLIFYPKMGMGNNIMGYASAVMYAC LSGRVLKIAPQKVRDKSVFKEGFQCGEFFASSPGSICDGLEMDGELVQKRMVTLLVVA TDFKT" gap 2887..4909 /estimated_length=2023 gene complement(<5838..6664) /locus_tag="THAPSDRAFT_264634" /pseudo mRNA complement(join(<5838..5842,6266..6664)) /locus_tag="THAPSDRAFT_264634" /note="predicted protein" /pseudo CDS complement(join(<5838..5842,6266..6663)) /locus_tag="THAPSDRAFT_264634" /note="predicted protein" /pseudo /codon_start=1 gene complement(<7592..>12363) /locus_tag="THAPSDRAFT_11316" mRNA complement(join(<7592..7911,7982..10676,10711..>12363)) /locus_tag="THAPSDRAFT_11316" /product="predicted protein" CDS complement(join(7592..7911,7982..10676,10711..12363)) /locus_tag="THAPSDRAFT_11316" /codon_start=1 /product="predicted protein" /protein_id="EED86517.1" /translation="MHEGLHSDHTMQFVDFDQQRLFGNKSFTPIIGQERQFTLKNAIK KRAFLTKLREIHQHQKIGERVQALQLAFQKEGKTEALEQRYNRVDYEIQCSMLAAANA QARKSFGYQWSPALVTAGRMKRFWQIVTSSKRRHMKVAPAALHLAETLQIPTDNLDDQ SLTVCHRSLHEAAKRLRQVQRNDAEERLTWLEELAQEATRDDPSSDWQQILKRLVTAT KTKALHRKLSAVLKPERVGMDHIEVPQEGWYYSTSLDELYEFDNGIFRAHTRVDGDLA MVAEITHENGGIRMSQPSPTMPPTWRKITNMEEMEQWLLRRNKRHLQQMYLEDSPPTL PSFAAMMGEHGTSTTVDAILDGTFDIDSIELPEQMKAWLKTMKRTPAERDLTVVTAMT SKQYQEAFKAVDERTSSSPSGLHYTLWKAIAGEDDLCAYFSVMMSLPFTFGFVNERWT KEIDVMLAKEKGVTQIHKNRLIGLLEADFNTALKWYYPVQIMGNAESSGINVNQWGGR SNRTATMCATRKLLLWEYARLTRKTTATFKYILHDIAEIWNAKSMERFVRTAAGTSKQ SYKQEEMDIPLSGEIQGKGDIMALWTLQSHTMLETHNQQCPGVILQHAADDTVSERTI DAYVDDADNYADAPETNGAEEAIGRLQTSAQIWADIVAATGGLMAFHKCNWQILAFTP VGGYVLPQSRNRFSRCDIHLRNHKGMSSKIEYKRHTDANKGLGVTMCPTGDQKPKFMR RLTQMRECVARIATSSLSLTEAYLALKTRVLPMITYSFPVTSFTAKQLKSLAVLIDTA FIPKLGMSSKMKRIEVYAPLELGGANVPSIESIHDQMGIQHFVRSVQWGKELASDIRI VLSRAQLYSGVCTPFLEDCMTNLRHMEEGWLLHLRKRLSALNGSIWVENAWTPKLQRV NDVSLMEALSQLDLVKKADLITANNCRMYLRVITLADITTMDGRAIDKRLIDGSTRAE STLRWPMQPIPTDRMWTAFRQLPKRAFCTKRGFNQRDKMILDTPMQDWLQVERHVKHT MYRTPRKMYLRETVDDLDAYIVQNQGGDVGAIEQLDTGIIWQYSEHPTGNYFLREKTV LFIPPQAQPVTGYFQEERLYPDIAYTVQPTVHNPPTQHPAVIDDEAALRNAPRLTVVS DGSMDPISGRAAFAWVITGPDRIGHIKRSKPIRTNPRYMSSFRAELEGVHDVISYLTT NHYTGQRIDLWCDNKWCIDALSNPHDAIDELGRAEGALIKATRTLLREFTGITLHHIY GHQDDIRTYDDLTMESQLNVDCDTEAKEQMKKSTLSGRTEAEPGTGAMLYLGDDMVTS HMAEQIQYAGQAPSMFQYNRDRFEWTDQQCTAINWKGIGVAKKRLTRPKSHRTTQMMY GWLNVGHQKVKLEQEGLCPCCGKEEETQIHLYRCMNSTMRESLASGIREMEKTLYKSG MAAQVYIGFIDQIFRRPSVERLCYGAFITSNGHIHYNLRGYPLKDMMMEPSREKETPL NSQQCLLARPGNYSSNSGRHGMLYSTALIVTSTESSSSNLTNVFWITKGTEHFYLDIK TIT" gene complement(<13217..>18439) /locus_tag="THAPSDRAFT_11317" mRNA complement(join(<13217..13725,13877..14178,14323..15349, 15421..15656,15692..16126,16235..16367,16400..17175, 17264..17416,17444..17550,17574..17662,17736..17817, 17858..18073,18200..>18439)) /locus_tag="THAPSDRAFT_11317" /product="predicted protein" CDS complement(join(13217..13725,13877..14178,14323..15349, 15421..15656,15692..16126,16235..16367,16400..17175, 17264..17416,17444..17550,17574..17662,17736..17817, 17858..18073,18200..18439)) /locus_tag="THAPSDRAFT_11317" /codon_start=1 /product="predicted protein" /protein_id="EED86518.1" /translation="MNEFAKKPFGIFRELHSNHKKGSPMHLLSCGRLQQHVASPVTQM NSPYMLHDFIARVERSGGDIVVIVGGGGVDASRWDDLRPLKLLTGTQSTHHPDAAMED TSASTSTASAPTPTVTNLTNLTSTSIAGRTRNASKQAALATDLDNEFSISDWFTLARE EVMEEVGLEEIMAFGGNSDNRKGDWCTAAMVGRWSSVLTPASPAAPTPTFHNPYHCRG KGVCYHPDCTFFLLGLHQLQQDGVFHQNVGSSTLSSMAAASSIGCKHEDPTIWEQLSS AYNAPGLPNYNWCKNSGLKRVRITLPATHGRGTGERECCPRGAFPPRNGRGDGDNRYA LLADVNNTRAPQPPTLASQQYQSRSNGDGRADNAPGRQREMISRGRGRNAGRGGRNGR GRGRGYDWRNASSLHGTRPLPPSRNMEYTTNFIRVLVPLLQLQADSPVTLEGDAFDEL SSNEYTLLLFHLAAERGHRYWPGDKHALADRWTFTHPDEILAAINDPTEMRRIVSRFS AETDLFPQFDDRTAPLQYTSLMGDLLKKLGTARQRDNTLMTKSSQNYCWETDQMRGSK FEMLRSLPPLQLVPLLTSPGAFHYYLYQRGMENPLPSAGVQTDNTVGETRKKGGALTT EFPSDAEFDEEADAIASRIEFERTLREPNKLLQLDLLTIAQMTDEDAWKRVVYPALEQ YLMKYYPAHHTQILTRVKCWSLPELLCYLREPGALTETIRGMGLERLRAIAQGTKDSH RVKMALRRLVDRQTRWTWNSSGGTRLGRFRPPGMGHVMSIETTTPGHTAAAIFDVSAL PTESEDLEGRYTRLCGDLKSTKRTQSRLYCDWARKTLRMSMEVIAIPSTNFVPGIMIL YSSYYDVEDDVRGEISQQILENCSYDLIPTDCHISLRTKRVTRPTMESPEVQQVQLLC IDFVSAKADELRTLFLQLNACEKTRLNARATYDFVYFPAQTSATFSDTEYDECLGKQL DYLAGRLVATVKGVPASVDLCHLVPPRHKDGQTVWDMGSLLISEEKPLFYSRKSGDHV AALFDKITPFYNSGGRRTGKYVFIGRKQRYTTMKEYLIEHLHRGLVEDFPDLDWSKLV ITLGGPSQASGPAETIFTLPTSTPPRRPDPGLTAASSAHHPGAVTPATNAAEPKDVGH VGQAGTSPPPLLSTSHVIEMEYRDRQWQHDFMPKAVKIISRSIISSLDTATRSQYELP TDVDGEHADPALVARGVVNAFWKTLDPDAEGSEDDSAASQDDIYASPQENKFAEEEGC KTRGSSMQPLSDHTGTTDATPVGKTDRDSDYDTMTDYHRRRRHAWDSDSSEEDDEQQR PESPWKNKDGVPLVLEAVNRKTRKDVSPSTHSRPQSQARKHLLPYTVPQAADSSTTQT GDSTISSVASTPTITNLTPLTSTGIASRTRNASKKIKSTSMLHNEFSVCANSEDEI" gene <18877..>22231 /locus_tag="THAPSDRAFT_25525" mRNA join(<18877..19101,19190..20207,20294..20394,20456..20791, 20873..>22231) /locus_tag="THAPSDRAFT_25525" /product="predicted protein" CDS join(18877..19101,19190..20207,20294..20394,20456..20791, 20873..22231) /locus_tag="THAPSDRAFT_25525" /codon_start=1 /product="predicted protein" /protein_id="EED86471.1" /translation="MPPRLHLHLFFISVAIAAITIFSVILGYPPILSSTLSNPATVLG KTGLRSSSFGDFPDQEDSAPSIDLRRVDVVEQPSESEPESIDVNGREAVSGNNEQNIK PYSLDDILPVAKAYRNQFAVFAYIPTEDQFICLESEANGLFFGQKNTHQIRHAKMIIY SLSVFLRALFPQHLNKDSTEFVLALSAGDSPLIHYDRCRKNELVCDSSVPVLQFSSVL RDASEFLPNRLAMPVPQSNHLGCFIHWIEHGDVCKPFLPRSPTNQVGLVFGETVSLTW EDLIPQVVWRGTDHAFLEALLSPRLRRPNFEKDVAREMSDISDNMTVALDGMRKVYDE LVPRWKGVVWTVEAESEAMHKSGSLPWADMKFSSVMQRGVKNDTAYYEQFKNYGISAV GNKMSLEELAKYKYHIDIGGGGGTTWSGTLEKLALPGLLFHHETETRDTTTISWSLSD PIISLFSPDVHFVPVKQDLSDLKEKYDWAEQNPEKAKEISERATALIKSFGTTEGMES LFKEYIEEPLNKVISAYQPVDGSFQEVLEQFSAAADFHISSSCTSDGCNTSMALEKRT ANNATNTANTTTDIVIQSPSPRSVHPSCFGLGNAISNNITVLPASPKYLIFYPKMGMG NNIMGYASAVMYACLSGRVLKIAPQKVRDKSVFKEVFECGEFFASSPGSICDGLEMDG ELVQKYYAANVTILGPEAYGDPPCGGNRLQDLKYFLCNDGMGEDEFVAIKSCQFYGDL FHRNPHFQHRLSATPYRDIVQAKLRPSVKVQEKMIQKDGPFHVCVHVRMDEEKTRNTL GKDWLGDLSKCVSNLMLDHTAAASNEILLFTMHADIRRDVKGTLEAGGANYVQFASET SPEGPQGNSDDLHQGVADMFTMATKCTNILASKADSTYVLLSANLMNETRVFPGNQWK QGCMPGSEVVDFEPTGDFWSRRDLCGIRNITCDTSKEIWKEGSSLLPVSSPSLSSYQE KFSTLQETSIIDASIERQAKGLHKNSLGSKKNEGARMVMS" gene complement(<22643..>26602) /locus_tag="THAPSDRAFT_11319" mRNA complement(<22643..>26602) /locus_tag="THAPSDRAFT_11319" /product="predicted protein" CDS complement(22643..26602) /locus_tag="THAPSDRAFT_11319" /codon_start=1 /product="predicted protein" /protein_id="EED86519.1" /translation="MSAYARRSLVSFSQSKRRTSTNNDSHQHHRDGNSTSNAPANHRD ITTTTANNNNSNNSNRDGGGIQRFRNGPPQTLLSQRSRHHNDHQQQQQSSQHDNDNDL PIHGSIIAHEEEEELAFSQITMSQGMDTTNGGECGLFLSLMSTSGGTGGVTTAPRNVS NEGCNSNRMTSGKIMSGGVGGKYSRSVAWNTNGRAMGGTIMSSTAASSGGGGMRQHQP APKQRQHQQVQEQSSSRFGGAQRGDTNRQGSSSQIGGRGGERQKQQQQTSNNDNMSQR SSRPSSHRREDAQVALCASGKGMMHTSATAANTTSASSSSVHNSINNNFNNNHDDDNF TVLSSQPLLTPLPISHTKQQQFNLSQSSKSSKHGRGDNDSFTPRRDKRARMSNCSDNG GGGGGGQQDNRQQYSTNCQVVPSRKQPPHSSTITTSITYPSLSSSRRNTFTKTAVSTL GSLSSQLRTPIIILRAASTSTTAVATGTIGMGTGSTTASSLARTGVVASVVRSVTNVM TPRRFRSGAALGRGFVGSISGKRTAGTALTTVNLPHSLLGNGLGAQMEVEAQPSPLSR PSSSRIWGIGSALKSSSRVAVAATTTTTPAVADKSTMQAEENEDGGSIVECSMRPHNA IMMSTKPHVESQVSGEEEGSLSHFSTESSNQSSLKLSQHLSQSLKRGDIHQDEEDDNK SKMSASTYDPTMLSSFLKRGDFSFQQAEDLLSSIQSAQKEMETVRKEIGEQRKLLQQK GEEIANQNADLDKKQCTILEKLEELDRKRECTLSDTQKEHKQARKCKSVMEGITSKFE SDSEELLLQICQLVNSSKDDIQSEKRQALELIETHAQELRTQAVKLANDSIDDVESAR DCAIETVESHSCSARLAVNGLIDSFKGIVANAKSDFQSWVGGTNGGFQSNADGVVDAT SIKESGCFHRSPELSTVGGRNSKHMHIMRLAVATSRWDEKDASSMSEQSGVGVDESVL HRDNVAATATLFSRRALSLQNNYGNNRSSSVQSSEPHHVIDLKAGLRRDMSISTTPFS QFSKKSTPLIRNSYGRINSRPSVKSSTAVGRTGKVAVSVLTKSSGTVSSRANKDSMHS LLIESKCKSSGDPPKFVSLTEREEKSPARPIKDKTKENISASLRSRSEKVEMATTKSV SEPKPRGQSKRDRSGFNAVKSPRRSKRLKEITRVERQSALNVLPLKTFTDKQTHPKQV TSKENEATPLAASSYHSSIVDTRRSSITEASQALSDGYDSLLGTSVVVDEGTKDVVED VGLDGSENDHQHSGRPLRVPWGNKAVKNRRNKSYYNGRKCDRKTFSETLSSIFDFNF" gene <28026..>29393 /gene="DPHA1" /locus_tag="THAPSDRAFT_11320" mRNA <28026..>29393 /gene="DPHA1" /locus_tag="THAPSDRAFT_11320" /product="2-dehydro-3-deoxyphosphoheptonate aldolase" CDS 28026..29393 /gene="DPHA1" /locus_tag="THAPSDRAFT_11320" /EC_number="2.5.1.54" /note="Reaction catalyzed:2-dehydro-3-deoxy-D-arabino-heptonate 7-phosphate + phosphate = phosphoenolpyruvate + D-erythrose 4-phosphate + H2O. Note, cadmium cofactor.; putative; GO_component: GO:9507 - chloroplast; GO_function: GO:16829; 3-deoxy-7-phosphoheptulonate synthase activity - lyase activity [PMID 3849]; GO_process: GO:16089; aromatic amino acid family biosynthesis - aromatic amino acid family biosynthesis, shikimate pathway [PMID 9073]" /codon_start=1 /product="2-dehydro-3-deoxyphosphoheptonate aldolase" /protein_id="EED86472.1" /db_xref="InterPro:IPR002480" /translation="MEQSNWTPHSWRDIVAAKQLPVYENQDELNQAIQTLNRVSPLVF AGEVRSLHEQLARVSQGRGFVLMGGDCAESFEEFHVNHVRDTFRVLLQMALIMTFGGS MPVVKIGRIAGQFAKPRSSPNEVVDGVTLPSYRGDIVNRKEFTPDARRNDPHRMVDAY YQSAQTLNILRAFSTGGYADITRLHAWNLDFVKNTEEGSRYRLLASKVEESLRFMKAI GVDTSRPEFTSVDFYTAHECLLLPYEQALTRQDSITGKWYDCSAHMLWVGERTRNLDG AHLEFTEGICNPLGVKISDNCTPEDLIQTIERMNPHNTPGRLSLIVRMGSDKLKQKLP SLIQAVQDSRKEVVWISDPVHGNTRKTASGYKTRDFDSIVAELRAFFDVHDEMGTHPG GLHLEMTGEDVTECTGGVSAVTEDTLKKHYNTACDPRLNGSQALELAFLIAERMTLRS GLPPI" gene complement(30432..>32785) /locus_tag="THAPSDRAFT_25526" mRNA complement(30432..>32785) /locus_tag="THAPSDRAFT_25526" /product="predicted protein" CDS complement(30488..32785) /locus_tag="THAPSDRAFT_25526" /codon_start=1 /product="predicted protein" /protein_id="EED86520.1" /translation="MAILFDQSNARNKRVREVDFGGKKRFKCAALVAKDTISPKETTE PKIVIDKASSSSETSSPVEEAPILPLTPNETPLSQLHTTNANNDVLRERRRKIMLQER RMEKVLSQKSAEKSAIEDGASTVALGGTLEEGASVVAAKVEVSKTESVALGAMHQQSK VQRQATVNITLQPSHDDSTPSQSFTMTSRVLLFALALFEVQQHCTTTSNLLAAILQYV SSCFGYVLSWFDTALMQSDNGLFFRVVLEFRQASVAWFRGVVFPWRNALHVAFVLAFM IRGFTNALASVRALQRPTPSFDDKKRHQLKCVVLFVTIASCWMMVLPLFHTCNHYKNG KSTQFCFFVEASKNVISAPRYSTIDIIVIKLYKMMHLVVKRTIKGHVIKQLYRAFINP FRFHGRLKKLFTIIRWAKFLAPLIGTCNKLRGHILDMSQKKRQHSTSKTARRMWKEVL DALSTQTKSERAALLIQKSFRERRENKAKRRYELLVTNRKNVEVTHQIRKKLKEERTL SKSKLAKFEVLDNQRELRRQVSQDERMNITKYNEAKRKEKKRLLLSPKTAFAVLWKCV AISCVVLEISQILFAPVLSGELGKMPLDKFISAILFASPCESKKPRKNATPSIMFPAI DDIDWYLCTNSSLKQNWLVAVHILASILVLFTNTVFFLDVLVTFLTGELTSSGKLVPK PFFARYILPGIGLQLIVNPTMVELSGLFKRAVNYANAVGPSLCSQLLVTCFPMVSHCY DCLLDIVFDFVERQNKILLRRPIFD" gene 33633..>34545 /locus_tag="THAPSDRAFT_25527" mRNA join(33633..34200,34242..34431,34495..>34545) /locus_tag="THAPSDRAFT_25527" /product="predicted protein" CDS join(34007..34200,34242..34431,34495..34545) /locus_tag="THAPSDRAFT_25527" /codon_start=1 /product="predicted protein" /protein_id="EED86473.1" /translation="MEQKEEGTKKDRFCKSLSHHHQQRRVQTVQFHERTVREDIVPSE EGTVPCCWGWVAAALVQEITDISSNGVPRMPPRHGSGASLSSFVNEAGGGDFSALDAT IGGGTLELDGGDEVLEVEGEWVGVKKCNNAAVPKMRVVWLDD" gene <34948..35914 /locus_tag="THAPSDRAFT_25528" mRNA <34948..35914 /locus_tag="THAPSDRAFT_25528" /product="predicted protein" CDS 34948..35850 /locus_tag="THAPSDRAFT_25528" /codon_start=1 /product="predicted protein" /protein_id="EED86474.1" /translation="MRSLGAAAAFAVEGEGGDGKGRVRGLSVGEEFGALAAMAVEPMI VPLSDVISVDQEVPSSSSRSRTSSVGSDFFSSQASDTGSSTPLSPPPPSNSLHGKSAL YRIFLHTSSNGYIEFSFDNPNSHDILMAYLAAHLKPNQIPHKTENNAPVGGALQTMVL TPSKTEHRAPSSLSSTPRLLRTNSSSSCTLEKLQTKIINQRLQQESTPLEKVKENVAS WMSSIVDCACCQDTTVAPDPDDKSTIEGRHSMKSSGIYANSIGKHNVSPASAKLKSRG IGGLSFEESSMGTSGSGSPKLTVE" gene complement(<36970..>38966) /locus_tag="THAPSDRAFT_38351" mRNA complement(join(<36970..37181,37265..37627,37743..38045, 38126..38401,38555..>38966)) /locus_tag="THAPSDRAFT_38351" /product="arylsulfatase" CDS complement(join(<36970..37181,37265..37627,37743..38045, 38126..38401,38555..38966)) /locus_tag="THAPSDRAFT_38351" /EC_number="3.1.6.1" /note="based on sequence similarity to arylsulfatase from bacteria and fungi; GO_function: GO:16787; arylsulfatase activity - hydrolase activity [Evidence 8484] [PMID 4065]; GO_process: GO:8152 - metabolism" /codon_start=1 /product="arylsulfatase" /protein_id="EED86521.1" /db_xref="InterPro:IPR000917" /translation="MKHKPNTANLPSKPNIIIILADDLGFSDVGCFGSEISTPNIDSL AYGTTSNNSVGSNNTPTRGGMRFTQMYNCARCCPSRASLLTGMYPHQAGIGHMVYDAG VGEEYQGYLRKEVPTIAEMLRGSGYKTFMSGKWHVDAMGDETHPIPTQRGFDKFYGTL GGGGSYFQPPSLVRDEEVIREVMPEGFYYTDAINDEACRMIESTSKEGDDPFFLYVAH CAPHWPLHAPKDDIDQYRGKYMQGWDKLRRDRLAKLIEQRLIPSIWPCSPRDEHAPLW NKEHVPNQEWEDARMACYAAQISIMDRGIGRIIDTLHRTGLYDNTAIFFLSDNGEEGN WPEFYGGLTRNREKITVGNRPELQPGGEETFMSYDLPWANASNAPFRLFKSWVHEGGI ATPFVVHWPALDKQRNAKDSQICHTPWVLMDLVATCCEIGGADVSADKLEGESFLSIL QGHSVERSKPIFWEHQGNCAVRDGKYKLVYRRCEVVQPQLGWELYDMELDRTELNNIA EQNHMKVEQMKQIW" gene complement(<39602..>40774) /locus_tag="THAPSDRAFT_11325" mRNA complement(<39602..>40774) /locus_tag="THAPSDRAFT_11325" /product="amino acid/polyamine transporter family II protein-like protein" CDS complement(39602..40774) /locus_tag="THAPSDRAFT_11325" /note="Similarity to amino acid/polyamine transporter family II protein. No EST data; GO_component: GO:16020 - membrane; GO_function: GO:5279 - amino acid-polyamine transporter activity; GO_process: GO:6865 - amino acid transport" /codon_start=1 /product="amino acid/polyamine transporter family II protein-like protein" /protein_id="EED86522.1" /db_xref="InterPro:IPR002422" /translation="MTACAALATSTMQSINRSRGVFEEDGNTTGDCSVSSVPTSYVQL ANLSLGNTASTLVFTLTLAASLGVCSTYIAFIGQTLASLSVDAESNNIVYSILPNVEE TTWELWTAGAVLPLSLVRNYGVFAFTSALGVTAVLGGILVTLTYGVTIDPGGGIVDAL SAVSHLKMWPESLADAFGGSFGTIAYLFCVNFLTFPIMNSMANPREDYNESVSYAVSA VYLVNIIFAIVCLGFYEDATQDLVLQNLDNGPYLSSLKILLCVDLLFTFPVVFSSGRQ ILENALLKETTTEENNATTSLILSRTAITAGAVSACFGLSQLGGFGVVANLVGGVAQG TLAFIVPPAIDVALSRRRNSGEFDVKEIPQWLVGAFGVAVVSSVTYFTLNESLNWI" gene <44766..>45694 /locus_tag="THAPSDRAFT_15285" mRNA join(<44766..45110,45197..45271,45368..>45694) /locus_tag="THAPSDRAFT_15285" /product="predicted protein" CDS join(<44766..45110,45197..45271,45368..>45694) /locus_tag="THAPSDRAFT_15285" /note="GO_function: GO:4672; ATP binding - protein kinase activity [PMID 5524]; GO_process: GO:6468 - protein amino acid phosphorylation" /codon_start=1 /product="predicted protein" /protein_id="EED86475.1" /db_xref="InterPro:IPR000719" /translation="QIVKERTRSNWPEYAVKIVSTQKIEELGYEQSINREIAILRTLS HPGISRLISSFRFRDGAYLVLEYASGGDLHTLLKKNGSLDHESTRFVVGSVAAALGSI HERGFVYADCKPENILITETGHIKVTDFGACRPVTEEACVDTNNNETTEEDLRIEGTT AYLPPEVVVGGYPTAAADVWALGCVLFQCISGRPPILEDNDDLTALRIVTFHLNSDPQ DFFGECDPSTFCDDTKSLIQRMLNREVNERP" gene <47557..>48135 /locus_tag="THAPSDRAFT_11327" mRNA <47557..>48135 /locus_tag="THAPSDRAFT_11327" /product="predicted protein" CDS 47557..48135 /locus_tag="THAPSDRAFT_11327" /codon_start=1 /product="predicted protein" /protein_id="EED86476.1" /translation="MRRRAEEAAEPCKEDLSGARSAYQDCIALTVSRSEQPWPLQPAT LKICICIDSLTRGGRRCDHQPPPNVQKDLSVPPISRVPPLLGLPSRLTPLSGATTFKV LFVQCVSKSIPASRQTTSQSFANKTGYPSKRESFLMDTFQLQPVYLSARTLLQPKPTT RRRPTSANALSVGVLNLHWYHQWGDHETAHVI" gene <49342..>50785 /locus_tag="THAPSDRAFT_14374" mRNA join(<49342..49695,49779..49937,50019..50616, 50694..>50785) /locus_tag="THAPSDRAFT_14374" /product="predicted protein" CDS join(<49342..49695,49779..49937,50019..50616, 50694..>50785) /locus_tag="THAPSDRAFT_14374" /codon_start=1 /product="predicted protein" /protein_id="EED86477.1" /translation="KSKNIWIVTTAALPWMTGTAVNPLLRAAYLSTGRKAEGGSVTLM LPWVEREADQERIYGKTKMFERPEIQEEFIRGWLRDAANMKEASEDLEIRWYTAWQEV AENSLYSMGDIIGLIPEEACDICVLEEPEHLNWYRAPGENWTAKYKHVVGIVHTNYFV YATEQPAAFIRAPGMRLLCSWMCRAHCHRLIKLSGTLGNFAPEKELVENVHGVRRTFL DVGDELRSKLTAPDAASDPIFSADADPTVYFIGKMLWSKGLASLMDLMKYAEESAGLK VKVDMYGGGPNKDEASAKATKMGLDMPFHGAIDHAELGWSHKIFINPSTSEVLCTTVA EALAMGKFVVLPSHPSNDFFAQFPNCLPYSNKEEFVGNLYYALTHAPEPLSDEYSYAL SWEAATERF" gene <52336..55850 /locus_tag="THAPSDRAFT_269997" mRNA join(<52336..52609,52717..53738,53781..53815,53902..54153, 54342..54658,54770..54834,54910..54946,55025..55113, 55189..55850) /locus_tag="THAPSDRAFT_269997" /product="hypothetical protein" CDS join(52336..52609,52717..53738,53781..53815,53902..54153, 54342..54658,54770..54834,54910..54946,55025..55113, 55189..55806) /locus_tag="THAPSDRAFT_269997" /note="No reasonable similarity to any known sequences; hypothetical gene" /codon_start=1 /product="hypothetical protein" /protein_id="EED86478.1" /translation="MARKKDDRGTQLPHVLKLEVAALVREKGSIPLSRIKSEYALRYR KPINPKQEISDRWIEGIPGIFVKNSFAVRDDTWRGAQLNSNSVLADRFIPFKDKQLNM VEPAARWICPGACAGSGRDEPYKLDGVIASLAKDFIPSIPCVCPRGVNDAPPHGEYFF KPTLPRTVFPNLQFGLQSNSVQKYGETSKSAPPGVVDLDETTSITEYCSTFQHALRLE YEEHLRLYEHYSLYKMKVHPLEDAPPPGAPSISRHLAMTSRARIFVDGISDARPSLQI GDIVLLRPIQPVINFYRGNYGEMRSTQMIVEIESRILGVVRGRGDQKDQVLFSWGLDP QQTSLLRDPSWHRQYNIRFVPSAVTVERCQSGLDWVQLVSKITPGVLDDILFPVTAPK VKPLGIENQSIHLNAQHATNVENSDVSKPLNELQSSFVRMPPMVLSGPAGTGKTKTLM AAIADVLGLLQPAQSIELNTNRVLVCAPSHAACDVITRRLSVFLKRTEIFRMYDSARP SNTVPGDIVPFTCQLPKSDRQCFRSFLLLKSNNLGTAASGPFEYMDNPFFSHLFIDEA AQATEPEILCPLSCVVDPYPGGKLVEVGFIGDPRQLSPQIFSDESAKYGLGRSFMERL LRRPVECMGGQDESLTIFLTENYRGHPAFLMMPSSFFYYDRLGLMPKEDTRSLNDFFR VVKQTTWPIHFLGVAGNDKSAALESFTGTDSWQNLPEAEMTVEVITNLVESGVEPKQV GVMSPFRGQVTLIRQLLRQRGFHDTNVGTIDNYQGVEQDVIIFSLTRSNKEFVSHDVK RRLGVFGQGREKQTNVAMTRAENLFIVIGEPDIMWSDPLWRQWLMFCFRNGLWYGKGV DDSNIEMMSSSDLAFCSMQEQVAAKHEFGIVTSAVVSTLEKVHRHH" gene <56269..>57336 /locus_tag="THAPSDRAFT_11330" mRNA <56269..>57336 /locus_tag="THAPSDRAFT_11330" /product="predicted protein" CDS 56269..57336 /locus_tag="THAPSDRAFT_11330" /codon_start=1 /product="predicted protein" /protein_id="EED86479.1" /translation="MMMMIPHRHLCSAFQSCSHLVTHRSRRIHTSTTHQVSTLIPSDL DSEDSDNAVNSNIDTFNPIGPPEALLRLSVGETRFIGSHSITRLSQSPDVFIVKNFVS VADRETMMQQATLQGMEVAGTRKSDTNTIRRHSYITWIDPYSILGLDDAVTREAVRVA KEMVAQSASLFAHESLHGKLDVAEVDYIFAEDVQVAKYNYGGMFQYHHDGFSRYLTVL SYLNGVGGTYFPFALSDTQSTHEIDTANEEDAAEIAKRRIVGRDGVLLVGKEGPESYL AISKTATNSIVTIEAGDAVVFYNYKANGDRDWRSLHCSLTVPQEKWIATNWFRSEALT GPFSWQKKASLLEDMMKGGMH" gene complement(<57532..>58610) /locus_tag="THAPSDRAFT_11331" mRNA complement(join(<57532..57821,57970..58306,58542..>58610)) /locus_tag="THAPSDRAFT_11331" /product="predicted protein" CDS complement(join(57532..57821,57970..58306,58542..58610)) /locus_tag="THAPSDRAFT_11331" /note="GO_function: GO:16597 - amino acid binding; GO_process: GO:8152 - metabolism" /codon_start=1 /product="predicted protein" /protein_id="EED86523.1" /db_xref="InterPro:IPR002912" /translation="MFSLAIRTVISPSPRLVAPIGVKLSASTTNATNTSTTRPFSSPS NKKHLVINTVGTDRPGIVADVTRIITNHGGNVGESRAQLLGGHFSLMMLVEIAEGDMK SLHGDLERGVEGMSTTCFDAVDPRLVEVSPKIGFAGHFKLSGADNPGLVHKLTSALAR NSLTIGSMQTFQEEAPFGGTELFTMEGKAVAYQPLASNFDWKKIKEELVEMGESMNCD VEFSDVTGDSIRN" gene <58895..>60175 /locus_tag="THAPSDRAFT_11332" mRNA join(<58895..59366,59461..59614,59689..59741,59833..60097, 60157..>60175) /locus_tag="THAPSDRAFT_11332" /product="predicted protein" CDS join(58895..59366,59461..59614,59689..59741,59833..60097, 60157..60175) /locus_tag="THAPSDRAFT_11332" /note="GO_function: GO:3824 - catalytic activity; GO_process: GO:8152 - metabolism" /codon_start=1 /product="predicted protein" /protein_id="EED86480.1" /db_xref="InterPro:IPR001753" /translation="MMYSSTTRCLAASKWHPSSAGSARRLSVVTSLMYPSASARHHYR FVSLSLYKPQVICPITTFNAIVQRSHSSSSLNEQPPSATYSKLSNTTIQAITTTSKTD DETRVKILPLGDGIVHVLLSRPKKMNSLDMSMFESIAEAAMMLKEDRDVRVVIVSGEG RAFCTGLDAKSVALSGPTKSLNRLLERPSGYGGECGLGNLAQDVGYLWRQLPVPVIAV LHGMCFGGGMQLALGADMRFSTPDCRLSIMEARWGLIPDMSASITLRELVRIDVAKEL TMTGRIISGLEGEKIGLVTRCVDQPMEEAMKVAKEIVESLHNIT" gene complement(61352..62753) /locus_tag="THAPSDRAFT_25531" mRNA complement(join(61352..61675,61767..62753)) /locus_tag="THAPSDRAFT_25531" /product="predicted protein" CDS complement(join(61431..61675,61767..62745)) /locus_tag="THAPSDRAFT_25531" /codon_start=1 /product="predicted protein" /protein_id="EED86524.1" /db_xref="InterPro:IPR001611" /translation="MAKLNDVTTPPDAAPTTMGDDGSEPTVTGAADANGTTPGARAVF PTSSAARNDSTMDDPPGGSVEFVLGGNILEPPPTMVADDGDGVRIPCDVAHVEDAYAV EIEDAAVELMSSSNVMSAEPVKTMRFMGREIDSSNVKWATCLSVIAAVAVVLAVSVPT AMHYHRKGSAEEASVEGMKESEQKREELVQSLGQFGEFTVEERAEELRSRLASISNDE ELNDPSTPQGMAFNLMISDGNSRIDSKMYHPSIPISKAQERYSLLVFYFSTGGDEWTR TDDFLTIGNHCDWSDMIECVGEFESSDGQNNCITGPASCVFQERVVGLYFDDNNLRGY LPSEIKNFQSLKYLSLDRNFIEGTLPQGLETLVRLETINLQDTNITGSVDFLCQGTDV DLKVDLDEVDCGCCS" gene <67345..>68903 /locus_tag="THAPSDRAFT_15093" mRNA join(<67345..67393,67547..67873,67979..68732, 68846..>68903) /locus_tag="THAPSDRAFT_15093" /product="serine carboxypeptidase" CDS join(<67345..67393,67547..67873,67979..68732, 68846..>68903) /locus_tag="THAPSDRAFT_15093" /note="Putative serine carboxypeptidase with similarity to gi|1705669|sp|P52710|CBPY_PICPA Carboxypeptidase Y precursor (Carboxypeptidase YSCY) (model%: 98, hit%: 73, score: 748, %id: 40) [Pichia pastoris]; EST support; GO_function: GO:16787; carboxypeptidase activity - hydrolase activity [Evidence 4185; carboxypeptidase C activity] [PMID 4180]; GO_process: GO:6508 - proteolysis and peptidolysis" /codon_start=1 /product="serine carboxypeptidase" /protein_id="EED86481.1" /db_xref="InterPro:IPR001563" /translation="NNDKNLFFWMFEKRTTKGETPLVIWLTGGPGCSSSLALLTENGP CSVNQDGATTTVNPHSWTESAHVLWLDQPANVGYSYGQDNDTNEEMISEDAYYFLQAF FQSEEGEKYKDAPLFIVGESYGGHYAPAIAHRIWKGNNDLQDGLLKLNLAGLAVGNGL TDPEEQYKHYSEMAFKNSHGIQVIDESTYNAMKSAEPMCTEGIAKCNSGDGMLSSFAC QAAFLYCNTALTTPYRATGLNPYDIRKPCGDNPLCYDFSHVETFMNSDATKKALHVDS HNPTWQTCNMMINMSFHTDWMKDFAPYVADLLNAGIPSLIYAGDVDFICNYLGNKAWT LNLDWDHSAEFKAAEEHDWNSGAGLARTANGLTFLQVYDAGHMVPSDQPEHALTMITQ FLNG" gene <69623..>70500 /locus_tag="THAPSDRAFT_38360" mRNA join(<69623..70319,70412..>70500) /locus_tag="THAPSDRAFT_38360" /product="predicted protein" CDS join(<69623..70319,70412..>70500) /locus_tag="THAPSDRAFT_38360" /note="GO_function: GO:8234 - cysteine-type peptidase activity; GO_process: GO:6508 - proteolysis and peptidolysis" /codon_start=1 /product="predicted protein" /protein_id="EED86482.1" /db_xref="InterPro:IPR000668" /translation="EVPESFTWDNVDGVSYLTKHLNQHIPHYCGSCWAHGAISALSDR IKIARKNQGHDINLSIQWVLNCGAEKAGSCHGGYHTGVYELIKEFGYIPFDTCQPYLA CSAESEEGFCPQVDTTCSMKNTCRTCSGFSDSGGECVELDYFPNATVAEYGEIEVGWL DSYEDVAHKIRAEIYARGPVATTINADPLRDYEGGILDDETAGTNTNHIVSIVGYGKD ETSGKDYWIIRNSWGEYWGEMGFAKIAAGKNMLGMEDNVAWVTP" gene complement(<71003..>72597) /gene="GDH_1" /locus_tag="THAPSDRAFT_38359" mRNA complement(join(<71003..71815,71897..72063,72155..72282, 72386..>72597)) /gene="GDH_1" /locus_tag="THAPSDRAFT_38359" /product="glutmatae dehydrogenase, nadp-gdh" CDS complement(join(71003..71815,71897..72063,72155..72282, 72386..>72597)) /gene="GDH_1" /locus_tag="THAPSDRAFT_38359" /EC_number="1.4.1.4" /note="NADP specific; GO_function: GO:16491 - oxidoreductase activity; GO_process: GO:6520 - amino acid metabolism" /codon_start=1 /product="glutmatae dehydrogenase, nadp-gdh" /protein_id="EED86525.1" /db_xref="InterPro:IPR006096" /db_xref="InterPro:IPR006097" /translation="KYPHQPVFLQAVEEMALSIEPLFSDPVNGEFYKRAFLYMTEPER MISFHVPWMDDGGVLHVNRGWRVEFSSALGPYKGGLRFHPTVDDGILKFLGFEQIFKN ALTGLPLGGGKGGSDFDPKGKSEAEIRRFCESFMTQLCRYIDASTDVPAGDIGVGGRE IGFMYGQYKRLSNKHGEGVLTGKSPLFGGIHLRPEATGYGTVYMAQHAIQDKLNKSLS GARCAVSGSGNVSQYACKMLMELGANVISVSDSNGVLVFEHGMTKDDWNKIIEAKQVK RVRLGSLDGNISGKYVANASPWNLPSDLQTIDFAFPCATQNEIDEHGATLLLQKGCKG VFEGANLPTTAKGQEVLRAKREVLYVPGKAANAGGVGVSGLEMSQNMSKTYWKKEKVD EMLKDMMEGIYNQMKKGAGEDGTLEEGCNRAGFLKVASALKELGWVW" gene <72671..>73988 /locus_tag="THAPSDRAFT_11337" mRNA join(<72671..72906,72987..73378,73489..>73988) /locus_tag="THAPSDRAFT_11337" /product="predicted protein" CDS join(72671..72906,72987..73378,73489..73988) /locus_tag="THAPSDRAFT_11337" /codon_start=1 /product="predicted protein" /protein_id="EED86483.1" /translation="MAFDVVDDICLLNEVVAIVVRSCSRMKVMSRSIRSGRRRDGDGI VLDVDTCIKNVNRVILGDPGVFVRVVNVDSFPKLRLKRIFLSLLHLQPHCSPFLANRP HTTPTMKNLLLALVAVAVSTTTAFLTPTQTAGRTISSSARDVILPTDFVSSTADAFHD GATSLLLSDEALSPAVEAARQKFWFYFFAGSGAGGIGIAQLPAIFRDASAARNAANTG SSLGGEALDAGPLLRVYYNNEISAKDVGNAIAKAPSSEYISSRSQSKNYMASKGYIER RDFIKEMEEKGCNPLANYVLYDAISAGKGDVVSPVVYDDKLAAYREGSLSDGSVAGSF VGDLNGFLAVKVGAFLGLVFCLLVDFGFIANAGIEGFLSQP" gene complement(<74199..>74965) /locus_tag="THAPSDRAFT_11338" mRNA complement(join(<74199..74256,74319..>74965)) /locus_tag="THAPSDRAFT_11338" /product="predicted protein" CDS complement(join(74199..74256,74319..74965)) /locus_tag="THAPSDRAFT_11338" /codon_start=1 /product="predicted protein" /protein_id="EED86526.1" /translation="MNMGATFNEAKHLNLKGSSSAAFDKPSDEAASTTMTMSVPPPSN YQLVPSTCDSPSRGHFDEIYLKGVWGKSTRSASDFYSDAAWPTKAMRASSASGPGSNL GYATETSLKIIKDAIAKYHVQSMIDVPCGDVNWVFDSLETDTLPIYLGLDVTSAVIDV NNVRFHHHNNKFFSFWDATECVLPKFKNGTAVELSSFELVHVRDVIQHLTLVQGVKTK EHEHRRGKLVQKQFTT" gene 77459..81410 /locus_tag="THAPSDRAFT_270000" mRNA join(77459..77579,77793..77856,77954..78269,78516..78846, 78907..79454,79469..80543,80907..81410) /locus_tag="THAPSDRAFT_270000" /product="signal peptidase" CDS join(77492..77579,77793..77856,77954..78269,78516..78846, 78907..79454,79469..80543,80907..81391) /locus_tag="THAPSDRAFT_270000" /note="Putative signal peptidase; similar to gi|15240582|ref|NP_199804.1| putative protein; protein id: At5g49930.1 [Arabidopsis thaliana] (model%: 100, hit%: 98, score: 1267, %id: 32) [Arabidopsis thaliana]; GO_function: GO:8233 - peptidase activity; GO_process: GO:6508 - proteolysis and peptidolysis" /codon_start=1 /product="signal peptidase" /protein_id="EED86484.1" /db_xref="InterPro:IPR008532" /translation="MTTHLKRSMLGFKLANVYDGSALGIMPAADAEQAKRAMLLIESG VRFHPTTHYSQSSSSSSSMPSAFAMKLRKHLRNLRLENVTQLGNLDRVVDFRFGSGSL THHLLLELYSLGNLILCDGQYRILGLLRTHEYEDGGGDEGKGEEVKVRVGNIYPGKKK VNGKGGGKKKTIDESVALKALLLKPNSGVYHYGPSLIEHCITTAGVDPMVKLTHDNIE YTLPEASWNDLVSSLCGEGAKVIENLSSGESGGYILYKPKQTDDKNDYNKTLLEFQPH LLHQHKNQHALSYTTFATATDEFFSHLSSQRIAQRADAAEAAARERLSKIQLDQQRRV DGLVAEQEKSRDCARLVEMHAEDVDRVLGVINSALESGMNWDALEQLVLVEQGNENPI ALLIFKLELCKDQVVLALPDIDDWDDSDPDRPPKLHYVTVSIKESAHGNARNMFATIK QSKTLEASTTALKAAEAKAKQQLAEAQKKKQRIQVMPNRKTYWFEKFAWFITSDNYLV VAGQDAQQNEQLVKKYLRPGDAYLHAEVHGAATCILRAKRRRRSDGKTQVIPLSDQAL REAGTFTTCRSSAWSSKMVCSAYWVESHQVSKTAPTGEYLTVGSFMIRGRKNFLPPSS LEMGMGVLFRLGDDASVARHANERRDFALMEHEEIFARQDALREKNKVSVEVEDESEP IPLDSYEKEHDDVCPTGHTNAIDGNAGDEAIEDTENNVEVTPDAEESTEQPNSDNESS DGKQSDGDEVPTADTKKKQKELSRGKRSKNRRAKKKYSEQDAEDRELAMMALQGGESS KKKRNGKGSRDGKQRKTKAEKQAEKEAWQEILAEDGIIDDDGDDDGGAVDDTAELSKL TGKPSPDDVLLCAIPVCAPYQVLNQYKYRVKLTPGSVKRGKASKQCVELFLHNDDKKM IVDEGTKRDCALIKLINENEWVKTIIGDVRITSAGASKLTKKQKGGGGKGKSNK" gene <82552..84694 /locus_tag="THAPSDRAFT_25535" mRNA <82552..84694 /locus_tag="THAPSDRAFT_25535" /product="predicted protein" CDS 82552..84666 /locus_tag="THAPSDRAFT_25535" /codon_start=1 /product="predicted protein" /protein_id="EED86485.1" /translation="MTTLSSALLTGLCIICSGISSVEANNITPHRILATTSYDLANAN SAERFFSDSAVEATLFLGDGARIEWTATTPVAVACGSSVLLTDESSQGGEFKYVIHAP TFAYRFAHLHALLALFFSIADSPYFWQHIYSESMDPSIPSVDHIVSISLDTTSLQSIV DISTQASVYDSLDESSNDVALMSFRLFAFMPYAKGVMVSHMYALELPLDIVQSLIDGT GDSTTTSLTIDALDFSGGQSTYNGDFASYSAPNFLFGLWGEDSNAFSKGPCYESVKLS QGNSDVIVIGGSNSDEGACVCEVEDSGFRQRELQMSGSVFCNEYSSPLVATTLSEGVV GFQMSVANEERGVASSFHYGLTNPVVSDDGSVLYSADAFDKTNGKEGNVTMYHGLVRA DEVSANPSAVVALTSFPLNTGDSSSDESLSVGGVWIDSQMKVGYSRETQISVSEMARY VSDTVEVDVTSTGDVVIRESSLCPAGSIRNLALLSEGADILEVSSFYSSVYGATKAID GDSSTEWSSAYDGNNAFISIQLPYPSNVVYVEFHTRTMTSSAQIYEYLVEAGNEAPNN YVVASSCFVPDATKLYECDLDLMTDGDLNSMGVRDVTVVTFRVVDSSGGNTGAIDIGV YGCSLADEELALDMTTESAGSTSSASGSVSGNDTIVDSSVGGTNSTQIGKNAPKSAAK AVLKITDTFLLAVIIAVCNYVL" gene complement(<85498..>86679) /locus_tag="THAPSDRAFT_11341" mRNA complement(<85498..>86679) /locus_tag="THAPSDRAFT_11341" /product="predicted protein" CDS complement(85498..86679) /locus_tag="THAPSDRAFT_11341" /note="GO_function: GO:8483 - transaminase activity; GO_process: GO:9058 - biosynthesis" /codon_start=1 /product="predicted protein" /protein_id="EED86527.1" /db_xref="InterPro:IPR004839" /translation="MNEPWSKRHKREFKGCQYSLSNSFAQPLTQPELVQFTKDGGHTE LLDLYHNHDLEYVPNGGSIDLRRDIARVVYHDKLSAENILVFPGGQVAIQTTSLLFAK GCHSIVFTPGYQSTVESPGWSLGNEGVTKIERRAENDWQIDPQKVREAIRENTKYLIL NEPYNPGGIVMSLEQQSEIIEICRQHDIVILCDEVYRLLEHDPSQTRIPPMANAYERG VSCVTKSKPWGGCGITIGWLACSDVSMIQRLVDVQYFGTACVSRASEIQGRMVLASSD AILEDRRTIILRNKALLQAFIEVRYKEWFAWRRPNAGAIAFVEFRGPWTSKKLGVHLN EADISIKPAYCFTDTVTSEVDQYFRVGFGEKKFPLALEALAKFVDRHEAAWRVVEEDV K" gene <86917..>88470 /locus_tag="THAPSDRAFT_11342" mRNA <86917..>88470 /locus_tag="THAPSDRAFT_11342" /product="predicted protein" CDS 86917..88470 /locus_tag="THAPSDRAFT_11342" /note="GO_function: GO:3677; DNA-directed RNA polymerase activity - DNA binding [PMID 3899]; GO_process: GO:6350; tRNA metabolism - transcription [PMID 6399]" /codon_start=1 /product="predicted protein" /protein_id="EED86486.1" /db_xref="InterPro:IPR007811" /translation="MPPKAPSQAGRFKPLKRPAKKPEASTAAEGGAASSANTSSSDRA GGGGRGQRPSRGDANDRGSGRGGRGDRSKSPGGRGRGARGGGRAGSGRGGRGGRFIAP TGAAFFTGACAKRSDTAGGFASVAAAGGGGDVNGDDPRTEVKIAAGQDGAIFIPGSSS ASGAVGSRGSSGINKVSNSAAESMAAAARAKLGEGEEIVVAEMELEDGETADDGKKKS VLDGPSRSERFDGMPSLFDDENDVPMISDHATLADAFVYDSDSSEEERRVKRKQQQRG GGGGKNSNSGVAPTQLPFPVGVNQQPMYNCQEGMFDDEKKIEAEPIASSSTTQASSKL NDPPIQSPFLDVTYASEEMKQAENNSWFLMKFPTRLPHLDNNSMVSSTKSGKKAVKAE VDEDGLELVGSANVDTSMDIPTAATSVSSSATVGGALGYDDTLKDCAPGKYGKIVVYK SGRTELVVGGGDSGRPEVRMLIHEGLQCGFRQEAVSIDPEEATFVPMGDVGKALVVVP DVERAFAHS" gene complement(<88743..>89462) /locus_tag="THAPSDRAFT_11343" mRNA complement(<88743..>89462) /locus_tag="THAPSDRAFT_11343" /product="predicted protein" CDS complement(88743..89462) /locus_tag="THAPSDRAFT_11343" /codon_start=1 /product="predicted protein" /protein_id="EED86528.1" /translation="MVHLLTEGLIVGSINIILMLAFEFGLSYTDVLRLRKRDANLYNA GMRATVFNSTVLGAITYFATITYCCVPGPLTVWQQVSSIFIFILVENVWYYVAHYLMH RPQLFWIHRFHHKFNTIVIPSSASAVSIPEFLLAYMAPIAIGSWVGGCDKASALTSAG IVAACNLIIHTPTLEEKMSCLPWIFVMPSDHVDHHQKLNCNYGAPVLHLDRIIAYLQD VSFDSVSLVYEMISGANKKLN" gene <89682..>90544 /locus_tag="THAPSDRAFT_38370" mRNA join(<89682..89739,89883..90061,90197..>90544) /locus_tag="THAPSDRAFT_38370" /product="predicted protein" CDS join(89682..89739,89883..90061,90197..>90544) /locus_tag="THAPSDRAFT_38370" /note="GO_function: GO:8080 - N-acetyltransferase activity" /codon_start=1 /product="predicted protein" /protein_id="EED86487.1" /db_xref="InterPro:IPR000182" /translation="MKKNYEICLSGSNILLVPYRTEHLTNYHKWMQDPSLLDATASEP LTMEEEVDMQISWRDDERKCTFIILARDLLDCSSGEPDTEGEEQPYNTGNKQSTTCTE QLSQAELDIMIAESSHRHKGLGVELALTMMHYGAFHLHIRRFFVKIKNTNNASLKLFR EKLGFVQCAYAECFGEYELECKCERWEEMVKLIED" gene complement(90574..>92857) /locus_tag="THAPSDRAFT_25537" mRNA complement(90574..>92857) /locus_tag="THAPSDRAFT_25537" /product="predicted protein" CDS complement(90926..92857) /locus_tag="THAPSDRAFT_25537" /note="GO_component: GO:16021 - integral to membrane; GO_function: GO:16758 - transferase activity, transferring hexosyl groups" /codon_start=1 /product="predicted protein" /protein_id="EED86529.1" /db_xref="InterPro:IPR008630" /translation="MGVALLCCILSLMLNSYYLLLTTSSHRPTVHDAPYAIIVSTDDG NGNSNSLQSTHNPNERPSLAVVSVCISGPRFTPEYINASLSNKRLFCNRWGAMCVLPS ERLDNGDMENYHAKWEKLVYINQTLHMENVDWVLWMDCDAAFTNLEVDWRTHVPLNKS KLMVVSEDKNGINLGVFLVPNTLQSRGFVQQLYEKRHYVEQMQFKWKDQSALIELIKD DPNIKTKIEIVPQRKINSFLKDERNKDGKKWQAYDWIMHQVLCREEAMCTSSFIWTLD SVAGREPDYSAFVGSEKRGLQAKETEAHSSLLIPTQSMCNDTHGDIRSAGVRWYDQAT NVNIEMYPIGKKTVGVQIQALSSKSATLLRKHIGRKALRTQQLDHLMNTSELLAESFP KMNKIYHVEKAPNYDHYLGCDNEVQSPRGTALLVTQNFGCNNLWHALANHHGVWTLFK VLGISPKDVTRVLPTEYGTPFASPPRTVADVLWPLYIGTNVINEPQQTNCFERLVFIE TNLNRPGPYWEQYQVKEQCSSDTPYMQRHEHFHTEAREIAIQVISNTSDAGSIMASKP EPQVICYMSRRLRDERVRYFSEEFAPIIEDLLDIWAAKHNVDFKRLVFDDKVPFSEQI EQTASCSILFGTHGLDLDI" gene <93302..>97981 /locus_tag="THAPSDRAFT_11346" mRNA <93302..>97981 /locus_tag="THAPSDRAFT_11346" /product="predicted protein" CDS 93302..97981 /locus_tag="THAPSDRAFT_11346" /codon_start=1 /product="predicted protein" /protein_id="EED86488.1" /translation="MESDSQQQGSALPDSSSIDDGHGSRLNLRGPLHRDQPSNTRAAS SSHRSVEHPIPSTQQHVSTPTRSMNHGQQPNPRNNLDPTGMFRNRGRHVVARQYGGQK MSLKNISQHGHGGVNGGSGGQRVSSPQLLSHHTTDVVANQMDDAQLEQTVHQPAEACI GEIIATRSDEQIDAQTDPAPTKFVSSPECAGPPDDVSVKDGDPLNEESAEPDGNVQIE TSLSQDSNDSAPSAIGSTISQMGSVADDDDDDDNHETKSGPYSSAYNKMHKVNEARWL EVKRRMMQDVQEVDDDDFDEGAIEAETENVVGGDTDNNTGSISNAKLPHALSNDSLQS MPDYQQHGKNDAVNISMECNIDSDTSLLDDSRPLFCDDHSEVMEEGDATTTRSNVVKV ADFSSMNGNYDNNYVHDEDDNNNDLSSPATYQPSAASADEEQESPLRAAFRRKAQDVS NSLSPFSSSFLSAPTAESLKRITPRFQVVGDAPTAAVPNANTSNTPSSTNERAITTNS LLPLPLQLARKCFSFDDTTLDDDNFVAMPYQPQLDTVGERGGQHPQQSFLPPPQPQST KYYGLSAVASFPSDYDNSLTARNNATKMSPATLASKSGILRPKWNTEPESPFRIIRHS QTWNHTQQRSNLVGRETARQQTSSFDPWSSYKGQVYPLSNTNGRVEPFPLYGEDKRGE DDENKEQTVSDILTLCWIGMSLSNLTYLLMLLQNIQTERDDAIDLLACIVEQSLNFVR KEPIGEDSEPKSADSIDSILCETCQAMRGSNNPSQIQSCLNCESCMDKMQQLVSSIKD ISASHVDQLGDAHSSPSNHHVRSIAIDTLMQSYKYSVEMKRASLSAHKWLDSLGRAKH VALDATNDKSDQSLDGVAHKARLHAAEFTISKQQKEIHRLNEEVVHYRAEVGRLRNTS LSKQIRGVSSTANRSILSDSSSEDSLDAIIQKGRSLVLGSPRAVIKLSESRDESFALF ERKMESDMNLEARKEVLLLKAALEKANRKIASLERLNTVSEKKEPFPSEIEPITTSVD DLETELFLKIADEIEESMSALLSEETEGARESEGTTNSTIQLGDPALEEELEKYRAAL IETLQIDASHNKIHSTNSMSMDDANLAQDVSKSHSSELTSEQRMVNVHMIDGENFVTE WENVSALPPPPDHGLRSPIVDAILKRWTDDEGTRVALTNWVEGILNGADPESILPLKI AGLDHQIKDGFLMHVLPLLLRRKDVHVQVTSRATRTTQYDIAVAVSPSADSNHDNDKH DASTQQHQTVGESKHHLMAFRATRSGSSMKESVYTSQVIEGYPTASSYLSSIGRKVPS LVRSGSNAGSISTAVTSPISNRTPTKVRPFFNAQLHGVNRHTPHHLNSSNASFPRNDN AFPIMSVASPALGDDLSVGSSVNDDDGSKQSFQSSRQGSIMNSLGDAFGGFLSRTKPP KPSSPANDSPPVFLTPHFVSTPKASQHDEEHPYHRVVSAPAGKIGMTFVEYRGHCMVS HVAEDSPLSGWVFPSDILIAINDVQVSGLKTRDIVKLLTAKKGQRRDLRMISSHDMNE LIKPGGI" gene complement(<98162..>98617) /locus_tag="THAPSDRAFT_11347" mRNA complement(<98162..>98617) /locus_tag="THAPSDRAFT_11347" /product="predicted protein" CDS complement(98162..98617) /locus_tag="THAPSDRAFT_11347" /codon_start=1 /product="predicted protein" /protein_id="EED86530.1" /translation="MCLMAILLLRNYVKYHIRSTIPIRLNDGFEVWAQNYRRFDKSIG VQCRASCSSSSKRHSAVLETLLLKLFGMNSLLVLDHRKDDSWSTKCYFVIVGEASASR PLGKSFSSHSHQASTDIQEAINWRSENAVALYDDEFSSGDEGIHMIQSK" gene complement(<100454..>102578) /locus_tag="THAPSDRAFT_264646" mRNA complement(join(<100454..100841,100918..101166, 101255..101865,101942..102011,102045..102048, 102094..>102578)) /locus_tag="THAPSDRAFT_264646" /product="hypothetical protein" CDS complement(join(<100454..100841,100918..101166, 101255..101865,101942..102011,102045..102048, 102094..>102578)) /locus_tag="THAPSDRAFT_264646" /note="Function unknown, but may hydrolyze 1,4-linked beta-glucosyl residues.; hypothetical protein containing glycosyl hydrolase family 9 domain, with weak similarity to endo-1,4-beta-glucanase; GO_process: GO:5975 - carbohydrate metabolism" /codon_start=1 /product="hypothetical protein" /protein_id="EED86531.1" /db_xref="InterPro:IPR001701" /db_xref="InterPro:IPR004197" /translation="RVNQVGYLKFATKIGVVVDSSTSPIDWQIQDASGSVLLSGYTSV YGEDPASGDHLHHADFSSLKDIGSYKLVVDGIGSSLSFRVAPSLYPSLPHESMNYFYF HRMGEEILGKHLVDDRYARAALHPGDTSVPPYTGWCESCENFDLFGSWADAGDFGIYT VNHAISAWTLLNLHELFPEAFADGDLNIPESGNTFPDILDEVDFGSRFVRGMLPSNGG LASHKAHNHVWSDFTTTVNGENSQQSSRSAMGSSTPATYAVARVNAQLARIWDSKGGA SEYVAELWNAAVDAWDRADGTSKTYNANEASPGPAIGGGDYPDGEVNDDRYAAAVEMY LSAFAFGDQSTQAYKDAMMASSYFKAVGQWDWASVNAAGTLSLYAVDNDLSSSVRLLD IEANIVSFADEIRTALDGEGYPTNLQFTGQYPWGSNSFVVNRAIALAYAYEITGNTSY QDYILRTMDYIMGVNAMDLSYVTGYGEKAETDTHDRWAWTVGQASFWPKGWLSGGPNN GLINDYETPGGVAAAKSYAGPNTAPHAWGSKENTVNWNAPLAWVAWYCENKIVPNFGG CEGNCNPVASSLIGVKVLMNTPFFLELSGECYMMYA" gene <104964..113871 /locus_tag="THAPSDRAFT_25539" mRNA join(<104964..105015,105091..105504,105694..106135, 106267..106539,106741..106803,106892..107683, 107770..107879,107974..108734,108831..109058, 109143..109952,110030..110659,110743..111070, 111158..112117,112183..112578,112665..113871) /locus_tag="THAPSDRAFT_25539" /product="predicted protein" CDS join(104964..105015,105091..105504,105694..106135, 106267..106539,106741..106803,106892..107683, 107770..107879,107974..108734,108831..109058, 109143..109952,110030..110659,110743..111070, 111158..112117,112183..112578,112665..113800) /locus_tag="THAPSDRAFT_25539" /codon_start=1 /product="predicted protein" /protein_id="EED86489.1" /translation="MLAHASASVRPSSSLARPFLLFIIRSFLSRRLPSFLGGGNDNIS SSAVNNIHLADTTTTNNSGVSEGYHCPASPKRSAPVIKLTALGRDVSAEVFRAMALSE RGGGDDTNYNNNDSDDEKKSAREGEDESIDLQQEEAKSDQAVKTSDMEGSNNCDNNNE EQLSHLLTIQTSPLQSLSLDCYPYSFSPLLLFTSNGYFDRGYSNYNAVAGFFEGVLRE LVVTSVDEVGGEGAGGGGENGASDANINGGMEDGANNGWERRVNNGSPRRNGNNETDN GNNDEVEGGRGGIDMLQGVLYDRKNMTYDELFNHIIATKMLVTCCIDAHFTAFQIISP KYLVYYDPLSPNLRVAVGENDVQTAAVFLLLKCNYGDNQHVQENKAYYTGPTSSGLQR RMQNNSSMSTQLTGNTCYFQTYLFAILCKVGKLAISGDGRSIEVENVDLLEEATIAIA RFMLTFFVDTERGIMRPLTNNNFVLDFGRYESSSYYTAFVNYPRHKKVSVPNYQAQFD GVMKFFVFNKTLHGYDKFTLEGATSSTPNTKSLQYIRGTDGASRKLARGDYYKYRAEN LMFGCNAGAMIGISCFSEFNAWRKNQLLSFYDAMRTMIGGISKDIGAKGLSKYRDYYF MPQFEVGQQELVDIHHYTYLMDMCSLGKTPNGLEDVVHAINKFLVQHIYFSTQGRNNY HKMMDLQSFMRTSKFMNPFQKNFLTVEWFHDYIGLGFSEINPKEKDINSLTQTVFYSS DLARTIAHRQEYEFEKECINQMARSNFRKFLALVDEKHNATKKYEASIKIGHGFTYSK YNTLMHFLGAIESYWSNPDVNNIQMFGKDIRTLLAISCQKIFFEPSHCGYYHYGPMEY RANYRNEFDLAVASCTGRAGASISEATSTYGSGCNYLAITDRVYEYSYVQGILERLFS SVNSTRLKSDNGVLNLLLLSLMLDFGLYDEYAKLLNIPFMKDLQHEADTKELQVEVAN LIYDFDRKHTADPVTRSKVEDLIFEVSYKFLVNKNFNISSSQFKLIQMLNGDSDYQEY VLLCKIYMSLCQINKSVEVDYYKIKCNGIFHIIIPHNFSKVTSEYLEEITVHHSFSEK EGIMRYDNIEVFDLRPHQPEINLYRVRLESSTEVQSMVKYIEINNVFRALSVEEQYLV FIGDNSILIDVEEGGKMIIRINQVYAEIATIFFNDSISFIPCFKYTEGEDVIIFTSPN IHYLVDKGGQFCPDYYGMKHELMGCINSDEIFVDLNDDVNFKREKLSELVTESKVVVY FPDYLLLVSSRQQLINLLDYAIYIRNLSMFILVLFYLRRTSVELQYISRSKDKSPCPP LIRSPWKEAILYVLGMSKNESYSEIFAQQFNDLDQQKDLPLREFIDVLCDNFTRYQRY TEDGQYQIIPTAKQKQFLERIICGDECFHFSEVGTGKTKVILPLLCQTFLSSNKETHR YLARGGKEKDTLVILVPEHLVADACTQVFRHCINLNSPHQEYRVFDDIFALLHDNVQL NPEQRNKYYTVNPNQKQRRRMKQIFVTSFNQFKKALTYDKICRKIRPHREHVLIITDE VDDFLDRDKLVFNICSNKANSFDRDTMEYFHEASNAAYHGRQFSKEFFQSSPNPAYWK ELYEKFVAIHTEIQDASKSLNKSFGIFNEQTLRHCTTSISHDIEGYKALIARPYESVN RAMPGSYYSDVERTIFLTFVILSEDIAKYNELFQSERKFITFEYWKEFLAELDYDELV YGHDKLSEIAEKHPNNRSGLIHFLYVIILKRMEVRDGSRSVNSVDIIFNFDCIGFTGT PFLDNYPTANYIRNQRKDNIPPTIDRSFYAYTSENLSTEEFESRFARFQGQNSNVQAE YVSSDFMQETLRAGEMKTLKAIFEREQSNAGATKESKTTVAPFNAIVDLCGIFKLSNI YDVRSLVLKHFGPDCFHYIYHIDQTDGSDRMLCIRTKNDVNFDEETYKFMCRTYGASL REKIFFFIDNRNYIGKDVPYQLIYQRQYKLPLFAKSVVIAHDVDDFSKIWQAMGRSRT MNDTSFSIYKNNIPEGMVNGNSGPGDIKKLPLTRLLYARNCDCKMAGNLSSIYQTLIA LYNLSEESFYYSDEIVNTFIDKMEKTISSKVARLEEKLSRTVLGDPVPAQILQHIFLD KFRRSSNQAVSQNELTLQMANSLLRQIVQQKFEQRLPSGNIYDDYIRFLSGEQISLME ISYTKQQQKQKQKQKAKSQDNDTMDVFAKANQLTLTSKMENYFDSTIKGDVDGVKQTL SLPLSVPIFTAHYINSGKKHAIHVYPTVQFLYSHHIKPLYIDEEIHSLMRNADNPRQF FSDFVETVCNVKSLEYGEMESDKFHAEVKFSCIRQNPQYSLVGIQPGVYIIGMKDQFN SHDCATHPLNDFMQYAADEIGFVLFDKTNSKSVDEFGPYFIEQYILLDALSKQEVAQN VITYYCNHKETLERCLEKYDEKQGKGFICWRFLMNQATLALE" gene complement(114033..116382) /locus_tag="THAPSDRAFT_25540" mRNA complement(join(114033..114675,114776..116305, 116338..116382)) /locus_tag="THAPSDRAFT_25540" /product="predicted protein" CDS complement(join(114184..114675,114776..116170)) /locus_tag="THAPSDRAFT_25540" /codon_start=1 /product="predicted protein" /protein_id="EED86532.1" /translation="MMRPFATALRQSADQKSQEDDLSDDNGSSLSPLLADSNSHNSTS LTAADTTTLPSQEETFSLRVKLNDGHSQDYTLDNVNAKLGTVGMLKERILGCYFGEES GANSGSVGGGGLRELGSLLDTSSNNHPSFHASKNRYLRLIVRGRMMAPDSSTLEKFSI TKNDVIHAVLAKEGVRGGQQARMLRRLNAGNTVSGRSNYYGGSGGDGENNSSAVVGGG ALRQFTTAATSTLTPPLSDNSLTNRLMRRIGIDANGVVISQSQDDDDSEDDEDFDDVH GELDEEMGGQHHQQQRHHEDRSGSRRRIRERRGFDRLRATGMSRDEVNTIRLYFARSV DRYIQRRRVMIRASQMIRNRRNGRSVNADVATDGEGATRSTSGDEHHLSALTSSRQRS ETGESANSLLDDGEGGSGANVVESEHGADVLTANAGNAGEDDTANNENNTPAVEGEEI LLDRRRMEDEWMSTQGPYSEFRMNLNTSNPLLLAAISGGNTPTNNNANTAGRSFASGL FFRRGGTLSNNPTGMTLDGEEEDEEDLMFGGTIGPNGTFVRTTNPNGPFHPYSMGPVP SAGTEKDFLWGFILGFFVGFIMLFWVWMPTVPHKQKIGIISGISFQLGLNLLRKSGQE GVAM" gene complement(116814..>119259) /locus_tag="THAPSDRAFT_25541" mRNA complement(116814..>119259) /locus_tag="THAPSDRAFT_25541" /product="predicted protein" CDS complement(117223..119259) /locus_tag="THAPSDRAFT_25541" /codon_start=1 /product="predicted protein" /protein_id="EED86533.1" /translation="MPPFRIAVGSCSHPSLPQHLWSIVHQRHPAAFVWGGDAIYADRY SGLNWTAVGLAYVAYGDDSAVDGNSYGAASVGDGNIDTSHAPQKKSGEWRFTFPPPSI HIDATPEVIRKWYQKQWNIDSYRQFVEGWDWNEYHPDLQLPDGNNNEAINVRPLIFGT IDDHDYGANNGDLTYRYKRESNLEFLEFIYSGVDGSVDSANGSCNAHVQDDGGQICID GSQKSSNRRNKHNDPMYQRAIDGKGVFGVQLFDFASTKVSPKSMSDNILWGGGYWIPE EEALIDPDVIAKQTTHNDSINATQHILQSNYSTTHSVAIFLLDVRSNKTPWPKGKHHH HQSSTGDDDNNVPVLDFLGEEQWKWFQSALSNSKAAVNLIVSGLQVHPERFPNDGNIV EEWSKFPESQAKLYNLILNSGVKSPILVSGDVHMSQILRKDCIRSSDIPEEDENSRRP LPTKRPLVEVTTSGMTHSWGTSFSSQPKHHTLPLWPYSYFVSRTFMTICHFVCPWRDL VIRTADMRREEEAMRAMQPDSVGGARGKVGKQYDLGLNFGEFEFQFNEEGGGVVSFRV FGKGKDQPPKLQMTWSLDQLSGKSDLAGMTAKYPQDFLTMKRRGTKSKVSSSKDEWIC VPYRGIAPALHEYTANAILFVTFCSLFFSPHVLLIAVFVVVRRRWTRRKEVEVS" gene complement(<120266..>121108) /locus_tag="THAPSDRAFT_11352" mRNA complement(<120266..>121108) /locus_tag="THAPSDRAFT_11352" /product="predicted protein" CDS complement(120266..121108) /locus_tag="THAPSDRAFT_11352" /codon_start=1 /product="predicted protein" /protein_id="EED86534.1" /translation="MKKYRELKASVADATAYADVDATISQTKKDKSGKKLKSSSPDYA CTNPTKKNGKSSVNLVMEESPDPVPSPTVTNALPVRPSPSSMMSTAIPRFDLQDLEED AGNTSKATATWSMMVSAASTYSFSTPSTAPTTNDTEKVLVNLYLAHLERESKSNGIAT DTNITHFQAVNAPDNYIRSLWNNYHTTTEGNINTEMDGSSSTTSTANSKTNSITEMDS QNVDDDDELKKFLNTLDLGDAENEADQEEHLRDELASMPFHDVEDTTTKATSKTIERT MTPV" gene <122582..>124017 /locus_tag="THAPSDRAFT_25543" mRNA join(<122582..122878,122983..>124017) /locus_tag="THAPSDRAFT_25543" /product="predicted protein" CDS join(122582..122878,122983..124017) /locus_tag="THAPSDRAFT_25543" /codon_start=1 /product="predicted protein" /protein_id="EED86490.1" /translation="MDQHQPKKQRGGRKAKAKKDEPKIPRPYTSWLIYFQLEREWILQ KKLGVASSLDPEKAFVAADPRYNGPPLPSRYQDLTLPSDWWMPGKGDRRKRLHRKSHG LVSFHELSRMIGDSWTQVDTETRSFCDRLSEIGMNSYREAKKELKRQELFDDESTTAV TEANANANDNDRKHKKNGELAADREQPAVLPQAHGKPSNTNTVGVHVQGDAIRPSCAN DERTNGELDVDRDSKPAALPINANVSLPVPPTANASTGATTAMSMSASHHINFNSQGQ ILGHTNLTSRTSEAAARSVIHPSLPNQTLEQRYRQMFFATVQAPNTETEEALIKMYMS HLETKVKTAHEESNGNHNRLRTVDMADNGIRRVWDGRTNSTVKSDDNDSERLLGSTDD INAKTEETDVKESENTEEAKEEEENTEGAEANEVGDEFKNFLDTLHLDGVR" gene <124844..>129577 /locus_tag="THAPSDRAFT_11354" mRNA join(<124844..125426,125449..128718,128805..>129577) /locus_tag="THAPSDRAFT_11354" /product="predicted protein" CDS join(124844..125426,125449..128718,128805..129577) /locus_tag="THAPSDRAFT_11354" /codon_start=1 /product="predicted protein" /protein_id="EED86491.1" /db_xref="InterPro:IPR001841" /translation="MTSSPSSSSSELLLSQSPPKLLSNKHHQSSSDTNNSEVRRPSAI RASSSSSRAGGGVAAANTSHSNRQPNERLSSHASTAAFTATRPSNVRSAGAVWRSLRK RERETEGGDVGGGGASTPASLAKKIKVNPGTWRRESKGYHGGGITSGEKRKNDKCKGP PDISNDVNNSCPRKLNETLDEMQQSRDSYRNAKTTPQPHDDDDNNHSNHYDSIIIINN KEEEEVSPKRPSDQFLQSYLPRNIQNIRNVQNIQSIANDIEGSLLLGMEVGADVGGGE RSEGGVYGGGGVEVMSQLTVGIGTQQYMSARDEESEMEEEKEEVGGTVAKEVEVLAGA GDTVSNGGEGVASPNEGGDAATQKKQQSAKAVGTEKKQSSAMNQDNNSDDDYEKDDFV SSDDDADEAANNARKKTALAAAVEETSNATSFNPINTKANAKKRGRKFKGPTFKALAE ADNGSGSEEDDQMDHGDDAFEEEEDLLETDDGAPNAAEAAGPNAMMKEEEDAKPSEST TLPIEVFIDNIARDFENMGKALTCPICQHSLRKATILPCRHAFCHSCLTQAFNPSSSG AATKGKKKNSPTHVKLECSVCREKVTRRSMTRVEQLDDLVRAYKMTARDFGFAPHVHN ANVTMTQLDPEENAFTFDDESEEDERKMPAEGGNERHYSSGKKKFDVTETTQHLQVTR VVRDAITSKSKQLQSSPETKTAANQNLVRTKEDFERQAMARRYQFLARDADAVVRADE EALEKATRRKQMIVSATAAVLGVEDVAKKSAPVESSGNLKHAKIASTTTTEMAEEPLQ EKDLGKATQETFQTCRADLESLEYATAKEGSHQANDDVLNTTAPSPTVLHDGNTAKKA NRKSSRASSLEDVGGRDDGKAAKEANGTTGATKRRNHSFATFATGDASPIVLHDGNTL RKSRVETSNNHPTPRRSGKIPSMAESRVLRQKEEAESEHDANDDVDFGGEHDFADDDE DRSPVPEGAAACVGVKSSQDSSVVSEESEGKPPANAKTEELTAWEEIVKGSVVMVKSR TWPGINKHGGVGRVTKVNVVGNATKYDVTYVLGGREKDVDASFVSLHGEMASIEASSH GESKVSQRTSRSTEKIAVRPMRQRKVAETIQPAPVPIYNDEVLKHIPPETLQWAGIKP LESRSKGSNSKGSKSRKKAKKAKVVVPKEASDGKKKRGLAEHNSNTTMPAKTKKQKGS SSSHPKEASTATNPTEPVHDAESFRTIIDALSTEEVVRRADVRYASLLLNAEILNVVT SSLEEKDSDVLYSLSKMLKSKNVTLKVMKDYKPGKTQLCITATSSSTSQHSGMVSKSR TLKVMRSALAGLPVLTPLWMEACLKDETIVPPTKDMCIRTLPTKQTSKEDGAADFGVA KYAGAFQSGEFPFTNHVLSGVSVLMCGAWKSGMAKELKILLQDAGATIVSSVSVASKT LTYMSTAESKTGSLVFLCDDSHTNKDCGISGSLLKEAKAAISAKDENTQKILAVHFNW LFDCISCATVLTGNAYEPMSPRAKELWSLGVESDGSECAKVAKSQAY" gene <132081..>132829 /locus_tag="THAPSDRAFT_264647" mRNA join(<132081..132461,132591..>132829) /locus_tag="THAPSDRAFT_264647" /product="hypothetical protein" CDS join(<132081..132461,132591..>132829) /locus_tag="THAPSDRAFT_264647" /note="No EST evidence, but hsp groupings and InterPro domain hits as well as similarities to cytochrome P450 sequences from other organisms support this annotation.; hypothetical protein similar to cytochrome P450; GO_component: GO:16020; mitochondrion - membrane [PMID 5739]; GO_function: GO:16491; monooxygenase activity - oxidoreductase activity [Evidence 8395] [PMID 4497]; GO_process: GO:6118 - electron transport" /codon_start=1 /product="hypothetical protein" /protein_id="EED86492.1" /translation="TFLMAGYETTAISMSFVVYFLSKYKRCQERCAEEARRVLGRCGV HGTDIDDDELVYCRAVFMETIRLHLPVMFTTRVTEKEMSFDTGLEEGHNVTIPKGTRC VVCPTVVHMDERNFERAEEFLPERWNNSASSISAADPHNFFSFSDGARNCVGKRLAIM ESTILIAVLLRDVCVDFAEEGFEMKKVRRFVTCGPESLPVVFWRRE" gene complement(134214..>137101) /gene="PGP" /locus_tag="THAPSDRAFT_25544" mRNA complement(join(134214..135139,135306..135367, 137089..>137101)) /gene="PGP" /locus_tag="THAPSDRAFT_25544" /product="phosphoglycolate phosphatase" CDS complement(join(134318..135139,135306..135367, 137089..137101)) /gene="PGP" /locus_tag="THAPSDRAFT_25544" /note="Likely functions to cleave phosphate off of P-glycolate formed during photorespiration. High similarity to p-nitrophenylphosphatase which occurs in the same superfamily of hydrolases; GO_function: GO:3824 - catalytic activity; GO_process: GO:8152 - metabolism" /codon_start=1 /product="phosphoglycolate phosphatase" /protein_id="EED86535.1" /db_xref="InterPro:IPR005834" /translation="MSMRPKDLLPGVDVFIFDCDGVIWRGDSVIPGIPQTLEKLRALG KKMYFVTNNSTKSRAGYKKKFDSLGLNVPAEEIFSSSFAAAAYLEQSKFKETGKKVYV VGEVGIQEELDLIGVPHFGGPEDANKQPDMGPGCMVEHDEDVGAVVVGFDRNINYYKI QYAQLCINENPGCEFIATNTDAVTHLTDAQEWAGNGSMVGAIKGCTGREPTVVGKPSP LMIDYLCDKLGLDRGRICMVGDRLDTDILFGSDNGLKSLLVLSGVTTEEKLLSQENVI TPDYYADSIVDFFVDENAKVGA" gene complement(<139417..>140929) /locus_tag="THAPSDRAFT_11357" mRNA complement(join(<139417..139961,140170..140827, 140888..>140929)) /locus_tag="THAPSDRAFT_11357" /product="predicted protein" CDS complement(join(139417..139961,140170..140827, 140888..140929)) /locus_tag="THAPSDRAFT_11357" /codon_start=1 /product="predicted protein" /protein_id="EED86536.1" /translation="MPTPIERDAPKPSDSPQDSSSKSPTKPSLLPVNPAPPNDTDSPT SVSPSLQPSSSPTAIGALPTTKRPSSSAPSPSNTLTSPPVGKSSVSSDPPTPPKAPPV PSQTVPVCLEPTNNASTKNLDSSFRSYRLFGKSDKTTRVFQYQSKTLKNDPFSDDCLA NSGRYRTCFRRATNLSSCDAFARAKPFLSSDETMASDTMHLTLIQDTIIATCRNLIQV EKALLTFLADNIGSDTSDSSGRVLEQIRLCVALKMQSTMRLANTVQTLLVTSLGVGVD DALDAILAIYQENLAKVPSLRSTTLENQGSPVRQAVGKGLFSTKPVQVNTCSWYGLLN RDDFNTVIRRYSEFKSEMTRAILDATDTEAVAKCCANRFSIDKYGTPSLTCDEYDNED CPNNEDIKGSTTLVGLKPNNGL" gene complement(<149026..>151454) /locus_tag="THAPSDRAFT_11360" mRNA complement(join(<149026..149348,149430..149580, 149799..149822,149905..149928,150066..150089, 150188..150247,150338..150957,151108..151258, 151374..>151454)) /locus_tag="THAPSDRAFT_11360" /product="predicted protein" CDS complement(join(149026..149348,149430..149580, 149799..149822,149905..149928,150066..150089, 150188..150247,150338..150957,151108..151258, 151374..151454)) /locus_tag="THAPSDRAFT_11360" /codon_start=1 /product="predicted protein" /protein_id="EED86537.1" /db_xref="InterPro:IPR006970" /translation="MKVTTSIITLLFASCGAADVQRVLEDVTEPAVTTPAATPAPITP EPATPAPTICEGRNFYRDDDTRKCSNEATGGIYGTLIECCVAISGSDSCPYVDICNTL QPSPSPETNEPSAKPITAAPISSAPVSAAPVTSAPVAAPVETTSMTGPTTIVASIVST NAPSSTNAPSSSLEAVVTRIPVETTNTASPTTTAASIVSTNAPSSSPEAVVTPRPTFR PSPKGTESNTSPASIASDVMFGPPRTATPTSTPTSSSHPSSSEPTLSPSVSKEPTRYP TSSPSHSPTKSPSKSPSSSPTTSPSASPTETPTETPTESPTELPTLSPTEFPTLSPTW SPTGYPTLAPSPSPSISSAPSVSSAPSSPPSISSAPSVSSAPSKNFGFLPGLTEMPTI SPTEDHYFFGKSHKSHKSHKSKATKTLKVSKSGKSAKSSKSSGRRPLFGVSQLSEGIA VGYAKSSGRSSQQAVGSWMPVAAACILGALSFILN" gene complement(151953..156499) /locus_tag="THAPSDRAFT_25547" mRNA complement(join(151953..152690,152775..152873, 152949..153005,153101..153163,153247..153309, 153387..153476,153560..153646,153723..153755, 153845..153977,154054..155243,155320..155839, 156068..156499)) /locus_tag="THAPSDRAFT_25547" /product="predicted protein" CDS complement(join(152128..152690,152775..152873, 152949..153005,153101..153163,153247..153309, 153387..153476,153560..153646,153723..153755, 153845..153977,154054..155243,155320..155839, 156068..156454)) /locus_tag="THAPSDRAFT_25547" /codon_start=1 /product="predicted protein" /protein_id="EED86538.1" /translation="MKNSKEEEVTKLSTRIQELEAQITQMQVGGDMANQDVEMLQREH EQEREMMNQRIMELEAMQDGSWQPEGAYELQQERDALSHRVAELEENIEAIKNELTER DKLLRTSNKATDILLDQMEAQKLDFEKEREARLAAEREVEKLTAMMNENRRSGQGATE PKTKKSKSFFEQMFGARGDSEKDDVMSEQVYSEEIISEEYTMEYGGRDDYALLNDVLT PDAVAPRRTTDYKVARESSMPRESLGRRSNDARYGAESAPGGESPYYNSMTPSPAVAR EPPVQVPTPQLEFERRLAENPIVPAGAFGGSRPTAPFFSKTSPPTIDSSASQYETPSS ASYEEPALPPAASFAQRPTSPSPRVQARTGISDNIYAGAGSAIKSDNVYAGAGGVGKS DDVYAGAEQRVGGGRGGAGKSDNVYSGAGRGSSDNIYAGVERGRGGGASDNVYAGAGR GAGSMAPSPSVAYEPPMQVPTPQLEFERRLAENPIVPSGGKLIYVVNSISLALASLFL IIVFIKAFGGSKPTAHFFGRNNAPPTGASSRQQTGNESGDDESAKWQSLDENEKKRVA AEAYKAFEKQLADRRSSQERKPKEAQGGGGGGGARSPAAVNPGRKTMNVKSPERTSAG VPSDNPARTTMQQSQSDQVQMERRKHQEMVQQASRDQQQKKEGLEESKPKQSTEELRQ QAAEASAKRAKKAEEAQMELQKKKQMEEAARKDQDSRSKADAIRREQEQRRAAVEAKQ KAEAEAIAAKKKAQAEAAEEAKRREVAEANASRAREEEEAAKKAAEIEARVREAEEKE ALRRQVEARLAEEKKAAAIEAKRREEEERCRDDMKKAAESREREEEARRAKEEQEKKA AIDAKRREDEARQEKAKMAAAIEAKRKEEEASRRGGKRQEEEAMRQESAKKAAALEAE RREGEARAREEEQIRARQAAEAERIQRVEAERIQREREEHEAARVQEAQRKFEEEARL RMEHEAARRQEEAAVNSNDDAKAKETEAKHAEQARKKARLSSEVHKAEEIRKAQQSKL EQQRNAKAESEARAGKRLKDIAANANPPEVVVLKEGNKSELGGSLKDLLGAPKEPPVK KRETNY" gene complement(<160653..>161918) /locus_tag="THAPSDRAFT_11364" mRNA complement(<160653..>161918) /locus_tag="THAPSDRAFT_11364" /product="predicted protein" CDS complement(160653..161918) /locus_tag="THAPSDRAFT_11364" /codon_start=1 /product="predicted protein" /protein_id="EED86539.1" /translation="MAPEKLHVGILTTILGTFFYSNPFLQQHTQWITSEEHRNTIKPK ATLNVNRHNQNWTHQWLVDRSYDPARIYFVHVGKTGGITLERGVPIETNNKRIAIPCI MEKMEEGTSVSLSLEDAVEKCSRQTKPKKRRRQTPALSKHILSHKHLFSALFTKEEMD WILARVNTFLITTRNPVDRIVSAFNYHRNELLQLQQKNPMKKRFYSECFRDVKDMANQ LAVEYNLSQQIIVTTGDQQRTFNASSAMTSKKASTAQQPTCVQLAQAILGDDGEWGHF AYNYHYYKEVTMDKRPDVPVLVIRTEKLWQDATQLELALGGDPNRFSAANQTMSHGSE GYEVTAKLSTAGEKQAICCAIYNDVQAYQSIIMAALNLNDDEKLDMMRLVWKDCGVDV NDGEEEELLLSETNFWEHWFNVACSRPKR" gene <162343..167759 /locus_tag="THAPSDRAFT_25548" mRNA <162343..167759 /locus_tag="THAPSDRAFT_25548" /product="predicted protein" CDS 162343..167667 /locus_tag="THAPSDRAFT_25548" /codon_start=1 /product="predicted protein" /protein_id="EED86493.1" /translation="MSSDEHRRRASNDEDSDTTDTDTAIMDLDILRSLAESVDCSNQM SGFVSTFRLPPMMADLDDDDDDENMQNSMDGAREHQLPYASGITSSPPRSSINVRPRL NSNEIFRSSPSSSTTAASPLASPLSGHCIMSDGSCLPSPRQTPQQTTSQNRTIQLNQQ PQLITTPRLTPLQREWGLNLLSPQSVASDATDAVCNRTRKTTTNNGAKGRRGLPKMLI PPNISDVGRVEDGNNVGSNVVRSNSGDCSPDNTQGERSKNNNTAQVHMNEATPPRGNR QSSNNLKSTTITTSNNLKKKKTKPSSRKNNLVILPNPHQMQPHLFHRSKSHGIHPLLS TSPTSTSTIGSGPLSPSKLPPPHPMSHVQSYPPMHHVHHQHISPTYHHHNHHNQQQQH HRFAHHAHSQSYTSDASSSEATPITNPARCRACSFSSDISDSVAYSISVKGGLDNSLR SGGGGTTVGEQHSIHAGSKASDADLNGLRYFNDFDDDLSKDSVGDPLMGVNIHLENIA NNDAVPTLVDNVPIVDVVPAFMSTNSLLGLSIPYSPKQTPTVENYYEEPLEGLGDDYR GSGVAVANGYCSSTDITPPASPNPQLRQQDQQRHHRKQSSLVSFSESCNRSHRSQHSS NSLNSISNILLNYPNPDPTYTEEQGSLTPTTPQTVDSSSYYVDDNRSLYEDEKIFVQQ IHNSGLQSLSFEHVEDTLENTSMEVEMELVYCDSFGSPGADDMSLGSVSKGGSVNSLG SLMSVEEIKEMERRIHCYRKFRKRNSAEVAGVGGGDGSAAVCKNKEDVGASKRDAGLP SLFAKPLSERVIITADVKGDIVDGEVRSHGNFGHDTPVASNKASADLLDIIMNIPKNR NDGGADAVEEDSYENIDIEMAPFSNQIGFSADDVDDDDNAGVNRHISFGETKVHPNNN PGWRHYYHLRNLLWWWQLQPSRPVAYSQIQRDENRKTKSSNHRFDDVDFDRNSSSCCS CFYRCLCGDDQGKCNGNSRGSSCCWYMSSTWRKLLAISCLFMLLAWMFSDNGLGQHRS NDVQQYDPLAKENFTVTSKIIDDADDFDGYFHNPMNKPEDDDEFQYTSHSAQSEQGHY LQETYLDEDDVFRRFELEDDVFDAPTMKGYTSFVSDSNDTTGDNESKVSYDASSLIYE PQVITKDSIDTIVVLGERHVGIDWLVGRLKVLYPDVAVSSGFPEENGVGRAGIWFQDN GARRALSSEANDTTTRKHVIVIALWLNPYDWTELMRVDPINSPYHKGLEWEEFVEAEW APRDTTPSKTMTALKADDAGDWDDVLSNQDEEELESRHLSRGRVLTAAKTKAPVCQHN FSSSSVSPCWPTHDGKAPQSLATANANNKEEPVYELNTNRNGQKYYSVLELRADKIRS EVKGSSLHSSVQSVIPVRYEDLLIPCCNKGSSSDMPGIVGLMDQIEARSGLTADPSVG WDETPKSENQFWPDPIGCTGHVCFPSINQMSKDVKYVKYLNDHIDWHAEQLVGYQKRS LPKPSVNQIVVLGERHSGAEWLVDRLSRCFPYIDINYGFERERPGKYFQAPPSDEIPS TLVIATFLHPYDWLELMRERPINAPTHKDMVWSEFVNSPWKRMRSNLDNSISDPSSAT CSFGFSYHEIIPCQTQRDPESDSFPLYELRHSPNGYNNDADEAYSNILDLRSDKILNF LSTAEYPGVVDLISLRYEDLIWDDGTYADDASVLSLPFPGIAGLLEKIRDRTTLVPDV NAGWILDEKDVFKAEHLRGNGTDFDPSFVEWIEGHLNWDVEKRIGYGP" gene complement(<170283..>171889) /locus_tag="THAPSDRAFT_11366" mRNA complement(join(<170283..170294,170333..170392, 170483..171024,171172..171322,171439..171546, 171830..>171889)) /locus_tag="THAPSDRAFT_11366" /product="predicted protein" CDS complement(join(170283..170294,170333..170392, 170483..171024,171172..171322,171439..171546, 171830..171889)) /locus_tag="THAPSDRAFT_11366" /codon_start=1 /product="predicted protein" /protein_id="EED86540.1" /translation="MIDSLLMHRMTLLIFLMEGIAPAIILQMQFAGDIEIQEEEEPQF VAAMPEDIPIAITEPAVTTPAATPAPIIPEPTTPAPTICEGRNFYRDDDTGKCSNEVT GGTYGSLIDCCFAIFGSVSCPYVDICITTQPSPSPETNEPSAKPITTAPISSAPVSAA PVTLAPGSAPVETTSTSSSTTTSSEEVFSTTTSTEQVIQTTVASIVSTNAPSSSPEAV VTPRPTFRPSPKGTESNTVPASIALEEKLGPATSKPSFSSHPSSSEPTLSPSVSKEPT RYPTSSPSDSPTKSPSKSPSSSPTTSPSASPEIF" gene <172258..>175928 /locus_tag="THAPSDRAFT_11367" mRNA join(<172258..174985,175029..175777,175899..>175928) /locus_tag="THAPSDRAFT_11367" /product="predicted protein" CDS join(172258..174985,175029..175777,175899..175928) /locus_tag="THAPSDRAFT_11367" /codon_start=1 /product="predicted protein" /protein_id="EED86494.1" /translation="MVQWIRCVIPPSAPNTETSQKEDNKSEGTASSRCNVIRSDVDAH ADGSATNLNHATHHTYYGPGELELYRHVQLQQDAKAAKLNVYYPGRWHRHSPRACFRV VFGDCFQLPFGGDDGCGNVDDTVVLKFRSNARIISARWLQQKKSCSLRWESNAALTEN GTVYDHTLLLSSSVVDIDETEQLPEYSSESMLKLELDSQFPDMSDEEDIYGKLYPPPC VTLQTTYRGSQNQRASVLTLAADWGNTVWEWKCTTANAEVANEEWVPITMYWAYSADV ADELESSPLTLPHQMELSHVDTIQPVRAIPLAEIEVEDTNCSVLYDFGKELLGKVCLI TSSATTDISNVQLRVGETLSEAMNDAEEHFEQSTELSYGNHGLVSAHLLAFRYVRIIF NNSNIPPDATVECRASMPPLAKCGFFSSSNDCPNSELDNQIWRSAADTLQLCIHNNFI LDGIKRDRLPWAGDLAVSIMANAYSFRDEECIRWTLVVSGRCGIDRLHGSDDTANDTT SGYDATRAPVEESHVNGILDYSLWYIISHWLYQRYFEDASFLHQEWRVVEMRLVNLIQ FCCDKDEGWLSMSEDDWVFIDWVSDINKNASLQTLWWWALECGTLMAEKMDLPENNTT KRLICDTQSRLEESFLAMQDIQNGYSRHAHILRVISGLYNRLDDKASEGDWSNPDSSI ERWQVLAKIRKLQDASREALLGEELINVGTPYFKHLECLAICRLQDRYIALERIRRYW GGMLSSGATTFYEAYEEGETSEEVANYYDRPFGRSLCHVWGSGPVTLLPEIVLGLRPL SDGWNEWACDPLEDFVSTVSATTMTKFGLIEVHLNPTELRVSVPEGTTMLLMEKVYAA GSYCIPRNLFMSTETIHRWSMKYRGWQYHPDHVIPSNPKIPGYEDIQMVDVPTVNLLP ARRIQVYQLPDDDTFYMSFIGFDGNGYQSFVAESIDLLDWTDHRLAMGYGEEGSFDFG GAVLGAYLYENYDVDANRILKKIDGRFYSLYGAYSKKGSYEIDPGCQGLASSEDGLVW KREKEESVLSILGPGNVKEWEKDSIYQPWLVEHEGKYYNFYNAKQMPQWVEQLGLATS TDLHDWKRHKDNPILCVGTDKTIYNGYDTQFCSDAKVFWDKDVSSWVMFYFGVGKGGA HIMIAFSKDLVHWSEEMEEVLG" gene <176183..177587 /locus_tag="THAPSDRAFT_270004" mRNA join(<176183..176498,176610..177587) /locus_tag="THAPSDRAFT_270004" /product="hypothetical protein" CDS join(176183..176498,176610..177538) /locus_tag="THAPSDRAFT_270004" /note="Very good alignment with ESTs from diatoms. But no other strong matches to other known sequences. Best hits to thioredoxins.; hypothetical protein" /codon_start=1 /product="hypothetical protein" /protein_id="EED86495.1" /translation="MTTKTIPLLLAAYCLQQQPTVTSAWSPSKIDVCGRSVSPISAQL FTSPIVTAEPSIDAQPPKSFGKGVWVPPSQNVAQRKGNKVFAINHPQDLLDFVVEDER LSVVKVYASWCKTCKVFDVRYRKLASQLGDKYDASDATQIAQKGRARFAEMQYDNPAN EEMCKLLNATKLPYILLYKGSKGKVDEFQCGPADVQKLIDAVNEFADSEEEIANGAGG AQRKTQTIVPLLNKEDDVPVNATTVEQPSHEDVSKLKEQLAAVYKEKLEIFEVMKAQI EYDKEQMQLLTADLDQQKKEYKALLESKDNEIKNVTNIFTQQRKEYEKEAAELSKQLT DLTAKLAQSKKTITSLELEASFQQKAANAALRETELRTREWEASKAQYERERNSLRKL SGLAVNRVRRGVRSLISRVRGR" gene complement(<177665..>177954) /locus_tag="THAPSDRAFT_264651" mRNA complement(join(<177665..177835,177872..>177954)) /locus_tag="THAPSDRAFT_264651" /product="hypothetical protein" CDS complement(join(<177665..177835,177872..>177954)) /locus_tag="THAPSDRAFT_264651" /note="hypothetical protein with thioredoxin type domain" /codon_start=1 /product="hypothetical protein" /protein_id="EED86541.1" /translation="VKVYASWCKTCQIFDVRYRKLASQIGDKARFAEMQFDDPANEEM CQLLNANRLPYILMYKGNHGKIAEFKCGPAEFARLNDAVD" gene complement(<178866..182700) /locus_tag="THAPSDRAFT_25550" mRNA complement(join(<178866..178871,178957..179460, 179505..179528,179631..180320,180455..181411, 181520..182074,182151..182222,182314..182337, 182421..182434,182571..182604,182678..182700)) /locus_tag="THAPSDRAFT_25550" /product="predicted protein" CDS complement(join(178866..178871,178957..179460, 179505..179528,179631..180320,180455..181411, 181520..182074,182151..182186)) /locus_tag="THAPSDRAFT_25550" /codon_start=1 /product="predicted protein" /protein_id="EED86542.1" /translation="MPTEESPPQPTPPPTPKPSPPPVIPPTPKPSPPPVIPPTPKPSP PPVIPPTPKPSPPPVISPSPATPSPVNQAPELTTNAPTKLGQGMPTSSIKPTPQKVSP GFTQSPSLIGLVCLQPNGSSPESSNAISQQNSAKVTEATVSRAFGGKSGKGSKAGAYQ SKTYKTDALMDASSCAYTGRYRTCFQRSSNLSTCDAIIEKALLTFLADNIGSDDTFEP ACVFTSGDAFASQVVPNSGGQFVESTTLEMEVMFIQKNEARRELAIDVVEEEHVTLDV IDPGSLFERRLQSNKCTPLGRAMCCSQTAINSNVGEYCSNLGCDFARCGSGRRPRRSP RELEVESNPDIELSPENAGERRAYSTKSGKSSKGSKASLFAQGKASKASRPVVVKPIE VQVNTCPWYGLMNGDDFTNIVRRYTLFQPQMTRAILDADDTEAVATCSANRFSIDTFG TPSLTCDLFISEDCPNNEDLPVEDDPSVLLSSTKLLSRTRMSETQQVGGVKIAADATK LWSVGFSIMSLLDSTNLPRKTASATSATAVSHSVRTSNAVVTTTSDKENIIPSPTQQR TNLSNDNDLADFLKGIWWGDDMFDETWQLDVTATSVDAQEPVQQWCEDDALLYWFEEF DKLGNDKSVVEVFQCIQQSETIHSLLRGFVNNDDEWEYFIDILKMIHRADGKYKFLPV YVGKCAVRDADGGGGRCKRCMSSQYFDAFRNCASVDMLLTLHCAISSMTDGVFSTEAM LVPAVVDDGSPYQGAVIVAKGLNLFGKLARELDCMPMFINVAGTSGKIVMSKLDNALF FVHDEMCHMSNISTRRKSYGTGEEQTTKVTQLLDGLEERVESLFEELATLKLNGAIES AFVDGRFKYYGSDATIKERVRALEEKSEAEHAIKEEKRLKWIRAQEEKRKAELAIKEE RLQRKRELI" gene 185018..185596 /locus_tag="THAPSDRAFT_25551" mRNA 185018..185596 /locus_tag="THAPSDRAFT_25551" /product="predicted protein" CDS 185020..185538 /locus_tag="THAPSDRAFT_25551" /codon_start=1 /product="predicted protein" /protein_id="EED86496.1" /translation="MKSSASANTNADNGSLSSPNGRQAADDDIGNSDPKPKTSLAKET ASDTSGSGNTTKLFGTMTSEEQHQYQMLKNLATGKADAFDFTVRKAEEASSSSPNEAS PSLPAQHGQRQAHASKATKTKSNSTESSSSSHADTNEFRTHTWTQEQVDGLVAFTNRL LMPLYKRRRVGK" gene <186066..>188558 /locus_tag="THAPSDRAFT_11371" mRNA <186066..>188558 /locus_tag="THAPSDRAFT_11371" /product="predicted protein" CDS 186066..188558 /locus_tag="THAPSDRAFT_11371" /codon_start=1 /product="predicted protein" /protein_id="EED86497.1" /translation="MITTSSNDKATASSSKEGGQHKPLDELICRHGRQKEVQLLRSCL DRLLEEDLDVESGDGAAISGEYDAAAGTTEVEELASDKVDSSKKDGTDEGATKGDKDE HEHDGSDQEKGDSNSKSLSEDVPDKGNNDDENTDSTKEQSAKSPQDQIKTSTNLNPLP IKRQPLTREQRLQRLRQRRERRTRSKRRRRRHYVYTQNMNDVTLTIDPVPLKRECVLI YGQQCMPLVESALREYVGKRFHGWFVRGDFESDCHVFGGSGFRGGRKKSERKLNAKDK VTDGNTAAAAAVKRKIGVRFEDELCNEDEPSLGGEGIPFSGITSVVRQICIKLLAMQA ESSAADSPNAVPNNDDSKAEGTVPQRRGGVRFNMSHLFDATSGELMQALSVEEQKMLV SVTGLTELACIFGLDAQDEALHEQSNQNILQYRNKLHYAFQKLISVICQSHGPLVICF NNMQLVDNASLDLLDALLLDRENSKLMVVGCIQTPPLAKNNYFSDTSPTPESLRKALK AKIDRWRVHGDLFGLTLTEIAVGNLTVDEIKTILLDSLSMENKGMDSLADVCFEETKG DVFYLTRFVEMLHKKKLIRQNSKAQWTWDAAKIASKTSESPIEDLVLRESISKLSKQA RSLLLLASCLGSTTITEKLLYLVWDKFEYKSHLNHGDITKYRLLINEVLNRGAFVVVE SEYSSQKAYRWAHEAILKEIIVNVAPADLAALRYEIGSTITNSMNGAQAECSIFVVAN LANSGGHSVLGSLDASRRVHWAEKNLIAAKKAVELTAFNDAIRYAEAGIEYLPSNKRW SDYRPLMLDLSSVLAEVAGLLGKEVLSLFVFW" gene <189258..>190229 /locus_tag="THAPSDRAFT_11372" mRNA <189258..>190229 /locus_tag="THAPSDRAFT_11372" /product="predicted protein" CDS 189258..190229 /locus_tag="THAPSDRAFT_11372" /codon_start=1 /product="predicted protein" /protein_id="EED86498.1" /translation="MKDQFRVGAMNAEAAIALLGTLNNQADKAQAVFVIFYMVMPWVR HVRDSVKPLLAGYNFGLQSGDIASAMNCIFGKLVVDLVSGTPLKHIEEACRTFVPLME HANVQGLTLITRILWQTTLNLMGECDNLSLLEGEAFDESDFLRQPKTPQLIMNYFQAF KSYLYLFTGDYTRGSELALVRMDQFLEEVSCSALGLVDLITRGIPLFDRLHATKRRQY LKATSRIRATSNSWRSKGVCNMLHLGFLLDAEQASYDGKLDAASEMFKQAINFASRAG FIQDVALANDRYSSFLVSHQMSEIEATYHYERALFFYEEWGAGYAGL" gene complement(<190368..>192266) /locus_tag="THAPSDRAFT_11373" mRNA complement(join(<190368..190699,190779..191006, 191051..>192266)) /locus_tag="THAPSDRAFT_11373" /product="predicted protein" CDS complement(join(190368..190699,190779..191006, 191051..192266)) /locus_tag="THAPSDRAFT_11373" /codon_start=1 /product="predicted protein" /protein_id="EED86543.1" /translation="MSPTDDDAAFADSSYEALRRKFGLSDSSTTAATAGDVGVSEDME VDDNSEKEASTQTMPRRRRRTAVAEEPTIQTEAQNEPESETQSEIEPPTAEPYEEDTR VQSACPIASFSRTFPRYTINLANKSNSSKDPESEARRLRRMQRGVITKLTKAGGDANV GGLNPFGFVNGLVKGVWNGGSPSSSSSASDPNLRKSIEGVYSKEIDEGSFRWIATAND ATTLPSLVDEEFVAAASFWRMASDLSLHSNTLSAQEPQHHMKWYLALSDTTMTVASNL CDVLNWYAEYLERGDDKRARVVRAQLDTTRSTSIPIIQFTATSQNGEETAQSQQQLEQ IRNQLPNAQDTESQTKAWVKRVLVQLGICPFTKSDVKSGQGLGDLGVPVANILYGHSS ALSGGGGLYLLMAARLTLYLIFSDTWETINQMVSAGASGKNGVSSILLSAPGFDDDFS LWAGPVFAILEAGVGAIQAEEIIGVVCFHPEYETPDGTTWAGFGHMHSLPRLRQWYNK YKPQSASSLSDNEIAAGGAWQRRSPHAVINVLRAEQLEAAEGRRSTGVLYERNIRVLM GREGGIGLETLERDLRQERCLKATN" gene <193021..>195641 /locus_tag="THAPSDRAFT_38363" mRNA join(<193021..193152,193233..194082,194200..>195641) /locus_tag="THAPSDRAFT_38363" /product="predicted protein" CDS join(<193021..193152,193233..194082,194200..>195641) /locus_tag="THAPSDRAFT_38363" /note="GO_component: GO:16459 - myosin; GO_function: GO:3774; ATP binding - motor activity [PMID 5524]" /codon_start=1 /product="predicted protein" /protein_id="EED86499.1" /db_xref="InterPro:IPR000048" /db_xref="InterPro:IPR001609" /translation="GATDLAELDHLNEAAMLYNIKDRHCRRKPYTRVGDIIVAVNPYQ WLPDLYSEEIQNEYADRIIWKQQSTTTTQQPTWEPHIYETSSNAYLNLATKRQDQTII TTGILGAGKSTSNKILLTHLSVLEMTAPSREDNDSSFNRLSFLDRSEVVTKIIESNPI LEAFGNAETTWNYNSSRFGKVQRLQFDVRKNGADEHGSVPIARMTGSCCETYLLEKTR VTEHSVGERGFHIFYQLLSSPESFRREVWKDGFVVQERMAIDSDFLYLATSSDVTVTK TIDTKNWNTTVRALELFGIVGDELRSIVRALCAVLQLGNISFATDSSPECDISSPLDL TKLANLLGVPESTLEAAFTQQTVNIKGLTATKKLRVRTAKDICEVFAREIYCRIFEYV VNRMNDATDATRNCSPEGATQLSCVNLLDFFGFENFRVNRFEQLYINYASERIKNKYI TGKFKVIEEYLSEGIHEFRNHRVAGNSDVLDLLDGRIGVIVALNEECIRHNGSNNTFV YKVKVVHESHPSLVSDRLFAKNQFAIRHYSGDVKYTADSMMERNLDCLPQSLIDIGCQ SSNHVIQSEFRRLKATMSANHETPKQKTVLSKFRSQLNNLMTYLDRTRIHYICCIKPN NTKKPQTTNQRETATQLESASLVAAALISRENLSKSLTHIDVLERFGILCHDNVQSVI KKSDNKKRECLEYILRVMLPMDSNAGQHKYPWRITKSKVFFKAGALEQMESVLDSYVK SFAVRIQAWIRRILSRQTYLLMKHNATKIQSVHRGNAARASYRMQIDKVLNIQCIFRR AIAQNSLQKL" gene <203384..>205714 /locus_tag="THAPSDRAFT_11375" mRNA join(<203384..204654,204729..205524,205574..>205714) /locus_tag="THAPSDRAFT_11375" /product="predicted protein" CDS join(203384..204654,204729..205524,205574..205714) /locus_tag="THAPSDRAFT_11375" /note="GO_component: GO:16020 - membrane; GO_function: GO:16491 - oxidoreductase activity; GO_process: GO:6118 - electron transport" /codon_start=1 /product="predicted protein" /protein_id="EED86500.1" /db_xref="InterPro:IPR002916" /translation="MHSPHYYFTPHLVPFPIFTLLILPRRRDDYNKMTMLRRLAVIIT LQCVSVIQLYLYFGGTSYGRHWFRDISSVIIGKDDNANGGGLILLMLAIPIFISGAAC TLLLINNKGNQRIRNDNTKVSAEDATSLLLADAIVVPRGFDDDEDNLVQAKNVTLLND WAAKQLSFISSSIKRYCTRGLLRPRPFALAFGILPCMIFFTCSIHRHKQAAVLSYPTL ESAVDAPNHYSYNSANSSDTTNTYIEISNWRRTLTLHIANDSAILSLVAFAHLLIPVS KHSPLVTLLKWSPSEAIVVHKYEGRLAIFGVVLHGGLHLVCGYWRWWNAFMLNDANDA RDSSWMKRSFWYGYIPPVSCWKNVLYRSSSDNEYDFEPNFGEGCIHDGSPCKCYDFFV NLTGLLGLVALLILLFSSVGYVRRHFYRVFYVLHITTAAFFLLMATLHYNRTILYLCP SLLYYASTSIPTDIQAWLYRHRGQDTRILRITKVPCPTNERPDGSVLDITFEASLKAI QQYQPGNHCTVHIPQLSLVAHPFTVNKVLGYSNRLRLLIRETGPFTTQLGKLLDGKDE DNLESQTLPQITLPTIQINGFHGTAHRMEQLQKHDAVVVVAGGIGITPYLTMLMEIAS KKSPLTKSVELHWICRERSLVRYIYDEYFSTMKESCIANSVKIIVHFTGDEAEEYSSS SRPMTFSEPERLYLTTCRRLQRFSPSSLLDSRRFGIVSLVSKVQSPWQRDHSS" gene complement(206040..207793) /locus_tag="THAPSDRAFT_25552" mRNA complement(206040..207793) /locus_tag="THAPSDRAFT_25552" /product="predicted protein" CDS complement(206075..207697) /locus_tag="THAPSDRAFT_25552" /codon_start=1 /product="predicted protein" /protein_id="EED86544.1" /translation="MPSSKPRGSPLKFSAGGSKSASPHPSSSKQFHTSDAKANPQLPS QNIKPPSSSTPQSESNALTILQTGINIKRSPLQLKNWIACVLTLDGRDKFTKVLQYAS RMLSWYFGVLGAGASVKGGTSEAILKQQLYLAISQRFQSLYKSLVDSRKAFRIGRSVI EMDKLKSMGWGEYLSYMMRHPLAGGVACGDRGMHFGNEATASKNGHVNSLHQYATHSI PEHHDEDDESGSWNEEEASDDELLGDDEEKKDDANGNVDKAIARPGRPVLPSRISSNI GWGPNTTAIASASSSTRKPSSKQLSHIPPPRTVSEMGRQMYRPFPSRSSSMGSYKQLK DSTQQLSVPIAPPTPAWKLVGGTLKLIGLMGFWAFDNVSFLTGTGFLDPIKFNANTNE VAKSSADSAYADRMRRRQYASEWATRFYFVGVMGGLYYTSRSLWQHRYGALKEARETL QGITSTQAKGDNDTEREEKARQALKKVEGKHFELFLALLKSVCDFMVFSNNPGVDFHL KLRGKKNHEGLHCLCGLTSASTVLYNNFPNAM" gene <208119..>209978 /locus_tag="THAPSDRAFT_11377" mRNA <208119..>209978 /locus_tag="THAPSDRAFT_11377" /product="predicted protein" CDS 208119..209978 /locus_tag="THAPSDRAFT_11377" /codon_start=1 /product="predicted protein" /protein_id="EED86501.1" /translation="MSIDTNDDMARASSPTKNTAGPDNNINHPTMARPPKHYYPTCHI NGGGVASLMLSCCAPGLGDFTAPAVSPLSYDHAATAAAAGSDAANKDALGDNVKTSSS NGENEGDGANNSRGNFWSDKEKKTNQLQLSNQQQMKPNNNAASDFPNLGMPQSKQQKK QNKPFASTTLLDRADSLVHQSSLLSLIATESEVGSPSSLDDDDTATVHSALPTENITY ASFINTPERTSQPITNHTPTQQQLPIPPPPSPALELSMARLEANTIANINQSGKKNRR LNCSGGSVGSASKINAAVAALARTLPTHSENIKSSTANSGIAEGLEKNYSMGSNENEK SVVSGTVSLASVALSTLNSNSNTNNISMHTAFGKSDPPPTWTSEGVNSELNSTYRYPR LYLKDPVDVMEGLAASAASNATTQPDSVEGIKEVWWFHLARKMDTTLDHVLDNHWKSI AKQQLHQYKRHSSSSDSISAASSNLEGNGLSVGQNGCNADALVDLCSSATQQPSQPQH KKSKSWSSYKSKTSNAPLTYTTESATATTPINCWSEPSASTLKVRGPTYSQDGVKVES DVALFACLGVDSFVNGSDRNENDSSKLGAGTKSFLERWERACEEVGLEKAPFL" gene complement(210592..212699) /locus_tag="THAPSDRAFT_25553" mRNA complement(join(210592..211006,211112..211556, 211641..211709,211807..212699)) /locus_tag="THAPSDRAFT_25553" /product="predicted protein" CDS complement(join(210706..211006,211112..211556, 211641..211709,211807..212506)) /locus_tag="THAPSDRAFT_25553" /note="GO_function: GO:5509 - calcium ion binding" /codon_start=1 /product="predicted protein" /protein_id="EED86545.1" /db_xref="InterPro:IPR002048" /translation="MSPSAMKPPSSRKYSSADVPPVLGVARQVMDDDTEDSSDEDNHA APAEQQSLKQRKKNRLTKVSAMYDLDGDGELDEIELAMRKYDRDGDGQLSKNEIYKIV QEQLKEKKDASQLRKVTAGLVCFVFVLALSNLGTSFASAILAKETKADSNSATMILKG TGDALGTQTSGENFDMEPLSPEENRRRRELVVKRLIDEPHGHHAHHRRMANKNKKGGS SKNTNKKNKPSKQRPGGCKDASTICDTSNADEIYFDQGVIPIAIAEKILKKCDGSRTV NLLRKYDDGSTDSHTLCMPGTSVVKKNLGNGMDSKNIKKNKREGENENQMISFVTGRE ETHFECDGKECAVYGGGLLQRAGEPCDISRGNADCEDGLVCIVPPGNVQSSWTGICSM LAANDPWYVDWENHNCVQDCDARLGGNCGGVVSDKVQNYDELYASHDMCCSLMLSWKG KECVRDAEALMDLVDDSSSPYLSKSQCWQSGSDCYNRGGQFTCCSGICYDKMCQ" gene complement(217745..220302) /locus_tag="THAPSDRAFT_25555" mRNA complement(217745..220302) /locus_tag="THAPSDRAFT_25555" /product="predicted protein" CDS complement(218469..220112) /locus_tag="THAPSDRAFT_25555" /codon_start=1 /product="predicted protein" /protein_id="EED86546.1" /translation="MMNRFCQTTTSGTFPKAVPSNETAESDSPFLYSLPTRVDETQEG NEIVSVSAGNVHSVALTNEGLIMTAGSLDDEVLGMGRAIGGNATTPFEPITEAYFLSN ITTSHDEAPTTPPKFTQVHASQYFTLALDEDGNVWATGSNTQGNLCLNDTEDRDRFYM VDPRFYNAVDEEQRKITSIALGERHTLLLRQDGKAFGCGWNMYSQLGTGVAGDNVLAP TEIVIDVPDDDFNNTEIGTNGATSTGYEVVTQVAAGRGSSYFLTQMGRIYSVGTNFNG QLCLGHREDRSLPTLLTTVADGLAFTNFSDVTIAGNTEDTSVTSIASGKSSLYILFSN GLVWACGDNAQGQLGDSGTLSDDSTDVPVQVQGVSNVIKVFSGPLSFSSFFVQRNGIV YAVGYNGSGQLAVGDELNRNTPTVVACSDEGDEIAWHGGIVVSSGNDHSLFVGAKNTF SCPDYGASLSPTFSAIPSVAPTLSSQPTQSNMPSTETASPTTSPFVNETDATLSPTPR PTDGGDRNVPNPGNAVEKYSTANAVASLVVVVTLGFILQ" gene <221345..>224868 /locus_tag="THAPSDRAFT_264653" mRNA join(<221345..221761,221843..222782,222877..223452, 223530..223606,223686..223967,224061..224752, 224792..>224868) /locus_tag="THAPSDRAFT_264653" /product="Hypothetical protein" CDS join(<221345..221761,221843..222782,222877..223452, 223530..223606,223686..223967,224061..224752, 224792..>224868) /locus_tag="THAPSDRAFT_264653" /note="Homology to ATP binding domain of cyanobacterial serine/threonine kinase; GO_component: GO:5622 - intracellular; GO_function: GO:3677; transcription factor activity - DNA binding [Evidence 5524] [PMID 3700]; GO_process: GO:160; regulation of transcription, DNA-dependent - two-component signal transduction system (phosphorelay) [PMID 6355]" /codon_start=1 /product="Hypothetical protein" /protein_id="EED86502.1" /translation="DKLNFGDYIFGRSEQLETLLRQAVMIRNEVGRQINRLLLIAGES GSGKSYLVQRTAETLAPSGWRFISAKFDRLQRQPFSVVVSSFDAFFRSTLANSHERNY INAVVASLSHHLSVSAIVDLCNLIPILRGMFPNILQREESSRRNRLHYIFRVLLTAIS SVERPLMLFLDDLQWCDQNSLDLLTTLSTQQLDGGMEDAGCVMFVGSYRSNEVDANHV LSSYLTNFDRSTAVIVTKLRVDPVSLKDTNKMLSTVLRLPLRLTRYLTIVVHSKTLGN PFHIKAFLQSLVNEKVLTYSLVERRFVWDILAVKAASIHKDIVGFLKRKLLSLPRSVQ ESLKVISCFGSQVDTDIIEKLHFSETLQSFMEDLKCAVNESILETSGRVYYFTHDSIQ QAAYELMSLEEQRECHCYIGSELLKNISASREPTFLFMFDAVDQLNISKAMGVTVPAL AQVLSSLNSRAGDRSMEVGDFVTALSYAEFGISYLDESKWTSNYELCLHLYEIGCVSC YVNGQYDKIHLFLDELLENARCLQDMLKGYYVRIQSLGVLGKVSEAIDKSFDIIRQLG QSFPPLEEITPELLHSELASISLDDFSLDEIVAAPKLPDTKKCWIMKYMTNVMTYLVF LRQQYLPLLAVRMIMFSQKYGYCNESAVGLSAYAHAQLHLLRNVDEGFRWSKNAILLL EGFKAKELLPRLMCFHHSHFTFWKEPIQSSCELFLEVHRQGLLIGDVEIASISRANFG IRCFVCGHELTIVEKEFSSHAIDMVQLKQLRHLSLHLPYQQLILKLMGIDKNPYAVSA GVFVDEDSLLQHAVSTKESGLLHTIYFTRLFLAYWFERYEEAAEMAACYRGATAMRFS DVYHALYEGLSAFCLARSSVDEPKWMVLGENAIATYRSWVKHSSWNFENKLLLLEAER FFAKGEIEAAKEKYETSIESARRHRFVHEEGLAMELLGRLYRDSGRLEEAKEQIALAH VCYDKWGAKGVLDRLGASWPEVSIPPIHRHTISLKGPDVIVTSNIFNL" gene complement(224838..225780) /locus_tag="THAPSDRAFT_25556" mRNA complement(224838..225780) /locus_tag="THAPSDRAFT_25556" /product="predicted protein" CDS complement(224928..225770) /locus_tag="THAPSDRAFT_25556" /codon_start=1 /product="predicted protein" /protein_id="EED86547.1" /translation="MNQYNDEDRQQHQLRRVSDYDTALTPDASAAAFSASASSSMSMS TSASDQDQRLQIQDLQVKAMKPSVEANAEDAPRKPPSAAGDADSESKSVAATVGSAST PKKLSLNLNEQFKSTCASLLAKYKPPLLSNNCLSLENAMSTAEKQKLVELKKIIADIK EKETQTQAKPLLWDKPSFGHGSEADPNASDDTLRKRQRVDEQPYSCLSIAERQVQEHT RTLFHNPTALSHCNHGSDGIGPFNWSQEQVDGLVAFSNTWDDYEPAHKRRRLNKKGGE PCRR" gene <226034..>226760 /locus_tag="THAPSDRAFT_11383" mRNA join(<226034..226099,226263..226285,226373..>226760) /locus_tag="THAPSDRAFT_11383" /product="predicted protein" CDS join(226034..226099,226263..226285,226373..226760) /locus_tag="THAPSDRAFT_11383" /codon_start=1 /product="predicted protein" /protein_id="EED86503.1" /translation="MSTRVFPIFRSLQRAVGGYGAFRYEPSLATSQKYHGTSFKEDEQ ESDDVECIGRVLPDGHPNPDEMNYDNEINTNPNTWPINSEEACNARREFVRDGFRRLY ATALGQNRVLEMGSAFGVGERVGLQVNVTNQNTPSDEDRPSVPLHVAEEEWMYYAQ" gene complement(226729..>229786) /locus_tag="THAPSDRAFT_25557" mRNA complement(join(226729..227038,227130..227841, 227969..228521,228654..>229786)) /locus_tag="THAPSDRAFT_25557" /product="predicted protein" CDS complement(join(226863..227038,227130..227841, 227969..228521,228654..229786)) /locus_tag="THAPSDRAFT_25557" /codon_start=1 /product="predicted protein" /protein_id="EED86548.1" /db_xref="InterPro:IPR007577" /translation="MKAAASVTLLALLLSVVSSASAANSDESRRASNANGGATSVHNA NGEQQQQPQRRRASDNSEELEDVVQEFAPPKLSFTTEDYFRDLVAADSSDNREPSWTV FQGSSSTNEQPASSSSLLTLVHNQPRAKHVVLTPDEQRAYLVEHAKVCLPGNNSEEAT TSSSLLAQYDLLSASSSTSHLAHELWKYCSLYNEGGVYVDGESVMLVGLGDVLGWTPK NNNNANYAVLASSHSSGISPSLLKGASISTPVTYASSYAYDNLAVDGAAGVPGTEDDA ATTAVNGGTTLGTNIITTPLLAISQKHNPIPLSMVEKIVNSSVRELQEDALLLPRALM GLIAADEEKAGKVAAGESSSGKWNFFRQRCQGVEVAGGDSVDSRSLRHCPASQGYCCE IIDPRRKFVFMLSRYSLLPNQMLPTYDSLNKPYSYHHHKDGDTTTVLDSATISQLPFI STIHLDPSTSPTDQIKQGNSETTPNAYQIMSSLNSLPNQEKNKQQCMDCLREKGGADC NPDSEHSCIAHCPRFCGKLCDVEVAEKPVSKVVVVKAPRFRKDPERLIPRIIHQTWFE PVTPEKYPNMSRLIESWKRSGWEYIFYDDDAASEFLSLHFPPEVREAYESIIPGAFKA DLFRYCVLLIKGGIYSDMDVILEINLDAAVPPDVGFMTPVDAPGSKPDHRMCLWNGLI ASAPAHPFLARAIEHVVNNIRNRFTVVDYDKMLCPAPELSVQHAFSTLFTAGPCILGL TLNEVMGRDLQQTFVEGDLEMNGGGNPDGVPGRTIILQQNKWDMGAHRFTWVENNLVV AATDMPDYDDRQKLEGEDEEQAAHYSKTLTRTDLYGETKVYVNRDIAHERIVIRTESM E" gene <231573..>233741 /locus_tag="THAPSDRAFT_25558" mRNA join(<231573..232860,232954..>233741) /locus_tag="THAPSDRAFT_25558" /product="predicted protein" CDS join(231573..232860,232954..233741) /locus_tag="THAPSDRAFT_25558" /note="GO_component: GO:5634 - nucleus; GO_function: GO:3677 - DNA binding" /codon_start=1 /product="predicted protein" /protein_id="EED86504.1" /db_xref="InterPro:IPR001005" /translation="MTDEHDYTPTEGSELNLKTAVNDADVLNPPPEEGQDESGANNMN TTVNEEEEEGQYVSTSVPVPPVTNTMALHSPIVIHQSLAPPGDDDESSSPLLAAAAAA PLPPSLPKNSNDSMPPPLYANNSHHQQQLGEDVSTISNSIDNNPTHYLNPDGAIHRQL LHQTTNTSTSNSNFWLQEEEERFLLGLRLYGWGQWKRIQSVVQTRTNKQIKSHAQKRE KVNPQIKVKYAKGKAKRGRIASRDAILPGRGVLIHNPEEQGDGGHSFDEMWTDVYGTN NGVGPNSRLRRCRNSVLHQQWLDTVQNNPPIVHEVGRTREQHIENDHGVDTSSSSNNS KQQPIGKQMQPIKASHIPLPPLSAASSYQIAPVVSVPPPVRHSQRQGGGSSSSHPPMQ FYNSHMGMYHAPPPGYGQPNMPYGYYPPPYHYGPPPPGHHDPYAVYPPYPNAPGSKEP LHPGMEIYARKEDGFTWTPGVIYSAKVEVNKDSEKEEESSTIIYHVQYEGGSENPNVR EEFVLSKPVYDRAVYDLERYYDLPIYNSARGANRDAPLEGGTPVFAQWMDRLNPNSHA KWLPGTIHSAQQGEGGNLYCVLFDNETEKDDVPDYAVLKRPEYSELVKTMQQHQPSWN EAIAELYKIFSCGDDDNNVGNGQGIDLLCTVSRIKRKAPVDDGEGGDPLNEKQHEAKR SKINKKGDV" gene complement(<234529..>235641) /locus_tag="THAPSDRAFT_11386" mRNA complement(join(<234529..234704,234843..>235641)) /locus_tag="THAPSDRAFT_11386" /product="predicted protein" CDS complement(join(234529..234704,234843..235641)) /locus_tag="THAPSDRAFT_11386" /codon_start=1 /product="predicted protein" /protein_id="EED86549.1" /translation="MSQRYTDHNPSNSWNDNNNRNSRHSSHDNRHSKQNGNPIGKWIQ KRRRHRNNSSSSTSTPSGPVGTEARALQNIQRNAQHFHYPSQGFVTPVEHSLLYAMIQ YPELYPESAVRKEIEEEEFDVYLEVRDDCDGGGGDGRKATTTATTTTTTTTTTDSSLQ PARESYAQLSALLALSAEKNPAKATPEQSERSSNNSNTSSVEPSSQQLSIGQILCRKI LRTITQYSDAHSLDRNHEIRELLNPVEKRCQLRNEQAALKQLLPAYAGYTLSLMTGNP LPLLIGAAALTGKDPMMEENTNVSGFRGMGGRTGNLETAGLLDECEDE" gene <236144..>238591 /locus_tag="THAPSDRAFT_11387" mRNA join(<236144..237703,237797..>238591) /locus_tag="THAPSDRAFT_11387" /product="predicted protein" CDS join(236144..237703,237797..238591) /locus_tag="THAPSDRAFT_11387" /codon_start=1 /product="predicted protein" /protein_id="EED86505.1" /db_xref="InterPro:IPR001251" /translation="MGGHFNGSHRNDADSSGAPSATNGGVGGNTSNGSSDKRRKRKNK KRVSFNPSMEIPASPSATNNELGVTSSSAAGASTSCLSRFSHGRRNGATNTGGAASTH RGKRQQQYLQSKYSKHSKQQNTKTQTFLLLIAVFITVPKLLLGGVGRYCGREGVALVV QVATHDTDGLDNNAGVGSLLQEVIEWKRQFIIHREQQQQQQQHTQTHDRKQYNQQRQQ QHRQLKEQTCQVASLALGNYAYYMPFGRFFLGLQQRKSTAGGTNNDESKQATTSELQQ MEKEATTKKKRGILQRVVDSHRSRREERKAKKMQKRQQSQQPLVVEEMERQRHPLRHI LEEALAFSHWKKMQKELHHRERIEQQQKQQLAGELQPPKEEIEVDWNKWKYAMSDDYD LPSDQSSLVKELAARVLLKAKLMNGASNNNNKGVEQVDGKGGSSDNHGGDSTTTTSTT SLSGGTNVPNKPFSERVDNVPWGGVYNVDATRWWPREDNGGKVVVATKISGLSEGGRL LAGYLKIMKWPKDLFVKFPFKLCSNGCNSEVAILHTLEWREKYKPWCMSPSAIQYNKD GFIYFRGHSKAGPKQRLEIGGGGGGSSNDDELNNAGHSMIWYRPALSSLQDPELYVRT MIHTLDSAVADSLLRNQGTIGRFNVVLDCKGVGSKSIPSMTMVKKLFGFLQDHFPDRL GVLLVANLSGMAQMIMKMVLPFVTEDVRAKIHILPGNDEDRRRMLLQFIEEENVPVYF GGNDEYVFDIDEYYSDSGGVGAGEKCVLSEDEIRGYLETMPYHA" gene complement(<238907..240562) /locus_tag="THAPSDRAFT_270005" mRNA complement(join(<238907..240098,240177..240303, 240380..240428,240516..240562)) /locus_tag="THAPSDRAFT_270005" /product="hypothetical protein" CDS complement(join(<238909..240098,240177..240195)) /locus_tag="THAPSDRAFT_270005" /note="Putative protein of unknown function containing leucine-rich repeats, with some similarities to At4g20140-68296.m02098 F1C12.60 leucine rich repeat-like protein Cf-2.2, Lycopersicon pimpinellifolium, PIR:T10515 (model%: 73, hit%: 54, score: 442, %id: 17) [Arabidopsis thaliana]; putative protein of unknown function containing leucine-rich repeats; GO_component: GO:8372 - cellular component unknown; GO_function: GO:3676 - nucleic acid binding; GO_process: GO:7169 - transmembrane receptor protein tyrosine kinase signaling pathway" /codon_start=1 /product="hypothetical protein" /protein_id="EED86550.1" /db_xref="InterPro:IPR001611" /translation="MYFNFRVPNNLTGSLPSELGHFVDTYSFLLESQEYLSGPLPETI GNMTQLYSLAILFNGPDFGGEIPESLFKINTIEGIRIEDNLGEWSLPSDIVVDEVSVL THLHLRKIRLTGTAPSWLSQLTNLTILDLSLNGDLYGPIPESLGEFPSIKYLNLYGNQ LNGTIPASLGNLTQVTTLVFGNNKLQGTIPDELGRLSNLLLLDLSYNHMSGSIPSTFS DLANLEYLVLNGNNFNGTIDVLEPLTNLTSLLIRKNSFSGTIPIDIFSELEGEIALDL GFNQFVGEIPTNFGSLTNLSELNVVSGVISFSFNDIGLTLSLPIAAYFVATENSFDED STDAIVCNNTKFIILDCSTCTCCDVCCEGEGDSNVCDIHMTAYGHLGLDCADWWMFCS DVTYDPLPVSS" gene <242234..244036 /locus_tag="THAPSDRAFT_25560" mRNA <242234..244036 /locus_tag="THAPSDRAFT_25560" /product="predicted protein" CDS 242234..244006 /locus_tag="THAPSDRAFT_25560" /note="GO_function: GO:3723; pseudouridine synthase activity - RNA binding [PMID 9982]; GO_process: GO:6396 - RNA processing" /codon_start=1 /product="predicted protein" /protein_id="EED86506.1" /db_xref="InterPro:IPR006145" /translation="MASSFLRSPAATKEVCTRYARNHLAGGLSTRHKGSACDAFVGIY HTSKKTISIRPKYGFCVGVRRASRSPLFSSIANNVDSSKETSECDNVSALLQDAFSTG ETDAVQDSLVSYSILQSLLDDNQSAAEDVADILIKSAIEAATSDCIGTGRNVGGLNRV NLAAILNAILASCCERTDDLGGTVALALLEQMDEMHSDDETTMVAPDLVSLSLVYHAL EQTSSHPEAQQAILERAQRLAKKGAGSQRRKALAAERRRKPPGSDEGDIRLVEQRLQS LYGSDIHILYQDDDLLIVSKPPGMVCYHNKKTSAGKITTSRKKKSRAANNGKDDGSND GGDGKLMDISLEDALIDMAFPLSTVNPSARGIVHRLDRGTSGAIVLAKNDETHLRLVA SFFLRKATKVYSALVPGSCLQNIDGDEDANVSAMLTIGSEGEIDLPVDRRPAKSKYRV VKMYGHQEEQTQQPTPEAILLEVQTLTGRKHQVRVHCASGLGRPIFLDPLYSTKPAAA ASAPKQQSRDKKKAQKGGKETEIVDDEALPTAIQNVLKVGRYHPEQFFLHAKSLSILG VTVDAPFPTWWSDTFDEWELKKEA" gene <245382..>247421 /locus_tag="THAPSDRAFT_11390" mRNA <245382..>247421 /locus_tag="THAPSDRAFT_11390" /product="predicted protein" CDS 245382..247421 /locus_tag="THAPSDRAFT_11390" /codon_start=1 /product="predicted protein" /protein_id="EED86507.1" /translation="MTRANQHPLLSLTSSQLLGTRSPAGCISANGASQIDAHGRCRLH PQIVLRKKSALRGWKIVNSSCSECDLERSSSAGSGGRLSKRRSAGGSSSTARKSNGAT YGKQQRRRSGSSKKRVYDSPNDTASLSDSPGGESHHRRQQRGERRPNGSNEGRRSKHF DQHGLSTRTAETLKLIEMPFQSGNDVAKNKSSTTTSKTTKETAGKQKSEPSRRRSLDS SLPGRAGTATFHNYRSSGTEKEVPMHHQRRAAVPDPPEKCTSKSYLRQSSAPLLPTQS ITMNESSTSTLLSRDPDGVILKEDYSDSSRLLNGLKRSGGGQRQRRRHSGGSSKQQQS VPTATMLPTMHEDDDVVVSTESDVQDAVLHQSSSYHHQAAITTSYQQSSASSITPSQQ GQNTDPCIYFRPRRQHHHHRRDESSYISTCSSFTESLTCAYVDDTDDEESVDETVNYH SANVVNGSQLDEGSSKQSQEVKQIIICGMPYSLVLPNRTDTAVTASSFQYYGKYTGQL NAKTKLPHGLGSLRLTTDEIKEGVWHEGLLLDEFGGGGEDDDEAANEDTFNHVHSSPI SQKESSVVHVSSDSNNADMALEFCCLPCNDEGGEFLTTLPRLCVSQDEQQEEEEDDNR TPDGSCTDAISIYEGNVAEDEDSGNENSSTFSLPCAMAARSMGGGTDSVRSFITM" gene complement(<248089..>250047) /locus_tag="THAPSDRAFT_11391" mRNA complement(join(<248089..248169,248335..>250047)) /locus_tag="THAPSDRAFT_11391" /product="predicted protein" CDS complement(join(248089..248169,248335..250047)) /locus_tag="THAPSDRAFT_11391" /codon_start=1 /product="predicted protein" /protein_id="EED86551.1" /translation="MSEDKENIDPNRRKAVASSAISPAKKRLKKTAANSKVINVNVDS TTIHRMKTLLYDVAASDRMTKVFTEIINSNSNNEIVRIMRSTMKQQISIRHDESTNEK DRPQSMNDMEMQFRDEMYKAIVAEGNKCDVDFSNRINFGDCIDASMDSLCTLNYATLK EIKRPKEGRGKYNLINGHSSGCPRMMEVSVGKRFQSLKFEEAKKEGEKARSDFRLENL RFNDGIVIAFDKPKKYGTTLQTIQRNLGLKSVDYLHAISVLLRIFHALLHWEVNKKPL NSLCCSAEIFLAQFGEGKENGFDGKLISLKGQCRHPEWMMRGFAAGCTNAELEPFDRA LCTFLANSKEMREELKSRGVDLDEFNEPVSINLSFSDLDGLSEEERACYLLNRQEGSR LGGQRCGLLHHITSKVASSVLESASSFEDLRLSDDLFDDAVMLCDVVHEGSVKTTLSA LQSLKKGHGLLSLMKTIAKNNTLTDITNATSLDELKMSEEDRALLHQHGETKSIASLF AIKRGHKDRMASEGVTVDGKSKKSLTMIRSRETKRQEELETLGSKRLFCGYCGETKFF KEGSRSSLTKEEVLNRERSASTKQESQNERR" gene complement(<250200..>252162) /locus_tag="THAPSDRAFT_11392" mRNA complement(join(<250200..250953,250997..251795, 252033..>252162)) /locus_tag="THAPSDRAFT_11392" /product="predicted protein" CDS complement(join(250200..250953,250997..251795, 252033..252162)) /locus_tag="THAPSDRAFT_11392" /codon_start=1 /product="predicted protein" /protein_id="EED86552.1" /translation="MSMTLPYHKPTMMVSSYTQIHRRSFPSTFRLRVSFKALSESCTV LPKIPSQSNFILGNEQRQQQMMFMSNANATTPASQTHTTSSGASSSVTSPPVTSVPMM GGGGGGGLVSSNFLCVLQNAVARSNHPPQMQQQQYYHSSLGIPQQQQAVVPQMALSNE ASSSLDGQQNQNIQLDLIKTVLLSRQSNAANNAAVNQPTTAKGHQGLPAPTSNNYRID NNNNNNNNNNNNNFGTSEPVLLEQIKRKQAQAQDSGAAATACGISYQQQHYGMSSSFS SAGRNSGSLHHQHDIVSHLLESVRRQNHSRKPRRGTTNQHQGGSSHQQTNVASLLSML QQRQPTQYQQLQQSGGIDNGNPLVAPLRRLQDERQKGTLMCLINNQVMQSASPQTKAT STPTSNFVPLSAVETESAGSWYNVNRTTAADFALSQAERAITSGTSSSPDKLDATAHH AEKKRSPPKKRQPSFPSVSFFRTSRVFSLDEFPLSSIVEDAKSACSLAEELQPSHDNL VNILSRSFQSDDFNVAALGKEILGSVIERQKEKICWFAIGCCFGLLLPYLTK" gene <254023..>257098 /locus_tag="THAPSDRAFT_264655" mRNA join(<254023..254316,254428..254587,254639..256353, 256442..>257098) /locus_tag="THAPSDRAFT_264655" /product="hypothetical protein" CDS join(<254023..254316,254428..254587,254639..256353, 256442..>257098) /locus_tag="THAPSDRAFT_264655" /note="Homology to ATP binding domain of cyanobacterial serine/threonine kinase; hypothecial protein; GO_function: GO:5524 - ATP binding" /codon_start=1 /product="hypothetical protein" /protein_id="EED86508.1" /translation="LVLIGGSSGSGKSYLVDIVSGDLLDSGWDYIPCKFHQNLPVMSN IASAIEVFLSSMFEDEYEQEYSNAIASALELNLSSASIVSLCELIPSLRALFPHDGHF LEIEVEHSRNRMNFLLRSLVNAISNSAHRALILFFDDLQWADQSSLDSIIIVTPNVLF VGAYRKDEVDDSHMLAPYFTKWIHSATVAVTALNLDAMNKADANVMVSEVLRLPERMT RSLTDVVQSKTLGNPYHVKEFLKSIVDESILNYSLVEKRWVWDIEAVKATSIDENVLE LLQRKLLRMSGDVQVALNALSCFGSQIDMSIISLLYTEQQRLNFIDNLEKAVQEQILE RRGSAYSFVHDLLRQAAYDLMTSEEKGRSHYVLGIGLITNLPEYDEPQYKSIAFIGID QINKAKALRCVTDDLHSVKFASLNLLAGERTIGQCDYVSALSYFEYGISFLDGNGWQS SYQLCLRLHESACLACFASAQSNKMMSYVDELLNHATCLEDRFCAYQVMVQSLGALGN IEMAIDKAFSVLEELGETLPDATPTAISSELTTTKKLLESYTKEDILALPRTPVMLKC KVMKMINAISSFLFVASPRHLPLLSCRMVKLSLKHGLCPDSAIGFALFAHSLGSVLRD IERGYHWGKVAVAVMESFTSKGLQLQAKVKCIFHSFLSFWVEPVQSTAKSLHVAHQEG LQIGDLEFAMWSACHYCRQSLMCGDNLQIKERECEAVAMQMAQFKQQLVSGSFTSHYQ VVVKLSGCTQNPFEVAFGGAIVSEDELIQQVKASGKLGTLQTIHFDRMFVAYWLKEYE KAAELAEKYRGRMVMRFSDVYHVFYEGLLAFRFLRCSSGETRRKWMDLAQESLTAYRT WVKHSAWNFENKLLLLEAEHLFTRGETEAAKELYEESIVSARKHRFVHEEGLAMELFG EFQSATGNAEEAKEQKILARVCYEKWGA" gene complement(<257280..>258551) /locus_tag="THAPSDRAFT_11394" mRNA complement(<257280..>258551) /locus_tag="THAPSDRAFT_11394" /product="predicted protein" CDS complement(257280..258551) /locus_tag="THAPSDRAFT_11394" /codon_start=1 /product="predicted protein" /protein_id="EED86553.1" /translation="MTKPSLLVVLSLLSAQSALAVVCEGNASNSCNGLQNTCTSEGLC TVDSTTGSCCSLPATSLIALRSAATCNSNADASICSGEVGITSLTTSTVGATLAPEEL TYAPTPSPSITSKPTETCYNIEIGIILDTYPEETSWEITEGRKSTLQDPSATVVSASP FYDPNRYREASDAHIVCLPAGKYTFTIFDSDKEDGMCCGYGEGKYVVTYQSTGEMITQ GAEFGSSETTKFAIPFEAPTLRDANGDGVEDRTKNIIPTIPLTSDGMPSCSNEFGLHL QTDDYGVETTWELRERDEMSTNYTDGKVIASGGPYTSEFEYDISYCLNPGKYYFIFYD WQCDGLVGIKSNGYYTVKVNGMDVWTGGTDMNGYEEVVDVEFLNPLVGAESEESSKAY VASYSAGERLVGNERWIQVLVATAAAVMAWN" gene <261259..>262080 /locus_tag="THAPSDRAFT_264656" mRNA join(<261259..261830,261943..>262080) /locus_tag="THAPSDRAFT_264656" /product="hypothetical protein" CDS join(<261259..261830,261943..>262080) /locus_tag="THAPSDRAFT_264656" /note="Putative protein containing a trypsin-like serine protease domain; putative protein containing a trypsin-like serine protease domain; GO_function: GO:4295 - trypsin activity; GO_process: GO:6508 - proteolysis and peptidolysis" /codon_start=1 /product="hypothetical protein" /protein_id="EED86509.1" /db_xref="InterPro:IPR001254" /translation="RIIGGRVTNERRYPYAVALTKGGDRFFCGGSLIARDVVLTARHC LGGSYNVAIGRHNLTSSNTAGDEVPILEEIGHPLWDKYTDVYDFALVILSRPTTIPGI PLVKINSDSSLPQVGSSARTMGWGDTAQDDDLRRISDVLMAVDVEVISNNECRNAKGT EGNLYNSYKDYIYPSMLCTHTPGKDACQGDSGGPLVIPGTKATEDVQIGVVSWGIGCA TQLFPGVYSRVSTVYDWI" gene <265021..>265363 /locus_tag="THAPSDRAFT_38361" mRNA join(<265021..265191,265247..>265363) /locus_tag="THAPSDRAFT_38361" /product="predicted protein" CDS join(265021..265191,265247..>265363) /locus_tag="THAPSDRAFT_38361" /note="GO_function: GO:3677 - DNA binding; GO_process: GO:6355 - regulation of transcription, DNA-dependent" /codon_start=1 /product="predicted protein" /protein_id="EED86510.1" /db_xref="InterPro:IPR000910" /translation="MEEDSELPGKKKRRVKVPKDPHAPKRNIGAYSHYMKHNRKLIQE ANPKTDSKDIVSATKLVASHFKDLDEEEAAKYTKMAEVDKARYVKEMKSYRE" gene complement(<265838..>267283) /locus_tag="THAPSDRAFT_11397" mRNA complement(join(<265838..266231,266316..266759, 266901..>267283)) /locus_tag="THAPSDRAFT_11397" /product="predicted protein" CDS complement(join(265838..266231,266316..266759, 266901..267283)) /locus_tag="THAPSDRAFT_11397" /codon_start=1 /product="predicted protein" /protein_id="EED86554.1" /translation="MLSLRSIQQCLLGTILCLSLLMLQTSYNLSLSSFNYDDGLADND RPVAWSAKNVDAVRGLTSGVGLVTTDGMWSVTNSLSNGKSFGASIPVIRIIAFTDKHY LNIAKVWYERLEKLGYTEHYIVCVDETILKPILNPDYSHPILGFGRQLFSLRIYYTRE LLLNGVHVLVTDLVSLLLIVICVHKDMFKWFLVQGTRFFSNTLVFFVVLHRQDNIFNR YVPLQGFLEEGYDVFHAYEMRYPENLWREYGFVVCAGHQFLRSSEASIAYLDMVLGNC YEMNNCDDQVRYNIALHYMLKMNWTPNPNRANALRISSNNTENDGLLVESFTGVSDEI NLKVKVWDRDFAWRLAGNIPDKCPSANNWVAMPTKADHLVDAKGSNKVWRKIHLFSVW DDLCHGIKTNLSKS" gene 268565..270386 /locus_tag="THAPSDRAFT_42867" mRNA join(268565..268605,269072..269736,269870..270386) /locus_tag="THAPSDRAFT_42867" /product="cytochrome c1" CDS join(268582..268605,269072..269736,269870..270014) /locus_tag="THAPSDRAFT_42867" /EC_number="1.10.2.2" /note="putative cytrochrome c1 based on sequence similarity, EST support, and presence of defining domain; cyt c1 heme protein; GO_component: GO:16021; mitochondrion - integral to membrane [Evidence 5746] [PMID 5739]; GO_function: GO:5489 - electron transporter activity; GO_process: GO:6118; oxidative phosphorylation - electron transport [PMID 6119]" /codon_start=1 /product="cytochrome c1" /protein_id="EED86511.1" /db_xref="InterPro:IPR002326" /translation="MAQQLKSVFTRYNKGLALAATSTAALALSTLYPATTKCDDEILH APSYPWDHLGLVSSYDCAALRRGFQVYRQVCATCHSIERIHYRELVGVTHTTEELVEM ASEVDVVDGPNDEGEMFERPGKLTDALPSPYQNEEQGRLANGGALPPDLSLMVKARHA GQDYLFSLLTGYCDPPEGKAMMPGLYYNPYFPGGAIAMPPPLNDDGVEYEDGTPATIS QQARDVVQFLNWCAEPEADSRKKDAFTYMIVLGFTAVMTGYYKRFRWSTFKTRQLTYV K" gene <272008..>272448 /locus_tag="THAPSDRAFT_11399" mRNA <272008..>272448 /locus_tag="THAPSDRAFT_11399" /product="predicted protein" CDS 272008..272448 /locus_tag="THAPSDRAFT_11399" /codon_start=1 /product="predicted protein" /protein_id="EED86512.1" /translation="MADESELQSPLLCAGADINVGEANNRVTLHGESMDLRHIFNTST VANSTAASSHYQEHHANESTNQQQQPSPTSTTLSLDNQSSTFTNDYHASNNSLSLKGA IRSLSRKGMKCYFDARFKVARLVSQGGDGGVNRSSSGGAAVSAV" gene 273091..275236 /locus_tag="THAPSDRAFT_25565" mRNA join(273091..273614,273951..275236) /locus_tag="THAPSDRAFT_25565" /product="predicted protein" CDS 273955..275211 /locus_tag="THAPSDRAFT_25565" /codon_start=1 /product="predicted protein" /protein_id="EED86513.1" /translation="MGMIGTYYFCSAVILSILLSLVVDMFDEWDRQRVEEDLLDRRRE LEFVLRGEEEEGCLELTEEVEEPSPLNRTLVLDHSSLHSRSHYSHHRHHQPHLSHHSF TSQSQMTNTISSNYNPTQQPLPAPTTNGRLLKQALMIVFSAASLPLVCFAVTLPTMQR LVYGGAPSLLHEVLGMVWQKEYSLISLVKTTGDAGGWDTLLMVTFGLFVVVGPILRSV CLIMHVLLGLPVALLGDCIERPRHRTTFRMILYQVTSTFQRGLRPIIDALGAFCSWEV LIVAFIMIQLEMPSITDTIYQDDRCQEADPEYGRTCIEVQFNAMDNFLLVGVAWFALV AASSLLLDLAADGEERSLIETKKKYEYGQPIPQRRSNLNSDQSWKGALGGRKSIGDNI NMDRGSYYSSVQVEEERDCLEEIVFV" gene <275724..>277808 /locus_tag="THAPSDRAFT_11401" mRNA join(<275724..276024,276111..276382,276468..276831, 276908..277122,277197..>277808) /locus_tag="THAPSDRAFT_11401" /product="predicted protein" CDS join(275724..276024,276111..276382,276468..276831, 276908..277122,277197..277808) /locus_tag="THAPSDRAFT_11401" /codon_start=1 /product="predicted protein" /protein_id="EED86514.1" /translation="MMLLLVAITSNNIKQALSFVVRTQMTTTASTTTRLQSSVLDPNA APVETKGERQLPKIIQGGMGVRISSWKLAREVSKRGELGVISGTAMDVVFVRTLQDGD PEGHFRRALATFPNQQMVERALDKYFIPEGKAANKPYRSLPLWTINPPRHLEEAAILG NYCEVWLAKHNDDGSPTGGVIGINRLTKVALPTIHSLYGAMLGGVDYVVMGAGIPAKI PGVLDALAEGKDCSLPIDVSGASSEEDEAYAMDFSPRNFWKGSGTSDVARTPLERPLF LPIVSSTTLAQSLLKRSNGSGPTRGIDGFVIELHTAGGHNAPPRGFHFEPEQAKGLNE LGEPMYGEKDMVDIEAFGKVAKGLPFWFAGSYGKKEKLCEVLEGGGNGIQVGTLFALA DESGMDSKVKQEILAEIATGHELKVFTDPAASPTGFPFKVLDVEGIPTLADKEVYDAR PRVCNLGYLRQPYLQDNGEVGYRCASEPVNDFIAKGGDAKSTVGRKCLCNALCADAGF PQVRLVTNKETGEKDVFTEPSLITTGDDVNLCKELIRQEEDGTWHYTAGDVVDYLLSE WELKSSIAKQIEAESVEAVHI" gene complement(277921..278904) /locus_tag="THAPSDRAFT_270007" mRNA complement(277921..278904) /locus_tag="THAPSDRAFT_270007" /product="hypothetical protein" CDS complement(278036..278743) /locus_tag="THAPSDRAFT_270007" /note="Hypothetical protein with similarity to yeast metacaspase; Hypothetical 46.6 kDa protein (Metacaspase) (model%: 54, hit%: 37, score: 374, %id: 44) [Schizosaccharomyces pombe]; EST support; with sequence similarity to yeast metacaspase; hypothetical protein; GO_function: GO:30693 - caspase activity; GO_process: GO:6508 - proteolysis and peptidolysis" /codon_start=1 /product="hypothetical protein" /protein_id="EED86555.1" /db_xref="InterPro:IPR001309" /translation="MATLIDDGRNEEPTYANIMLAFERVARDSQPGDTVWVHYSGHGG RLRDDSNDESDGYDETLIPADFKRRGQIRDDDVLKYLVKPMKQGVTVTVVCDSCHSGT VLDLPYQFIADGKHDEMERNAMFLLGKRRKSWKKVTRASGPSASVTSQAILNDLETPD ARTYSVNDPYPMTRLPSVPEVDLSPPTNDTPHKSYSMLVSPSPTNMLKRFASVEVVHN EKGNVVEVDEDEEQWEV" gene <286058..>287047 /locus_tag="THAPSDRAFT_264659" /pseudo CDS join(<286058..286150,286478..286647,286995..>287047) /locus_tag="THAPSDRAFT_264659" /note="hypothetical protein" /pseudo /codon_start=1 gene <288250..>289137 /locus_tag="THAPSDRAFT_11403" mRNA join(<288250..288379,288463..288550,288643..288674, 288754..288835,288933..>289137) /locus_tag="THAPSDRAFT_11403" /product="predicted protein" CDS join(288250..288379,288463..288550,288643..288674, 288754..288835,288933..289137) /locus_tag="THAPSDRAFT_11403" /codon_start=1 /product="predicted protein" /protein_id="EED86515.1" /translation="MTTTPEPKFWPDWLGDDTCVFNEEFPQYMQLNPSWTGSTLEDCC RRYYSWRYDDCMVEGGGTSNTATLYYPNWEGSDHVCVNDGEAPAYITQAASAFMFEDL KDCCETYYWWNMAKCLGSEANAGSNKYYADYSQSKCVKDCTDSDCGGLVGGVWDELYD DKAVCCDEKFWWVEDCDA" gene <290189..>291104 /locus_tag="THAPSDRAFT_11404" mRNA <290189..>291104 /locus_tag="THAPSDRAFT_11404" /product="predicted protein" CDS 290189..>291104 /locus_tag="THAPSDRAFT_11404" /codon_start=1 /product="predicted protein" /protein_id="EED86516.1" /translation="MPSLDSLSDNSTSSPPIVAVAIAAAASSAADSTTDGTSATRLVT AATIVNAAVNNSANATTVAAIPQPSINLNSASNDDGEEYTEQELANIARGSKRAEAER AKKKRQEDKLQQQLQQLEEIKNAPFEAIQVTADGTDVHKIGTVLWSNVELKFRQEYMK HKNIKFGRSRSKDLLGSVIVQYLKAQPYKNAISNSRRTSSISSNTQRGNGTLVRKGAK PNFLVVDGDGTMFRAANVLLHHKECYVATKNALGRSELDSGLGHRVEWVTMTNTYNKT YDPDNVDTTIDCVQTYNDLEVFGIDPMTA" CONTIG join(AAFD02000034.1:1..2886,gap(2023),AAFD02000035.1:1..286285) //