LOCUS CM000654 454954 bp DNA linear CON 14-MAY-2014 DEFINITION Thalassiosira pseudonana CCMP1335 chromosome 23, whole genome shotgun sequence. ACCESSION CM000654 AAFD02000000 VERSION CM000654.1 DBLINK BioProject: PRJNA191 BioSample: SAMN02744045 KEYWORDS WGS. SOURCE Thalassiosira pseudonana CCMP1335 ORGANISM Thalassiosira pseudonana CCMP1335 Eukaryota; Sar; Stramenopiles; Ochrophyta; Bacillariophyta; Coscinodiscophyceae; Thalassiosirophycidae; Thalassiosirales; Thalassiosiraceae; Thalassiosira. REFERENCE 1 (bases 1 to 454954) AUTHORS Armbrust,E.V., Berges,J.A., Bowler,C., Green,B.R., Martinez,D., Putnam,N.H., Zhou,S., Allen,A.E., Apt,K.E., Bechner,M., Brzezinski,M.A., Chaal,B.K., Chiovitti,A., Davis,A.K., Demarest,M.S., Detter,J.C., Glavina,T., Goodstein,D., Hadi,M.Z., Hellsten,U., Hildebrand,M., Jenkins,B.D., Jurka,J., Kapitonov,V.V., Kroger,N., Lau,W.W., Lane,T.W., Larimer,F.W., Lippmeier,J.C., Lucas,S., Medina,M., Montsant,A., Obornik,M., Parker,M.S., Palenik,B., Pazour,G.J., Richardson,P.M., Rynearson,T.A., Saito,M.A., Schwartz,D.C., Thamatrakoln,K., Valentin,K., Vardi,A., Wilkerson,F.P. and Rokhsar,D.S. TITLE The genome of the diatom Thalassiosira pseudonana: ecology, evolution, and metabolism JOURNAL Science 306 (5693), 79-86 (2004) PUBMED 15459382 REFERENCE 2 (bases 1 to 454954) AUTHORS Bowler,C., Allen,A.E., Badger,J.H., Grimwood,J., Jabbari,K., Kuo,A., Maheswari,U., Martens,C., Maumus,F., Otillar,R.P., Rayko,E., Salamov,A., Vandepoele,K., Beszteri,B., Gruber,A., Heijde,M., Katinka,M., Mock,T., Valentin,K., Verret,F., Berges,J.A., Brownlee,C., Cadoret,J.P., Chiovitti,A., Choi,C.J., Coesel,S., De Martino,A., Detter,J.C., Durkin,C., Falciatore,A., Fournet,J., Haruta,M., Huysman,M.J., Jenkins,B.D., Jiroutova,K., Jorgensen,R.E., Joubert,Y., Kaplan,A., Kroger,N., Kroth,P.G., La Roche,J., Lindquist,E., Lommer,M., Martin-Jezequel,V., Lopez,P.J., Lucas,S., Mangogna,M., McGinnis,K., Medlin,L.K., Montsant,A., Oudot-Le Secq,M.P., Napoli,C., Obornik,M., Parker,M.S., Petit,J.L., Porcel,B.M., Poulsen,N., Robison,M., Rychlewski,L., Rynearson,T.A., Schmutz,J., Shapiro,H., Siaut,M., Stanley,M., Sussman,M.R., Taylor,A.R., Vardi,A., von Dassow,P., Vyverman,W., Willis,A., Wyrwicz,L.S., Rokhsar,D.S., Weissenbach,J., Armbrust,E.V., Green,B.R., Van de Peer,Y. and Grigoriev,I.V. TITLE The Phaeodactylum genome reveals the evolutionary history of diatom genomes JOURNAL Nature 456 (7219), 239-244 (2008) PUBMED 18923393 REFERENCE 3 (bases 1 to 454954) AUTHORS Grigoriev,I., Grimwood,J., Kuo,A., Otillar,R.P., Salamov,A., Detter,J.C., Schmutz,J., Lindquist,E., Shapiro,H., Lucas,S., Glavina del Rio,T., Bruce,D., Pitluck,S., Rokhsar,D. and Armbrust,V. CONSRTM Diatom Consortium TITLE Direct Submission JOURNAL Submitted (18-SEP-2008) US DOE Joint Genome Institute, 2800 Mitchell Drive B100, Walnut Creek, CA 94598-1698, USA COMMENT URL -- http://www.jgi.doe.gov/thalassiosira JGI Project ID: 2662235 Contacts: E. Virginia Armbrust (armbrust@u.washington.edu) Igor Grigoriev (ivgrigoriev@lbl.gov) The clone of P. pseudonana that was sequenced is CCMP1335 and is available from the Center for Culture of Marine Phytoplankton (http://ccmp.bigelow.org). This clone was collected in 1958 from Moriches Bay (Long Island, New York) and has been maintained continuously in culture. Annotation was done by the JGI Annotation team and the Diatom Consortium. Chromosomes 7 and 18 are complete and present in GenBank records CP001160 and CP001159, respectively. The JGI and collaborators endorse the principles for the distribution and use of large scale sequencing data adopted by the larger genome sequencing community and urge users of this data to follow them. It is our intention to publish the work of this project in a timely fashion and we welcome collaborative interaction on the project and analysis. (http://www.genome.gov/page.cfm?pageID=10506376) Annotated scaffolds were added in January 2009. FEATURES Location/Qualifiers source 1..454954 /organism="Thalassiosira pseudonana CCMP1335" /mol_type="genomic DNA" /strain="CCMP1335" /db_xref="taxon:296543" /chromosome="23" gene complement(1383..>4297) /locus_tag="THAPSDRAFT_25810" mRNA complement(1383..>4297) /locus_tag="THAPSDRAFT_25810" /product="predicted protein" CDS complement(1400..4297) /locus_tag="THAPSDRAFT_25810" /codon_start=1 /product="predicted protein" /protein_id="EED87505.1" /translation="MSEAVHTLLSQAIQSARYNEAVVVTFGDDGQNLVVDGRASDDHG SEVDANNGSNSGRGGTREIVLVACLDAENGGIVWRDAAAAVSSSATTGDGDNLLQDDI GILRHLRTKRWALPMLNDHRRNEMYSIAVKEACDVVVERRLKELQLREDDGSATHGEG EVDDGDEGRNNVGDETIRILDIGSGTGLLAMMGARYTQNAIKESSNKSTDATILEDIV NTSDASQLNVRVTSVEMASAMARLARMTVHENGFTSDNVVVVEHHSSDVEFGIDDRRF QSSSSAGGGDGGAMDASVCTNVKQQQKADICTSELLESGLLGEGVLPAMRDAWKRHLK DDAVVVPTRARVKAVLVEGMTLGEEDGGEKTTASLNSATSFFGPELHSFEEASGGVWF NTRAPTSTSNEGNCLRGISVPLHANAMINEDYGGTSLASLTQEYDGYSLHPSANRTLH SNEYRGIRVLTEPVVVLDFDFASGVDAIPPPEGRTVTTQVVPNTDGTVHGVLFWWELD MGAEDSGTYSTEPIGYNNSNGSDNWQDHWQQVLYLFGDDHFRREDEMTRQVVNGRPVD VVASHDDGSISFDIKNAYTTTDASRPSQRRKLNEEDKDVDVQPRQMTNANITPTRALQ LNDSQRTRVLHAAIFHALETKGKDAPLLDLSDMGLCAMMAAVQGATRVTSLESSSGTM PTLAATIAQIGNKLPKQGCEFQIIQALAEHITSEYIAGGVADIVVAEPYYEMLEGWHL QEALNYFYLVRSLKMRNVISPKAVSVPAEACIMGCVVQFDEFYNAYGKVGDKSGGSDE MVQGFSHRTLNHYGDRYHTYDVSLPLWQYQWKRLSKTFCVSTLSYEGETPTIRGDKQW VVADFERTGTCQAVVFFVDYLCRVHNGTHSNQSNGDRFGIISTSSSSHRQTVRKLASP IEVTEADLIGGIKFHCRASFESIDPNYNDNLDDHMFSFKVIKRGDDDLCLR" gene <4690..>15006 /locus_tag="THAPSDRAFT_11974" mRNA join(<4690..14763,14848..>15006) /locus_tag="THAPSDRAFT_11974" /product="predicted protein" CDS join(4690..14763,14848..15006) /locus_tag="THAPSDRAFT_11974" /note="GO_function: GO:5515 - protein binding" /codon_start=1 /product="predicted protein" /protein_id="EED87420.1" /db_xref="InterPro:IPR001478" /db_xref="InterPro:IPR001680" /translation="MAGLPPPPPPHPSTIGIPSTPARNTVTPANNNNNTPVSSTPQTP SSTGMSNSFISPGLTSSLHGSQTFGNIRGGFGSAFGGVYNNNNNNNNDRACDTPGEAS TLSSGMSATGRSVYGSGGGGGDRRSKLVRERSRSSAAFASSTGRRLDNKNRSLRGSLI LQGKTTSSDAAAGNDGGGVVSSGGMRGAGSGGVVVPRAFKTKEDDNESFIEEEQSPAL FTSFCYKGVEVIFTAVQNNNGDGCELLGIHARTRVPFQTMIVNPFVDSSGDDSNNSGK KNKAINPQIVALTSHPTTGQIYAATSYGTIHSFSPIKADPSKHAYGKWRWVEGCVAFV REVFGYDALGVSLSASELGGDDDDEEKEGVVCFRRRRRQQNVASSAPSSAASPLANYP LKDGQPPLPSTNNNRGGREEYGVMICASLTEKRVLIVHKDQLSIYDFSTSASSDKGAI SGDVRMPPPTVWSAFPKTKNRKARVREALLLWTHKMQGAIINHCSFSGDGCAIALVLR GEGVGVPYPFGVRTFVRDREDGTHSVAASGKAAGSATGVKTTMGNSSGGSEVVKPGGK PPRHRRTGSELKAMPVENPPLPKGGNNFDSVLENVFGGDNTTTQAAGEPDSPSPRKRA AKGILYKRGPFLVHSAPVTRVSFRGFGTTTSYSAHNSGWNETEEGNDLLLTSCSTDSS VRIFSQNSWRQLMHWASPPKSRADWVRGITAANLGDLDTSPMYTTSSKKKKVEGQMSP IPKDGADPLYYNQQQHHGQSPSMPPSSDNSVSSDASGGINRALLASHHHHVPTPSSFP SHSVPGTHAGAWIAELTFRNAFPALRLSRLSYMKTGGDDALPAHFESVAAILPPGSIT EEVVLDDSEEGERRLEVEGIWPAWDPWEPDTTGSASSGSHGNDGDTSKRLLGPSAAST ASVGNAQPPPMTGPRWLGDGAELGGSHIPPSELRITSSHLNGDCLVQIEMPLWGDKDF GAMEFGAPMRYVMSIPGSSSSPQLAELPCANLEFESGSRLCARTSLDRRSVDLCWRMH GAVNLEEDFATTAGEGPKRFKDLSLTPLPLCLPSLVLPGHHYKSADNLQSLLTFDKHS VSSLHWWADENFGGPPRLVTLTQGGTLIVYEMPPPWSALEPPMPSYDPFDSSESRGSS IASDFGAEKEGGLMLDDGFDEEAGGEHRWDYEVTITPHPDFGIGLRLEAQAQGMPAIA GSFKKHPLSGGRLPAERSGAIALGDELISVNDVNLEGFRFEDAIATVRQIGFDSYGAP LQMRFRRCKGKKRGSAPSSIGSKGGRLSSSDKGTPESKGTNIIMGGAVGGSLATVEVG ADAENQQEFGRIIAIVRDAVGSSHDESMMTSSPAMLLLPWNFGKGAIVSHKMYGGALV LWAVPDQRLIKAARLEVVLDIDPENARFYEIGSIAFDEQAGDSGSTIKSITFISSTEN GWLVAVNDSSGNVSLLFVETFSTATTGSRGEITASFRHYPSIFNVHGNDAVRNKGTPK GSFVLRSFSVELFGCMSTRVEGYKEVTIWSALPQTTHSEKGSTDLAYSSSIVAIRDIV GLSDEYILDFRWVSSGFVDAFPWLVVFARSAAVVYRRPCSLLSWQPCAVLAYPSLSSA EASSPHDVYPHLISALRCAVLTNDEQSQMRSDWHPESILASICTEEEGTKTALKSYVK GVYSWLSQWMNADESMRPSWEGQGPLSGAPFRIVNDKTVSDDDDKEDDAVETSTNLMS ALSVNPVSARPQSEEEVLLSKLQKALCPIDGTSQSENASSLLHNTPRSREFLMTMSVD KQTQKKEVESKQPLPAPLQSLSKDELICVWAIGEVLSQKPSFSSLDSYSQLCLFSVSL MRRLLDVKTASEQSTESKPSGIMPSYVGGRPTFTKQMSSNRFEQEPVRFDTAASAAFL SALMSNTQSTLVVACRPKGEKYDWVMARAVGIPFWVRSKKTLVSIAEEIAQAIYKSTK SVMDCALYYVAMRNMKTLRAIAATDRSDSGTKFLKFIIDHDFSSERGRKAAEKNAYSL LRKRKYATAASFFLLAEPPMIKTALDVIKSQMKDITLAFMVARLMENAPKSSAMPDDA LTIGGGFNLSSMGGGGGFAGGGPIGGTSLDLEEDGAKFSEWSPSLGPSARSVLQTKGT SPAVEDNCMESVKLLWLNRPNEAMLRLAHMPTNSVADASSINDVAVPSISGDASSVTN GKVLQKTNEVINFCSGPFLLKAMEPKKRVLWSSALLVSRAMGRCGIENSAMRILLQVA DPSYVEPSDKDTRLKADRSKTSAMSSTVNAVPSSRAPTSILDSFDAPTQTHTDATSSI FDSFDMPPPKPKAPSQTQPEITSSIFDSFDAPSPKPKVVAPPPPSQADPMASSIFDSF DAPLPRPKAPTQQQQSDPMASSIFDSFYAAPPKPKPTIQQLQTADPMTSSIFDSFDGP TQKQATKSAAQPAAPKVLETKDVDEIEEEQPFDLPEFPALWNEWREQLIQNVAARRLL REMARILSSFNGEPHYCNIETFTQHKHPLIPMGAAEVLHKACDSEGLLNSIHKSLYDL RTSFGVDENIIIAQALVLLSTNNHPRRIVLSVLLQCLLGRSDFAEDLVRDAASMQMNC SELLGFSNDMITDNKDSKYYISSQWARRDSANILWQLELCLWLHRGGAFDISSIALKE TLLAVRVGLATASWGRCHQSLDTLIKAEPDCLLDFDAGKNLWRSMKIIVVNENTLDNV EGVSSGGWEFLVDCRRDEATEMLRDGKTGQFLIRPHAQDPGVFTLSFKTNLVPTETLP TTNYDEAGGADAAQEQQDTPSPSKVIKRDDVVQHAIVRLSDSGFRCGSFGPFTALVKL LHAVSDSLPFNLRFSDPPVKGIISERGTQTSPNSFLFRKLALHSTAEFIHFHGPKSAE VSEVDDCLPKKGALHNTSCEEKNDDTVADVNRRYALFSQLLFLTEFRKQLCAVAAAHD DEADVGASSHGERKSDAAENIDDEYDGSISEGSLDVDEEEILGLAARMVRPLLNWCKA REIEIVDEIAPLISDIKHKSETLLSVAINASGDEFEVPSIESFGGDSMIRRMIQAGSG VDFRTLRVGEPGNSVIVVLFNKQDAIKWLVANETGNDETEALERLRVMELLRVIEPIT SSDLSLPKGYAATHPSTSSQYRFVDPWEVEALESKAGETASAALGRGKYHTLSVGLIA AACEKIVRASGGLHLLGLWSTMKGGISLTKALCSAHPAWERDAGGDLQMKGGFLMEPS PYENSIRQHLYGNSLFRRLKLPQRFLALIQVELLDLKNVTSPSGTSSLTAYALLRLKR QGSSAPLNHKARSLDSACTQARKISKISGPNAPASWGSLVRFRFPLPEDVNCEGKSFD SDRESLFKGPPTCLQVTVYEKKFMSDVELGGADVNLESLGSGGQIEEWVPLRVGKDGI TW" gene complement(<15513..>15869) /locus_tag="THAPSDRAFT_11975" mRNA complement(<15513..>15869) /locus_tag="THAPSDRAFT_11975" /product="predicted protein" CDS complement(15513..15869) /locus_tag="THAPSDRAFT_11975" /codon_start=1 /product="predicted protein" /protein_id="EED87506.1" /translation="MFSSKLLLLSALPSTKAFLPTPTKASIMSATSLRSITFEPPPDD NCELDGSDCEESVFERKRKEKMNANQSTKERYAAMGVQLSDADLIDSNIDQYANAPLG GTLMAGISLSALCEDD" gene <17517..23924 /locus_tag="THAPSDRAFT_25811" mRNA join(<17517..17586,17726..19248,19349..20196,20275..22764, 22847..23924) /locus_tag="THAPSDRAFT_25811" /product="predicted protein" CDS join(17517..17586,17726..19248,19349..20196,20275..22764, 22847..23900) /locus_tag="THAPSDRAFT_25811" /codon_start=1 /product="predicted protein" /protein_id="EED87421.1" /translation="MNFVAHVEDHLRDLGAEARKTHPGVKEASERAIIQLRSLQTQYV AAVRRAGASAKNAKQFPPKQSDAFSSSSASSSAAANPPQHPTTALFRSQDVLRPFLLA VNYPDASYSLLVISLESMQLLLRGDAICSEDGVQISRVLGIQAWECAGRLGLSGSGAI SGRGGGGEGSSGMAVGMIHAASGALGGITGMVLGGGRPPTPTLETGSAVSHHHHQHHS SNRTPKEDQSIALKILQTITMLVDSRSVELSQEVLGACLLTCLILGAGQSYPVKKSVD DTKERKERVSHSAPGKDSAAGGTKGSVQRAALATLNQIISILFERAKDVMFGKLPLPA VLTEASHDESTILTVASRTLTDLCTIIDNYQCRSPHQLFGAFAVAGKEGLAPSPATCL ALLDMIFKQQCGDFFQVCQNSCGGDGLEQNNGLSFASQVIIETGQLVLAILLSQKMEQ TSDSMDFCYYYYATVLGSTILTNYLSSSTLEFYELFDATTLAINESGKGKAVMTNIAQ EIMRLLVVFVTGATDTYHREVFEDGYVFNQTERESLNIGKTNAENVSTNRRSASRALS TQQPAQPTPDPLISNELIWKAFLSLEVISRLVRSHLDQIAWLDFIVSQRVVGDMTIVS SIAKAASDLATISSSTRERILHVVLTAHDDQPPETLETGAENTGDAPTAALAAKFADS LSIGVDEIPLCDAGVATWLSFKCILSLVKSLKKSVASSEMANQSTTDSDTRQAIVLEI LNESFGPSVSVLQHYVKRMSGSHVVVSQTLLAYEELASASMVVDCKQENLRRQAIVTS LCKLCLPSWGKKRSHCQLKESNIDSLQTMLWIVHQHFDEICDEWHTVLSTFDQLSIVP IKSSKLPDSYTKTAQEIAESYTRLPAFTTCFSSDTLCNFVMSLVQLSEAVSFASPSEQ NLDISRENSESFSNGDINDDFASAGRDGIGGKLMSFAGRAFGGGGSQPSINPNVSFRR SGSDAGSSQSSKTYSEDFLELICSQMLAMKLSTPRSTIRKLPLPLLLLTVVAEANAYR FSVVEEVIAVHLCDLVAKAPTMELRSFSMETLIHFIPLSLSESDEHTAPTLKPTSLPI DRYKNKPLGTIQRKGSHDEGVEDIAKSSPEGKSPLLKILCQTMQITPQPDTAETGLNA LHIVLEEAGHDLSRDNLITVVETLSVLAGWDDYGNETDKTTSISRSSKAWANVSAKSF QNLKLILDEFLEPITSTEARTGSSDEARKAIIECCVAFGKSRHDVNTSLTATGMLWTL ADQDATPGTLDVVLSKLSFLALDNRPELRNCSVNTLFSCAVGLGDKFTDAQWEKCLNQ TVFGIMRDISFAINGSDSNQASSSDEGARSRRYKVAVHHSRDSATKQWSTTQVLVLRG LERVLRLFFARLLATTTAGADDKDPWFLQTWKAILRTSLECATLAGGRETLDVRLAGI ELALLCSHLSCKAGLVASAASARVGTNMEVVGGALRSVRAATTVQNKDQNSGKDMTIV DPEVEKWRDHLFDLSFAALGEYRVYLEQLDTQVAAEAEKKHMLTDSVQTQVLTKLVGE LSKLYECCKGNEMLPGVSALQLDIFVEDDDCYESRFVHLILTIMNNADSDYNSRYLNQ VQRGTMSLLQAMASNSSLRAFKALITLSGDYMFVRSNALGGNDDNHGEILELEAAKTV ASAFDSDDLSGEAKVVVMCSVLVQYLNIYGSPEVRISDSQPSDAGVRKTSEARYDILT SVLDSGLEAASHIDAQTTDESATLDAIWERILLTISSLMLPSNDTRYDGYENHSKSFL NIIAVVLLHLPVRKYAVAASILEKGAARAVEVAFECNEDRKNKTIDPVKVVEGAIDVF LACFMGLCQKMPSSPSVHTLSNQILGDTLDSKVFGDNAGLESKTRQNLALAVCESLKT TFSQELLVGMFPLLCRLTSVENDGLRRAAGKILGGINLAEAISREHRRAEQAEIRASE IEEENSALLEEVEYLQAENEELQRQLAIFSESSDIT" gene <24237..>26337 /locus_tag="THAPSDRAFT_11977" mRNA join(<24237..24773,24867..25950,26024..>26337) /locus_tag="THAPSDRAFT_11977" /product="predicted protein" CDS join(24237..24773,24867..25950,26024..26337) /locus_tag="THAPSDRAFT_11977" /codon_start=1 /product="predicted protein" /protein_id="EED87422.1" /translation="MLSLHFTLLVWLLLLMTQVVVATMGSVYDNGHKYTLSLEQQLSR ILERSLNVKTDEGQERRINKKSKAKKSSSKSTKRSKTKKSKKKKGKDDEDLIFTRTSC GAPLNNYRTCYDRVIDPSNDLTCDAIRLDNPFTSPYIGEDEELVSGKMHLTLTQYSIT AVTCEDTIAMEEISLEYLKDKVGSDNTFTPVCVFIANNAYDIQPVQNDTVQTVAFQLE VAFAFKKRFSKEIEQEEATTSPITRTPRNGQQLDHQHRALGEKCSKTDFGVCCSGKAF NSNAARESKECNNGCNYMKCGKRKKKQKNGRFLQSSSACDINLARKSFKNVVAAYTDF EPVTTRAIVGGNNTTDVAVCGVYSYIHDLLGNASMTCEAYEVHDCETNEDIVIQPNET VCNVDNPTTSPTEFPCDIEDEGTECCADSDCRAGESCFSGSCFCDSTKGDEGYCCTSQ DCDAGASCVFNQCATCSLESPGVECCEDDDCGDVAVYKCEGKQCFDRECSIGDVGTEC CVSSDCEDGMSCEFKQCVKKGQLRFTLRWTGDDDLDLHVLTPGGGLIYFGNDIDSETG GELDHDDTAPEGWGLHIENINFPVNKTLPNGTYTYWVHSYSRRGVGNDEWSVALYVNE EYYVGHNGNGTSTQYNYTKL" gene complement(<26400..>29102) /locus_tag="THAPSDRAFT_11978" mRNA complement(<26400..>29102) /locus_tag="THAPSDRAFT_11978" /product="predicted protein" CDS complement(26400..29102) /locus_tag="THAPSDRAFT_11978" /codon_start=1 /product="predicted protein" /protein_id="EED87507.1" /db_xref="InterPro:IPR001199" /translation="MIAAARSPMAARHAATATSNGLKPSLSITRGVTSARNGVLRSSS NNVSSRGAPSPVSSSLALRIGNNHLQRKSFSLVTSALRASSRNTTALVLSCVTGASAG VLFASNNNIAYAEAPRDNGGASSKKKSMDAESQSLVDELLEFLGMKSKSDVTADSDGK SDDASNYSSSDNNNDDAEEDGTINTSGGPGKRVKVPLDPELLASLPIIPLSQLEHPTG EHKNKLLVSHEGIVYDVTEFANHHPGGKDLLLTATGLDLGHFFDNYTVHGNSDKASNW LANMAVGKLSESDAKMARERTTSVVHVEQRHKWLNKARRRIVFIAATLPMWMTVRTAV RFVGWCIPSLGRLLARLVPVAVPGLSVGAEPLKINSGKKGEAEVEEGGVVKVKEGAPK VAVIGGGIAGCGTAWALRQSGFDVTLYEAREQISGNARTFDWDFSPFRSKEEEKIVKS CCSVTAWPPLFYKNYTCLLNTLDVETVHQPLSWFLNSKVPGAVGTLWAADPTPYGGSL RNVMKKDFDIYDKVVRISDTLSYLFTFRWAPWRRNDSPSMYDSHTGIGLLNPMNIVPL YSMFRGLGGSELWWQVVFTPHYTASFLVDELRPFPAVFGPLIEAQIPLLPNDQNAQSF KSVRSAQDCNITTCQTWKDAGKGIREVFDKLTKDIDLRENTRIREVEVLANGKKRIHD EYDNTMDVDRVVFACPANAVGNIYKRCGWLANTIFSTLVYADDHHPDSGHMHAVLHSD GTVIEEKYREECLKRASNYVEVTEKSDGSINIENQYNFGVQTPGPGVYDLPLEKKPVM LISHALGEGKSIDPNLVVGTANHARAHPLYSGWNVMAMLSLRLVQGKNGIYYCSNWTT PGNCHDMSFLSGLVCAHAIGAEYPFEENGEAKKDFHRLRDLMGF" gene complement(<32982..>33485) /locus_tag="THAPSDRAFT_38860" mRNA complement(<32982..>33485) /locus_tag="THAPSDRAFT_38860" /product="predicted protein" CDS complement(<32982..>33485) /locus_tag="THAPSDRAFT_38860" /note="GO_function: GO:3676; helicase activity - nucleic acid binding [Evidence 5524] [PMID 4386]" /codon_start=1 /product="predicted protein" /protein_id="EED87508.1" /db_xref="InterPro:IPR001650" /translation="NAQLQRFLRKGVAYHHAGLDNNDRRVVEEAFMSGSINCLCATST LSMGVNLPSHLVIVKGTSAYRGSGTGHQDLDTGTLLQMIGRAGRPGFDTSGTAVIMTD SHSKTRFENLSLGLKVVESHLLDGNRLSEELNNEISQGVVTCVEEAVDWVKSSFLFRR INSHPLYY" gene <37213..>37613 /locus_tag="THAPSDRAFT_264883" mRNA <37213..>37613 /locus_tag="THAPSDRAFT_264883" /product="hypothetical protein" CDS <37213..>37613 /locus_tag="THAPSDRAFT_264883" /note="Distant similarity to cysteinyl-tRNA synthetase; unknown protein" /codon_start=1 /product="hypothetical protein" /protein_id="EED87423.1" /translation="EKLNNNKQPSPNILPNQLFYDESKYSAYDENGIPTKDVEGNDLT KSAMKKLNKLHQAHCKRHTKWKEHDTDATTEELAVVGGRASSIGDDPPEAHWEDSLDP SFCQVVVGSFGRRQGLEFSSDMGPFVHLFQV" gene complement(<37689..>38702) /locus_tag="THAPSDRAFT_38813" mRNA complement(<37689..>38702) /locus_tag="THAPSDRAFT_38813" /product="predicted protein" CDS complement(37689..>38702) /locus_tag="THAPSDRAFT_38813" /codon_start=1 /product="predicted protein" /protein_id="EED87509.1" /db_xref="InterPro:IPR005654" /translation="PPKGVYLHGGVGCGKTYCMNLFYDALPSDASKQKVHFHKFMLNV HKQMHKAKMINKLQGDAILENVLQTILEEGKVICFDEFQVTDIADALILKRLFTGLLE DGAVIVATSNRPPRDLYKGGLQRDLFLPFIDLLEETSVVVSMWESDMDYRLVGVSSES HNRGPHRVYFVDGKDDDGRGKSSKDEFEELFNTLTKGSLINSVILDVQGRQVFVPKAS EEYGIARFSFYDLCGKAKGAADYLAIGERFHTVFIEDVPKLRYHEVNLVRRWITLIDA LYECHVKMVVHAATTPDEMFTVDLENEHCDEVFAFDRTRSRMEEMRSEAYLKKKWVGS RPR" gene <40685..>41578 /locus_tag="THAPSDRAFT_11981" mRNA <40685..>41578 /locus_tag="THAPSDRAFT_11981" /product="predicted protein" CDS 40685..41578 /locus_tag="THAPSDRAFT_11981" /codon_start=1 /product="predicted protein" /protein_id="EED87424.1" /translation="MYEIPVFSASIWMLPLASKLGTKLQFDKTNEYSTILWHEGKCCH PNHSGHLILNMVLAYCMVEEERSMISSNNEHSNDATSIEHDFTADERPVLRDPIYLSM EEEIVYVESETDQTYIDFTNPSSSNVTWSNIISFNDGWQYYADNSDKDKYGFIADDVK GGPHISFSLAVAMSGSGGTRVRTHVEISYTMSYENFGLAYAWLSDSSERRPSCTKPVD IGERQPIISGEEPLVAYWDEPSSVPTVVILNATIIEGEQRQSTLHLCLTPRNDNLRGD GHNKFKLLGIRLLPTSHINEQ" gene complement(41621..42758) /locus_tag="THAPSDRAFT_25812" mRNA complement(41621..42758) /locus_tag="THAPSDRAFT_25812" /product="predicted protein" CDS complement(41682..42653) /locus_tag="THAPSDRAFT_25812" /note="GO_component: GO:5622; ribosome - intracellular [PMID 5840]; GO_function: GO:3735 - structural constituent of ribosome; GO_process: GO:6414; protein biosynthesis - translational elongation [PMID 6412]" /codon_start=1 /product="predicted protein" /protein_id="EED87510.1" /db_xref="InterPro:IPR001790" /db_xref="InterPro:IPR001813" /translation="MPLSAERKQQYFSTMKELMTTYSKCFIVEIDNVGSMQIQQTRLA LRGKAEVLMGKNTMMRKCIREFVEENPDTPIAQLEACCRGNVGFVFTNGDLGAVREVL ESNVRPAPARVGAVAPIDVIVPKGPTGCDPGQTAFFQTLQIATKITKGQIEMTTDTHL ISAGERVTASQAALLQKLAMEPFTYGLVLKSVYDSGSLFDAKVLDITDDVLAAKFVSA LNTISKLSLALNIPTQASVTHSIANAFKAILSVTVELENYSFDKADIYKAYLADPSAF AGSGGGGGGGGDGGETSAAAAVEEVEEEEAPPAVDMFGGGDGGGGDY" gene 43873..46353 /locus_tag="THAPSDRAFT_25813" mRNA join(43873..44119,44208..44880,44957..45056,45136..45430, 45533..46353) /locus_tag="THAPSDRAFT_25813" /product="predicted protein" CDS join(43942..44119,44208..44880,44957..45056,45136..45430, 45533..46251) /locus_tag="THAPSDRAFT_25813" /note="GO_component: GO:16020 - membrane; GO_function: GO:16491 - oxidoreductase activity; GO_process: GO:6118 - electron transport" /codon_start=1 /product="predicted protein" /protein_id="EED87425.1" /db_xref="InterPro:IPR002916" /translation="MTSRKPQPSPPSPSSYKHIASSWTPTIRYKALKLSQVICAVYIL IMTFRYYPRGLIDPSAPLGEWRIVDVWNPSNTEAGVIQLNGDPYGQKRAVVAKDRASL IFLAISRISAFTLYPPMFLIFMTKCKATINFLMRTPLSLFMVDDQHELHSFCGKYIAF DVWVHTLFHCLRWGIQGNIDLLWTNITGLSGLIVVLATPLITFPMFINPLRLAMSYEV RKGLHYLFYLFAIGMCFHNKTSAFPNGGFNQVVLGFCIVYYTLDSLYVMIFMTEMIET TVFNVLPSGVQMTMAVSDHFQKTYAQGGYGYVCLPWVSKYQWHAFSLFEHPTDPNLRQ VFMMKVQGGDWTNSVHEQLQRNTVRPVWISGPFVSPYNNALDFDNTICVASGIGITPA LSVIRAHRESRRINLIWTCRDVAMLEFFSDHLYLNNDDGWILIFYTGKEPLSPAIENA NSNVILIKRRPDLHRIVPNIIYGIESGKGVPEIRQPSEKVKAKEFLADKLEDLEEMLL SEDEIVEELTMLAHEKGFLLSNLVAEDDTDEGKGLLEVVKEHFSSTPPVKHIQQDTSH NNTGPYQRRKSQRVRQVSNIRRNSMMDTGYCPSEEHEGADEFVRNLNKRLVLSTWGLL YCGGSKPVENTLRQISKEYNIALHPESFAW" gene complement(<46444..>47312) /locus_tag="THAPSDRAFT_11984" mRNA complement(join(<46444..46691,46734..47092,47260..>47312)) /locus_tag="THAPSDRAFT_11984" /product="predicted protein" CDS complement(join(46444..46691,46734..47092,47260..47312)) /locus_tag="THAPSDRAFT_11984" /codon_start=1 /product="predicted protein" /protein_id="EED87511.1" /translation="MGHGGFRRETNPFYHGYPKLQRQVAVEGDAGESHADSIGIGRQE WESFSFRELYKHFDCSSHARDQTKRLYSPMKRNTLNDKYREQFGKATPSLEAYVPPYY ANYSNGKGRGVFASRDIKVGERVHDGMKRTVFFDDEMSCDVMEWVWTQQVGDARKFCM CVNLNDSAFMNNGGKDGSNIAPKPNEFSLDFYATRNIKTGEELLYDYGQYDTDWGFFG L" gene <47712..49479 /locus_tag="THAPSDRAFT_25814" mRNA <47712..49479 /locus_tag="THAPSDRAFT_25814" /product="predicted protein" CDS 47712..49448 /locus_tag="THAPSDRAFT_25814" /codon_start=1 /product="predicted protein" /protein_id="EED87426.1" /translation="MYPNLHHHIASSNTNPRLNTIVMPKFQYIEEFLPLASRYPILAP LFTKSSDITSSLATIKAAKASFAKRTADGASSSSSSVGLDIDHDEYDPSSVLCLVVGE GSEPRTAVVAAIKYGWCTVAIDDKLDEKWETRDTLPTTCNYVGFRGGMDRFLVEGRDH IESSLCDLSDVQHLVIICMDQNNNNFDHLRSLRGRLGIADLRAIYNRSPATVVSLSSQ QIISRCPLKLTPRISSIDEGILSPYRQVQVWSFLRCSPSIGSMHSSMSTSSSLYKLNA SDGRRINSKSTISLSSHMSNLKTGHSSSKVDQLQRQQALALASKKQLLASKAASVPKQ EFDLEYKPKSKVKRRSISASNGLLKILPRVHPKCRQTTPLTEARRDEKDTPVDTNAES IVQLDLHLSSKNPSALTFATDSTSNSSDDDDLTSNTSSNKSRHHQYKEGDFVEVNIGD VYQVGVIDKQPLKGANLYNVTLLTDTPAWKEMQGTSIIYQNSGNKMPEVSGRHIRRFI PAPLGEILHVCVNGTVRKCRVHGYSFDNENDMSPQNMKYSVKFASQDGQGWTNEKYRV PVQRAYRQLYSL" gene <49838..53145 /locus_tag="THAPSDRAFT_25815" mRNA join(<49838..50065,50155..50871,50996..51514,52030..52398, 52481..52814,52905..53145) /locus_tag="THAPSDRAFT_25815" /product="predicted protein" CDS join(49838..50065,50155..50871,50996..51514,52030..52398, 52481..52814,52905..52924) /locus_tag="THAPSDRAFT_25815" /note="GO_function: GO:3676 - nucleic acid binding" /codon_start=1 /product="predicted protein" /protein_id="EED87427.1" /db_xref="InterPro:IPR004088" /translation="MDPAHQDAVARARAIAARLAGTGATVDTTGGSGGGSSSGGGYGS ASASTYGHGGGGDVNAQAQAALDAAFSGGGGGQSAYAPTGDSTNKRGVQEALASLIPG LGTLPDAKRAKTDSYYGSADSGKTVKKLWIPSDRNPGYNYVGLLIGPGGSKQRELVAA SGGDVKISIRGKGSHSKPGSEAVPGMPEEPLHVLLEGSATCVANAEKLVRELLEDSEV ADKEKARQLSSLGGEDGDTATKGGASSYTPKPVAQILGQMSGNTALSAYGPNGGGGEE QVEEKIGVPNGMVGFIIGRGGESITSMQRRSGCRVQIQKEHEMAPGTTQRVITLTAPN AESIAQCRAIIESMIVERMQQTGATTAALASSSTALGGGSSASSQMALLQKALNEGQK HITVQVPDADVGLIIGKQGAQIRTIQDTSGANVQIPQVADAGNPSVRTVNITCPTLEG AEFAKQMIEEVLASKAQVGGGGMGGGGGGGGDVTIQVNCPDKDVGMIIGRGGCVIKQM QTTTRCRIQIPPTAPPGSMYRVISVSGTAAGCQQVQQMIETIVAEQSSQAVMSGVAYS NGAQPQQQNYYGQQAAYGQQAYGQQAAYGQQAATGQKDYSAEWAAYYAAQAAAQGGAA ATTTAEPAAAPAAATATSETPAHDAYYEVFWQYAAYYGEEAARKHYGGWSPPVGTPNP NANAGASATSSAPAPAAPAPAAQVGEAKDSSVRKVSNLPALTKGTN" gene complement(53263..>55981) /locus_tag="THAPSDRAFT_25816" mRNA complement(join(53263..55180,55242..55615,55857..>55981)) /locus_tag="THAPSDRAFT_25816" /product="predicted protein" CDS complement(join(53289..55180,55242..55615,55857..55981)) /locus_tag="THAPSDRAFT_25816" /codon_start=1 /product="predicted protein" /protein_id="EED87512.1" /translation="MTNAIDERHNAKTSIAEELAQSDMDADVNVDVSAKTNASWKNSN KHNEYQQNTTGYVTKKITKNWDVTFDFLLDSLKHTGSDLKSDRRSKTLQYSLAQELPF TAPVHRQHNKATTQSKYSGLVILQHHDLLILIVLTDGLIGSATIIKQEHTWQAEATAL HENKTSTGLRGGDAPDLVSSSAQLQQRVLGAGTAREDPQTIQAQRERLKRMNDERKRK NAAARGKKNDDKKGRQQGGRDKEESGRGSSIQQGGQRVSGNNSKKQDKKDALKDEMKE QSSGANKPDKKGDKLTMQEIKEQMKGDDGEDDKLTKQEKKEQMKMEQQSSSGSSGGGG KKDEVVVDNGDKAKEQALYNQQNQQAQQQSSSTTNKPSFISKEEWESLSKQEKKDLLK LKDGDLVISDNLRPGSVDLDAVDDYDADESAGSVGTEVDDVKPSIQDSPQVKPPKDEP KEEDMSANDAFVSSSSSTSTSATNSDTSSSKSFSFAEKLQDNSIQLDGFETPSKLKWN MSGQGWNLDTSTYYETKTSIKSGITSNTEGSETVNSDLALSTDASFVGGVLSFWIKAD LELPNEAFYVSVDDEVALPPISPSDPQEWVEYSVAVESGQHEITWSHVYNPFGLESLP RRSGNKVGLWMDDLRLSPFDGSYNQGFEDSGELTMTTGGDATWEIDDASNGISGSYSI VASSKNIHSDSGSSNIEFVLNSKKGGSLKYNVLSSTTAPHDDFAIYLNGKPAEAIFGQ SLSYEYKALDIPAGKVVVTMMHRKNPGGLSGSLLESLGIVETEGVTRLDDVRFLPM" gene <56551..>58344 /locus_tag="THAPSDRAFT_11988" mRNA join(<56551..57303,57397..>58344) /locus_tag="THAPSDRAFT_11988" /product="predicted protein" CDS join(56551..57303,57397..58344) /locus_tag="THAPSDRAFT_11988" /codon_start=1 /product="predicted protein" /protein_id="EED87428.1" /translation="MSSSASYLRHVAARPLHQPSALSIHRVLNTLFDGATATTNQDQD SYPTLQQTTSSSSSSTPQAPQIDNYDNASWDTRATILFIEAIVLTVIVLCVLIACTRR HFRRMDLASQRAASLQEWAAVRMPAGSGGSMEVSDATEDIENGVVANNASADATNTTT DNTNLSNERTPLRQLTSLPSTILSILLSPLRMLDSAINDWSTERVRTNNYDISYYRSM VERWEREKEAGREKEDERGERLRRAFEKGCMVWEIEEEDFVKPKNKGEINDDNMEEHN DNNNNNGSESGDFEIGLDETGSDDEAAATGEKSITETTTADTTDEGESLSNEITEATT ELKDSDVCEDEVSHQQQMNNAKNSVQHDNGVDKETRAEPNNTEEASSNTPITSDEDSA PGYLYLNRQRANTTGSLTTITASTSQSTLPESHSTHSMPSSFTTQLPFTPEVSSGRTI PNQCAICLCDYEKGDTVVVSCNKLCPHAFHQECIVEWLAKMQEGTPCPCCRRTFVELD EYLPGNKNTNVATGTNVITSSQQSPEEAERLRQIRRRRHIELGLQRGRAFNASVISMW " gene 63967..>69428 /locus_tag="THAPSDRAFT_270116" mRNA join(63967..64220,64260..64395,64496..65005,65086..67125, 67457..67661,67867..69107,69276..>69428) /locus_tag="THAPSDRAFT_270116" /product="hypothetical protein" CDS join(63970..64220,64260..64395,64496..65005,65086..67125, 67457..67661,67867..69107,69276..69428) /locus_tag="THAPSDRAFT_270116" /note="The HEAT repeat is a tandemly repeated, 37-47 amino acid long module occurring in a number of cytoplasmic proteins, including the four name-giving proteins huntingtin, elongation factor 3 (EF3), the 65 Kd alpha regulatory subunit of protein phosphatase 2A (PP2A) and the yeast PI3-kinase TOR1. Arrays of HEAT repeats consists of 3 to 36 units forming a rod-like helical structure and appear to function as protein-protein interaction surfaces. It has been noted that many HEAT repeat-containing proteins are involved in intracellular transport processes.; putative protein with heat domain repeats with similarity to hypothetical protein predicted by genemark.hmm; GO_function: GO:5524 - ATP binding" /codon_start=1 /product="hypothetical protein" /protein_id="EED87429.1" /translation="MGVVRAVSSLTDPSAKVKMDLPVLKILLGFLIAFGLGDGNEDVR NESRNAARDIVAYYGSSEDVISFFLPQFESVLTTGKADENIAASDYRKEGVVVSLGSI ALHLNDDADADKIDDIIDMLLNALKTPSEDVQASVALCLSKLMKKGRTQARIETLLNN LMDECINGQSLASQRGAAYGISAAVKGSGIASLKKFDVVKRLEESCTSGSPPNKEGSL FAIELLSSRLGILFEPYVIVLLPALLKAFSDSNDHVRTAADKTVGLIMSKLSGHGVKL VMPAVLEAFDEPEWRTKQASIHMLGSMSHCAPKQLASCLPKVVPKLTEAFSDTHPKVK NSAESALEELCKVIKNPEISSISTILLKALTDPASGTVHALESLISTEFVHAIDAPSL SIIIPVVHRGLRDRAANTKRYAALISGNICTMVNDPRDFVPYLPILLPDLKSTLLDPI PDVRSISAKSLGSLTRGLGESTFPDLRPWLIETLTSEGGSSVERSGAAQGLTEVLVAG GAHLTEKVMVSEILPLSTHPKAGTREGVLWVLTFLPSALGQAYSSLIDESLPALLSGL ADDSETVRDVALRAGRVLVRSNGKAHKDKILPALEDGLSNEDYRIRVASLTLLGDLLS MLGGTKVVKGNADTQDDIRQAERAQAQIALVLGNETRKRVLSSLYLSRSDTAAVVRQS AVQVWKTVVSVTPRTLREILSELVDQIVSALASGDSERTQVAGRCLGDIVSKLGDQVL PEIIPVLRDSLYRGDEFTRQGVCVGLAEVIACSSKEQIIKFLDILVKVVQDALCDEDE QVRKMAASCFQSLYQVVGSRTLEEVVPALLVAMESSDEVVKTRALNGVTGILSVRSRE LLPFIIPKLLKAPLTASHADALASISAATGETIHMHFSTIIPTLIFETASFVGSDEEE KEREEAIRRCARAVCHNVDTSGVNWLISEIASKCTNDKDSVRKEGCWFFQVVIEESKF MLAQTISRLNDDSKVVLKSTSEALRALTTCVPAEELVTHIQFIRNLIASMVSEARYRK GGVGDGQFYLPGFNMPKGLEPLLPIYQRGVLYGDAHTREISAAGLGELITITADKYLA GPFLIKLTGPLLRIVGDRNPSAVKIAIIQTLGLILQKGGPALRAFVPQFQTTFVKALS DPSRQVRIEAIKALALLMPLSTRVDPLIKELVATSLSKGSNVTAETAGLVAIQTATLE ALAVVLKHGGSKVKLPESIPSALDAGKELVAHEDEGIRESASKVIGYACELLGVDTAN DTLQELVSDRASNLTSSSTETKHGIACITRRILSTSVGKDVDRSIYANITNTTLTLMK DDSAVVRSASSVAIGAIVGSSTDVKATLALVEKHIHKNMDKGEELEVQQAVATGLCVT AMLQPGIFRRSEGLALINGALKLAMSGAQRVQFSYNDFLWIALDVKNGETGLEEYLAL AHFDSAKTMKPLYEKVLKKMKPVKECPSDEVDHDAPIFSRCEINVLFQPCRHSITMTS SFIDDITISPDLQCHPQ" gene <75181..>76041 /locus_tag="THAPSDRAFT_264885" mRNA join(<75181..75382,75482..75879,75973..>76041) /locus_tag="THAPSDRAFT_264885" /product="small GTPase" CDS join(<75181..75382,75482..75879,75973..>76041) /locus_tag="THAPSDRAFT_264885" /note="contains a RabGAP/TBC domain; GO_function: GO:5524 - ATP binding" /codon_start=1 /product="small GTPase" /protein_id="EED87430.1" /translation="ERLSQILFVYAREHPEIGYRQGMHEILSYVLLVLEMDLLQQATE DEKKRLMTESLSPMGMSRFGSEGKHLLHDAFNIFECIMMALAPAYDAIPVGDETTATL MEAAKIERGESPMEQMTSSIVSKIRYVARDEALFSHVLYMPVPPQLYFAKWVRLMFGR EMAGGMKDVMRLWDAFFDLAWAASALDNQTEVSTSMALMNPDPNEGIGYLMNYPPVED IGLLV" gene complement(<77785..>78105) /locus_tag="THAPSDRAFT_38802" mRNA complement(<77785..>78105) /locus_tag="THAPSDRAFT_38802" /product="predicted protein" CDS complement(<77785..78105) /locus_tag="THAPSDRAFT_38802" /note="GO_function: GO:5554 - molecular function unknown" /codon_start=1 /product="predicted protein" /protein_id="EED87513.1" /db_xref="InterPro:IPR005345" /translation="MAKHHPDLVMCRKQPGIALGRLCEKCDGKCVICDSFVNPSTIVH ICDECNYGSLEGRCVVCGGVGVTDAYYCRECVGVGKDRDGCPKVVNLGSTKTDLFYER KKVGF" gene <78651..>79688 /locus_tag="THAPSDRAFT_14638" mRNA join(<78651..78837,78958..>79688) /locus_tag="THAPSDRAFT_14638" /product="hypothetical protein" CDS join(78651..78837,78958..>79688) /locus_tag="THAPSDRAFT_14638" /note="protein kinases are enzymes that belong to a very extensive family of proteins which share a conserved catalytic core common with both serine/threonine and tyrosine protein kinases; putative protein kinase; GO_function: GO:4672; ATP binding - protein kinase activity [PMID 5524]; GO_process: GO:6468 - protein amino acid phosphorylation" /codon_start=1 /product="hypothetical protein" /protein_id="EED87431.1" /db_xref="InterPro:IPR000719" /translation="MELRVGKKYRLGRKIGSGSFGDIYLGTNMTTGEEVAIKLESVKT KHPQLLYESKIYRILHGGLGIPNVRWYGIEGDYNVMVLDLLGPSLEDLFNYCGRRFQL KTVLMLADQLLGRLEYVHTKSFIHRDVKPDNFLIGLGKRQSVIHIIDFGLAKKYRDPR SHQHIPYRENKNLTGTARYASINTHIGIEQSRRDDLESLGYVLMYFIRGSLPWQGLKA NTKKQKYERIMDRKMSTSTEQLCKGYATEFRSYFEYCRSLRFEDRPDYAYLKRLFKEL FYRKGFQYDNMFDWTVLNLQQERAKLPPER" gene <81225..>82688 /locus_tag="THAPSDRAFT_11993" mRNA <81225..>82688 /locus_tag="THAPSDRAFT_11993" /product="predicted protein" CDS 81225..82688 /locus_tag="THAPSDRAFT_11993" /note="GO_process: GO:6520 - amino acid metabolism" /codon_start=1 /product="predicted protein" /protein_id="EED87432.1" /db_xref="InterPro:IPR000277" /translation="MTFTTATYRSTLTLSRRVNKSTIKQLGATRQLSSGNENKHATPT SFATKCASLYHPSPKANRGLAPPIYFGSTYLLDDADHGARLHDKKEAAFTDDDGFVYS RWGSPTNEACAKQIAALEGVEEKGGTMLFGSGMSGITSALMSVLKAGDHAVFPYTVYG GTHEFLKEFAVHWGVEIDFIDGAGKNGPEAYKSAFRENTKVVYCETPANPTCRITDLA GVGKVVDEYYGTREANPSRPWVMVDGTFATPYHSRSLDFAGIDVGIQSCTKYLGGHSD ILAGSVSSNSPEFLHGLAKVQKLITAPLNPMDSYLLMRGIRTLDVRMQRHGENALQVA KMLEDHPLVESTFYPGLESHPDNTGGDDCLVVQAFRSGRDSPADAAIMPQTYGGMIAF IVKGEGNVALERAKKVCEGLRVVNLAVSLGSVESLVEHPASMTHAMIPREDRIAGGLD DGLIRISVGIERASDLVEDLKSSLDRVMESESEERVA" gene complement(<84797..>85309) /locus_tag="THAPSDRAFT_38827" mRNA complement(<84797..>85309) /locus_tag="THAPSDRAFT_38827" /product="predicted protein" CDS complement(<84797..85309) /locus_tag="THAPSDRAFT_38827" /note="GO_function: GO:3700 - transcription factor activity; GO_process: GO:6355 - regulation of transcription, DNA-dependent" /codon_start=1 /product="predicted protein" /protein_id="EED87514.1" /db_xref="InterPro:IPR003711" /translation="MTTVLEISYSDAVVHVPIERAYRLSRYRAGDAAIKPRLSRVKGE AWSKAKRKVEANTVQMAEDVLALYATRETLNRSPFDPSLEGKVKTFATSFPFEPTPDQ KKCFEDVENDMVWRSRPMDRLICGDVGFGKTEVAMRALFRAVANGRQAALLAPTGVLA AQHFKQIVRRM" gene <87259..91308 /locus_tag="THAPSDRAFT_25820" mRNA <87259..91308 /locus_tag="THAPSDRAFT_25820" /product="predicted protein" CDS 87259..91071 /locus_tag="THAPSDRAFT_25820" /codon_start=1 /product="predicted protein" /protein_id="EED87433.1" /db_xref="InterPro:IPR000008" /translation="MTWTDESVEDAGVTSVGSITTETNEDTPTGIAVDNSGGSSDDSN NEMAIIAPTPKQTKHVHFSFVDDNTANCTDSSDSGSSVETDGDGDDREDATAEEKAQV FTTAVTAVTAAHNSNTDVNNSNMGNITSIPNTNTSEQPLTFLIEIVGTTFSTKQQNRF SKTRSSWGEDSRGMHCTTTWVDPATTKRFGKATKREELVLHRTKTLRMDGTKSLTNST SDLSTVTTEASSAGSDAATKSAKSINTTFVDDGEDEGLMHAEHIFTVDDASLFLFHTS MEQMVNASKNSKDSKNINITDEEEGDNYLNSGGLRFDIFEKPLDALSTVYRTVLADSV SVKEANAIISSDSSSPLLASYRLLGSVFLTPKEILERCDEERFEYDLHDGLRKEQQSK NRQGKSLQRGNDTMRRSGSVGRLALRIRVATEMDIAFIETLQSSDANSSEISIAQLND ALRKNNPDRETLKPVQLITEMDENVLTAQTSLKAISNIAPVAQESMRYLFSNDVEKRV MVKPYPDPNRVEETMWFTEAELHEECWKPSTNWIKAGSGSLGKVYLEVLECRGLPNVD TGPGNKTDAFVSVVYEDVMVQTEVIDDSLSPMWMPWTNRAYIFQMNHPSTAMYIGVAD YDVGPLEHECIGRCAIQLSKLSPGTLYTLSYNLYESSNLTEKGESSGTIILRLRVEYD EKKYLMEGWKTPPVQWVNSQQWKSHRVAKYCCDGPHDTEVFEMKLFRSHINELLTAKR TLTYVIGDSLHSLIFWRGQVKVGNTWLPLHSAIVFSFSVHAVEKPHLLPSFFFFACGW IMISSMCQRESHPSPWHRGHSFYHYWNILLHGKSFHVPQEIRPLQGHKEALKYEKAWQ DRLVEDDARWAKQAELDAKVKGISDDTVIRTKAKANAPLVDPISAMAGARLLPYQQRL GRYCDKVRYVRNVMNWSESVMSFWYTVGCIGAGIFGLFVPWAFLIRWTSRLIVWICLG PWMKGLDIFLHGLTVEEEVERQKMKSSNQIKQAFQLQCKAAKTLRENTLKLKAFRVRL FGKYITRLPEYNLTRHEDVPLPESTAEPYCGSEIKPSLIVPGQNLTGVMIPNSGRDLT STRATAPEEKKKKLIEHYQQFSEAQQRQTISMPIDDEVVEGFELVKPEEGQITCVPSE KEQTRLVRRLSSGRDPNVSERELEAALDSPLQKGYDPTKQKRWVFSASIQLSFKQESA DNRSDLELCDDRNGRKMIQYVPKEIEEEGVEIIPLLGMGSNLDENESPNVDEFDGNAR VSVLYVKG" gene complement(<91408..>94551) /locus_tag="THAPSDRAFT_11996" mRNA complement(<91408..>94551) /locus_tag="THAPSDRAFT_11996" /product="predicted protein" CDS complement(91408..94551) /locus_tag="THAPSDRAFT_11996" /codon_start=1 /product="predicted protein" /protein_id="EED87515.1" /translation="MTNPRRKTASSASSRKAAVPSTASNNNLHDDDHEPSPSLMSSPD SPCPDNESSNKLHSLYLLLPPLLLLSGLSVFTQSFFLSRTAFTARSSCQIGSAGELLV DALQLSEDHVDFMRDEGWLTDSKGVNGGGGCWTPRRVDSMAILVVDALRFDFARDHLP LSVGSRLFPGKLSNSTRSKGRGYSQLYQFVADPPTVTMQRLKGLTTGGLPTFADITGS FGGASVDEDSWVEQLKNTPWTRRHHISVGGDGSKKPLIGFVGDDTWVDLFPTQFDDSN PYPSFNTRDLDTVDNGCLMHLPRLLDGLLGLKKQSEFSSNNKHNNATSFELIVAHFLG VDHVGHTYGPNDPHMERKLNQMDGMLSHTLDAIDDAPEESCIVAFVLGDHGMTEDGNH GGGTSEEVNAGLFVHFSPGCHYEDESMQQYRIGRLDGGEIGFDSVRAFESIHQIDLVP TISLLLGLPIPYANIGGLVPDLLPTPRTGSGDVASPTPHSATALALNAAQVWSYLDAY SKISSDLPVDRLKELKELLDSATLVFKEALASSQKQALLHADEGKQHHDSIAYRQACG LFKLFLAESTDLGKRVWTQFNEGGMMFGIGILVVAWIMTFPLWKRNVRNELWAVLSWR RSDDSMKGHIKCAETASHNKLVPSASSMQSFRQIELIAAIAFMIFQCGVLTFGNSYID HEREVVTFFLAILCLLVFRRWYFATAGHTSNAGLQNSVVYLPLMVAFSARANDVFVTG HGLDPSIRMHLAHHSVVFLSSLLVLILLRLRWFCGSSSRSALIDVVAIICLAFSWWEK RSRDHSRNGFIMARCAIALIFAGLLHSCYAMYRRKRMTSKGGYRSDREWTEMTQLASF RAMMFITIVTGPSTASTAVLMVVQTAALSQMMEAGGNREIDAPVMAALWRLAIRHVFF ATNHHCSFNRLHFSAAFVATETFEFHIAGASLFMNTFGWEIIGTTLVLVLSRTCRGKA RRSVWTWFCFYQWTETVASCMSVSVMKRHLMVWAIFAPRFMFAAVFTGLTLFLCFVDV VVSIVTTSSAVMRKQLV" gene complement(<95136..97143) /locus_tag="THAPSDRAFT_25821" mRNA complement(<95136..97143) /locus_tag="THAPSDRAFT_25821" /product="predicted protein" CDS complement(95136..97103) /locus_tag="THAPSDRAFT_25821" /codon_start=1 /product="predicted protein" /protein_id="EED87516.1" /translation="MAGSGTPRSARGARNFTYDDALNAPTAPKTYSSPHKLQSQSTRT MVSPSTMSQENVISPSMMKKRIHIAGKHQHAPGSGSPVILRSPPRNSTPRGADGKIAP LSPSSRDTVVRRDSRDDTAAARKRESGEQVRSPRSGSSPSAESAAAASPRSRRSSRSV TGADVIAAASAARAAKRQQEKQMVASSSNNTSSSNTKNADVDVDVDDSNNEDTLSPLP NKHINDNDNDNDSPRTKFQKQLFNNLRTPDASLNELLSVLTDERTSSSGHHAARRVNA CGAIKTLTLNKANQVALARTEGVVASITSVLCHVEATEEERTRCINALMHLCVPEKNW RIVYMFPQTPEALARNMADRYPQIRYAACLALSFLAKKNRKEVVVNRILMYSIARVLE VDKNGALERVSVQDKKVFVGSRLCALKMMLHLSKNKDVSVQLAKTECIPAALVQIANK MEVAANILCIAIMTNLTRHPENSVGLCRIDGLVSVLVSHIMPVASNNDGSNGDTADNQ ECAKCALYALQNLSCTGSIRQELANTPNLLAALTKNAFKTQHPEQQLSALHSLKNLSD DPFNLVTMTNTPGCTATLLALANDGTNAMAQYLACDTLATLSHWLLTLSAAKTRRKIE KDGGLEDRKEEEEDKDLGRRTFQTLTWEMWA" gene <99475..>101625 /locus_tag="THAPSDRAFT_11998" mRNA <99475..>101625 /locus_tag="THAPSDRAFT_11998" /product="predicted protein" CDS 99475..101625 /locus_tag="THAPSDRAFT_11998" /codon_start=1 /product="predicted protein" /protein_id="EED87434.1" /translation="MSPSASPNPNAAIYDPQAKCIRFPTSPNGVERIDGSIVQIVGNN NSSCSSDFLSDSKHPPRMLLGEVMEITIMDERMGALDVDSTLSQHPAQQTQQRKPPIS TCNTNTKLTQEQLHQKQHTLHSTHNSQSQSLIHRHHNDRQALIANTIFNQEEKAYITM QMEQMHHEELVELQERQMEEMEALMNLVFQCVDGGDDKGERHQQEENTTSTERCDTNA EEQVASSNATPLATKPAPPKKSTAELVSELRREEIQTAMKDKTMSREEKQKKLAEIKA RYSALRATAVEPPARAATTPTITGVVATPKRTAGLWNEATVATVAVNRVARKYHPSPP IAPSDVEATLPKPSASRTFARWNKAAVFSATATAIGSGHPKEEKEAPQVSKPPQKINY QKMDLIVQQLKLNDPTLTSLVLDGRNNITADDWKALFQSLEENSQLQYLSIDNCRLTD EVSVSLVLALVENETITSINLGNNQGLTDDTGKGLVKVLKRSNHVIKQLDVTGTKISA KVQMKLQTYLDDRDENVQFERLQEARKLRIERLLSFSASDTVEQPSEESVDEDAEVMH GSKMVGSGKSPSKKSSKAAIGSASTVSHASGGSRSSRTGSKGSRKPNRRSSLTNSVTS NESGPLKASIKRPAKANKDVYRSVALQMASLGADVATGVGSTANQMKELRKMRGECEH CGQKCFEKRMFKTTPLTIPNAVFEGRCLKCKPMS" gene complement(102635..105196) /locus_tag="THAPSDRAFT_43097" mRNA complement(join(102635..103476,103569..104327, 104455..105063,105154..105196)) /locus_tag="THAPSDRAFT_43097" /product="importin alpha 1 subunit-like protein" CDS complement(join(103180..103476,103569..104327, 104455..105063,105154..105171)) /locus_tag="THAPSDRAFT_43097" /note="Strong sequence similarity to alpha importin in other organisms (65% identity over 500 amino acids); strong EST supporting data; contains conserved armadillo repeats (IPR000225) and importin beta binding domain (IPR002652); GO_function: GO:16491; protein transporter activity - oxidoreductase activity [PMID 8565]; GO_process: GO:6606; intracellular protein transport - protein-nucleus import [Evidence 8152] [PMID 6886]" /codon_start=1 /product="importin alpha 1 subunit-like protein" /protein_id="EED87517.1" /db_xref="InterPro:IPR000225" /db_xref="InterPro:IPR002652" /translation="MNKEEERKKTFKKSIDIDEGRRRREETTLQIRKSKKDVRLAKRR QMPAAMDNGDTPAGLAASSMLAMGGVAPGGYGAVDHGGGGAMATDGSSGNKLENLPQM IQGVMGADPTVQTECTTQFRRLLSIEKNPPIQQVIESGVVPRFVEFLGRDDNPALQFE AAWALTNIASGTSEHTKVVMEVGAVPIFVRLLMSTNDDVREQAVWALGNIAGDSPPCR DLVLQCGAMPPLLSQLHQGSKLSMLRNATWTLSNFCRGKPQPDFEAVKPSLSTLSQLI FSPDEEVLTDACWALSYLSDGPNEKIQSVIEAGVCRRLVELLLNPSPAVQTPALRTVG NIVTGDDLQTQFIINNNALPCLLALLSSPKKGIRKEACWTISNITAGNKDQIQAVVDN NIIPPLIQLLTNAEFDIRKEAAWAISNATSGGSPAQIKFLVQQGCIRPLCDLLTVNDA KIVTIALEGLENILKVGDEEANVTGSHNEMSTYVAEAEGLSKIEELQHHSNNDIYEKC VNILEKYFGVDEEEEMANIAPEMAEGGGQFAFSAPQGMDDGNGGAPTFDFGN" gene <106074..>106592 /locus_tag="THAPSDRAFT_38780" mRNA <106074..>106592 /locus_tag="THAPSDRAFT_38780" /product="hypothetical protein" CDS 106074..106592 /locus_tag="THAPSDRAFT_38780" /note="good supporting EST evidence; Protein tyrosine phosphatases dephosphorylate tyrosine residues.; with sequence similarity to Drosophila PRL-1 prenylated PTP; putative protein tyrosine phosphatase; GO_function: GO:4721 - phosphoprotein phosphatase activity; GO_process: GO:6470 - protein amino acid dephosphorylation" /codon_start=1 /product="hypothetical protein" /protein_id="EED87435.1" /translation="MTISSKPTLITTPNLRFLIMDAPRQSNLHLYIKECRRHHVTDIV RVCEPTYLGAELKSAGIELHEMAYEDGHSPSEEILGRWLDLVEGRFFGSGGGGSKDAT IAVHCVAGLGRAPVLVAIALMEFEKMDAVEAVMMIRRNRRGAINEKQLQYLEGYKCRR GGGGGGCACVIL" gene complement(<109744..>112697) /locus_tag="THAPSDRAFT_38789" mRNA complement(join(<109744..110694,111829..112178, 112259..>112697)) /locus_tag="THAPSDRAFT_38789" /product="predicted protein" CDS complement(join(<109744..110694,111829..112178, 112259..112697)) /locus_tag="THAPSDRAFT_38789" /note="GO_function: GO:8324 - cation transporter activity; GO_process: GO:6813 - potassium ion transport" /codon_start=1 /product="predicted protein" /protein_id="EED87518.1" /db_xref="InterPro:IPR006037" /translation="MFIILILDKIGTDSVMLTALTVFYISGIIDITEALKGFNSQGLL TVLVLFVVAEGLNKTGALNWYVGKLFGHPTTLAGAQLRVLLPITLLSGFINDTPLVTI TLPIVIQWAKKVRLSTRYLLMPLSFAALLGGCCTIIGTSTNLIVVGLLLERYPDDPKF QNMSLFAIGKFGVPVAFVGIAYILLMTPLLLDRKKNEYNTSLMNSSSSDDLLLGAKLT QWSPAAGRTIKRSGLRDTGGVYLVSVKRQATGNVHTAVSPEFVVEDDVIWFSGTASSV GDLRKIPGLVLYESGEVDKMNEKVQNRRLVEAVVSRNGPLVGKTPKEIKFRTSHGAAV IAVHREGRRVHELPGNIKLQAGDVLLLEAGKTFLLANKNRHDKAFTLIAEVEDSSPPR FRMLIPAVILTVGAYVCYMLKLSTLFGTAMIAAIMMVICGVLSEEEARSAIRWEIYLT IAPAFGVGSALINSGVAGAMATFLVNLGNAMGIGNAGVLGSVYVCTVLMSQVVANNAA AALIFPIAMDAAEQIGMDLSLMAYAIMLAASAAFMTPFGYQTNLMVMNPGGYSTSDFL IMGTPMQVVLAFVT" gene complement(<113365..>116262) /locus_tag="THAPSDRAFT_264891" mRNA complement(join(<113365..114214,114323..114383, 114477..114690,114724..114996,115083..115153, 115202..115474,115568..115693,115838..>116262)) /locus_tag="THAPSDRAFT_264891" /product="glucosylceramidase" CDS complement(join(<113365..114214,114323..114383, 114477..114690,114724..114996,115083..115153, 115202..115474,115568..115693,115838..116262)) /locus_tag="THAPSDRAFT_264891" /EC_number="3.2.1.45" /note="based on sequence similarity to other glucosylceramidases from metazaons, a ricin domain and the glucosidase domain; GO_component: GO:16020; lysosome - membrane [PMID 5764]; GO_function: GO:16787; hydrolase activity, acting on glycosyl bonds - hydrolase activity [Evidence 4348; protein transporter activity] [PMID 16798]; GO_process: GO:5975; sphingolipid metabolism - carbohydrate metabolism [Evidence 7040; protein secretion] [PMID 6665]" /codon_start=1 /product="glucosylceramidase" /protein_id="EED87519.1" /db_xref="InterPro:IPR001139" /translation="MLSPFPPSLKMNHTPIKMKTSGYQAIPEEAEGEQPMELSRQESS NNSSGSDDGDGSNAKRNMLMVALSLFLLCVAYSAGRSSSTSGGIGGGISNGKGKLSIL PQDVNIQNEIPSRLCEVYYDPHQASRDVTVIQTSRAEPSRSPILGFGGAFTEAAALNY MSLNEEGREMVMQLLFGREGLGYSLGRTHINSCDFCIKSYSFDDTPDDFSLSSFDTLV SHDLNVGMVDMMLLATKTYTESHPKEIEQHHTMRIIASPWSPPSWMKAPTESDVKGAD GVGEDSKYAKAWALFFSKFIDAYANHGIDFFGVTVQNEPEFPAPWDACAFDAKSQRDF IANHLGPRLAKSNPNTKLLIFDHNKDHMVDWARALLEEDNPAAKYIDGTAFHWYAGMP NMHRMVSELDTMKVDKSHILLGSEACHCPTTGYAGGDLDIAWARAMRNAHTVLADMAS GSNGFIEWNLILDSIGGPNHLGNMCDSPLLALNAEGVSAKYLDVGIVVQPMYYYMGHI TRFVRPGSRAIHALVDSSIGSPEARTFRPKGQTVAGGGINDLARIGIEVTLWPCEGST RQEWTYNSDNQLQVFGHDWLGVPTTSCLAEKSDKDMGGLMLTTCNVTESKAGLYDVIP VDDKSDRVNIVMKKSKVDAKKSCLVVQPLRNNGGAYGPRGGGQINVGDCSHSWAEWTF DTTTGEISSMAFEEVGGEVCVTTGWPFLQVGAFDTSSTGDKANAVVVLNEAGEAANFI LRNDGVEIMSSSIPPHSIQTISFN" gene <116432..120091 /locus_tag="THAPSDRAFT_25824" mRNA join(<116432..119104,119318..119562,119639..120091) /locus_tag="THAPSDRAFT_25824" /product="predicted protein" CDS join(116432..119104,119318..119562,119639..120014) /locus_tag="THAPSDRAFT_25824" /codon_start=1 /product="predicted protein" /protein_id="EED87436.1" /db_xref="InterPro:IPR001680" /translation="MTIHTTCSLSFHGAAANSCPHSLLWVLAPRDGSSVDVVDLGETS SSASTGGAADNVSGSDGLSSRDVRSANIGVEGDALVDTVLYASSGVINLATSTSNGAQ EGGHINLMNVSQTLVTRTLDQYSITDDADGTGIGTTKKVLARGEAVEDAAALRSVTAL SWIDGGSSSTAEDGSFGIVAAFSDGTVTSWRWSDTNIIGSNASGWKEHVLIGHDPSTL SSSSSSDATVRHYHNDNTAAKLHVQESIADISATNLTSSAWLVGTASSSGLLLCVSHV GSGDAADEKVVKGEEGGIVHTVYVRQIGHSAASSVHFVKHVANDNINSNITTQECFLF AGSASPRNNKVWVYTIPYSSNTAPSSSWAPFPLLTIGDPTYHGHLLGHQDWITCFAWW NINTNGNNESGEDDGDDMNGGRKYEDAILASSGHDAKIRLWKFSSLISADTTIMPVNN GVAVMSSIGEEDIVSEVDDENDEDEANIDDLEEEEGEARLVIQHDPSSGYSTTAVSLE ALLLGHEEGVTSLNWRRSKQQTDKPCLLSSSMDRTILLWMEEEDDDTHGGGGVWVPIS RVGSAGGILGGSIGASLMGFVDACFSPEANRIVGHGYGGSLHFWTQNQSGKENHNHEE EETLMSARWIADPCITGHFCSVEDMAWDTNGEYLLTVSSDQTTRLWAEVPMTASHRRW MEVGRPQVHGYDMTAITCVGGLGCGNDSGEPRHRFVSGADEKILRVFDAPLATLRLLR TLKKLSEPTSRNNPAEEGGVDSSSWRVERAFLPSLGLSNKATAECEQESAKYAGPTND DDFVEALDTVEGGTIDELKLPSERDLGVTTLWPETRKLFGHESELVCLDAYRAPEGTD CPSLIASSCKARNDVASAAIRLWNVKQGKCVDILKASSGKDRRICIWRRDGDPLTSDS VSYQLSAAVDSAHKRIVWSVHFCPFQPNILASGSRDGLVKIWHVVETATGTDEMKLLL RFEPSCKSGKNEPVTAVAFAEGVLPGGEEASNHFGILGVGTESGRIEVWSVPLSVNDS VLSSSLLYAVHANECHFVAVNRLAWRPIVVGNYDDGNESRDDNSLGLTLASCGQDCGV RLFNLRFNS" gene complement(120624..>122810) /locus_tag="THAPSDRAFT_25825" mRNA complement(join(120624..120936,120982..>122810)) /locus_tag="THAPSDRAFT_25825" /product="predicted protein" CDS complement(join(120711..120936,120982..122810)) /locus_tag="THAPSDRAFT_25825" /codon_start=1 /product="predicted protein" /protein_id="EED87520.1" /translation="MKLLRVGRSASKNSEHHPSSTGTGCRATSSSTAPTMTTTTTDNA AMAQYQSITPPAANKSRMKFKSSSSANNNKAATRDGASPTFPEEESPSTVAAPTLEQL SSTVWRTDGRAGSGAGSSGIGGATGTPKNSPFIRRTFSTEGHSKQRLPSNLSSNPSLT PPFLGDASQQQQVLRRSQGGDPPDVHEVRRSSGSLMKRRSSMPVSAAKHDKSMTRRNS ESHQQQNRRASTSSQTSSNSSGASSTRGRTSLESKLEEHKKVDVVVAENSLALLGTSN TKQNKDAMSNSSSTSNNEQKQRQVSSNRATPLQDAIKEFTFPLELNNSNNTNTSVNPT SNPYVINPFSSTPSTIPSRNSYQPSTRSNNRASSSKQRHYQRTSHRHHQSDSIPEHTP LNLPNFQSSQSSPLEDLLFSSNSSVFSSDDTVNSTIRTAAELQHLIEAMQIEFARLKN AKLKVEAECDKLQTDYVEMQESLERQFIQVCEERDAVKEKRERELIAYEQMERENGVL KESLRKECLERSYVNDRVVALEGENEKLKMSLKMSVRVNKVGSGGGSSKRESFKKKAV QNASQEYHRSTSTSNTDRSSVCTNSTTNDIVGEFYDSNEEHLDTPTPNESLGTNTTSS PHPSTPKQQQRQRTRSASASGTSSKTKRAAREEASNEVNHVLDKFRVLKGKQKSCSTR QL" gene complement(<123613..>126371) /locus_tag="THAPSDRAFT_38776" mRNA complement(join(<123613..124016,124102..124642, 124782..125343,125410..125908,126074..>126371)) /locus_tag="THAPSDRAFT_38776" /product="predicted protein" CDS complement(join(123613..124016,124102..124642, 124782..125343,125410..125908,126074..126371)) /locus_tag="THAPSDRAFT_38776" /codon_start=1 /product="predicted protein" /protein_id="EED87521.1" /db_xref="InterPro:IPR001680" /translation="MAPVDSGSDSDGSASDGDDGSVNEEESRSDEEYDSAAEHDDDDD NNEAEGEDDNSVEVSDQYSSDEFEDQGSSDDDEDRLARQAELDSDEDDDSDDEDGAAA SLLHTDNLSSDDEDPTGTDNRIGRVPLHWYDDYSHIGYDAHGNQVIKSSSAFNNQDLL DQAIQVADEMEGDGKFKVYDALNARDVTLTPRQIELIRRLQSGAFAHPEHDANPDYID YFSGVDPEISGINSNRYEKKSRFQPSKWEKLQVRRLLHRLKCGSINMDDKPFELWKGD EEDELALRKGPQHMPAPKLPPPGHAFSYNPPEEYLPTKEELAEWSEMDPEDRPYGHYI PQKFHNLRSVGAYQHAVKERFERCLDLYLCPRAMKRRLNIDPESLVPQLPRASDLRPF PTTKCIRYEVPGGDESGGDGVVRCLSVSPEGQYLASGGEDGVVRLWEVQTGRLLRSWD LVDKPIASLEWNPNRSHHTLLAAIGQCSVIISTGTAGPDDAEVTDALLSAAASCKNGG NVAPDSRASKAVKWISLKKKYNTSKATPISAYGGTSGPIALIRTNKDISSLRWHRKGD YFVTVSPKAGAASVLIHQLSKAASQQPFGKMKGEAQLACFHPQKPFLFVASKEHVRVY HLVKQVMVKRLMSGCRHISSLDVHVSGDHLVVGSLDRRLVWFDLDLASTPYKTLKYHE RALRSVGFHPRYPLMASASDDGSIHVFHSMVYSDLMRNPLIVPVKVLRGHSVVNSLGA LAMVFHPTQPWLFSAGADGKIHLFQDL" gene complement(128140..129906) /locus_tag="THAPSDRAFT_30747" mRNA complement(join(128140..129313,129485..129906)) /locus_tag="THAPSDRAFT_30747" /product="phosphatidate cytidylyltransferase" CDS complement(join(128761..129313,129485..129819)) /locus_tag="THAPSDRAFT_30747" /EC_number="2.7.7.41" /note="CTP + phosphatidate = diphosphate + CDP-diacylglycerol; cytidine-5'-diphosphate-diacylglycerol synthase; GO_component: GO:16020; integral to membrane - membrane [PMID 16021]; GO_function: GO:16740; nucleotidyltransferase activity - transferase activity [Evidence 4605; ATP binding] [PMID 16779]; GO_process: GO:6644; phospholipid biosynthesis - phospholipid metabolism [PMID 8654]" /codon_start=1 /product="phosphatidate cytidylyltransferase" /protein_id="EED87522.1" /db_xref="InterPro:IPR000374" /translation="MGLTVAGWVYSGNYIFTLLFTLMTALGQLEYYRMVMKAGIYPAR RISVVGACAMFVTALFAPDLHQIVLPVVSTWAMVWFLTMRRTITSISEIATTLTGIFY LGYIPSFWVRETNIHPYTLNKLPSFLPKTIHLPITTGANFIFWSWLCIAFSDVGGYFA GRKFGKTKLSAISPAAGKTSPNKTVEGVIGGCAFSMILATLGAWIQKWPYWAIVGPIH GVMLALLGLVGDLTASMLKRDAGLKDFGDLIPEHGGIMDRVDSYIFTAPYGWFMCAYM IPWLKGIAKGSAPAGALAV" gene <131205..>132767 /locus_tag="THAPSDRAFT_12008" mRNA <131205..>132767 /locus_tag="THAPSDRAFT_12008" /product="predicted protein" CDS 131205..132767 /locus_tag="THAPSDRAFT_12008" /codon_start=1 /product="predicted protein" /protein_id="EED87437.1" /translation="MHLHPSPLDKSTPAYPSARFDSNGYCIAHPSIRLCRLTNDGKYK IVRKTCFKCGSAGLMTDAHENKIAVHGYKKKGSKHREIPSLLTASGGGDGASHNHIHG EGRLSKDKEKHGNKRYDRHDNNSNDTKSTSNIKKERAIVVSSTKTTDGEEKKRSSHTY RIGCAFAEAPPNTNTKSRSRTLSPNSNKLTRKSRGITLSPLRKSLSSNTTKVSSTSTS FMRKFSAEMSSPRNIKDVIRIKLPSLIKSSPKTDRAVHGSNKSKSSKVAPNVTTSNVD KSTTTSRYDAAPFNKEGCCNIHPDVHLAEKDRCGVWKVIQDACLKCNPTESEVTKSPK PRRRSSSRLKNSRKDEPLKVKDSDDKIHAHAELENQQSPIPSMKVYYAKLKEAKASSS PNKTTLLNNGGDRCDDTLPSLSCDASPKSLHETTNINSINSSPQPAFSSAYFSRGSCF SHVGFPSLPLPTDDVDWDTSFTATVKQNNNNGVALSGDCIVGNLKSNGRKSGCSRPRT TRRHRVFPEQVY" gene <133720..>135579 /locus_tag="THAPSDRAFT_12009" mRNA <133720..>135579 /locus_tag="THAPSDRAFT_12009" /product="predicted protein" CDS 133720..135579 /locus_tag="THAPSDRAFT_12009" /codon_start=1 /product="predicted protein" /protein_id="EED87438.1" /translation="MYTSPLDKSTPAYPAARFDIDGLCISHPSVRLCHLTTDGKYKIV RKTCPKYGTAGLMNDEVSGKVNHPHGYKKKSKVASKSRENPNALTAKRSEGRRIERHN SHGTHKTMKRSLGINSNDDTNEDITFVPAQAAPFSRLQQGGRYNQSMREDYNTSSCSS TYRRSRTTSPVDKERGSTPRRYTDNRGRDNYPTSSSSNDNNKSTTKVPKSPKLETTPD ALKQALRMKLKQTKHGMPSLPKLTEKLTGKALDTNITSVHDISDRAGELEDSLNRLNI VNGGRTSDDERQEKFRSSPFDGDGYCHQHPNIRLAKRTMLGSWKVMMHVCPECCHESC CKDTTAGSVCSRGSNRSGRRSRCSSNSRRNRSTSRSSPHRSSGRCSRASSIKSSDDGT ATSTLSRRELDEIVDELSSLSVVPVHPAERVDVEQTRSAQYVKHTAQGVEKVTATKLP PPPRNPRRKTNRQSVRKSRFVEKKNPTVVGESFDTVSLLPPKEAFTETIDCTTKITSD DSICTLPTVASASPMSFNKISISPAFSSAYFSRGNDNYHSGGISRKGIPSLPLYSRND WRNESLDSFVKDENHEKFVHANAMKQQRFSKEVDEESVISELSGSADGVIWGC" gene <135866..139118 /locus_tag="THAPSDRAFT_25829" mRNA join(<135866..137460,137684..138287,138385..139118) /locus_tag="THAPSDRAFT_25829" /product="predicted protein" CDS join(135866..137460,137684..138287,138385..139074) /locus_tag="THAPSDRAFT_25829" /codon_start=1 /product="predicted protein" /protein_id="EED87439.1" /translation="MSSEEEYQSDGNSESASEEEASDYDDDELNKSSRSRGRSSGTPR ASRARQTVKAARSRQQQQQQQENSHHDDNVRRNPRRYTTDEMDREIAMSMYNEQSSDD NEDHDDDASSGDDVAARRRAPSKLSVRAAKQKKGTPVKRKPKKSSAKERTSRSKKDEF IANSDEEDDGILSTSEDSKSEESEEWDGNGFDDSAHSSEAEDSNSDEDFSTNSKKKKK GRSDSKKKGKGGGKQQRTVNTTKRTPTRRSTRGTTTTSKHDDDDNNNSDSSEEEIEYA ITPTGKRRPRRHCTAKTEERMHKVVEEDMKTEKEALSGVLEEDDVELLDSDKEGSGSD VGGAKREEELLKRNGAVNGYGSAVASPSRKYKYDEDEDYNSDANGESSSGGGDSAEES FEEESDGESYNTSRRRSSRSTSKRSVKQRTSSRRGDKKKSYTEVDEEEIGTADEDSED EVTGPSLLLSPAKKSSGSYGSPLARQRARRGREPLAVYEGGSSDIEDEKPKSKRKRRN VGSDDEESDGSDSDRGKQPYFQHNNAPEKLQFLQPPHFRSPMEDDMIDQIGSRFGRGA LVIENSTLYKKMKQRSWGGSVGFADALDDEMDEFDSDGEYTGGTGGVGQNFNDRFQRY VQNLMGSQDVYCCPVCYIEADRRLGNMGDEEIAEDDPSDDVDNANTEHVDRFSFLDDP LTILGHLDHQTFEIASTFCFRLLSGVKNHLKAVHGVNLKEIEGNDLFKRFQIRASDGL LQRWLRKSLRTHIVQGDMVRYWLGGENQAFVLLLSQIDKGELRGEQSGEYGSDFSFSF PNRARKLWREVSAPYLKLQDDMEDFVADSDEEEASGPMINPHFTPPSLEDEGGDIMTP EERMIEHLKKRNAARNVQSCSDEGASGGGDDSSDDELEVLPKPSEMEEVEEEDDWTKS KRYDSAKKSRKMKTLVDSDADSDDEEEPKVPATSGSARKRIDDSSDDE" gene complement(<139337..>140935) /locus_tag="THAPSDRAFT_12011" mRNA complement(<139337..>140935) /locus_tag="THAPSDRAFT_12011" /product="predicted protein" CDS complement(139337..140935) /locus_tag="THAPSDRAFT_12011" /codon_start=1 /product="predicted protein" /protein_id="EED87523.1" /db_xref="InterPro:IPR002848" /translation="MGFRTHHLSRILITIHPIIKATTTRSLHTALPLYYRQMDYLTEI GSELRLTEQKRRSANDRSWELNAALVRVVMAHEASMEPSSTNQDELVKEEESLDSLVK ETVLSSAFYDNIVRDGLGDMKTESKAHPRAPRLDSLSLKMEEYARYKAFRHFLSEGNL LSPSAPCFVATEGDEQRAVVTDEEYLGGCILLCHDLAKYGVSRATNAVTDPDAVPAVQ KARDIVSKVLEMLLEFDFRNGPLRRKYDGTKYKLKTLETVLYELSVAGAGGGNKASSR ETTEGGPLKKMKLEDKKGGDVDMTDYDASGTIPNDEIAAIKLRMDHRDVLRERLIKAC RDGQKSAKQSIFALHRGDTTRASNLLREVETLYNNDLLTILKEEPSLRSGSLSGVLEE YVEGIMFYTWLHGEDNANGGSSKKPSCKILKPSELPLSVSSEEYLGGLCDLTGEVGRY AVARGTVRDKESVKLCLDTNKSIQNALKIMGKLPGSIGKKQTALIRSVENLERMIYEL SLMEMTGREVVTAVEDSPEDIGGD" gene <141175..>141483 /locus_tag="THAPSDRAFT_38861" mRNA <141175..>141483 /locus_tag="THAPSDRAFT_38861" /product="predicted protein" CDS 141175..141483 /locus_tag="THAPSDRAFT_38861" /codon_start=1 /product="predicted protein" /protein_id="EED87440.1" /translation="MHWFVKTETFSKPFPQVKSYLEAHREWVRCLREKNEGEQQTIVS GYRVDANDRPGGGGLLIFAAESYEAAEKIVREDPLVKNECVDWQLNKWIAETGDISLE " gene complement(<141748..>142317) /locus_tag="THAPSDRAFT_12012" mRNA complement(<141748..>142317) /locus_tag="THAPSDRAFT_12012" /product="predicted protein" CDS complement(141748..142317) /locus_tag="THAPSDRAFT_12012" /codon_start=1 /product="predicted protein" /protein_id="EED87524.1" /translation="MIDPGTKTDGEEFEAYNIRRWGSSGWTHSLKRAGRKVGANFNDW KVWPNTLKAHQLIAYVTDPKRHGENESKPTTTSECNAAIFDAMYECGENISLTETLVK IATDRLGVSQSEVPLLQTHLENNEGGKDVMREIQTGRKRYNIQGVPYFIIGAVDGEQS LGRPYGFSGAQDPSTFVEIFEELAASLEE" gene <142778..>143211 /locus_tag="THAPSDRAFT_12013" mRNA join(<142778..143078,143123..>143211) /locus_tag="THAPSDRAFT_12013" /product="predicted protein" CDS join(142778..143078,143123..143211) /locus_tag="THAPSDRAFT_12013" /codon_start=1 /product="predicted protein" /protein_id="EED87441.1" /translation="MTPFAILTLLIVALIGTTAAYSTPKSTSQSQRQTSATVVNRSTF LATAASTCLAFLASSPPAFAKDVDPALKGTKADPEFQACLSQCVYECTKPKGMEQRNE AADDDGVSQVSLEMEGGRIWGCDIYGA" gene <143617..>144933 /locus_tag="THAPSDRAFT_12014" mRNA <143617..>144933 /locus_tag="THAPSDRAFT_12014" /product="predicted protein" CDS 143617..144933 /locus_tag="THAPSDRAFT_12014" /codon_start=1 /product="predicted protein" /protein_id="EED87442.1" /translation="MRSTRQICQLYGFEKLECAAMRNAAFETAKEERDADSIVTEDSV PLRDRINSQLFVTRETETGKVLWKRPMQETDMIAHDIDARTNLQHRLPCHSFRSMLPS LAEGKRPPAKSPDWIRSCTRGILFSAWHAYRNERLASNADDAKFYFPEHVYCWFSDEA NSTMTTEDEDRWAFYQGIKGLAISDAEGWIMYQMLNDMQGEHFTSFLFHTLKTIKSQM ESSWTDQVGVVFSHAEGLQSMSVLQERFKSLGQSLENDNYLCGINLWISLDSAHDSLR VLFYSDRIKSSATTKALSKRATEMTVDVKGEPCIDVFSLVQIIMKEYIVHMQKQTTLL RIMFETGSTGNLTDVYDTPGEVDINEEIEIDTEYIVSFQQFVKIIKTIWPTLMLKEVA LLYREAYDMMYPTSQWNQPAPDGISFASFMSAADKRCLFSRVRANL" gene 145772..148383 /locus_tag="THAPSDRAFT_270120" mRNA join(145772..146056,146168..146362,146448..146481, 146573..147108,147309..147634,147717..148383) /locus_tag="THAPSDRAFT_270120" /product="ABC transporter" CDS join(145826..146056,146168..146362,146448..146481, 146573..147108,147309..147634,147717..148245) /locus_tag="THAPSDRAFT_270120" /note="ABC transporter, ATP binding protein; GO_component: GO:16020 - membrane; GO_function: GO:166; ATP binding - nucleotide binding [PMID 5524]; GO_process: GO:6810 - transport" /codon_start=1 /product="ABC transporter" /protein_id="EED87443.1" /db_xref="InterPro:IPR003439" /translation="MLDGTAIVLNHGNRYGLIGRNGCGKSTLMKALGVRAIPIPSGID IFHLKEEVEPSGEMTALDAVMSVDEERARLEQEQEILMDLLTSTYERLDALDADTAET RARSILQGLGFTHAMQSKFTKDFSGGWRMRVSLARALFIQPTLLLLDEPTNHLDMEAV IWLEDYLSKWDKILLLISHSQDFLNNVTTHTIHFTNKRKLEYYDGNYDQFVKTKSELE ENQMKQYNWEQDQIKSMKEYIARFGHGTSKNAKQAQSKQKVLDKMVRGGLVTKPEVEK PMNFKFPDPGHLPPPVLAFHDVSFGYPNCEPLYTNVNFGVDLDSRVALVGPNGAGKTT LVKLMSGELQPSLGDIRPHGHLKIGRFTQHFVDVLNLEQTPLEFFDTVYPGTPREEQR KYLGRFGISGKMQVQKLEELSDGQKSRVVFAKLGRDAPHILLLDEPTNHLDMESIDAL AKAVNEFEGGMVLVSHDMRLISQVAKEIWICDHKTITMYRGDIQNFKMDMRAQMHLDD DGGKGSKSASGGKGKLRGDASVMKKSDEDTKKEKKTKEASKVSSSSANGGAKNSALDS LLAPKPREPSETKDDIWDVKEVKESLPKVDEQPVAAERKKYVPPHLRNKQ" gene <149323..>149514 /locus_tag="THAPSDRAFT_38788" mRNA <149323..>149514 /locus_tag="THAPSDRAFT_38788" /product="hypothetical protein" CDS <149323..>149514 /locus_tag="THAPSDRAFT_38788" /note="The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn2+ ions. The FYVE finger has eight potential zinc coordinating cysteine positions. Many members of this family also include two histidines in a motif R+HHC+XCG, where + represents a charged residue and X any residue.; hypothetical protein with putative FYVE zn-finger, rabphilin/VPS27/FAB1 type-domain; GO_function: GO:8270 - zinc ion binding" /codon_start=1 /product="hypothetical protein" /protein_id="EED87444.1" /db_xref="InterPro:IPR000306" /translation="VPDSLRSACPGCYQTFTYTTRRHHCRLCGDVFCDACSSSRTVLP LDGPEFDVPVRVCDWCMKDV" gene 159285..>161805 /locus_tag="THAPSDRAFT_25831" mRNA join(159285..160061,160901..161701,161764..>161805) /locus_tag="THAPSDRAFT_25831" /product="predicted protein" CDS join(159297..160061,160901..161701,161764..161805) /locus_tag="THAPSDRAFT_25831" /codon_start=1 /product="predicted protein" /protein_id="EED87445.1" /translation="MHPNSGQLHISQSPSESMSRRPKRAKTSDSSSPPSPPVAELPPP FASAPVVVNDAVMEQLQHQSSTIASLNESVAHLRSMVNDLITSHSSLKKEVAVLRAAS EQSQPHAPLAPTNEGNMMQQQQQVPHLPTMNAIQTERDSSLQLAATTYRLSPPAAAAA INDTTSRTTDEETDAIIADLKEKKQWPVKRNAFKRFSEDASLIKVRFHRSLKYQHLPV RITGDTKEGRLNCKLCAKKDVSRNTAWMCSSCEMPLCETIDLGSFSMIGCGARKRSQD IDSTASFNVSAGAILLAFGVPRVVLVAIFVAFGGWLFLVLDTLPTQLNRIHQQLHRKQ LSPLASSNDNMNDLNDPSAVQMQRAVEAVAAEAADSLRVQVADRPPVAVGVAAAEPVK EDVLDQMHIDALVAAEPINVNEDDPVSVAACLRQVQSQLQQQIQVILAMKDKINELTV QRDTIQRELDSIKSSSKAPRTSTARGTPLPRNAAAMAHPLSHLGVTQDSTDVEIDAIF HAKKGGQLGGSQWPREAYSSSGPDSTH" gene complement(162348..163269) /locus_tag="THAPSDRAFT_270121" mRNA complement(join(162348..163080,163138..163269)) /locus_tag="THAPSDRAFT_270121" /product="hypothetical protein" CDS complement(join(162493..163080,163138..163230)) /locus_tag="THAPSDRAFT_270121" /note="supported by a Thalassiosira cDNA; putative protein containing similarity to unknown proteins" /codon_start=1 /product="hypothetical protein" /protein_id="EED87525.1" /db_xref="InterPro:IPR001251" /translation="MPLRHYRAEKGNLIEAIRKIKCTLRWRELFGKQEELRQLADTIA HENETGKIYCRGYDKQGRAILYLTPGRENSTNELNNMKHLVYHLERAIACTRRHSGRE KVCIVIGYEGFKLSNAPPMSTTKHTLTILQGHYPERMFRAYICDPPLVFRTFWSVIRH FVDPCTLEKIAFCSGKEGQTLLERDFDVDMTERQAGGQRDLRRFSSREFLFATSFDRT FDEKYVEE" gene <166446..>170866 /locus_tag="THAPSDRAFT_264896" mRNA join(<166446..166550,166587..166846,166970..167609, 167805..167869,167913..167933,167993..169529, 169607..169819,169904..170221,170309..>170866) /locus_tag="THAPSDRAFT_264896" /product="ABC transporter" CDS join(<166446..166550,166587..166846,166970..167609, 167805..167869,167913..167933,167993..169529, 169607..169819,169904..170221,170309..>170866) /locus_tag="THAPSDRAFT_264896" /note="On the basis of sequence similarities a family of related ATP-binding proteins has been characterized. ABC transporters are transport systems which consist of three different proteins which function together to import metabolites into the cell. ABC stands for ATP-binding cassette. The three components are: 1) ATP-binding protein - resides in the cytoplasm, associated with the inner membrane, provides the energy for transport through hydrolysis of ATP. 2) permease - spans the inner membrane, makes the channel for the metabolite to pass through. 3) periplasmic substrate-binding protein - resides in the periplasm in gram negative organisms and is attached to the cytoplasmic membrane in gram positive organisms (and is therefore not called periplasmic), this protein binds the substrate with high affinity and initiates transport through the permease. The likely stoichiometry of these proteins in the cell is two permeases, two ATP-binding proteins and great excess of substrate binding protein per transport structure. The proteins belonging to this family also contain one or two copies of the 'A' consensus sequence or the 'P-loop' (see IPR001687); GO_component: GO:16020; integral to membrane - membrane [PMID 16021]; GO_function: GO:5524 - ATP binding; GO_process: GO:6810 - transport" /codon_start=1 /product="ABC transporter" /protein_id="EED87446.1" /db_xref="InterPro:IPR001140" /db_xref="InterPro:IPR003439" /translation="LADTDLSDVLNVDSSAENLRKFHEMWEAEKHRAATEAVVPAAYP SLHRAIAKDFLSTLWFVQPLMLASSVGKLVQALALGMLLESFDSGDGKGYLWAGVLVL SGFVVLLCHHQSFFWTWRKGMQYRVASVSAIYDKSLRLKSTSSTDELSSGKVVNIASN DVERFLLASAYGLYIIWVPILSIGILALGWYVIGSAFAAGFVMLIFGFIPIQLWLSKK FAMMRSKVAALTDQRVTLVSQAVSGVRVMKMSGWEDSFEDRIVSIRAKEVDQIERVNR YRALNEAVFYVSNVATSVAVFLIHVGTGGVLTPMNVFTTMVLVNVAQLGETLVVNLAF VGVSECSVSIGRIQKFLESPELEQLLLHLASSDEENNRSENETIDYSGLTLALNDVNV QFDMGQLTCIIGEVGSGKSALLQMLAGELPSSYGMVRHRSECTLAYASQDPWMMDGTV RENILLGKPFDATFYNEVVHACGLSVDFILLRNGEQTIVGDKGVQMSGGQRARIALAR ALYRDSDIILLDDPLSAVDSKVGRLLFYSAIQDLGVNRGKCVVLVTHQHQFIGDSRCV MMSGGSIVCDGSYEQCVAASDGKLTLAVQNKESEDDDTTPALGISPGSIDVSSEDTPT ALSKKQPTKATTENIEDDSKEASQTGVVTRDTFINYLRAMPGGLWTGLLMLALFVATQ GSLLACIAVVGKWSGLSADGQSSGRIIGLVVGLVVAVSFFAILRAFVYFHLTLYAAKR LHDDMTSSVLRAKVQFFDMNPLGRILNRFSADVGSIDDLLPPALFDFLVILFIVLGGL VSTISLLPATLVFIPPLVWYFVAVRRAFVATSRELKRMEGLSRSPIFAMLSESLSGIS TIRSNNALEYFQKKFLGVHDAHGRSFFAFLACSRWLGFRMDGLMFIFLAVASFVAVIV QDQDWLDIDPGVLGLALSMVMQLGTYFQHGIRQSAEVVNQMVAVERVSGFCDLPSEAA LENDFDNSINEWPTKGDITVQDLSVRYRVGLPLSLQGLSFKIKGGTRVGVVGRTGGGK STLVQSLLRLLEAEDGQIVVDGVDISKLGLHKLRRSISVIPQSPVLYGGCTIRENIDP LHNYDDEQIHAALLYANMLDTIKTQPYGLDTTVADDGLNFSVGQRQLLCLARAILRKN KILVLDEPTANVDAGTDKLLQEAVAKNFRGATIIAVAHRLDTVIDYDKILVLGAGAVL EYGSPHELIEKVNGAFASMVNDTGSAMKRELTSRA" gene complement(171133..>173792) /locus_tag="THAPSDRAFT_25833" mRNA complement(171133..>173792) /locus_tag="THAPSDRAFT_25833" /product="predicted protein" CDS complement(171267..173792) /locus_tag="THAPSDRAFT_25833" /codon_start=1 /product="predicted protein" /protein_id="EED87526.1" /translation="MGAGAVGFGRSASNGFANAASESNTSSGGSGSSGAERSGNIRSF FGLRRKDNAVVVASNDGDAPIVTADTCTRDTTSIQLEQPLHIDTSFQQENNIRTYGNS TNDKLTPSFNPRISPSDYLYHTSPNGGSNAPLTPLSLHYDSNSRLLELFSSLISNGGT NGLQRGKQFNEQYSLAYAVGIKFIEVALFQIPKHGYYESEGYELDKRKSLVEANRVVE LLGGLLDELDDVDGTIGGEGGGGGSVASSIGRSERGSISVKERKETVQKLAVVAKRSF QEAVESRSKRNYLDCSGAASASAASLSTFWKEYVVDGTDNLCSFWSLDCGDLVGVVDD HNTNVGVDHADDEMEEKKQDQVIAGGLQSQKQQQRVRSEFSEHVVPTAAFSESIWPKR SVLKSFSSQSSSEVPDVASFVGVPENAIIKNNIQSAPRRGESLVFTEQYQSATPIKEV TTPTSEIAPSLIQQHSVIEDVPLFTQDGFDQEELQLALSLSMTENNITQSASSNEYRF GDDESIQLQYGKLVSTSPTISTASLSKLYKEQYMTLRDKEQFHIRFLDTYQGRIRGST NGCTVIAPLTCIQYFVTSEQNRISSASYVAGVVCDYDAWDGGLPDEHIANVIDVYAQT ILSEVRNNLNLAADSFIVPSDVHDHLMDIGLLSPTSFVGVCGGNVLDDEHLGQLKNTL LLLNDERERKRLKGRKLAAVFFFHGHVVALHVIDKGPDDVWIELIDSLPNPDTWARRS TTTSFASSDCEEREPAQAVSRRSDHSDDDEWERHVEYDDRLPLNAVRVRCVDDCFDTL IRHYAVSKFSREEHFFIDQTQWEDNNSHFDPRVFQAFIWAEAD" gene <174121..>175182 /locus_tag="THAPSDRAFT_12021" mRNA <174121..>175182 /locus_tag="THAPSDRAFT_12021" /product="predicted protein" CDS 174121..175182 /locus_tag="THAPSDRAFT_12021" /codon_start=1 /product="predicted protein" /protein_id="EED87447.1" /translation="MLHLTTSQLHDAITRHIDESKPDPLAKNQMASLHLSWESSPQMM GDGVDHQLNGGGGSGDAEREAVEERESNLRGTLTLSSDYVSIFIHVMCRWKTNTEHVS SSVDGTKSTSSDKQFQNEFYFRLTTTASFVPLSGYNNNSQSNDDETTKVDRKEKAAKR KLRAGMVKRLSADSLIRRLLVESEDGTLKEKDNTAIVKGVPLCEALIQSNGDLNYNIT VDNASELEERVNVNTESVEGIRNAILTHCEDNLDVLELLLNMPYLPRHKFDSVDKQGD EDKQNKATMSMLAERAYLRLLEDAMYDACEKEGEEEMLDNLNISDKNEDESSVDSREE ESRGKSKRCGGVEGKRAKR" gene complement(<175248..>176351) /locus_tag="THAPSDRAFT_12022" mRNA complement(<175248..>176351) /locus_tag="THAPSDRAFT_12022" /product="predicted protein" CDS complement(175248..176351) /locus_tag="THAPSDRAFT_12022" /codon_start=1 /product="predicted protein" /protein_id="EED87527.1" /translation="MNNFELDVSSSSSSSSTSQPPRQINAAAAASSSSDESDHEQVDE SILYELRSALHLSVGAICRHEDRIDADSSTDSADSDDMNSNTNDGSSEKPVFTMSKDA IVALTDLTFHYSTKLLANDLAAFSSHAGRRTVKTEDVLLVARKDKDGILAELKRELKE RIGGEVNATKTGRGGSNGGTKKKASSATTKSSSVNNNSISERSNSKGSKLTNEKSMLS SSFSDSSIDELDKLIREQKRSSSSATAAKTASHKNSNNGSDHGDEGEDLSDFIVNDNN YYNSNDSSSEVEFELDTSSKKKKSKDKHKKELSKRGGKSSGKKSSKSNGKVRSKATIG LSDSSDEDVGRGRGGKFQVEGASESIAIDLDSD" gene 176947..>179067 /locus_tag="THAPSDRAFT_25834" mRNA join(176947..178000,178083..178300,178386..>179067) /locus_tag="THAPSDRAFT_25834" /product="predicted protein" CDS join(177299..178000,178083..178300,178386..179067) /locus_tag="THAPSDRAFT_25834" /codon_start=1 /product="predicted protein" /protein_id="EED87448.1" /translation="MKQSDRSCFFGCAKEHVTPACSVELIAFLAHHGYIETEAGTTPI QQQPPIIYSYSHSKGLLLLDPTLLHTHNISTQFILASRHDVNCFGDPFVQNIVFSLVG ADTVMLNWILGLQHHQRQSHGKQQPNAKRNADGGKRNRFVYHWESGKELDLDLFHMDH YAFSSSSGSTSIADGRQQPMASSPLSNLLFNNLHKHPLYRLLRFLAFKFAVLLSTLFI FFLTTSLVSFTFQETQDRMLEFTLQLQTRVRMRMPLAGLIVGHVLENLVFVPVMVGMI FFLIEFYGGDRFLAFAVLSMVWICEVFSAISIRSIQGMYFFPRVFFLYFTLFHVYFFS CPVGFTYASLASTILFLCHSMLFFWNRYELPALAQGLITPDNPRMMMLVVGNRDGTGM MMNRMPSSSSLPMPRVHPRIHQPLLYPPPVPQISSFDDPTNNTETTEPSTIDYTPHSN QHALRTYVAAASSSLQSIRSLASIQSLADRAASPNWLFQGGYGFANTGGTGDSEDDDS YMAYVGDQRRSSLVLEAERNTGTLS" gene complement(<179157..>180150) /locus_tag="THAPSDRAFT_12024" mRNA complement(join(<179157..179964,180050..>180150)) /locus_tag="THAPSDRAFT_12024" /product="predicted protein" CDS complement(join(179157..179964,180050..180150)) /locus_tag="THAPSDRAFT_12024" /codon_start=1 /product="predicted protein" /protein_id="EED87528.1" /translation="MSSSASITPRLAASDSSGSVYKPKRCLSAYNLYYRFKRSKILDA CAAGNDDKANIEHLLKTTPGLEAHPHDISSMHKTSMYELCREIIKDDMKDDLLPRDTS KRSHTKTHGAMTFVAMGKIMSTMWKEQDDEIKQIFRDLAEQGRKEYQKLLKDLDVDVK ELRTKKCSSKKIKVKEPEEEAKIKEKPPLPEVLHHHAIVSVSDSDDQHEDEVQLPLFD FAFMQDHPLGNVDHSPDNLEQSFCRQCAPTRSRMSLVFHNVSMEPIESLNTVSTSSDE DESDNVSSYDFLKLICHLDEALEQAM" gene complement(<181129..>183972) /locus_tag="THAPSDRAFT_12025" mRNA complement(join(<181129..181336,181410..181467, 181543..181858,181937..182001,182083..183757, 183838..>183972)) /locus_tag="THAPSDRAFT_12025" /product="predicted protein" CDS complement(join(181129..181336,181410..181467, 181543..181858,181937..182001,182083..183757, 183838..183972)) /locus_tag="THAPSDRAFT_12025" /codon_start=1 /product="predicted protein" /protein_id="EED87529.1" /translation="MANDTVMNASDTSSPPSSAKAKRRKTTAKLTICTFLFLILIETN EHVYQVSDDGTTNSQHITWHVPDYSYTISDNVKKEYKQWHLKARHLSQLDLIPSNWTP AVNWKNHPAERSDRFPSVEERLRYYMGKWYNASVPMYGSQFERDTFIQRKTTRQYGTF SDILVNLYNLDKEKLTDCYNNKKELHVMAPYCRDYMDLAILHSEGSANVIHNIGDGLP TYVPEEIVKYPMFNKVRPLCDDNRNSVFVNKICHNKKRVEPILLPLNRKRHFGVASLV PENDIQWEEKMPKAVWRGQYGKTDKSHNGSDIDNTHDIKYALVSKHLNSSLVDAKFSR HADSAPPLMAGSYLDMKDQLRYKYIISIEGNDVSSGLKWMLFSNSIVLAPSFTWEGWA MEGLLEPHVHYLPLKEDMSNVEEMIAWAEDHPNEVQLIRERSTVFIHDLLFHPEAIHD EREVIQGIMERFEHNFGSRGSTKQLHPLEIDWESNSHPTDRGHRFPSVEERVQYYMGR WYDNNNSMFMKRSNVQQLATLYPDTVVTDAPFIASGHVLTQCAMPNSTFTQDVRRLCR ESLTEFDERSTADLKSNAFNRLRKTERGEGIKLASMSSWRSDGKGMKESKRVILDDSV RIISVGSPPKDTQVPIFAQSRDVDGDNEGSAILWPFGMDLNNSVMTGIVEKADTNFRS KEADAVLARDSRTDYESLRELLSHRYLVTTGERTSSDVLEALLLSQSIVLMPNERQST SWLMETALKPYVHFVPIKRDHSDIAAQMQWCENNLEAARKISERASLYVHDMLFDAKA GKDNEEIRFRIMERYLDLYG" gene <185605..>189364 /locus_tag="THAPSDRAFT_38845" mRNA join(<185605..186176,186279..186421,186470..187992, 188087..188286,188364..188700,188792..>189364) /locus_tag="THAPSDRAFT_38845" /product="ABC transporter" CDS join(<185605..186176,186279..186421,186470..187992, 188087..188286,188364..188700,188792..189364) /locus_tag="THAPSDRAFT_38845" /note="On the basis of sequence similarities a family of related ATP-binding proteins has been characterized. ABC transporters are transport systems which consist of three different proteins which function together to import metabolites into the cell. ABC stands for ATP-binding cassette. The three components are: 1) ATP-binding protein - resides in the cytoplasm, associated with the inner membrane, provides the energy for transport through hydrolysis of ATP. 2) permease - spans the inner membrane, makes the channel for the metabolite to pass through. 3) periplasmic substrate-binding protein - resides in the periplasm in gram negative organisms and is attached to the cytoplasmic membrane in gram positive organisms (and is therefore not called periplasmic), this protein binds the substrate with high affinity and initiates transport through the permease. The likely stoichiometry of these proteins in the cell is two permeases, two ATP-binding proteins and great excess of substrate binding protein per transport structure.The proteins belonging to this family also contain one or two copies of the 'A' consensus sequence or the 'P-loop' (see IPR001687); GO_component: GO:16020; integral to membrane - membrane [Evidence 8372] [PMID 16021]; GO_function: GO:166; calcium ion binding - nucleotide binding [Evidence 5524] [PMID 5509]; GO_process: GO:6810 - transport" /codon_start=1 /product="ABC transporter" /protein_id="EED87449.1" /db_xref="InterPro:IPR001140" /db_xref="InterPro:IPR003439" /translation="ASTTSGQIVNIATNDVERFLLASLFASYIFWAPLQSLAILGLGW YVIGWSFAAGFGLLIFGFVPLQLWLSKKFAMLRTKIATITDERVTLVSQAVSGVRVMK MSGWEDNFESRISSIRKKEVDQIERVNRYRAFNEAIFYLCNVATSVVIFLIHVGSGGL LTPRNVFTTMVLINVAQMGELWQAVFLLVAFECWVSIGRIQKFLESPELETKTKAPLL GDDSNDDAMVISNATCHWNIQNSDEFESAGLIVALNNINLEFAVGQLTCIIGEVGSGK SALIQMLAGELPPSTGTIRQKPDLTMSYAPQDPWIMDGTVRENILLGRPFDDTFYNSI VNACGLSIDLAQLRDGEQTIVGDRGVQLSGGQRARIALARAFYCDSDIILLDDPLSAV DSKVGRLLFYSAIQDLGLKRGKCVVLVTHQHQFIGDSRCVMMRGGNIACVGTYRQCVA ASNGRLTLTAQNTISQDDLTALDRDSPQDSEAAIPTKEAVISSEKCTDTEQYTKATDL SKDDHKEASQVGDVKLDTFLNYLRAMPGGLWTGFFMLILFVATQGSALACIAAVGTWS GLPAEEQTSWSILGVIVGLVFTISVLAIFRAFLSFHLTVQASKRLHDDMTRSVLRSRI EFFDTNPIGRILNRFSADVGINDDLLPTTLFDFLVILFLVLGALISAVSVIPVTLVFI PPLVWYFARVRRTFVATSRELKRMEGLARSPIFAMLSESFSGIATIRSNNALEYFQKK FQKVHDAHSRAFFAFIACSRWLGFRMDALMFIFLAISSFAAVIVQDQDWLDIDPGILG LALSMLIQLSGLFQCFTYSLQSAEVVNQMVAVERVIGFSELPSEAALENDFDNSINEW PTKGDINVQDLSMRYRVGLPLSLQGLSFKIEGGTRVGVVGRTGGGKSTLVQSLLRLLE AEDGQIVVDGVDISKLGLHKLRRSISVIPQSPVLYGGCSLRENLDPFHHHSDKQINEA LLDVHMLEAVKSLHCELDTTVADGGLNFSVGQRQLLCLARAILRKNKILVLDEPTANV DSRTDKLLQEAVAKSFRDATIIAVAHRLDTVIDYDKILVLGAGAVLEYGSPHELIEKV NGAFSSMVDDTGDEMSKLLRSIAKKGE" gene <192115..>193023 /locus_tag="THAPSDRAFT_38800" mRNA join(<192115..192428,192525..192762,192835..>193023) /locus_tag="THAPSDRAFT_38800" /product="predicted protein" CDS join(192115..192428,192525..192762,192835..193023) /locus_tag="THAPSDRAFT_38800" /note="GO_function: GO:4601 - peroxidase activity; GO_process: GO:6979 - response to oxidative stress" /codon_start=1 /product="predicted protein" /protein_id="EED87450.1" /db_xref="InterPro:IPR002016" /translation="MDNPSWDDGSLAPIFIRLAWHSSGTYDAASNTGGSNGAGMRFAT EAADPENAGLEVARSFLEPVKAKFPQISYSDLWILAAYVGLEHTGGPMIEFHSGRVDH VDDMDPETGTVKGWEGLCTHVRNEVFYRMGFNDQEIVALLCGGHVYGRCHPNFSGYAG PWVEHPTQFSNEYAADMIEDDWTLFVNKVHGKIDNEPNQMMLLSDMILAWDPAFRQYL EVYAEDEDRLKSDFGAAFKKLTELGCGF" gene <194741..>197373 /locus_tag="THAPSDRAFT_25837" mRNA join(<194741..196018,196041..196111,196246..196315, 196471..196605,196775..196972,197197..>197373) /locus_tag="THAPSDRAFT_25837" /product="predicted protein" CDS join(194741..196018,196041..196111,196246..196315, 196471..196605,196775..196972,197197..197373) /locus_tag="THAPSDRAFT_25837" /codon_start=1 /product="predicted protein" /protein_id="EED87451.1" /translation="METTNEDEDAPFMTSWDDAFIVTTRLQSALSSRGVSTPIASSTV SPLEGSGLSGDLSLLTLDTNKYVLKRTRAGVDAKKYSKRLGLQREGAFYETIGPWIQV RLDMIPTTSQSVHTFIPKALYAEHSDITGQKAIVLEYYDNATEAGRYFPHSVLTKNAM EYDGKINAKTITLEATRMAASFHGTFYLSPALLSSDLITPHLRMANWIQGKDKESFIA SQSEVAGRWKRCKARWEKGSYYGGEVKLEKEFVDVMDKSCDEALDFDKFVSKWSVEKD GDGTAKQSTIDWSLVHGDFHPGNMLCLNNSSTNKPQLILIDWEVVGVGSGPQDIGQFL ISHLEPSEAYDMIDEVTAVYRKSLLETLEAVNSKDNTNTAPTVVPTVKQMKHEIIFGG IERWVWLFGYMCDLEESMPSLYMQYFHDQMHGWIRMSVCLVLEEETGGWMEIGADRWS MSHDLITRKYPRCIATISAGFEKGDFLVREEKGQRVYFDFFEEAFNYIAAKGYARMPK GEEAEWMDLMGWRVMKWRGNYSLCLDFGISTGHAPFHELLGWICYLDVKRNRVGLFAF EIVVICRELCHLQNLDLEESIHASVKGGMKYNTGRLLMVLHKPLIPHKGEDYDESMRG GRGYEEPDMENNAMLPMP" gene <198538..>199872 /locus_tag="THAPSDRAFT_12030" mRNA <198538..>199872 /locus_tag="THAPSDRAFT_12030" /product="aspartate aminotransferase" /note="Has EST support" CDS 198538..199872 /locus_tag="THAPSDRAFT_12030" /note="I got to this annotation as one of seven of the aspartate aminotransferases called by grail/genewise. The 5' end of the amino acid sequence may contain a signal peptide. This model shows high sequence similarity to putative aspartate aminotransferase in Arabidopsis; GO_function: GO:8483 - transaminase activity; GO_process: GO:9058 - biosynthesis" /codon_start=1 /product="aspartate aminotransferase" /protein_id="EED87452.1" /db_xref="InterPro:IPR004839" /translation="MLSSSLATILLISCSRSCSAFTGALTTASSQISRPFSFQLRDTP PDAPIEPPLNPLLSQIKPSKTVEVFSLVKQMEAEGETVTSLCVGEPDFAPPQCVLDAA TKAMTNGQTRYTAVTGTLELRQAIADDLKSRKGIQYNPMTEIVVGNGAKQCVYQGLLA SCGAGDEVIIPAPYWPSYPEMALLVGATPVILETSVNDGYLLNPTELEKCLKEHPKAK ALMLCNPSNPTGGVHSTELLLRIAKVLEKYPNVVILSDEIYERLVYTEDGKCTAFASL PNMFHRTITLNGFSKSHAMTGFRLGYLAAPARFAKATSVLQGQITSCASSVSQAAGVA ALREVDESWLENNAEIMKEKRDYVLRELSKMEGVSVAVPPNGAFYVLPDVSSYYGGDD TQLCLDLLKEKKLALVPGESFGAPGTVRISYATSMEELEVAMTKLREFLETR" gene <200093..>201208 /locus_tag="THAPSDRAFT_12031" mRNA <200093..>201208 /locus_tag="THAPSDRAFT_12031" /product="predicted protein" CDS 200093..201208 /locus_tag="THAPSDRAFT_12031" /codon_start=1 /product="predicted protein" /protein_id="EED87453.1" /db_xref="InterPro:IPR001611" /translation="MFVKKDQRKVPQILSDATSRVNTPPDTASGTKDGEGEGSSIVRE LSFARRAPEFLPARNVSLLLQPSYRPALDHLVQLSLYDCGLKTLVEISRDGLDEGVTL FPKLEQLDIGRNPLLTNDSVSAAFHTQFPSLLELWCDNCSFGPVIPPTMLKLHGLEVV RMTGNKLEGVLEEGIGAKYWKQVKILALDGNALTSVGTGLGELTQLEKLHLRQNKLTS LPEGVPSAHNSNLVMLSLSSNKLISVPASLLDTGSSLKELYLNGNQITEVPEFMGEKL VVAKKLNLAHNKIGEGSTAVGEGDVEMEGNENNLLPRDFVERFGMPDVLSGNCTKDEQ CIVLLEGNPFTEARRKKHLDEERRKAKEAEMEVDAEA" gene <203360..205501 /locus_tag="THAPSDRAFT_25838" mRNA join(<203360..203957,204033..204577,204657..205501) /locus_tag="THAPSDRAFT_25838" /product="predicted protein" CDS join(203360..203957,204033..204577,204657..205286) /locus_tag="THAPSDRAFT_25838" /codon_start=1 /product="predicted protein" /protein_id="EED87454.1" /translation="MTISRYLPILLLSSASAQQAPTHNIQQQQNADPLRRHRLQQRKK KTTRQLASSLNTLTDTNVLLEELVLLNEKASRQADTITTTVGNCNGDSPPDVYAIGST LSICIATTDSANIISSLKTVTATSNNGIAEELVDGVGDANLATTVEGIGSAAVTIETL ITLGYYYDAIGEGMAPILAVEGTAVVEGVEVAFAMEVPFDLEVPLADISTEVPAVATN LPASGDLCSCSPLQYDFILSLDQDCAVNDLVDNAGIGETICTADAAGGGASSITITSV QFLEFDTSGELLVINQDDTYATTTLVNGDSISFKSITNELSADAALGDQLDYVPGGVQ LTIRGTMDNGSGGIVDVSNRITWSFTNSCGEGDVTVESGDGIGWITIDGTTDAHPEFC PSSVPIITTIAATSAPETTASPETTEAPIETTEVPIPEMSMSMSMSMSVPGEVVTTPA ATTEAPIPETTVPSVEEPECMSMSMSMSIPEEEVEEPEMSMPMSMSMSMPEEEVEEPE ESGSMSVPQSMSVPSLFGKSGKSTKATKTLKDTDMAKSSKSDPKADKFHQPKASKIVK EEMSVGGGKSTKRLFSKRGVSMSA" gene complement(205689..>207093) /locus_tag="THAPSDRAFT_25839" mRNA complement(join(205689..206858,206974..>207093)) /locus_tag="THAPSDRAFT_25839" /product="predicted protein" CDS complement(join(205752..206858,206974..207093)) /locus_tag="THAPSDRAFT_25839" /codon_start=1 /product="predicted protein" /protein_id="EED87530.1" /translation="MSPTTNGRTNNSPICQRTSSTALNFLGSDGGILGVGAPEVVHHL LDEYRDASLERDQSCDYHSFTSLTTLASCQYTLIPNSITLLHFTTTRQAVTLIVGYFV LGPSDLYKLVKEIGKFIQNFRTLGAEATKSFESTMENQLEMTELRKAQAELNEAFSFR RSINTDSDAEAFGGTSFTDNASSASEGSGAIAATAVVGPAAGATTVEGEDGAVVKKKR RLVRRKKKVVVEKPTEEEREIEKEYPDLDILESDFPESRTGISEEERLRTERLERLGG STTTSEPDWFKASDEDVASKILDQPAPANPALERYETDRFKSQLSAEQWNAQIMANED ELAPLSMVMKRLAILEEEKQSADRLLEEEYQKRMDNEDKYYLEKRRVLEEAITDIQEG VYASKDAETASGPKFS" gene complement(208423..209355) /locus_tag="THAPSDRAFT_25840" mRNA complement(208423..209355) /locus_tag="THAPSDRAFT_25840" /product="predicted protein" CDS complement(208608..209321) /locus_tag="THAPSDRAFT_25840" /codon_start=1 /product="predicted protein" /protein_id="EED87531.1" /translation="MCMHVDLQVAMSSILSKLTGKDDTSAPPLTPKDIVAALQSRGWE AEIISASSISQDMVEVDPAGILKCVDGRGSDNTRMAGPKMPGGIYAIAHNRGTTSVDG LKEITKEVASKGHVPSVHGDHSADMLGCGFFRLWVTGEFDSMGYPRPEFDADQGAAAV KESGGVIEMHHGSHTEKVVYINLVENKTLEPDENDQRFIVDGWAAIKFNLDVVKFLVA AAATVEMLGGPRIAKIVVA" gene <209939..>210882 /locus_tag="THAPSDRAFT_12035" mRNA join(<209939..210178,210449..210745,210775..>210882) /locus_tag="THAPSDRAFT_12035" /product="predicted protein" CDS join(209939..210178,210449..210745,210775..210882) /locus_tag="THAPSDRAFT_12035" /note="Vti1-like SNARE component, Qb SNARE; GO_component: GO:16021 - integral to membrane; GO_process: GO:6886 - intracellular protein transport" /codon_start=1 /product="predicted protein" /protein_id="EED87455.1" /db_xref="InterPro:IPR007705" /translation="MSSSIPFERYDEEFLSLTEQVTSKLRSLDPSTSSGLPPTAEADI KMAHNLLLQADDLLKQMGLEARGVDDAGVKRDLLGKVRVCKTRLANLRDDYEAAKSNV ERNSLGLNSDIESGGRNSRGRSSGSKERLLSNTDALQSQSETLANARSIMAETEGVAM EITEELGRHRETITSAHGRGKEDSAEYGEEGGAAEVDFVWCRGDGGGGVFVFSL" gene complement(<211324..>212877) /gene="Ppx1" /locus_tag="THAPSDRAFT_264901" mRNA complement(join(<211324..211864,211886..212508, 212536..>212877)) /gene="Ppx1" /locus_tag="THAPSDRAFT_264901" /product="protoporphyrinogen IX oxidase usually abbreviated" CDS complement(join(<211324..211864,211886..212508, 212536..>212877)) /gene="Ppx1" /locus_tag="THAPSDRAFT_264901" /EC_number="1.3.3.4" /note="Converts protoporphyrinogen IX to protoporphyrin IX; EST support; GO_function: GO:4729; ATP binding - protoporphyrinogen oxidase activity [PMID 5524]" /codon_start=1 /product="protoporphyrinogen IX oxidase usually abbreviated" /protein_id="EED87532.1" /db_xref="InterPro:IPR002937" /translation="DCLVIGGGISGSTLAHNLHSTHSLNILLAEARDYLGGNVQSVQH TDDDGTFIYEKGPNSFATQPSIVRISHELGIEEELVFADESLPPWVNHNGKLHPLPKG KGGKGPKGQIELFGLAGDLLSWPGKIRAGIGAFVGHPAPPTDREETIQDWVTRILGEE VFYRIIDPFVSGVYAGDPNTLSMKAALPKISRIEQYSYDIDWNKFGAIFYGGLARQVE LTKERKADPPDANWVDFEYGNPGSYKKGLSTLPNAIKEELGEKVKLEWSLKKVEKVDG GYVATFDTPNGEEKVNAKTVVSTMPTHAIGSILEPVLPGSTDLFSQKGIYYPPVAAVT LAYPKSAFKDVELDNEFGNLKDLPGFGSLNPRTEGVRTLGTLWSSSLFPERCPSDYNI LLNYIGGSRDPAIGTMEEKDIIDAVDIDLRKVLLDSTAPPPKVLGLKVWPTAIPQYEL GHLEILQELEAMEKKVEGGGIWICGNYRSGVAFPDCVTFGYEHAQVVKDYLD" gene <213453..>216062 /locus_tag="THAPSDRAFT_12037" mRNA <213453..>216062 /locus_tag="THAPSDRAFT_12037" /product="predicted protein" CDS 213453..216062 /locus_tag="THAPSDRAFT_12037" /codon_start=1 /product="predicted protein" /protein_id="EED87456.1" /translation="MDVRGGGGDQEHQYHPQQHEHVVSQASHCQRSDMRGGAQRLKSP PTPQFQDLDYEPDAEDEADEYYEYLNDNDADYATADDGGFIMGDDGEYTDEEEEYYYD DYDTAEEFDDDEQMILGEDNLFYFPDDDTNNPSDNAVDDDATSDQRRRFSVKNPLRPM KTTSNNNINNKHRSNTSLTIRGALPTAIPSLISSVSNNLSSLPSTISMLPSTLSSNIS NGFHFGMNSMANMGDGLSKALSALPSSLSNMSVENGSIMVISVVMAVVFVRTVVLGSG NLLGGGKGEVIRGRFFGRGGTRSSSGGRKKTRKNKRKSRVVNTKQGGASKLWARLFSK SSKEEEESSTSVANGGIGNSKYGDYSEGYPGEEEPDDGIMDLDDIESEFTMKQRSHWM VPTSLSSLRSSIGNVGKALVPKTANVLVENVWIWMCATGEKMGVGGFRRRVRIGSSSK NNGSGGEEMKGELDIGNTVDLAKDNVGVFASSKDDGSKEASVNNVTLSKESTEKVGVL QNQLDTLTKSHESLEQEYEASLRMLHEARMELRQLQSQRNNGIGDTDDSSQKKQMEAM VKKLESKYKQQMKDQVERVRGQMEGKLRSEMEVELQEKVEAKMRAELEEQFEQQMEAR AKEMQESFDAELQRELDLRVDEIKREKEDIAGTKFQEAINEAVAHEVDQAVEEAVQQA VQREQQKARDEMIRVRQGIQKVLERERRLMKEQVKRSTGQVREWVKQQQMEQLARREE QLQEEVEQLQMLEEEEAMGDRRRGNARQGGGAGARRSGQINSGRYGSSFYDDRKRAAR PRDMDDEDIPRSFAEKEDYYEDVDGIGDGQDESMQQQQQRRVSRDSPSSEGSGPRNVY KQAAQRKQQQRRK" gene complement(<216212..220923) /locus_tag="THAPSDRAFT_25842" mRNA complement(join(<216212..217485,217783..218601, 218698..219782,219896..220923)) /locus_tag="THAPSDRAFT_25842" /product="predicted protein" CDS complement(join(216212..217485,217783..218601, 218698..219782,219896..220809)) /locus_tag="THAPSDRAFT_25842" /note="GO_component: GO:5576 - extracellular region; GO_function: GO:8061 - chitin binding; GO_process: GO:6030 - chitin metabolism" /codon_start=1 /product="predicted protein" /protein_id="EED87533.1" /db_xref="InterPro:IPR002557" /translation="MKIATSALTLYTLSKQTTSTSAQQCVSGSFVLEFTGTCNYDTVL AAYTDQVYDVAGGRSASCASSAEDDLDAKLLDANLTIETLCGQIYNSAHSAPFTDAAK KGRDLKFESHFFNGRTEWQEEVETIYETQDSSATKILKEDAEAVRAFYEGVAQRKRVD WPGVLPNFQSSVTDANGLATCTTNAAMCCWPKDRQANDNNGNCAKPYDLNCVDKDPAD NTDLCFVDLERGAAVNDFESGDGLIVFPGDNNDGEGAIHCHGLAWGNDVNDHTARYKA NNLFFVSMYDHMYQRGYVENIPGAPMCGCVEQMPTVSRSDCTQVDLTETIKIAYDSLA TVSPFKGKITSVDVDFNACRGINNRNNDLWAYMAKLYYQGDISAEQFGEAGRIITNDG CEEAVKYQLNKQSLKTGYDHDDFTWTFITGRDSMKLSEGYGNKAFTRSLTQESLSQPN GIAYRACATCEKTHKKIYYRRLTEVPEEFDLLTNLLYYGNNGGGNNVWNEDFTMHSTY DDALTGANPWKCPGNSYNYGAGFPGECSPTGARVRNQQSRFNNSGERNDVAWYVNKPE RHSLEIVPTNVIKSRDYAAGAAVEAEDGTIYVTGAGRDIWGNADDFNYYSQPAEGDQT IIVNAASQSTPQGDGWSKAGIMIRQSMDPDSPHASVFLTGNQAWIKLEKRMNTYTSFI GQDDGSGNIDWTVMKSFDIPLMGDSFNVGLAVCSKRWYQMEVVFENYSTDSYYFPSAA PSISSAPTALVPSVDIGAVGLAGSASESSTGTWTVSGSGYDIWGRNDQFHYANFVHSG DVTVTMFVENYDVEHYWAKAGLMIRDTLATNSMHYSLFMTGGAGLAQMWRGCTNCGMG NSQTASIKDSSLWLRVQKVGNVFTASYKKVGGTEWVAFGSTQTMAFTGDSFYVGIANT SHDNAKIGVLKGTNFEVVETTPAPVPIQSIVFGTLSQSTNADWNPASSVSLCPPSHNG FLPLPGCTSYINCWGGTLQYKITCPEGLLYDENRGYCHDSVKCNVLNDAGNPTLVIDV VSDVVCPDYHSGRLGVDDCSAYVDCVNGFEVGRERCDEGTMYSTSTENCERDVTECSS IVSMGDAVKEGNGCGEGQCFTADGECAEFASCFIDPCESIPCNDNEICEASYCGGCDA VCSLQLNVAPQLIDDSGMYLPEKDDGSLVVKSSDVPKLIDDSGMWPIATMPTGGTEKV PDIALENVQPAPSTMTTAATATATKTTVPTVLEITSPGQFYPDWEGRSCNPKDDNAIA KVGQWYSYYPSIFNCCVANFLSDLQTFKACLDFDIETLEPPQQEGGLKGTKVKYYPNW DENTCNQEDGSTPEWVKSTLKGKKYLCCFEFFKWAFSACMKDA" gene <221437..>223065 /locus_tag="THAPSDRAFT_25843" mRNA <221437..>223065 /locus_tag="THAPSDRAFT_25843" /product="predicted protein" CDS 221437..223065 /locus_tag="THAPSDRAFT_25843" /codon_start=1 /product="predicted protein" /protein_id="EED87457.1" /translation="MQLSTTLTVSFLALQQFIAAAHRVDHKAAPSVPTNQRLRRAKTV KSDEETVQETAPATTQKGAETVSASDSGKPDKKGDKEDKKKSVESSSYYSYDGKAGKE DSGIELQDDEFTVGDPIDVHFELEPAKLVERGVTLNMTNSGDWKVGIFMKMENPQGGA LEPIVSLVPSITEKDEDTLEGDVTFSDDAAIVALMGGREPSWPTDLIQFGTGYDIWLL DEVGAGILGPEDFVLELTEEMKEEAKLIEEEKEKEPKHGLMLYDKGGKKKGDYSSYSD GNSSKAGKAEEAAPLLVLGTDVQSLSEYTLVTDKTEYLDNEDIVVSFDISETSQVVRR LQSSDLQYRKQFKQRRLLPKADKEDEDKGKKGKKDDETTTSTTVAAGGTEGSTTIPSE ESSTTLPTDVTTTTNEDAQGEDELEEGPPDMIDPELEEVIDRDDLSLYKIGIFMRMAR PQGGKLEPLVSVPLCGQVDCSTFTVDQLKAGSVTFGASHLSSMTGEWPLNTGEHGTGF DIWVLNGRGEEVAGPTEFTIPELEVEGGDEEEIA" gene complement(<223870..>225480) /locus_tag="THAPSDRAFT_12040" mRNA complement(<223870..>225480) /locus_tag="THAPSDRAFT_12040" /product="predicted protein" CDS complement(223870..225480) /locus_tag="THAPSDRAFT_12040" /codon_start=1 /product="predicted protein" /protein_id="EED87534.1" /translation="MSKATTSLAIALLVHNANSQCTTSGTISTFSGKECNRALFEANL VDGCTVADLFDTTTVDADTQIADLCKYDAPVQFVEINGYYQLDKRYFNGGGPLIDSAE PFGVEAGRILRFDANSGGNTLIGWPEYAALVGYNAQELSTEENPELGDHGYPPNFDIV NSCDLNTVMCCFIDDVADTGFAAEDSTTDVCRHDLLNSPKANHIKDGWSVFPNAETST HCVGFTWEDGADSDLFKGNALYDISLRNTANKGYIKSIPGAPLCGCIEQMPIVEKADC RTATGGDITFTFTHDAETGEVTASNVVDVTYADCAEADLAAHIKATHPTFADAIDLHL VGDGGCAADLTTYLNDEQFLVAGTHATKYKSITEADGWKFVAGEGIRFLPPKIDAEAA DAEFRALINAGCKDDGDVDRPCLIRRFCDSCSSETHRDIYYKRLTPIPEFGEAEGQVY FLDLFLNNWNSQPANVLNTDFELYSTYEDAIAGTNGWKKCNYNDAGVGFPRDCGPEWN IGSQWNSYIRDGASANNHGFYVELASTA" gene 225833..>229048 /locus_tag="THAPSDRAFT_25844" mRNA join(225833..226395,226488..227080,227123..>229048) /locus_tag="THAPSDRAFT_25844" /product="predicted protein" CDS join(225906..226395,226488..227080,227123..229048) /locus_tag="THAPSDRAFT_25844" /codon_start=1 /product="predicted protein" /protein_id="EED87458.1" /translation="MRPSTFALVSLTLKELSHLTNAQTCSGIQGPFSFQFAGSCNYDT ILEEYTRQVFDAAGNLPGTCGSVVGGETSTGLTAKDDLDAKLAGTSVEEICNAAYDSA EVTKFHNAARQGTDYHFEQMFYNGRSHWQEEVETLYESVDGTATSILKQDAKAVDDFY RADGTHGRVEWPNMLTNFDDATCTMNAAMCCWPKERQANDNNGNCAKPYDLNCVDKDP ADNTDLCYVDMEDGSASNEFGGDGFAVFPGDNNGGEGAIHCHGFAWANDEYDSISRYK ANNLFYVSMYDHMYQRGYVENIPGAPMCGCLDQMPIVSRSDCTQVDLEEEFKLVFDGT DFAATLIKSEIDFNACRGINNRNNDLWLPPFAHITFTITSSLRQAYSARLYEEGRMTN EQFGQVGRTITDNGCDHAVAYEQEKKGLTTGFMYDLDAWTKVAGRDALKEAEPFGREA FNAALFKYSLTAPSDLATETLDNTPILMRVCATCTKTHKKIYYRRLTPIDNEKFDLLD NLIYQRNNGGGDNVWNVDFTMHSTYEDALTGVNPWKCPGDAYNYGDGFPGQCSPTGAR VKDQDSIFNWLDNTKTDVAWFVNKPADVGVQDLIDNPNTRSDGFYADIDIGDVNIEGN TLEHNGVYHITASGKDIWNQADTFHYHSQPGFGDVDVSAHLSSFANIRNQHAKAGIML RADHDDNSEYIFALMTGSRGLYLQGRSSKGNYAKTYGSNYATNPVQTAAWLRLVKKMD LVEMYRSDDGVEWTLHASANIFFPNDTFRIGLAVTSHDNRYLSEATFEDYQINEYFFP TSSPSISSAPTVWEPAVDVGDVQRAGSITEPNADGVARVYGSGTGTWGHNDAFFFHNV QKSVGGALEVTVHVNNFSYHYQYGKGGIMIRDSNDPDAANAFIAIAGRNQGVVFQSRE EAGSPTTHHSFDWIQNNDVWLKLTKEADSSVVTASYKHSSDTEWTVAGATSLMLTGET VQVGLAVTAGDAYQHALAQLDYKEYDVVES" gene complement(229748..>232040) /locus_tag="THAPSDRAFT_43120" mRNA complement(join(229748..230921,231023..231468, 231957..>232040)) /locus_tag="THAPSDRAFT_43120" /product="predicted protein" CDS complement(join(229973..230921,231023..231468, 231957..232040)) /locus_tag="THAPSDRAFT_43120" /note="GO_function: GO:5489 - electron transporter activity; GO_process: GO:6118 - electron transport" /codon_start=1 /product="predicted protein" /protein_id="EED87535.1" /db_xref="InterPro:IPR006662" /translation="MNNISGPGRRSAMSSVDFYRRVPKDLTEATSLGAIMSICAITVM AILFFSETLAFARTAMVTSIALDENDQPQIRLNFNITLMDLHCDFVSVDVWDTLGTNR QNVTKNIEKWQLDEDGQRRIFSGRNREQREVVHEEHEETLEELHEDGEQAVELHPENF KAFLEGHDMAFIDMYAPWCIWCQRLHPTWEKFGEKVHELGMPVGVGKVDCVVHAQLCK DEKVMAFPTLRWYKDGEAILPDYKMDRTVDALVGYAKRKLDMEQKYKDWESKNAGGNA DARGKPRGGTSRPEHPGCQVSGHLMVNRVPGNFHIEAKSVNHNLNAAMTNLTHRVNHL SFGEPITKLPPHMENTPFMRKVKRVLKQVPEEHKQFNPMDDTEYVTAQFHQAFHHYIK VVSTHLNMGSSSKSEYSVNDVNAVTVYQMLEQSQIVFYDEVNVPEARFSYDMSPMSVV VQKEGRKWYDYLTSLCAIIGGTFTTLGLIDATLYKVFKPKKL" gene complement(232315..>234089) /locus_tag="THAPSDRAFT_25846" mRNA complement(join(232315..233476,233629..233821, 233928..>234089)) /locus_tag="THAPSDRAFT_25846" /product="predicted protein" CDS complement(join(232365..233476,233629..233821, 233928..234089)) /locus_tag="THAPSDRAFT_25846" /note="GO_function: GO:5489 - electron transporter activity; GO_process: GO:6118 - electron transport" /codon_start=1 /product="predicted protein" /protein_id="EED87536.1" /db_xref="InterPro:IPR006662" /translation="MYGDINGGGSRRRAVGTADLYRHVPKEITEVRWCSRSTCALMHA FSWFKDALRDATKIGVVMSLLSIFIMILLFFCETYAFSRSTISSTIAVDPNSEQLLRL NFNVTLYDLHCDYASVDIWDTLGTNQQNITKDIVKWNLDDQGQRKKFAGRNAEQRAVT HEEHDETLQDLADALGGELHAVALDPESIVEFHKRHNGQAIIDFYAPWCIWCQRLEPT WEKFARQVSDERINLGVGKVDCVTHAQLCKDQRVMAFPTLRWFENGKAVMPDYRGDRT VDALVDYAKRRVGSNEGSNDEEFEEDHHPGCLISGHLMVNRVPGRFQIEARSVNHELH SAMTNLTHRVHDLTFGALSGPPGHMLHVLPFFDTVPEKYKHTNPMQDKYYPTYEFHQA FHHHLKIISTHIDYLFSRSTVLYQILEQSQLVFYEEVNVPEIQFSFDLSPMSVNVSKE GRKWYEYVTSLCAIIGGTYTTLGLINATLLRIFKPKKL" gene <234704..>235630 /locus_tag="THAPSDRAFT_12044" mRNA <234704..>235630 /locus_tag="THAPSDRAFT_12044" /product="predicted protein" CDS 234704..235630 /locus_tag="THAPSDRAFT_12044" /codon_start=1 /product="predicted protein" /protein_id="EED87459.1" /translation="MPQLHTNNTQYGVSDWNPSQNAMLLRPDAREFGQTGNQIREFFH AFDMARDENATIFLTEEGFPMKVLREIFLGLEGGTPYWKERAESLFGVKVIDQLNDDN LIESFESWEYIQKTVKLFRYRSTRHTLEQRKNHRRYLLQNLYQLTAHEMMLNPHSQST TALCSSLYDFFGVGEKSGDDQSTSYHSYNITDKYTVIHSRYLLRLGALAEHKLGVDSQ ALTNMPPDFTSAILSPLGMLNNSIIMISDGQNIEAAERLSLDTVVGPVFQLVPSNISS MVSDMVRKRGVPVFKIVYLFGIEDPHCLTNFL" gene complement(235994..239306) /locus_tag="THAPSDRAFT_25847" mRNA complement(join(235994..238131,238245..238874, 238965..239306)) /locus_tag="THAPSDRAFT_25847" /product="predicted protein" CDS complement(join(236052..238131,238245..238874, 238965..239248)) /locus_tag="THAPSDRAFT_25847" /codon_start=1 /product="predicted protein" /protein_id="EED87537.1" /translation="MTIITPAALLTLAATVSTSTAQECSVAGGFTLEFEGSCNYETIL SAYTDQVYNAAGALASGCSTSAKADFDAKLLDANMTVEDMCSQIYRDDNTAPFSDAAA KGDDLNFEQHFFNGRSEWQEEVETIYETEDGTPTSVLKEDAESVRAFYEGVAQGKRVE WPGSLTNFQSSVTDANGLATCTTNAAMCCWPKDRQANDNNGNCAKPYDLNCVDKDPAD NTDLCFVDLERGAASNDFESEDGVIVFPGDNNDGEGAIHCHGLAWGNDVNDHTARYKA NNLFFVSMYDHMYQRGYVENIPGAPMCGCVEQMPTVSRSDCTQVDLTETIKITYDSLT SAFTGKITSVDVDFNACRGINNRNNDLWAYMAKLYYQGDISAEQFGEAGRIITNDGCE EAVKYQLNKQNLKTGLVHNAEVWTKVAGRDGLNDGLPFGKNTFYISMFEHSSTAPSDP SAEVLDNTPILLRACAGCVKTHKYIYYRRLTSIANGDFDLLYNLIYERSNGGGDNVWT EDFTLHSTYEDALTGANPWLCPGDAYNYGDGFPGQCSPTGARVKDQGSIFHWSSGTQL DVAWYVNKPESEGIQTFNANTRSGSFTDVDVGTVGATGRTMVTTIDGEDNYHISVASG DIWNQADNFHYLAQSKSGDIDVSVNVASFTNIATVWAKAGIMLRADNSPNAQYLMATL TGGRGVTLYGRTSRGNWATEFGSRYQENPVQTSAWLRLVKKMDLVEMYRSDDGVEWTL HASANIFFPNDTFRVGLAVTSNNMGTTSEATFEDYQINEYFFPTSSPSISSAPTVWEP AVDVGDVQRAGSITEPNADGVARVYGSGTGTWGHNDAFFFHNVQKSVGGALEVTVHVN NFSYHYQYGKGGIMIRDSNDPDAANAFIAIAGRNQGVIFQSREEAGAVTTHHSFHWVQ NNDVWLKLTKEADSSVVTASYKHSSDTEWTVAGATSLTLTGETVQVGLAVTAGDAYQY ALAQLDYKEYDVVEEGAAMRRMLRA" gene 239888..241578 /locus_tag="THAPSDRAFT_25848" mRNA 239888..241578 /locus_tag="THAPSDRAFT_25848" /product="predicted protein" CDS 239892..241502 /locus_tag="THAPSDRAFT_25848" /codon_start=1 /product="predicted protein" /protein_id="EED87460.1" /translation="MSKATTSLAIALLVHNANSQCTTSGTISTFSGKECNRALFEANL VDGCTVADLFDTTTVDADTQIADLCKYDAPVQFVEINGYYQLDKRYFNGGGPLIDSAE PFGVEAGRILRFDANSGGNTLIGWPEYAALVGYNAQELSTEENPELGDHGYPPNFDIV NSCDLNTVMCCFIDDVADTGFAAEDSTTDVCRHDLLNSPKANHIKDGWSVFPNAETST HCVGFTWEDGADSDLFKGNALYDISLRNTANKGYIKSIPGAPLCGCIEQMPIVEKADC RTATGGDITFTFTHDAETGEVTASNVVDVTYADCAEADLAAHIKATHPTFADAIDMHL VGDGGCAADLTTYLNDEQFLVAGTHATKYKSITEADGWKFVAGEGIRFLPPKIDAEAA DAEFRALINAGCKDDGDVDRPCLIRRFCDSCSSETHRDIYYKRLTPIPEFGEAEGQVY FLDLFLNNWNSQPANVLNTDFELYSTYEDAIAGTNGWKKCNYNDAGVGFPRDCGPEWN IGSQWNSYIRDGASANNHGFYVELPSTA" gene complement(<242306..>243934) /locus_tag="THAPSDRAFT_12047" mRNA complement(<242306..>243934) /locus_tag="THAPSDRAFT_12047" /product="predicted protein" CDS complement(242306..243934) /locus_tag="THAPSDRAFT_12047" /codon_start=1 /product="predicted protein" /protein_id="EED87538.1" /translation="MQLSTTLIVSFLALQQFIAAAHRVDHKAAPSVPTNQRLRRAKTV KSDEETVQETAPATTQKGAETVSASDSGKPDKKGDKEDKKKSVESSSYYSYDGKASKE DSGIELQDDEFTVGDPIDVHFELEPAKLVERGVTLNVTNSGDWKVGIFMKMENPQGGA LEPIVSLVPSITEKNEDTLEGDVTFSDDAAIVALMAGREPSWPTDLIQFGTGYDIWLL DEVGAGILGPEDFVLELTEEMKEEAKLIEEEKEEEPKHGLMLYDKGGKKKGDYSSYSD GNSSKAGKAEEAAPLLVLGTDVQSLSEYTLVTDKTEYLDNEDIVVSFDISETSQVVRR LQSYDLQYRKQFKQRRLLPKADKEDENKGKKGKKDDETTTSTTVAAGGTEGSTTIPSE ESSTTLPTDVTTTTNEDAQGEDELEEGPPDMIGPELEEVIDRDDLSLYKIGIFMRMAR PQGGKLEPLVSVPLCGQVDCSTFTVDQLKAGSVTFGASHLSSMTGEWPLNTGEHGTGF DIWVLNGRGEEVAGPTEFAIPELEVEGGDEEEIA" gene <248147..>248763 /gene="cht1" /locus_tag="THAPSDRAFT_264902" mRNA join(<248147..248173,248290..248419,248528..>248763) /gene="cht1" /locus_tag="THAPSDRAFT_264902" /product="chitinase" CDS join(248147..248173,248290..248419,248528..248763) /gene="cht1" /locus_tag="THAPSDRAFT_264902" /note="see gene id 109953" /codon_start=1 /product="chitinase" /protein_id="EED87461.1" /translation="MDGVDMVMLTAETDANQVHVAAAVVEITTTVEFRGRTPIANVEL LVLEEITPNVPQDNSATLTAQTVHSTMAFPTRTEHKKRVESRTITTTAELRGLLPIHN VEYRVAGGMMQSVLMAKHATLTAQTVHR" gene <248895..>250000 /gene="CHT1_1" /locus_tag="THAPSDRAFT_264903" mRNA join(<248895..249240,249347..>250000) /gene="CHT1_1" /locus_tag="THAPSDRAFT_264903" /product="chitinase" CDS join(<248895..249240,249347..>250000) /gene="CHT1_1" /locus_tag="THAPSDRAFT_264903" /note="see gene id 109953; GO_function: GO:16787 - hydrolase activity; GO_process: GO:8152 - metabolism" /codon_start=1 /product="chitinase" /protein_id="EED87462.1" /db_xref="InterPro:IPR001223" /translation="YYQSWAIYRNGNCNPIQPNQIDVTLFGYTHLAFSFAGISYSGYM EPYNGDTGFYSMYSTFNSMKATYPTLKTLIAVGGWTFDQSRFVYVSSTEARRTAFASS VVTFLETHGFDGIDLDWEYPVTRQGTAADYANYPLLCQALREAFDNAGHTDWLITIAT SINSDKLALGYDLMGMAPYVDWFNMMSYDIYGSWDSTAGANADVPFITNTMDYVFGLG VPREKLVLGLAAYGRSVRLSSTSCTTDGCAINGAGLSGCHGEAGNLPYFQIDETYLQT GNYDSLTLNPTSLSMELVTGGNQYWTSFDNAETINIKHNYANSECMRGVMWWAVDLM" gene <249976..250756 /gene="CHT1_2" /locus_tag="THAPSDRAFT_270127" mRNA join(<249976..250266,250357..250756) /gene="CHT1_2" /locus_tag="THAPSDRAFT_270127" /product="chitinase" CDS join(249976..250266,250357..250662) /gene="CHT1_2" /locus_tag="THAPSDRAFT_270127" /note="putative chitinase. Contains chitin binding domains also referred to as a peritophin-A domain common in animal and insect chitinases. This is an extracellular domain with six conserved cysteines. Also contains a o-glycosyl hydrolase domain suggesting that protein can hydrolyze chitin oligosaccharides. Also contains the chitinases family 18 active site; GO_component: GO:5576 - extracellular region; GO_function: GO:8061 - chitin binding; GO_process: GO:6030 - chitin metabolism" /codon_start=1 /product="chitinase" /protein_id="EED87463.1" /db_xref="InterPro:IPR002557" /translation="MWWAVDLMKSPLDNYSTNSPTTSSAPSTATKSPSKAPTPLPTSS PIIPADCGVACDAGYTGLMPNQECTAFYHCSNGVKLGNNIQCPSGTLFDFNFQTCNHA NAVSCTCTAGTAPVPSPPTGTSSPTVTGASPTPGDPNAICDACPPSSYTMLASNGCTG FYYCNAGSASPFTPCPEGTLYDSGFKGCNWVDKVTCSC" gene complement(<250735..>251886) /locus_tag="THAPSDRAFT_12049" mRNA complement(<250735..>251886) /locus_tag="THAPSDRAFT_12049" /product="predicted protein" CDS complement(250735..251886) /locus_tag="THAPSDRAFT_12049" /codon_start=1 /product="predicted protein" /protein_id="EED87539.1" /translation="MRTALTAVSALSAALMPSASHSFAPIHSLAHPIATLPATSSSNE EDTTQVLLLDHININHQKGRHDLLKAFYFDFLKCSIDSRKLDNYQSGRKTVWANVGMH QFHLPEGKPDAQVFDGMITLVHSNLEGLMERYNQYLDGEDAFVPLMGTEFDVIVEEDG EDIMMIVNDPWGTQFCILPSDDVDEDRAAYLGEQPLLKNHELSEGLCLEDLTVYVPHD ANLEGIGRFYQYVLGAPTVDELTTENQISIAMGDRQTLTFQHHPDGEAEVAHHDLTYE VKDDEEETDDSRPFYPSNHGPHISMYVTNLPYAYQMAEKLDALYVNPRFKRRAYSEEE AIDQCMFRILDIVDPLDETKGVILRLEHEIRSTKTRDGKKYKSCPLFDV" gene complement(<252461..>253738) /locus_tag="THAPSDRAFT_12050" mRNA complement(<252461..>253738) /locus_tag="THAPSDRAFT_12050" /product="predicted protein" CDS complement(252461..253738) /locus_tag="THAPSDRAFT_12050" /note="GO_component: GO:16021 - integral to membrane; GO_function: GO:5554 - molecular function unknown" /codon_start=1 /product="predicted protein" /protein_id="EED87540.1" /db_xref="InterPro:IPR004776" /translation="MTWVYLPVLNALIQILITIALGFVSDFFGIVSADRLVPEAVHLV FYVLLPSLIINGIGIQIDLYTESNVWAFIVAFLILRAIALILSLAIALFVNWNQQQRV LGLGDVAVLWLSLSWISTVILGVPICTAVFENPTLGAKYGIMAGISSFIFQLPLQLMF LECHAAEEAQRTSEATGRSTNSAREELPSQEKERESAVITIQEASEESFHQLSTEDIA PTAIGDRIQEAELHLRRSWWSLVYAEHLPDMDLWLDILRRVLKNPVVDGIFVGIVISL STAGRYLRCPSDTCVEGLEWISATLGWLGNCVSPLSLFAMGAWMHSQRKLILIPIPNL CVAMVSKLIVVPLLMVGLAKGMKLNNESARAAVLIATLPISLASFSLARQYNVGEKDL AANVAFGTLLMLPTVIVWNIVLDSVGLYPIEIV" gene <254179..>255987 /locus_tag="THAPSDRAFT_264905" mRNA join(<254179..255567,255622..>255987) /locus_tag="THAPSDRAFT_264905" /product="nadp-reducing hydrogenase" CDS join(<254179..255567,255622..>255987) /locus_tag="THAPSDRAFT_264905" /EC_number="1.12.7.2" /EC_number="1.6.5.3" /note="Complex 1 electron transport, shows most similarity to operon identified in the bacteria Desulfovibrio fructosovorans. Supported by EST sequence; GO_function: GO:4672; electron transporter activity - protein kinase activity [Evidence 5524] [PMID 5489]; GO_process: GO:6118; protein amino acid phosphorylation - electron transport [PMID 6468]" /codon_start=1 /product="nadp-reducing hydrogenase" /protein_id="EED87464.1" /db_xref="InterPro:IPR001041" /db_xref="InterPro:IPR001450" /db_xref="InterPro:IPR003149" /db_xref="InterPro:IPR004108" /translation="IANALVSLRVDGRPVKVAEGATLLDAINTSGSHVPTLCYHPEFQ PKAVCRMCLVNVKEANATATAGKLLPACRTKVEEGQEVTTNSEDIKAFRRRDLQFLLN RHPNDCMRCEAAGNCKLQSLVQEECVEDMWPTKTSRGSDEHPHLLLDHTSPSIWRDMS KCIECGLCVDACSAQKINAIGFAERGSGMLPITAFDKPLSETGCISCGQCILRCPVGA LIERPDWHRVLDVLDDRKRTTIVQTAPATRVAIGEEFGLEPGSVSTGRMINALRELGF DYVTDTNFSADLTIMEEAHELLQRLQGKREGALPLFTSCCPGWINYVEINRPDLIPHL STTKSPQQMHGAIARNGPMAKQIAAQTSESEEPYIVSIMPCTAKKDESVRPGNRGDID AVLTTRELAKLIRHRDIPFASLSNDGEYDSPMGESSGAGAIFGASGGVLEAALRTAAD TLGLDGKNVDTLQHEQLRGVDRGIKVASIKGVGSVAAVSSIGSAIELLNTDHWKKFLM IEVMACPGGCLGGGGEPKSDDKDILKRRMAGIYSIDKNAPIRKSHENKEVQQLYKDFL SFPLSEISERLLHTSYAPR" gene <256553..>257992 /locus_tag="THAPSDRAFT_12052" mRNA <256553..>257992 /locus_tag="THAPSDRAFT_12052" /product="predicted protein" CDS 256553..257992 /locus_tag="THAPSDRAFT_12052" /codon_start=1 /product="predicted protein" /protein_id="EED87465.1" /translation="MMCVKIVFGLVMAPICIVLLLVATLSGIGPLINYLLTRRDKESL DRDTILQQHPEMFLIDIPAGINHASGDKSYRAMVRVTKPTEPPADKLPPVIVPGGLAS NLMTMSRHQDELTSKYGFTVVNFDRLGVGLSDAYPSSTSQSPSAADVAREMNYVMTHC VGIDDTEKWIQVGGSMGTNVATAFVALFPNRLCGFFNLDGLPHAFLQIQCKKFLKDGK QIMDVMNMIQWTGIPRLAFTAAIQPMLPVMGDAFTKQQMIGVMCREQFFTTTGLEYTT LMSCCDLECAAWGKQATTEYDGDTLRVLASIAPDESVILNESKGVPRKVTTERSKSEL GNCFLTHDDPECVKVERTFRTLALQDPTDIDRNKTHCNWPQPSPKHPVGEFVGGVDED TTIYPLAPQFKAMVVRIMCARDYTGLEQNYSQEARNHAAARCSLQSLMSDDAKVYYYP QLSHLNLWQQVSEVTSVTNEIAQAVMRRK" gene complement(<258094..>259554) /locus_tag="THAPSDRAFT_12053" mRNA complement(<258094..>259554) /locus_tag="THAPSDRAFT_12053" /product="predicted protein" CDS complement(258094..259554) /locus_tag="THAPSDRAFT_12053" /codon_start=1 /product="predicted protein" /protein_id="EED87541.1" /translation="MQTNTSINNNNKNNDETSPESAALIRGAASTTKSVAKATRGTMV GRLPSMASASSSSRGAASTVSTATAKSASRSVVSKGVVMKTASRAPRPSSTAMTRGAP RSSATVRIPRTSSPKLSARIPKGRANTIAKASDRVSTTWDVYTAGIEGGGGNESAGDT TSTTKRQQRRAEKWSRNIPSLNQMASVGRVFVINAVLGMAVFATYEGMIDYLAPPFSK REEVDETDAPLFAENEPTVYDATIIDTANIELEHHTPQDAVNDKDESDAMDRASIPQH LLAGGLGGAAHAILSLAMELKGTVATKNSSGLTMPMNSNSLRVQIPKRINSMMSYNAK CLPSTASVLSLQYPTMRYSLASIAHHSLAHSALFGSYQFTKRRLLQYSLDFEDTNTDS TNATSMSSSFSIKDMAHASIIATAGGIAGQLQHITSHFTEQWLGLGGSGVKTSFRLAT LPSLRSTLVAFPPSAIGFLAFEYGKLMVTGDEDTSD" gene <260249..>261557 /locus_tag="THAPSDRAFT_38775" mRNA join(<260249..260535,260633..260688,260803..>261557) /locus_tag="THAPSDRAFT_38775" /product="predicted protein" CDS join(260249..260535,260633..260688,260803..>261557) /locus_tag="THAPSDRAFT_38775" /note="GO_process: GO:9058 - biosynthesis" /codon_start=1 /product="predicted protein" /protein_id="EED87466.1" /db_xref="InterPro:IPR001296" /translation="MIEPTPFTHVSGYANRFKEMLRYLSKAGDNVDILTVDSKTPKEE LPKEAFGYSIEHTQGFVFPLYNHISLTVDLPEMKGAKMMERRRPDLIHVTSPGFMLYA GLFYARVMRIPLLLSYHTHLPLYGRNYLGFIPGIEEFSWGLIRFAHSRADLTLVTSPQ MKEEMEANGVPRVEVWRKGIDTVRFHPKYKSEEMRRKMTDGNEGDFLMVYVGRLGGEK RLKDIRPVLEQIPNARLCIVGKGPQEEELKEYFKDTNTVFTGQLSGDELSSAFASADV FMMPSDSETLGFVVLESMASGVPVIGCAAGGIPDLIRDNDTGFLVQPGDTEGYLNCAK TMMDTEFRTEMGVRARVEAEKWGWEAATSVLR" gene complement(<262241..>263165) /locus_tag="THAPSDRAFT_12055" mRNA complement(join(<262241..262436,262459..262808, 262932..>263165)) /locus_tag="THAPSDRAFT_12055" /product="predicted protein" CDS complement(join(262241..262436,262459..262808, 262932..263165)) /locus_tag="THAPSDRAFT_12055" /codon_start=1 /product="predicted protein" /protein_id="EED87542.1" /translation="MCSTNWLGLVGLDPEFASAIHGIFRDTSFCHVTLMNRRGKALSS ISRDDCNSLWVVVSTCVQVEAVVTEGVDWGGSSDLQEINEDNDVIPLFLKQCKHRSYN FGCAIFGSHFFIGGIATDVGPKDEDELYTSINECPSSNAFASINGEKTTFNVKDCMDL LLNYSLGSFNLDDILAAETKATSIDNSEHDLSSNAFLDIKETMELANFLVTNLVGSDE FIEMACEEAGGLRAIASLAIGGKGALWYYVKDVVTSQLDIM" gene complement(<264378..>265988) /locus_tag="THAPSDRAFT_13485" mRNA complement(join(<264378..264964,265052..>265988)) /locus_tag="THAPSDRAFT_13485" /product="xanthine uracil permease" CDS complement(join(<264378..264964,265052..>265988)) /locus_tag="THAPSDRAFT_13485" /note="A number of transport proteins which are involved in the uptake of xanthine or uracil are evolutionary related . They are proteins of from 430 to 595 residues that seem to contain 12 transmembrane domains; GO_component: GO:16020; integral to membrane - membrane [PMID 16021]; GO_function: GO:5215 - transporter activity; GO_process: GO:6810 - transport" /codon_start=1 /product="xanthine uracil permease" /protein_id="EED87543.1" /db_xref="InterPro:IPR006043" /translation="IPYLMAILMGLQHCFAMVGGLITPPLVIFKFTVCGFPFCPSLEQ YAVSASLIVSGICSIINVAKLPIPFTNKIFGRQLYLGSGLLSVMGTSFTFLPVFEQAI NQMKQSGYGGEEAYGAMLGTSMVCSLLELALSLMPIQRLKKLFPPLVTAITVMLIGVS LIGTGMKYWGGGVVCAEMGWQTHKSITEYDPALSFGPPFPTCTNGETSLTYGAAPYIG LGFSVLCFLVAIELFGSVFMKNCNVVLALLFGYMVAGVSNYEGLNYVNFDNLELAEPV TFLWVETFPISFYGPAVVPLLIAYFVTTVETIGDLTATCEASELPIEGEEYESSLQGG LTADAINSILSSLMTSPPNTTFSQNNGVIGITKCASRRAGFACGVWLILLGVFSKVAG IISSIPDCVIGGMTIFLFANVLASGVNLSRNVDLNSRRNKFILAMSLAVGVGVTVWPY AFLDMRNSPYTAAFWTCSDCSETMKGVRNGVSIFLSTGYCVGTAIAVLLNMILPVDAD " gene <266976..268474 /locus_tag="THAPSDRAFT_25852" mRNA <266976..268474 /locus_tag="THAPSDRAFT_25852" /product="predicted protein" CDS 266976..268454 /locus_tag="THAPSDRAFT_25852" /note="GO_component: GO:16021 - integral to membrane; GO_function: GO:8717 - D-alanyl-D-alanine endopeptidase activity" /codon_start=1 /product="predicted protein" /protein_id="EED87467.1" /db_xref="InterPro:IPR007369" /translation="MYIFVPFVPSILLNFMNDNYQTLKQQFDTAAASLSTDAINSATV EYSVASALYSQSMDSLFVLLLSKRFMLYFIATMATFYAGWRAYGGIEVIASGVFGGQG EALDRLNKEILKGETISYDSYGQTELDNDATEDGDKLFATLIDENPQASNAGNALAII LPLVLASSLTISYTSVTTLVGNNEQYDLSSGSEYLDNLQQWAGDYLLYLSSLPSLILC LLFTAAEFRWAFANDAEEKLQKSDSTVASNSSILCTGNVLALLYVVSAYLAKVHPTLS FNELPLDLWPSHNGVNIALAVAVTRALALFLVQPTTKSIRTIALALLGITAFDAISVF GTAANAVDTLSESPVSVMETVARSKISASSLWQPGLLEVIVGHDNSRVSDALGLGDVV FPACLVAWAVAADRTNTHKLRDNDEGDADKDSWTYKYTSSAVSGYILGSILTEIVGSF SLLGKGSGLPALVFLVPCMLLCVTATALQNGEVEDVWGTNSE" gene complement(268449..270735) /locus_tag="THAPSDRAFT_25853" mRNA complement(268449..270735) /locus_tag="THAPSDRAFT_25853" /product="predicted protein" CDS complement(268494..270674) /locus_tag="THAPSDRAFT_25853" /codon_start=1 /product="predicted protein" /protein_id="EED87544.1" /translation="MRTSLLLRRRGGAPPATSDIARTSSAYSSLCPSSRRPTAMAVTA TTSAAPVRCNVSFADSRWSTSHLCTSNIHNMQRRLVSSSVPTNPKSSLRKVIRPFLRA CHPDAVMSNLDNINNNDDQDTVTTRGNKRPPRHPLSAQAKEVNLKAVQTLNGLIDTLE ELMVRCTPPSYYSNDTSTNNDNHRSAASLPELNAKYEIEFILPSSKHIELSSQLTTTK HKRNERESLTLRSITVSFPKDLRANVRKWALTSFPDHSSLLYTDQRELQHVKRQEEEA MEVVYRLRHVAYLELVRLLTIAGMEVPSSGLDTLQRQRYQQQDAGSLGGRQRKEEGEW TLSDHFLYELGIDPTEDIRGTDDGDSSTTTSTTTQQQQQQQQYSAFFGRTTQGTAAAP PAYSHPHLRQQRQSFMNSIPWDTFRQNYDQAFLDAQADWTTSRLKLFNPNTREGKERR EQLVSQICGGVRVWRNVVDGDSGDDVDEEEVDDIPEGLDVVQQLIAIRRLSLILYDNF DYLQFERMGRMWERLVIVLTPPRNRRNQHGQRKDDTENKGIGRGVGDHPGRKLNKWER RMKRRERATPPSRGRMMRVAETHYNSLKNKRDNSQRDGDTVGDDVHQSQQQQQTQQQR KSTPSINAAESGYKFSYGTRSDQGTGHVTAYIPIDFGSDELVRQLYTHLYDYFDNCCG DVGFFNYGPNRDVSVNADGVGGLDAGSRGTDDEKRNKSKSEARV" gene <271639..275888 /locus_tag="THAPSDRAFT_25854" mRNA join(<271639..272887,272932..273222,273298..273530, 273623..274057,274142..275076,275158..275315, 275402..275888) /locus_tag="THAPSDRAFT_25854" /product="predicted protein" CDS join(271639..272887,272932..273222,273298..273530, 273623..274057,274142..275076,275158..275315, 275402..275793) /locus_tag="THAPSDRAFT_25854" /note="GO_function: GO:8234 - cysteine-type peptidase activity; GO_process: GO:6508 - proteolysis and peptidolysis" /codon_start=1 /product="predicted protein" /protein_id="EED87468.1" /db_xref="InterPro:IPR003653" /translation="MDFDPVKINKPHRKGSHQKNRPRGKPVQIPQPSDGGTHRTHNND PWYDGMNDYDDDSNSGSGFSHAADDGSVGERRRSSRPTTSRGENDRKKKERRQRQKEA CTRSQQQRKETTVEMSSDEDEEVEPNRHEPGRSFDGISLGGGESEAKVASWKNKYGVQ EGETDIVDAVKSRKRWKRANHHGPSSLTEVYSGTQQFQRIRQSERSRHRDLMVSRSSH YEDPLDEARALQDHSPRHAQRREMQACAATSRRGKSETATQQMSRFMGMLGSGGTARK YGTGSTVSNRSRKSSHKEGGISSINGNISIHEGFNSFIPRKRNSSSESAQSTSEKKRN NDVFDQVDEAPIAPQKKMSHQHAIDVDKSSRSDDLSFSEHTNAREEEVDIEGALASVN NEYVAAKSSFGAIKRVRDGREARKAALIATTVAFYYALIDYAVTQGTPSDVDFSTRFT TKQKDVIETEPHRGSGRKRPDIISKRTQKGKQSDPTADRMRSSGMAPVEASSIADEEW MSRMSESDKKKKKQLKRIQLQSRIPSLISSQTSLPSSRKSGRLSGKGKKREGASENNA IEIEDSSSSSDDESVDSDSTEAVGEMLHSESLTPRRKTRAVGDLAVLDAVRIAIGEKV FSSKCELKIQFSSKEPYLLLSYSKRDSAKIQEHLIPLKSETLTEVKYFIATKDNDDYD SEIDDSMTFISFRIQPTEENGLDVFSNAYNQESNEEKHSREYVSVECRDTDQFQALLM QMKEHDDLSVWCNNGSELKVPELPKYNASLAKANAAEVNARKRESIGMKTRSGKGRGR KVLIDAENKLLLVYPFKGADEEMLAEASTGLKELSGHHLGVKDEMDVEEVEATANDSS DQNDDEATVEVKGSARAHYVTICQEDYNRLEPGQFLNDSLVDFWMRWYDHCSRVALPP MTLLSRHPLSFPPPIFHSHYRISREQSHLGNKSDVHFFTSHFMSTLEEENDPSSVASW TKKKKIDIFKKKLIFVPVNADLHWSLCVIVNPGLLYQPPAKSSEASDAMDVDTEDETV SVELMDTEDEAEAKEGSCILFLDSLKMHRKDKVARIIRKWLDFEWKRKHGIEDPKQKF FISRDMQLLTPKIPYQENGCDCGVFVCRYAYGLYLMRRQMFTPEDINDNFKGMITNGS AFAFDMKDIARIRGEFTTLIDRLSPKYLAIKDAEEKAAKRAKKKAVAGETTVEKAVTV AASSEENENTPQKMAAEVIGELTTDL" gene complement(<276282..>277371) /gene="SelD" /locus_tag="THAPSDRAFT_264907" /pseudo CDS complement(<276282..>277371) /gene="SelD" /locus_tag="THAPSDRAFT_264907" /note="selenidewater dikinase" /pseudo /codon_start=1 gene <278704..>281050 /locus_tag="THAPSDRAFT_12061" mRNA join(<278704..279513,279705..279784,279867..280746, 280850..>281050) /locus_tag="THAPSDRAFT_12061" /product="predicted protein" CDS join(278704..279513,279705..279784,279867..280746, 280850..281050) /locus_tag="THAPSDRAFT_12061" /codon_start=1 /product="predicted protein" /protein_id="EED87469.1" /translation="MSTKDITPPPTATSSAAITSRSSLEKMTVKQLKEYIQENDMEVP RGASLKLKKDIIAFIWGYTAGDGGANDGGDGVKGKEEEFGGAESFLNESSNSDVTRQP QKRKPTKLSIGTGMPPLPSQPTSSSSSLEDSINDNNQEEDEPYYLTPKDRIVLHVLDR YPPLHDAIISACSTSNTDTNTPTIDTITTSNIDQCDLNSLSYTTPNGLGENDMRQTYH PLLANATQTDLDLVFIGTASCTPGVTRGVSCTALRLNWRSHKVVDGKDVGFKLSIQRT SSIKPGKVSKIFITHCHGDHSFGLPGLLCLMGTDRDRDAPPIDIYGPEGLRMWLRVAI RYSVSRVVPPYRVHELMDVPMAPEWEQGHRRNGRFYYQLKNEGGGRRWGNKGLAGEDA VSWISRAPMMNLEPSRDFGEIEGGRDIYPRYDHPQCHDGAPVWEVEKDSDVSVYAAPM SHGVPCVGYVIEEHDRPGRLQPENVLPVIERNHAGLIEAGVRHPMKVMAMVKNLPVGG SYTFPDGTVVKQEDVVEPPRKGRKVVICGDTADCRGLEGLAQGADVLVHEATNTFLPG VDKDGDLRGVTRDAKIHGHSTPFMAGEFAKRIGAKKLVLNHFSARYKGDQSIESMTIM TRMERQAMKASGLPETSVACAWDYMILPIPRN" gene complement(<281131..>282069) /locus_tag="THAPSDRAFT_38770" mRNA complement(<281131..>282069) /locus_tag="THAPSDRAFT_38770" /product="predicted protein" CDS complement(281131..>282069) /locus_tag="THAPSDRAFT_38770" /note="GO_function: GO:5524 - ATP binding" /codon_start=1 /product="predicted protein" /protein_id="EED87545.1" /db_xref="InterPro:IPR003959" /translation="RSLALSIRRDIIQESPGVGWNDIVDLNDVKRLLKEAIILPKKYP QLFTGLRAPWKSVLLHGTPGTGKTLLAKAVATESNAVFFNVSASSIVSKFRGDSEKLI RMLFDLARHYAPSTIFFDEIDALMSHRGGMNGGSASGNEEHESSRRIKTELLVQMDGL LANNTDVFVLAASNLPWDLDTAFLRRMEKRVMIPMPTKEGRKEMIKSHLSDFSPSLFK KDELLNRCAEQTEGYSGSDIKNLCKEMSMRPLRRMLTQLEQTPTTWSEQNLSLLVKRN PITEQDFVQSLSTINQSTDAELCARHTKWSESHGAQ" gene complement(<283195..>283689) /locus_tag="THAPSDRAFT_12063" mRNA complement(<283195..>283689) /locus_tag="THAPSDRAFT_12063" /product="predicted protein" CDS complement(283195..283689) /locus_tag="THAPSDRAFT_12063" /codon_start=1 /product="predicted protein" /protein_id="EED87546.1" /translation="MPRTLDPITTQNQHLPGDILNPTSIPALTAHNLKGRNTSKRPWK LRPQKRASSLVTSNRINARSKKWEQRVAERDLKMEVKSAQQEMIEHRRTLAREKKERR LENERRKMENEYNHAKRSMQTLNMKKVGGTMKSMNKKQLRMIKKTRMNSKTGVVEFVG AYAK" gene <284217..>284678 /gene="Rpb8" /locus_tag="THAPSDRAFT_38815" mRNA <284217..>284678 /gene="Rpb8" /locus_tag="THAPSDRAFT_38815" /product="RNA polymerase" CDS 284217..284678 /gene="Rpb8" /locus_tag="THAPSDRAFT_38815" /EC_number="2.7.7.6" /note="One of small subunits associated with RNA polymerase RpoB; GO_component: GO:5634 - nucleus; GO_function: GO:16740; DNA-directed RNA polymerase activity - transferase activity [Evidence 8270] [PMID 3899]; GO_process: GO:6350 - transcription" /codon_start=1 /product="RNA polymerase" /protein_id="EED87470.1" /db_xref="InterPro:IPR005570" /translation="MATATSARVTLFEDIFEITHLNPEGKKFDLVNRLSATGTTFECD LLLDYNCQIYSLLEGEKMTLVLASTLNLDGTPDDHTSYNPATSHNNELTLADNYEYVM HGRVFDVSYKKDGVVVIAISFGGLLCRLTGDQRHLSSVLPDMRLYVLIKKD" gene <285661..>286290 /locus_tag="THAPSDRAFT_18351" mRNA <285661..>286290 /locus_tag="THAPSDRAFT_18351" /product="predicted protein" CDS <285661..>286290 /locus_tag="THAPSDRAFT_18351" /codon_start=1 /product="predicted protein" /protein_id="EED87471.1" /db_xref="InterPro:IPR003697" /translation="IRLLLASQSPRRREILDMMGLSNRYTAQPSPLDETALQLELSRQ DITPQKYARTLAERKAHAMGLALSANGKSGNGITLIIGSDTIVDLEGSIMEKPNDEAE ACSMLRRLSGNWHEVHTGVAVYGVGAGMNTSSGDNVKCMFSFTDTARVKFATLSDKDI QSYVDSKEPMDKAGSYGIQGIGGQLVESMVGDFFTVMGLPMHRLSRELSK" gene complement(<286452..>288327) /locus_tag="THAPSDRAFT_12066" mRNA complement(join(<286452..287708,287807..287943, 288024..>288327)) /locus_tag="THAPSDRAFT_12066" /product="predicted protein" CDS complement(join(286452..287708,287807..287943, 288024..288327)) /locus_tag="THAPSDRAFT_12066" /note="GO_component: GO:5634 - nucleus; GO_function: GO:3700 - transcription factor activity; GO_process: GO:6355 - regulation of transcription, DNA-dependent" /codon_start=1 /product="predicted protein" /protein_id="EED87547.1" /db_xref="InterPro:IPR000232" /translation="MSDTTSSTRSEIHKSIRSSASSGRSTSPSSSTSSNQQQQQPKKR RKKKAVPREKYVYRDFANVEPGTLVKDKNVLIHEVLPPENLQNQKLPAKLDAMLSDPD LAGMISWMPHGRAFRISNRARFSSQVLPRYFAHNNYQSFVRVINFWGFRRIISGTDKD AYYHELFLRGKPHLQERMKRLSVCERKTPVDKENKCPDFYELAKTSPLPELLSRNGTS TTASTEQEPMQVVQAPQVAASSVAVVPPTVQASSSQSDVSFSIDNQAAAALLSSLQSS MPTKQPSTEESGKSPPAVAHSTEESDVSIPPSVAQLVGEKVIEMVKNGNGTVPTAAAP ASLSTSKDSLLSNQQTSILMNLIRQQYIQFSHFQTMMPNTSQDNISSEQVLALLQSKL QPSQQQLQVPSMGNKSDVVAQLLLQSQGSLVPNVRMNLMFQPASSTSFLQSDMSQMKA FPPSPSKNTNEDAMTQLIRFMESNTNVQQKQPQTVKSNGPYQEEIMMHLVRSMQQQQP VPTPVAALSVSNLTNAMNNANTIQSMSQIYQRNTANSQAMCLVCQLTKQERCLHQSM" gene complement(288732..>290458) /locus_tag="THAPSDRAFT_25857" mRNA complement(join(288732..288903,289132..>290458)) /locus_tag="THAPSDRAFT_25857" /product="predicted protein" CDS complement(join(288818..288903,289132..290458)) /locus_tag="THAPSDRAFT_25857" /codon_start=1 /product="predicted protein" /protein_id="EED87548.1" /translation="MSSFSPPATSTTVPSPSIYPKHNSRQLWRCSPTLRKLAIASLFL LTNLIRWETQDSTLSSASFPPSSSSIASTSSSSSLPLATVAYAISITSCNFKLTRRLF DGAAVLKRSIELNSWPKHPHSRYASNFYALVLKQDDGELDECSKILQLAGWEVLLQDN PIVYTNLIKEPEGSSLKAGIGGDGCCGDKELIKLATYKLTNHPIAVHLDLDTLVLHPM DELYNAMHFNSDTVEGKQARMKLGEVVAPTYLNRRSSGNPAMDANITMESILTNMTVN AYFTKDYNMIGSYRHAQRVGVQGGFLVVRPSVATYTSILELIYSGEFYPGHDAGETGW FKSGYGRHIWGSLTIQGLMAYYFDVLEIQHSIELNRCRYNNIADNARVSTFANNPKYP RGTLLPFVRDESNPRYNFVDTTCRDSRESCDDVDCQRFPLEKVRLSHFTYWYCQGEGN YVAMANRNISAFPDHYEVVV" gene complement(291928..293200) /locus_tag="THAPSDRAFT_43128" mRNA complement(291928..293200) /locus_tag="THAPSDRAFT_43128" /product="hypothetical protein" CDS complement(291950..293020) /locus_tag="THAPSDRAFT_43128" /note="based on sequence similarity to other conserved hypotheticals; putative protein of unknown function with sequence similarity to hypothetical protein" /codon_start=1 /product="hypothetical protein" /protein_id="EED87549.1" /translation="MGVDFCQVDVDGEELLLKQSIQQWKQSCTSSNDATATSDQCLVV HTAGPFQGRRSPSLLSACLDLSIPYVDVCDEWDLAEISKEELHQKAVDANVAAIVSCG IWPGVSALMAAEGVSQLLADDDDTEIESIDYSFFTAGTGNAGPTIVSATFLLLATPAI TFLNGLRKDKEPWTEMKEVDFGNGVGNRRIWLLDNPDVPTTALYLKESKQSQPPNVSS RFGTAPLVWNYLFGAMKALPRSLLYNRDAMQNFSLFSEPIIRLVDFLVGATNAMRVDV TARNGKKVTMRMAHSDLEQCVGLATAAFALEVANSMKQEGGGTISSGVWFPIELGKEA RENILRVSKEDAFIYELGSKVL" gene 293604..295337 /gene="FCP_1" /locus_tag="THAPSDRAFT_270220" mRNA join(293604..293909,294296..295337) /gene="FCP_1" /locus_tag="THAPSDRAFT_270220" /product="predicted protein" /note="Has EST support. 4341585:1" CDS join(293662..293909,294296..295181) /gene="FCP_1" /locus_tag="THAPSDRAFT_270220" /note="contains bipartite plastid targeting presequence with conserved motif at signal peptide cleavage site" /codon_start=1 /product="predicted protein" /protein_id="EED87472.1" /translation="MKFSTVFLTAMATHQATAFSPAPINSATKSLTLSAATLEAAPAA ADNAADVPAAKEFATMDMGVEKDFAVGDGFVKDSERIQPGRYNDKANSIAIPFLPRPT PLDGSHAGDYGFDPLGFSETFDLYTMQEAEIRHARLAMLAVVGWPMSELLAPDWMLQN GCAPSVLNGVNPLSFLAIAGFLGAAGFFEFKTSLRSNVGTPMGKIHEKDMSAIWEFGV AGDYNWDPLNLYSAAGNDFKGRKGLRDVEISHGRMAMLGITYFAAWEALTGNPIVENS MFFHPNALLPALVAGYAAWSQVYEIGPLNEYPIQVKYTNEGEMKLKRFQNGIADTIEQ NAETTEKIQETFNKLDSQFGIVEKVTALPGQISEKVRSTYKYW" gene complement(<295638..>296391) /gene="NDK1" /locus_tag="THAPSDRAFT_12070" mRNA complement(join(<295638..295702,295806..296177, 296379..>296391)) /gene="NDK1" /locus_tag="THAPSDRAFT_12070" /product="probable nucleoside disphosphate kinase" CDS complement(join(295638..295702,295806..296177, 296379..296391)) /gene="NDK1" /locus_tag="THAPSDRAFT_12070" /EC_number="2.7.4.6" /note="Probable nucleoside disphosphate kinase (NDK) (NDPK) (NDP kinase) similar to gi|127984|sp|P19804|NDKB_RAT Nucleoside diphosphate kinase B (NDK B) (NDP kinase B) (P18) (model%: 86, hit%: 97, score: 520, %id: 66) [Rattus norvegicus]; GO_function: GO:4550; ATP binding - nucleoside-diphosphate kinase activity [PMID 5524]; GO_process: GO:6183; UTP biosynthesis - GTP biosynthesis [Evidence 6241; nucleoside triphosphate biosynthesis] [PMID 6228]" /codon_start=1 /product="probable nucleoside disphosphate kinase" /protein_id="EED87550.1" /db_xref="InterPro:IPR001564" /translation="MERTYIMIKPDGVQRGLIGEIIKRFEQKGYKLLAMKLVSPGQSH METHYEDLAGKKFFPGLISYMTSGPVCAMVWEGANVVKEGRKMLGATMPSESACGTIR GDFCIEVGRNVCHGSDAVESAEKEIAHWFPEGVNAWESCEKDWVYEG" gene complement(<297501..>298888) /locus_tag="THAPSDRAFT_264915" mRNA complement(join(<297501..298074,298126..298332, 298435..>298888)) /locus_tag="THAPSDRAFT_264915" /product="aminotransferase" CDS complement(join(<297501..298074,298126..298332, 298435..298888)) /locus_tag="THAPSDRAFT_264915" /EC_number="2.6.1.-" /note="putative aminotransferase similar to Deinococcus radiodurans" /codon_start=1 /product="aminotransferase" /protein_id="EED87551.1" /translation="MTSSKEQSSPPPPPIRYDLYRGHPNNSLLPTLEIQSILSSCAKD KDSLTRALKYGTNEGDDALLSALRSFIEKRTVHDDGSSDTTSSGFFITSGVSHGLELL CATCTNGGEVWVERPTYFLAPKIFQSHGLVVKPLPMLSDRLDLNKSDGCVPPPKMMYI IPSYHNPTGRSMTVAEREKLASFAIRNGILLVADEVYHLLDWERELNVDGTDDTSNQT TSPTLKGCCISVSSFTKIFAPGIRLGWIQAPPFIIQRLISHGYIISQGGVAPFTGRLM TNAIESGLLDSYLNKLKMEYAQRCDLVCDLLKVEPRIVVLTHNSPIKRGGGYFLWVQF PFGVDSDALLAFCMNDYGVKFMPGSRCDPFAGEDDDNGSSGALIKCCARLCFADLDRD ELVDATKAVIQAFRSFVDR" gene <299549..>300001 /locus_tag="THAPSDRAFT_12072" mRNA <299549..>300001 /locus_tag="THAPSDRAFT_12072" /product="predicted protein" CDS 299549..300001 /locus_tag="THAPSDRAFT_12072" /codon_start=1 /product="predicted protein" /protein_id="EED87473.1" /translation="MSLEDDTSDESTCSCNSECEHQSELSYEDPLHFPRRSTAAVAIP SSSQRQRHVTFIDEVKGLTPRYVVTETHYRPATSRSEYPKLYYTSKDYTKFETDANME EMICRLEQEIQDLQHMTEMGQVVEEKEERRRREELIANVARCYHLKVQ" gene <301941..>303966 /locus_tag="THAPSDRAFT_25860" mRNA join(<301941..302976,303011..>303966) /locus_tag="THAPSDRAFT_25860" /product="predicted protein" CDS join(301941..302976,303011..303966) /locus_tag="THAPSDRAFT_25860" /codon_start=1 /product="predicted protein" /protein_id="EED87474.1" /translation="MQSARDSVSDLLQVKQEQQNQQQLQIHQNGGANISKDALNSSGS SGRPISFRLKQRGGVTGNNVDGGDGSHRQRRLQLAQHQHVDCTDDRGTYDDFCYEPIE RGDDYDDDHWPLWEMTGDGDPWSVTATTNNNTNAASSKDFLDSFGNAYTDVNTCQPGD DTHSSIVECERIFTVPEVYNELTNLSASELRHDREDCSIFNCLAPWRQSSASFASSSH VSLPLEQSANGAFPGTINGRQNTHQRSNPKGVIRRTSMAVQLYLGMPASVRRSFTPSM YKVQLEDDEWGDAGDNTTGGGRKKRASIIGSDHHQIFTMDEDDVERIETSNDNNNNNN KQHQCNKEEESLRKFTPDEAIQVWFQREDFDHFKAEMTLLIQESEASRELAEIWLDAS ECERRRSSSELNGRDNSNNSNVGKGARGTAHHTKSRSWWHNYDHSRRGLERYASPGQA RQILASYKVALQKVMDEQIRQRMLRFFCIPNAIDPERIAEVYHEYTAWSRDLALAAGA SDADAVRTNFDDDNRKTREYYILKQVISSGYKVHKHMPQFMLPKCIQTKGFLDEKESL YCDDPELARGSKTFLDSIVRKKSQTGEAAREEMSNLHSGDLIGPVSPALAASLQVNEG GGDGVGGSATTGTMSPRHAKGPVKQKSLASKAKNYPFQQ" gene complement(<305165..>306058) /locus_tag="THAPSDRAFT_12074" mRNA complement(<305165..>306058) /locus_tag="THAPSDRAFT_12074" /product="predicted protein" CDS complement(305165..306058) /locus_tag="THAPSDRAFT_12074" /codon_start=1 /product="predicted protein" /protein_id="EED87552.1" /translation="MNREFTSIKDFMQNGLSSTSTATNHRRSRTTNNLSASFCHASSL GSPTLHNTTPHSSDTRYTASASEEWSIRLAEVKPTEEIDVTEYSNEDFDNLKKKDPFM YYSIPEIRRSSFLMSEVDANESEDNDVVVENEETIAMAVNAINAGAIDIGTNIIGSIN PQVSQSSSSRGQGKQPVFRNGQLHRQSSAHLRTISCPAGMLANADISHSQRNAILNSR RRSGEQRRNGGLNDSLVVRKSVRLSVEAHPDLILREFEDDEDVFGVLERNNEGDGADN EGGGDLDERFRRLVADFRSGE" gene <306879..>310679 /locus_tag="THAPSDRAFT_12075" mRNA join(<306879..307466,307557..307817,308077..308338, 308415..308522,308614..309075,309159..309428, 309478..309704,309914..309991,310082..310132, 310223..310318,310410..310526,310659..>310679) /locus_tag="THAPSDRAFT_12075" /product="predicted protein" CDS join(306879..307466,307557..307817,308077..308338, 308415..308522,308614..309075,309159..309428, 309478..309704,309914..309991,310082..310132, 310223..310318,310410..310526,310659..310679) /locus_tag="THAPSDRAFT_12075" /codon_start=1 /product="predicted protein" /protein_id="EED87475.1" /db_xref="InterPro:IPR006970" /translation="MMLRLPFAAIALFIAISLNSASSSQQRRMLLHPNESIVTHVRRA KAQKQKMGFTITAEVSDDTSNLFGGSPSVLVYLDVIEASPAITSDTIVENGGRRSRVD IDSIHTILVSDTSTLEGSETFTLLAIDPQTDKVHGIVEKKGSKAFKIKQKKGDKTIAT EEDEANMLAPDWSCGVGNDEEIGGRRLKEEDHEHHDHHHKQHHHEHEDPTASILENLS SSLRGTKVNPLGKRRLQSGTNYNYQVDMYIEVDGSFIDKSGGSMETALNQVNLMVTAA NVVYEKESALYDSATGTSNALSIMRTNYAQSTWHTAGIDIHHAMLGNGLGGGIAYVGV LCHPDYGFGLTASMSASFVSLDYKVVWDMKAFMHEIGHNFNSGHTHDAYTPVIDTCGT SCPSTTGDKWSTLMSYCQHCPGSYGNIMYTFGGSYDGSGTKSDISNWLDNPELVANYD STHQSVDPRREAHRMYTHASTRGGNCLAVNENPSPPSTPQPTISPKPTSNPTSFPPGI ASYDPSLRAPKCSSSASFCDSGSLLNGRGSVSGGNEPNAASNVLNSCNDGSSGTYHSD ESIDRIKVSVVGGGILQEGATIEIDATVYAWSTSADTADFYYTNAAMTNPSWTLISSL KPAATGVQSIKAQVTLGAGSLQAVRVKVEFSTLDTFFSSCLYSDFRYNGSVSSCSAGS YDDTDDIVFAVAQDSSTASPTSKPSSPPSAHPITLSPTSPPSKQPSKQPVTSPPTNNP ITQSPTDNPTQPPTQQPVTSPPSDSPTSSPSKSPSASPSKTPSASPSKKPSLPPSASP TPQPTHKPITASPSSKPTFQPSKAPSTDSPTYTPSVKPTSIPVHIEFE" gene complement(<310896..>313379) /locus_tag="THAPSDRAFT_12076" mRNA complement(<310896..>313379) /locus_tag="THAPSDRAFT_12076" /product="predicted protein" CDS complement(310896..313379) /locus_tag="THAPSDRAFT_12076" /codon_start=1 /product="predicted protein" /protein_id="EED87553.1" /translation="MDESKIHHSSSLSADAANLAPSSTITSSHNDGGIIAASTNTSHR GGVGGSSGALSRHASSSGALSHPKASSHSTTATQPTAAATTTLVPRPPINVYLKIAII RNPTDVVVCEDPEHPGRMKRLVVGNLLDEKRMTLEGKAQEGNAIEGVGGGEATVEGDK VNNTTTTASNNEELKPPPSESSNKKLEIPVPTITTVKQYTTDIQPTFPIPQSYVRYIP PTYEENDVTVEYNVDSEDEGWWRENEDFGPGSKGKIVWVDGSGGEVGVSCTVERKASD DIRKKRAKWGDDEEVDNDNDASSFPNLTMHQVILQNPYLLQSSHSTKYLIQKYQPRLP LVVLEQMLDVLEKATGFEMIVTLKQAEEVLVRKVPRLEEIFGGISEKRLDGNTAIEVR SKTGITPLERRIPTLAPPVTLKQVISQVYTYWMQKRSKLKKPLLRRYWPVTSSTDLNP HMVFRPRDKEKRKLRKKRQNDIEAYNKMKMLKMDFERLDVLCDLIVRREGIHANMVDL TNEYYQERLYEWVDTTGLPRDKNQLLNRRAMENALVDIPRYFEDGPIVKMVKGGNKKR KRTSTATAVRGLFGGVGGQGSLHVGGANKTNTSSNVALQPRKNVVVAGHDDGFPAPNF LQPLATRESHHITSWDDAVPFIPSYENGKETPIHRFRHRPRLGRGGRIVIDRVPCPPP ASEHDQPPPTVFSYGSEMKRSGYDVTVLGADGPKYNVNTSTTDNRFNAKGAPKATAAS SLQELMPKSLGNQTLLSRRIEEICAMGLVEDYQSSTNADTSKGGNAAAGGSAILVGDG LDETLVPLEDWMEASEEIYGTEHFVIGPL" gene complement(313938..>315233) /locus_tag="THAPSDRAFT_25861" mRNA complement(313938..>315233) /locus_tag="THAPSDRAFT_25861" /product="predicted protein" CDS complement(314256..315233) /locus_tag="THAPSDRAFT_25861" /codon_start=1 /product="predicted protein" /protein_id="EED87554.1" /db_xref="InterPro:IPR001440" /translation="MPGHPKAEAFKAEGNKFFKDGQYSSAIAKYKEATAIDPNVPAYW SNMAACYEKIQEYDQMEDAARGCIKADKSFVKGYFRLATALKAKNDLEGCIKALESGL AVDSGNADLKKMKKDLTELQRGETVAAYIRKCDEQMANGDIGGAYKTLELASRMDAGN PDIERMMSRVKPKFERMEAQRKANLSPDEVHKERGDDAYKNANFEVAIDHYTKCIEGL KRRGEEQSDLSMKAHSNRAACYKQISNFDGVIEDCTAVLEVDPENVKALVRRAQAFEG VERYRFALQDVKTVLNMPYASVGKTNVDMCNMMQHRLNRTVQQLKASNV" gene <316502..>320720 /locus_tag="THAPSDRAFT_12078" mRNA join(<316502..318039,318127..320632,320682..>320720) /locus_tag="THAPSDRAFT_12078" /product="predicted protein" CDS join(316502..318039,318127..320632,320682..320720) /locus_tag="THAPSDRAFT_12078" /note="GO_function: GO:3677 - DNA binding; GO_process: GO:6310 - DNA recombination" /codon_start=1 /product="predicted protein" /protein_id="EED87476.1" /db_xref="InterPro:IPR001584" /translation="MIELISEEQWKQYLPGFTPIAEMTPSAARQLHNEAIEQATSVPR HGYSSGHAGAIATPEQYELLETEQPRWNDPKHPGMHPLYPPLATETQKKEAEATHLCA LNEWKNWTQLNLRMRANLDKAIDNNCKFTALPNGTSTFGALTPQGIINRIVDEWAKPT PREIEDNESKLTQPFDQHRPIMELVRRLQQAKLFANWWGGNKITDARLTTSMLARLEA CPVYTEYTKRWNRRGDEHKTWKQCQRFFMEAYREIRNTMNNNVTSGTYNYGTAFNAIQ DDDDRSVNSQATVSELTRQIGQSVTMYHELSQNLQERDMEIERLRRENEQLHHMANLS VNRQNVHPIPSWQPYPPPPLAMAPPPFAHGQQPLPPASYNQPPPASPYKQRGTKNNPQ QRSNTRGYGGYGGNTGGQYKNAGGAGGYVPPQVGYHQHGGGQTQTGRGYGTNREPHPR KFYPNMLYCYSCGFDVDHDGTGCPPHKQKEHHLHHATMTREQARQWMKDDRFKGSHKG SHKTHYLPTPTLHNNDNVNTNNCQSNYYNLLADDDTEDNDDDDDATVKTSNTSGKKTA RDGHQDAANKATTALHIPKTWAIGDAGATGTFLLPGAPVINMQPAAHPLTITLPDGQC IQSTHTCNLDIPWLPPAATKAHIVPGLAHTSLVSIKQLCDNGCRVIYDDMACRVYYQH QLVWIGQREPSTGLWILPLQPNRPNASNIKKYQQRYEDDNLELANNVYSMTSKSELIK FLHQCAFSPTIPTWIKAIDNGQFSSWPGLTSAAVRKYLPPSPATDKGHMKRLRQNIRS TRPKQHSPSTTAEDNMANRLKQLISEELDANPPEEKMEEGNGTNVFCFAAIADKIEGT VYVDNTGRFPVRSLEGHLYLFVLYDYGSNAILVEALKTMESKEFIAAFQKKISYLTQR GFKPRFNVMDNIVSKAVQSFLEEHQIGIQIVEPHNHRVNAAERAIQTFKDHFIAGPST TDKDFPLQLWDQLLEQAQDSLNMLRTSRVNPRLSAYHVLEGPHDFNRTPWAPPGTKAV IYDAADARTSWAPRGTDGFYVGPAKQHYRCYRFYTPETGGYRTSAKATFYPSHCRMPT ETALDRLGHTAVKLTNILAKLSTQPTIASRHLTALKQLTDIFADMTKQQKHITEGTIQ RVSDQGRHKQQQRVEEEVQPTTSTNPTRKNNIIKLKLNHLRHTRRNTPNVLKPSEATI TDTTYAPIFYDEQVMPSATPNPSSVPPRRSQRISLRSPAIISQEAVNFITEQTWNNTQ QMLPLSLQNTPNMEHFAAPVIHPTSGETITDYRKLMKDPATAEIWTKAFGKEIGSLAQ GDDLTGTKGTDTIVFLDRIGIKNIPKDRIPTASESPQEEI" gene complement(<322539..>325211) /locus_tag="THAPSDRAFT_12079" mRNA complement(<322539..>325211) /locus_tag="THAPSDRAFT_12079" /product="predicted protein" CDS complement(322539..325211) /locus_tag="THAPSDRAFT_12079" /codon_start=1 /product="predicted protein" /protein_id="EED87555.1" /translation="MNDDDASSSSSSPSLEEPSSPWPSWSSPAKHPTTTATMESTIAS PTTHSPSSSSSSSPAEPPGAATYTQSSTADPPTRDNSPASSPDGSGGRNLLSSHFAKR SAVDHRERRGRDPSVVVGGGDMMNYSSHDDFDTTSVTSNITNMMSYNMTGGSSVVSYN AIGRLGSVSGGSSVANYDNSDKKGGDSPLEKHFHTTTTTTTDNNNITISRDAPTSSSQ TIVSTEDDDSIEQSVYSANSTSTNGSYNASKRAMKERKEQERYEGVKARVYNFGTGGY DREVGQQQQHYHHHQQQRQQQQQQNRVCFADQPSYQNVPPVDDYGYGSAWRGSGTMRS VTNRSIHSSSVNNSSNNVVNCYGSSAFGDSQYNGNAFPSSRTSTTISTSSGVKSSMIV KRLSQFNYLQKNATLLGLLSMGMLFFIGSYFNSGLTSSGVSGGDDARTLGLGQQSNSA NLGAVVQSGSFGSGLNGGASKGIMDNAHPSGIHPSNGGEEVDWSKVPPHLRGYYSQIV GAPQKQIHQQQQQVMNQQQQEEQKSQQHEIAAAQAIQSIGNEAVEFKGPTIDFEEFQR QQDLEAALANGQALPPPQQRQQQPMHEVANNEQAEVAAKEEEMRQQHAEEEALRIAAT AEEDARQKRESDIAAQREQQHHQQQKQKQKQQLEQEETARLHKRQKEEAEAMRVREKE HQKLIEDERARLEKESQQQQQQQQHGVFEMAMPSEAEIEEAKKLAAQDALRLQEQQQQ QLSEMMVSNETEAKQQASQEVQRIQGNQQQQQQQQAKQANNSNNDNNGEAVLFPQLQS PELIAEAQERQIQAEEWMRNKAAAQNNNSNDKQHLRNVPDALLKEVNEKAAEVEQAQQ AVFAGESLDIGSIAELQKELSELMAMMGGGDKGR" gene <326351..>328024 /locus_tag="THAPSDRAFT_264918" mRNA join(<326351..326885,326943..327443,327495..327675, 327736..>328024) /locus_tag="THAPSDRAFT_264918" /product="amino acid transporter" CDS join(326351..326885,326943..327443,327495..327675, 327736..>328024) /locus_tag="THAPSDRAFT_264918" /note="Possible amino acid transporter based on BLAST on interpro hits; GO_component: GO:16020 - membrane; GO_function: GO:5279 - amino acid-polyamine transporter activity; GO_process: GO:6865 - amino acid transport" /codon_start=1 /product="amino acid transporter" /protein_id="EED87477.1" /db_xref="InterPro:IPR004841" /translation="MSLFDLTAFGVGCTIGSGVFVLAGVAAHSYAGPAASISYLVAGA VAALSGLPYAELSAAFPMDGSTYAYSYITLGEVFALMSSLCQTLEYGGSSAAVARSWG NKFVDWLRERHSDSDGNALPQWVDSFLDPGFGINPMAAIIAFLCTLLLLWGVQESKMA TNVVSSAKLSLITFMILSATFSNWDPFVPPEFGPSGIIQGSSILFFAFLGYDQICNLS GEAKNPVKDVPRAVVYTLMIDGGIYMMAALALTAMLPYTEISTVSGFPRAFGANGWIW AEKLTAIGEIVTLPLVVLTGVQSQTRLLFAMSKDKLVPELFGRLTFAKKSASPCGCIN ACSNAKKEDKIAAFVPFQYMDDLISAGALFLFSLTDCCLLILRYKCPSESFLGTVEYD DTVSIFSIATVRKDYAFIPIAALQIALTVVFALLTLGITIYMALYCPEQSTTRFSLEG DGYGIGRRRFRTPLVPYLPALGIFANWFMIANVGWVGIVMLVCYLLFGILVY" gene <329160..>329941 /locus_tag="THAPSDRAFT_17492" mRNA join(<329160..329411,329492..>329941) /locus_tag="THAPSDRAFT_17492" /product="predicted protein" CDS join(<329160..329411,329492..>329941) /locus_tag="THAPSDRAFT_17492" /note="GO_function: GO:3824 - catalytic activity" /codon_start=1 /product="predicted protein" /protein_id="EED87478.1" /db_xref="InterPro:IPR000594" /translation="LNRLSSATVAIIGLGGVGSWTAEALVRSGIGNIILVDLDDICIS NTNRQLHALSSTVGKLKIEEMKRRLLDINPGCNVTLVHDFISVDNANSIIQSMLPKLT LVIDAIDGMYEKTALILACVDNSIPIVTCGGAAGRTDPSKIVIEDLTKVQEDRLLFKC RKLMRQKHGFPKVAIPGKGKKERVRNWRILAVYSLEVQQKVGQVSETTSSFRTCDGAL GTACFVTGTYGFVAAS" gene complement(330054..>332109) /locus_tag="THAPSDRAFT_25864" mRNA complement(join(330054..330820,331034..331447, 331530..331855,331943..>332109)) /locus_tag="THAPSDRAFT_25864" /product="predicted protein" CDS complement(join(330300..330820,331034..331447, 331530..331855,331943..332109)) /locus_tag="THAPSDRAFT_25864" /codon_start=1 /product="predicted protein" /protein_id="EED87556.1" /db_xref="InterPro:IPR001611" /translation="MKTSHLLLAIGTLASTTEAAREKTNLRKSRNVKEDGLSKELNTE DIFFWTRRMNMSLPPSKPTIRPTNNIPATSRPTNNVPVTPQPTEGTFDTPQPTEGIFD TPQPTPFPTVGEIFTSFPTDDSAGGGTFNCPAASFIGCTAIDPSDPQDECDVVGEPCL DGNPGEFCCRDACPRNYCTAKEGIAPITPQPTLKPVTTTPGPTFNCNLEPDERARILT RLIEEVSTVDDLKQDGSPQNKALEWIVNLDLSEVCPSDDAAVQQLIVQRYVMAVFYYS TNGDEWKECSAPNRFTVNTITNADAEYAFKVNVIEFENNNIGGSLPSEMQELSKLRFF ALERGSLSGPIPSSFGNLQSLLLLDLDFNALSGELPEEMWSLQQLRQLDLNDNDFTGT LSSSIGQLNELRFFQIDNNNLVGEIPASMGDITTFSLIGLSGNDFTGGMPDSLCSLRP SPLQTLVVDCSIDCAVPDCCTSCVP" gene <333329..>334220 /locus_tag="THAPSDRAFT_17651" mRNA join(<333329..333555,333663..333706,333808..>334220) /locus_tag="THAPSDRAFT_17651" /product="predicted protein" CDS join(<333329..333555,333663..333706,333808..>334220) /locus_tag="THAPSDRAFT_17651" /note="GO_function: GO:3824 - catalytic activity; GO_process: GO:8152 - metabolism" /codon_start=1 /product="predicted protein" /protein_id="EED87479.1" /db_xref="InterPro:IPR001345" /translation="SVYTLILLRHGESEWNSQNRYTGWCDVSLTKRGEMEARSAGRLL YENGIEIDHAFTSVLRRASFTCNMCLNMAKQHWVPITKTWRLNERHYGALQGYNKDTA YDDLNIDQELVMEMRRSYATPPPRMEDDHPYWHGNDRRYSKLTEEQLEKSRAESLKDA AERIMPFFNKVIVPSLREGNRCLVVSHANTIRTLIKQIDNISDEDIKQLSIPTGIPLI YRLDRNLKPV" gene complement(<336054..>339447) /locus_tag="THAPSDRAFT_12084" mRNA complement(join(<336054..336079,336220..336703, 336796..336867,336949..337235,337334..339236, 339370..>339447)) /locus_tag="THAPSDRAFT_12084" /product="predicted protein" CDS complement(join(336054..336079,336220..336703, 336796..336867,336949..337235,337334..339236, 339370..339447)) /locus_tag="THAPSDRAFT_12084" /codon_start=1 /product="predicted protein" /protein_id="EED87557.1" /translation="MTPRPPYIDKSNLISIIISIINNNSNPLLPPLRTIYCHCIIIIN IIMTESEESDSSVFHPSFVMRRSLAVPRGEEESDNLIINNRLDGNGTTGGDMNNGRDG RRPLLSRHREETTTDDDDDSRRSVDNDGGGVSLADLTECSSVNPHKMYTNNKRVGSSG NNRSNRSGKRRSNRSDRDGERMRVDRMDAVGRDLDRGNDDRVESSSSWYRRRHHREQQ QEEEDDDDDETNNKDGTSYISRGLSHSRHSSSRSRRRTTTNARGGYTSANNTSNEGIF GFLGGSGRGGGPMGIVFGVFEGVNLKTVLMCVLAMFMVVKMGRPPPEYHHHHGSGGVG GGSSSGGDSTANHWDYDERKDGVGGDIEGDSVGGASFRGAAILSIEDNNGSEEGEEDG GSINDFMNKRGASDGGIDFADDGVNSSVDGGGGGGGGSAVVGSVDYGNQLQVNQQAFQ QPPPPPPLLNQYGQPLDQFQQYQQQPVAGGMYGAASDGQSQELAYEQQQQFQQQTQQH QQYGQPLQLQTQQSQYSSVQQYPPQTQQGRYTDPNANGAVGVGGYGVGQTSSYNQQEQ QQIPPPPPPIDSQSVEIQPPPLLDQSTQQEVDQAPEVQTQQTQQVQIPGLPAVPLLPQ DPPAEGAIDLEELNNFKDSWDPHSTTDIPMFWHIPKAGGSSIKDAMGGCHRFVQATEF GVTDGHISDPTVQIVYPKIPGGDPDTDRSPFVNIDSTTVAGIQRAKELGFADAHLAQV VVSPFVFETNDLFTPTAQGRLFSVFRHPIDRAISMFYYIQVADWEPSYKPELKEWTLE QYAQSDIVENNWMTRQLSNQLGGELTEANLKKAMEVVRRKFMVGLMTEIEPTMSRFEK FFRWTFRVNPPNQDACRDRLMSGGSNSNKANKKPLPTKGEPAWDLLAHQNNFDLQLYN YIEQLFVEQEAFVAGLPDNFRMVDATCCASKLKGL" gene <339608..>342102 /locus_tag="THAPSDRAFT_12085" mRNA join(<339608..340109,340139..340192,340228..341344, 341585..341651,341680..>342102) /locus_tag="THAPSDRAFT_12085" /product="predicted protein" CDS join(339608..340109,340139..340192,340228..341344, 341585..341651,341680..342102) /locus_tag="THAPSDRAFT_12085" /codon_start=1 /product="predicted protein" /protein_id="EED87480.1" /translation="MSETRSLHLIRERRLPSRRANQQQHINGGDVTASLRSSINSQGS LYPSNSHYSTTSTIISPSGNRLVMVNIPEDDVDIELDKDEIRRMIHQIDESGDIDNMK GESGIFKPLDGASVSSSSVPSRSSSRNVANSIAICGINTACQSQQTELDKAQKQLKEY IYSPPFRLVWGGSVLFIGEYDGCQECMVGWMVESIGRLVRGHVDECIVAIVHCFLGAT LYAETFTSQYIVLPTGADAPQPVRGASIPLSIERNIADWAQDTSRVSPVSSLDGYGSI AVSEEALFRDAPLVWALPFTGGDVLDVVFGQCLNLVQASDKGVLDGNGEGENLAVVLV MGRKYVNTDTTTQDGIARAASLSLGSSGMADVVYSPLLHEVSGLFTSHNQGRLMFVAR HPIEREFSRFRYLRETSLTLLRDRHDELMNMSYVDFSTSEFVADNWMTRTLANKPNEG ELTAQDMLNAKEVLRRKATIGLYGDMVSGARHFTRYMGLDNSRNGGKLDDATLMCFQN ALSEESRKDGVGAINLNDEETLEGSIAWKNIMEKNKLDLELFLYSEMLYSFKKSAVQG SFLQLGIVCTSSSKDEEDTSINKCPSSIAFVSIDGEKKIFDVKESMDGLIKYSLGTFK LDDFLKAETKATSIDNSEYDLSSNACVHYAQGILRFLETKELADFLVTNIVGSDEFIE MARKKAGGLRAIASLAIGSKGALWYYVKDVVTSQLDIM" gene complement(342368..>344371) /locus_tag="THAPSDRAFT_25866" mRNA complement(join(342368..342925,343022..>344371)) /locus_tag="THAPSDRAFT_25866" /product="predicted protein" CDS complement(join(342500..342925,343022..344371)) /locus_tag="THAPSDRAFT_25866" /codon_start=1 /product="predicted protein" /protein_id="EED87558.1" /db_xref="InterPro:IPR001680" /translation="MAQSSATQSYTCSLSGLSPLPPGDAVVTPSGYICSRKLLLTKLS ENGGVDPFDDKGVRALDESSLIELSGGRQPAAVVPPRPPTGTSLPSLLSSLQNEFDAV LLELYDTRRALEETRRELSGALYQNDAAVRVVARVCRERDEVRLRLEGVLKEGVAARD SSSSSGSEVAKRVRDEQDVSTDRSHGGADSSDANPAKKAKVDSSSIPPADLEAMSATW ATLSKNRRTIAKLKRTPEEIAAIEGTDGIYAKLMTANNGEDKKVNLHKSNAKAGVLAL AHVTGGDGGEYVISGGHDKQAIVYNASTGVIAASLTGASDDVVAVDGMMVDNGLVVVV GSNDGSVRLYSVGEEGELIGAVNVDDSPIVHVAVHPSSTKDEVRILAATKGGSVAVLK YNSEGGIMKVITQLRSEEGAVLSGAAMHPDGLIYAVGSEDGKMVVWDLKTQTCAATLE VFEGKPINSISFSENGYHLATSSSVASIPVLIWDLRKQKIIGTIPPSDEVGRVSSVAF DPTASYLAYSGEECTKVCVVKDWDRVVCTLNKGGKKGKKKSDGLLPGGVVWGGEGFGL GGRGGKVWLCAGCDGEKPVRFWGVE" gene <344730..>345779 /locus_tag="THAPSDRAFT_12087" mRNA join(<344730..344783,345052..345145,345253..345487, 345593..>345779) /locus_tag="THAPSDRAFT_12087" /product="predicted protein" CDS join(344730..344783,345052..345145,345253..345487, 345593..345779) /locus_tag="THAPSDRAFT_12087" /codon_start=1 /product="predicted protein" /protein_id="EED87481.1" /translation="MNSAERGMSWGSSPRVTAPQSAMKSLASRTATAMVAAVIIIATT ADRAHASCSFPSTSHLSTPSNIKLTLIKSNSLCADRSGKEYSYGGFDSQTTLDQCFNS CAVDKALFLSRMVGVDWNCGTGACDCLYEAFTLNNNDCKGFDRCNNVYMGSGQVEGST AVNDIGCYKKVDNGNGFDRMEQSFLRGRE" gene <348037..>349682 /locus_tag="THAPSDRAFT_12088" mRNA join(<348037..348066,348143..348397,348530..348578, 348790..349468,349562..>349682) /locus_tag="THAPSDRAFT_12088" /product="predicted protein" CDS join(348037..348066,348143..348397,348530..348578, 348790..349468,349562..349682) /locus_tag="THAPSDRAFT_12088" /note="GO_function: GO:5525 - GTP binding" /codon_start=1 /product="predicted protein" /protein_id="EED87482.1" /db_xref="InterPro:IPR004095" /db_xref="InterPro:IPR006169" /translation="MSDAVMKQIEEIEAEMARTQKNKATNYHLGTLKAKLAKLRSELI NGPGGKSAGSKDAGRGFDVTKSGDTRIGLVGFPSVGKSTLLTTLTGTRSEAAAYEFTT LTCIPGTMKYKGARIQVLDLPGIIEGAAEGKGRGRQVISTARTCNLILIVLDAAKPLT HKKIIEAELFSFGIRINQTYPNVKVVKKENGGIGYQEMTPQTKGMNAEVCRMVLKEYK ISCAEVILREDITVDQFIDVIEGNRAYIPVLYVFNKIDALTIEELDILDQMPNYVPIS SQHSWNIEELMEEVWSKCNMMRIYTKPKGQIPDYDEPVILHSEGNPSVEEFCNRIHRS LIDQLDYAWVWGRSAKHQPQRCGKEHRLMDEDIVQLVKKSGGG" gene complement(<349833..>351136) /locus_tag="THAPSDRAFT_12089" mRNA complement(join(<349833..350593,350722..>351136)) /locus_tag="THAPSDRAFT_12089" /product="predicted protein" CDS complement(join(349833..350593,350722..351136)) /locus_tag="THAPSDRAFT_12089" /codon_start=1 /product="predicted protein" /protein_id="EED87559.1" /translation="MRAAAVIAVVSAALSPSTSYAFTSSTSTRKTSSTSLFIFGNDNS QQQSTANKGLTPLPKGISPFEKSLSKSIDIQATFRQYSKKAIDNAANDGVKLLEVEFP PLLGGEKSKSQFDDFDNIQELDRNKDWTMLLAPMFLGESSFQKGRTWLIFPDLKECEL AKQEWLGQRYQEATFTTIEAVTNFVGQLGSSSSGANNNKNSNDGGREVVSYDAPWGSN LISGLSKVLGGNDGDAGLLGDQSSLDALVEGENSPAKLWLVIQPGNGGPVEDWVNCEK MHNMQKDTTMVVINGALDKVRGGFYPAIFFPKLAATVDRFWKRFESVFYLKPFSDKGV YGWLYRVYPEPWQVVYESVRPGKDGNAEVTYVPVAVLEKRPTYAEVVDMLVKASQNR" gene <351414..>351935 /locus_tag="THAPSDRAFT_12090" mRNA <351414..>351935 /locus_tag="THAPSDRAFT_12090" /product="predicted protein" CDS 351414..351935 /locus_tag="THAPSDRAFT_12090" /codon_start=1 /product="predicted protein" /protein_id="EED87483.1" /translation="MNFSNLPLLHASAGLVAAKDFASYEGKAPAEGCYPESTRPVRNG ERFKFNLFAEDSVCGDEVGKQFQFGRFLQVGNAESCTNKCVNGVSDSLAHSLMGVDYD CYSGGCDCLYSKGVLTGTDCDEYFNGMCDRSSSLKGVGSVATSIMSVEKACFKLVGTE AEDAEVAYMRRRN" gene complement(<352605..>354285) /locus_tag="THAPSDRAFT_12091" mRNA complement(join(<352605..352712,352820..353652, 353859..>354285)) /locus_tag="THAPSDRAFT_12091" /product="predicted protein" CDS complement(join(352605..352712,352820..353652, 353859..354285)) /locus_tag="THAPSDRAFT_12091" /codon_start=1 /product="predicted protein" /protein_id="EED87560.1" /translation="MKLTTTAALLLAIQGCDAFSFTSSQSFIIGRAHLQKTHGWTSSS SSVHPSSSLTLASSRNDIDVTDKTKSTNDNINTNTIDASSIEDSDILAKLTSELQELF ACTSIVYSYLAKEDGEVTVEDVVLACDRVDESASGEGCDDELFLNRHLSLRHKSHLFG RYHLLVKLLKSNYDAYIKTAEFLSPSRIPRGELPNVQDVAYFKEETVTNYLDEGGVAL VPDCELDDMTYNDSSLDKVLLSIFRKLVAENTGGIQNDTPGIKGLLIQGRQFMTKELP EGVSYEDHTIAQHTMVKNTLGGLMTPVLPPFYRIFMSGIVPKLGTEFDGKQLGPWFYA PFLTSMVTPIFFGFLVGPSRPNRRADGQRGGLVVEKCKFLQESGCKGLCLHQCKIPAQ EFFKEELGLDLTVKPNFVTQECQWSFGETPLPPEEDPSFPRGCLVGCESRKDMAGRKN EALCM" gene <355151..357451 /locus_tag="THAPSDRAFT_25867" mRNA join(<355151..355958,356070..356959,357071..357451) /locus_tag="THAPSDRAFT_25867" /product="predicted protein" CDS join(355151..355958,356070..356959,357071..357415) /locus_tag="THAPSDRAFT_25867" /codon_start=1 /product="predicted protein" /protein_id="EED87484.1" /db_xref="InterPro:IPR008380" /translation="MNHSTVTSTLLSILLLQQAVLEVDGFFFPIQQRCNTIKQSSFAS TTTLQSPTATAVSTKLDVANSDMSSTNEVQTDAITNANAQTTLQSATTATDYTTKPPT PSDYCTMTTLPRHPTNEAANEILIQTEVALRNMQEKELFMACGDERGDIWSSESGRPN TTTTITTTGSSNNNIGRGVDSDNVVLPLDEEEMEIPMEVEQESVYANSYVDLGKVDTV GFDYDYTLVTYTQQLLELIYDMALKRLVEEKEYPREMLESGLRFDPFFSIRGLAVDRE NGWICHLSYTHKVAVAWEGRHRVPRPRLMAEYSGKRALTPKERKTRLKPLNDLFSMAE CCLMADTVQFFLDRDIPFCPRSAVNDVLGSITGTHVSGEFHRQVAREPEKFFEAKPYL KSVLDGMQQSGKRLIFVSNSPFWYVEAGMNYVVGPEWRDQWDVVIASAGKPAFYTEDN RPFREVDINTGKVKFKQVTKLEKGRVYTAGCLKELTKCINWSHPLFSAKDDPPYDDIY SPLTSPNVMYIGDSLFADLVDAKREFGWTTAAVTPEVGVEIELQRKTEFKIAERAIEM LLNSLRVYQRILGTSLRSKDDLAVMDSMERMVSAWRDEQTRLLGNPFGSVFRARYQPS LFAHSLRRYSDLYMSNVGSLRHYSPQHRFYPESPKLLSHEIRGSNPECCDIDDDIW" gene 358331..359512 /locus_tag="THAPSDRAFT_25868" mRNA 358331..359512 /locus_tag="THAPSDRAFT_25868" /product="predicted protein" CDS 358374..359399 /locus_tag="THAPSDRAFT_25868" /codon_start=1 /product="predicted protein" /protein_id="EED87485.1" /translation="MATTNYTQQSQLRPPTKGFHCSESIEKKGIIPSIIFRCNLPPPN DNKTLQFNCREPSCGALEGEETAATLTDEDYAVDPNFFDSGYSMAGSTGFKVWTGSRL MIETLTWPQCEIDPERLVGIRRRLLGGAARVVELGAGVGVVGTYLAAVGATVLLTDLA TLVENAIDSNLLQNEGIATDGDDYDTNNNNPPPSWLESSSNCRRIGKGWAATTPLDWT CPIDEQLTKEQSESIDLVIASDVVFLVSMLTSLLDTVESIFKSSSHNNPSFILSFQRR DAKDGEGSSSFTTVNRIISSVKKRGWTIDCLAWRPVTVRKEECNGDVKEEQSEVFVFD INPCWVA" gene complement(<360249..>361790) /locus_tag="THAPSDRAFT_12094" mRNA complement(<360249..>361790) /locus_tag="THAPSDRAFT_12094" /product="predicted protein" CDS complement(360249..361790) /locus_tag="THAPSDRAFT_12094" /codon_start=1 /product="predicted protein" /protein_id="EED87561.1" /translation="MTIKQRLIGTLACIVLFQWCHIFLTYSKPWYLPSINFSHNLPWY QPKPKRAIICITGLLDQLELNSKLSHLISPFLQQKYEIELVFILQTPKNNTHIMRENA YVPNGRLFPPDREPYSFVKSFNGPELEYGWPETIPGDWTLPIQKNESVYTSKEQIIGK LNDEYGALIESGNMKVNIEIYDPIINPPINVEHFFSFVVRDSDFIKTGSLRRDRFERL LWRTVTHVRMLESYSRCWNYVKDNDYTSAVRVGEDAYLDDAVDVGSIESIITERIGME TMEKERRVIASLPCMTLRGHVNDQIDFISPDSTKTYFSSPYSLLYNNGVFNEHTIYNV SLYLRHLYSVSGLAVMVPSEDGWVPLGDIPHIDTLPLHNASMINLKSLVQSHQDTESK HYYPLSYDEFIRNITKYATNGEVSGVEKETTETENDVIEKRALVCITGQLARVELENK LETLIKPIQSAGYRVDVTLVLSAGNVWNKAPPSDFLPSFENEEQVLEYLKSKADLDVI SENIT" gene <364226..>365263 /locus_tag="THAPSDRAFT_12095" mRNA <364226..>365263 /locus_tag="THAPSDRAFT_12095" /product="predicted protein" CDS 364226..365263 /locus_tag="THAPSDRAFT_12095" /codon_start=1 /product="predicted protein" /protein_id="EED87486.1" /translation="MAATFFNSKDALHLLPLLLALFCATTTSYHTVNSLSFTPQQPQI KLPNFLSALTGRSSGAKTFVYTHLEGNGQLWQASNGNNKVSVVIDPLASQLDFGVPWG YRANKKSLSEQATIDMICNANPSHCLLTMGLDDHTHLPTIEKLMERMPKLQYVVAPSC EKKLLDAGVDGKLITVLKHGQSCELENCGRVTATEGALVGPPWQTRENGFLLALNGNN SEDEDALSIYYEPHADVVLDNIKQLRADVMVSPVTKQSLPAQVPKEGQFTLVYGGDRT LEIAETLGAKIVVPLGNGELDIEGPLAKLVEASGGVDEFEQLVDERNMKSSDAIRVEK ATPGVTLSVSL" gene complement(<365603..>366232) /gene="lhcx1" /locus_tag="THAPSDRAFT_264921" mRNA complement(<365603..>366232) /gene="lhcx1" /locus_tag="THAPSDRAFT_264921" /product="fucoxanthin chlorophyll a/c protein, LI818 clade" /note="Has EST support" CDS complement(365603..366232) /gene="lhcx1" /locus_tag="THAPSDRAFT_264921" /note="Member of LHC superfamily. Assigned to LI818 clade on basis of matches to Chlamydomonas LI818 and Isochrysis galbana (Swissprot FCP_ISOGA; gi535080, 535081); GO_component: GO:30076 - light-harvesting complex" /codon_start=1 /product="fucoxanthin chlorophyll a/c protein, LI818 clade" /protein_id="EED87562.1" /db_xref="InterPro:IPR001344" /translation="MFKLALLSLIGSAAAFTASPMAKTSTAMNAFSASDLPGALPPMG FFDPLGFAEKADEKTLKRYREAEVTHGRVAMLAVLGFLVGEAVEGSSFLFDAQISGPA ITHFTQVPDGWDALIITFIGAAEAQRAQTGWVDPNDASYDQPGLLKDSYYPGDIGFDP LGLKPEDPEELNTMITKELQNGRLAMLAAAGFLAQEAVDGKGILEHFSS" gene <366611..>367378 /gene="Lhcx6" /locus_tag="THAPSDRAFT_12097" mRNA <366611..>367378 /gene="Lhcx6" /locus_tag="THAPSDRAFT_12097" /product="fucoxanthin chlorophyll a/c protein, LI818 clade" CDS 366611..367378 /gene="Lhcx6" /locus_tag="THAPSDRAFT_12097" /note="Member of LHC superfamily. Assigned to LI818 clade on basis of matches to Chlamydomonas LI818 and Cyclotella cryptica.No intron; GO_process: GO:9765 - photosynthesis light harvesting" /codon_start=1 /product="fucoxanthin chlorophyll a/c protein, LI818 clade" /protein_id="EED87487.1" /db_xref="InterPro:IPR001344" /translation="MKFTLLSSAIVATSAFVAPSPSTIASTALFSTEESTEQDIITPV SPSVAAINGWTPNETQNCFGLPGSVAPTGYFDPLGFAQDGITLNEIKRNREAEVMHGR VAMLATLGYFAGEALPSPFGITGPANDQLQQVPLPAFLLLTAGIASAELKRANIGWVE PDFGNWTKTLWKLRDNYYPGDVGFDPLGLKPTDAKAFADMQTRELQNGRLAMIGAIGM ISQELVNHRTIMGTIDFYNKVYSGVNPYEGCGDGVIC" gene <368273..>368902 /gene="Lhcx2" /locus_tag="THAPSDRAFT_38879" mRNA <368273..>368902 /gene="Lhcx2" /locus_tag="THAPSDRAFT_38879" /product="fucoxanthin chlorophyll a/c protein, LI818 clade" /note="Has EST support" CDS 368273..368902 /gene="Lhcx2" /locus_tag="THAPSDRAFT_38879" /note="Member of LHC superfamily. Assigned to LI818 clade on basis of matches to Chlamydomonas LI818; GO_process: GO:9765 - photosynthesis light harvesting" /codon_start=1 /product="fucoxanthin chlorophyll a/c protein, LI818 clade" /protein_id="EED87488.1" /db_xref="InterPro:IPR001344" /translation="MFKLALLSLIGSAAAFTASPMAKTSTAMNAFSASDLPGALPPMG FFDPLGFAEKADEKTLKRYREAEVTHGRVAMLAVLGFLVGEAVEGSSFLFDAQISGPA ITHFTQVPDGWDALIVTFIGAAEAQRAQTGWVDPNDASYDQPGLLKDSYYPGDIGFDP LGLKPEDPEELNTMITKELQNGRLAMLAAAGFLAQEAVDGKGILEHFSS" gene <371664..>372187 /locus_tag="THAPSDRAFT_264925" mRNA join(<371664..371905,371957..>372187) /locus_tag="THAPSDRAFT_264925" /product="hypothetical protein" CDS join(<371664..371905,371957..>372187) /locus_tag="THAPSDRAFT_264925" /note="Poor hits to most other sequences, but does align with an EST from Phaeodactylum.; hypothetical protein with domain similar to flavodoxin; GO_process: GO:6118 - electron transport" /codon_start=1 /product="hypothetical protein" /protein_id="EED87489.1" /db_xref="InterPro:IPR008254" /translation="LVLYGTQTGNSEAAAEEITSSLSSQLSKLPFKVTPKLLTLDDFL ELRHGEWTRVVVIVCSSYGVGQAPLGARKFREWCDEVKFLSGVKFALCGLGDSHYTTY FRNPTVIEEALTTLGAVRVGKLGKADASGTDDMEQSKVIDRWIGDLWGVLETSIV" gene <374253..>376038 /locus_tag="THAPSDRAFT_12100" mRNA join(<374253..375494,375517..>376038) /locus_tag="THAPSDRAFT_12100" /product="predicted protein" CDS join(374253..375494,375517..376038) /locus_tag="THAPSDRAFT_12100" /codon_start=1 /product="predicted protein" /protein_id="EED87490.1" /translation="MVTANNLPLWGRLQPILPFHYTNSHPTDSYKDVCLSDGPSKLTL PGNFGREALLRPFLDGCPCLFRHHQGRLPNEEADGDSSKRKRCRDCRSIHSLAKFIPR RNLLEFAYEMCHEKQPLVDVDVQHDDTVTCPRQMILKVSGSNATQIVHLRRATDEVNQ SASQSSTSSKYEVSDMDEVSLYWGEFGIAAATTDGDDDEESTSPSSLLRFRIVRMTNE VGAAHTVTDDAQTNQQQALQPALSEGVVHDNSNNNNNNNNNNNNNKSATAQHEDEPMA SKAINTAGSDDTPSQRKEDDVNANIDSTKPSQKAIKSPTIPDVDTPSSVTSNTDDRKA HCASIDEACNNSLACESEESSAPLTLPTYFLATCSRSFSYDSSQRSQSPASNTSYYKQ VDGDDATLINETSELGEGWEWRSIRSGKRKAGSDDDDATMPTPKKSNVTADVARKTLR STLALEAEGEANNTSSDDGKSTVLLSSLSYDQILKLHQDTNNEIASQQPALRKAVLSL TLALTSNASSWDSAFLKQCGANDKVTSHRMGTTPDGNQQRPSSSTDTKASSKGSRMKQ QWIPRLLQGTNIVLNTRDDTV" gene <377553..>379486 /locus_tag="THAPSDRAFT_264926" /pseudo CDS join(<377553..377688,377819..378046,378104..378380, 378420..378609,378625..378883,378911..379266, 379348..>379486) /locus_tag="THAPSDRAFT_264926" /note="phytoene dehydrogenase" /pseudo /codon_start=1 gene complement(379825..>384828) /locus_tag="THAPSDRAFT_25869" mRNA complement(join(379825..380251,380330..381217, 381319..383946,384094..>384828)) /locus_tag="THAPSDRAFT_25869" /product="predicted protein" CDS complement(join(379880..380251,380330..381217, 381319..383946,384094..384828)) /locus_tag="THAPSDRAFT_25869" /codon_start=1 /product="predicted protein" /protein_id="EED87563.1" /translation="MTAAPDDAALMMAASSRTPSTSSGIIDGSTAATNNANTETLARS NLLRLETTALLSESSLHIHPGVAPGSGTLDSTAAGAAADGNNKVHYEARWSPLVRSYL NAVNGAITSLDACSLGPEVCVSGGGGGVSSERGMDKIDGGKKIMYRVPLLSDKFQKHV RGAKKSSSVGIGDNNNNNNSHPWSFPFQGGTSLSLSPIGSLGHLGNAGLANRHANGNV VPVLDVAVLFDEGFVGGKDYLNGRYGDKRNILAVHIAKQLSQKKHRSKVGAVHLTNVF GDERKVALLLTPPLDGEDRADKGSGSKKGGKKRKKGEDKRKNKGKLRFRIRLIFGVKQ ANMLPKHHGGYNSNDENDDEESSIAWNCWIPRTRLFPNRCNNRSEKKSDHEHDDGSGG SVEKSTPHYTNSLAEDLHLVSTTHLISSTLSTLTATGSNVTPTSSFHETLLLLKVWAL QRGFLRGHDTFTTTTLAVVLVYLYRTKGIGKRMGAMQGFTAFMKFWSEVDWLGEDSVG GNSGTSAAASLANVNVDVVLKRKVQKKAAFVIPEEGRNESQTISHCEQARLYLDDVRE NGEDDSPKTLLECYKRHNTSSSSSCSFITNNSHHDSPILLDPTMTINFLARLSPSFIR ESRAEANAALRFIHGHEREEVGGGVFRKLFLETNRFWTRYDAYVRIPMSVVPKIAVGG KKKGKQGGDLQVWGHDVEDLGYDESVCRGVVEVLSRAMGDRVTAIRAFTSGNGDIRAT ASVDVGEEAATKPINDSDQCHTNPIRGTSSCGYAAGLSDRAPQPPALPSTQDDPCLVV GLRIDPNSSRRIVDRGPPAEDVEGSNAFVALWGEHHAQLRRFQDGAIVRAVVWNAPAG EAMSLDDVRFAGEDRSMGGIVERVAQHIVKLHFTDAKKSSTKQGKGPKSVSFELRNMV SFIDGVASTKQPSPFSDSLTLHKNAMSAFDSLADFLRRNTATTVNTIGGGKKASKLGL PLSIDEVEPLSPSLRYSALYPPMPHPLLGGSNLSGDKRKISGVVVGEPILIQIRFEGS SRWPSSLNAMGAAKCAMLIQLADGIEKMKQEPGQLGADDLGAFDGPIDVTPNYLDLGY RGFSFRIVVRADQELRMLNSLKNPTDEAKILQLSLINRHVRGSMHHSLIHAVHTRHPS ASAVARLAHRWIASHMMSDMIPHEAVELMVAKIYTDSAESNSSKLPLADTAPATVTAG FLKWLRLLSSHDWAREPLIVDPQNHITINDRGLIHSQFNIVRGADYSRGPAMYIISPA DYDGVEDMSGSKLLGEEENTSQVPAAENIWAPSITANHPESVVLSRASALAKCSHDHL TSCIMRGSKGSSWVAAFQESPASLTSYSALLRVDPSFVTDPGCSSTASDSTIIFPSKD DGSVQIQTPFERSLQKRYAGPKELRKKNFKNLVLEKDTLHEWQPVKSLVSTLRARYNE YAVFFYNEFAPDLIAMIWRPAAFVPQPFSAMVSEFKRPVSDVWKEDTLVITNSDDLMC EIGCASKDIVTTLKVLDDKKPVDVAPAVKRQKKSWKDDYEVSSDEE" gene complement(<385209..>386669) /locus_tag="THAPSDRAFT_12103" mRNA complement(<385209..>386669) /locus_tag="THAPSDRAFT_12103" /product="predicted protein" CDS complement(385209..386669) /locus_tag="THAPSDRAFT_12103" /codon_start=1 /product="predicted protein" /protein_id="EED87564.1" /translation="MCHEKQPSVDVDVQHDDTVTRPRQMILKVSGSNATQIVHLRRAT DEVNQSASQSSTSSNYEVRDMDEVSLYWGELGIAAATTDGDDDEKSTSPSSLLRFRIV RMTNEVGAAHTVTDDAQPNQQQALQPALSEGVVHDNSNNNNNNNNNKSATAQHEDEPM ASKAINTAGSDDTPSQRKEDDVNANIDSTKPSHKAIKSPTIPDSDTPSSATSNTDDRK AHYASIDEAYNNSLACESEESSAPLTLPTYFLATCSRSFSYDSSQRSQSPASKTSYYK QVDGDDTTLINETSTKRNWKKDGSGEYQTTPQKSIRSGKRKAGSDDDDATMPTPKKSN VTADVARKTLRSMLALEAEGEANNASSDDGKSTVLLSSLSYDQILKLHQDTNNEIASQ QPALRKAVLSLTLALTSDASSWDSAFLEQCRANDKVTSHRNGTTPDGNQQRPSSSTNT KASSKGSLMKQQWIPRLLQGTNIVLNTRDDTVQVDR" gene <387395..>388873 /locus_tag="THAPSDRAFT_12104" mRNA join(<387395..387961,388040..>388873) /locus_tag="THAPSDRAFT_12104" /product="predicted protein" CDS join(387395..387961,388040..388873) /locus_tag="THAPSDRAFT_12104" /note="GO_component: GO:5851 - eukaryotic translation initiation factor 2B complex; GO_function: GO:3743; GTP binding - translation initiation factor activity [PMID 5525]; GO_process: GO:6413 - translational initiation" /codon_start=1 /product="predicted protein" /protein_id="EED87491.1" /db_xref="InterPro:IPR000086" /db_xref="InterPro:IPR000649" /translation="MILTINNIKRVVSIFILRPKSNSNEYQIATFKRCTTMPTFPNHW AGISGSLEEKESPLECAVRELGEETNINELFMEYEGDRVEKMESKRGYDDNEGSRHEL LQSSMKEGLYLDIAKKSSNGAFGGRVIRVYPFALKLQQSTLWSRIEMRGTEHDEMRFM DVHEFLELSPCVPGLQTAFHHATAGFYLRLPNDIKTWANDRVNGAAYLAQHAVSLAAF HTKASLKDSTTTCVAHNKPTAAQSIAMLRPSMVAIVNVMKEFDRRCNAEDEMKQSAVD DIDRIRDELLHSLRTEAGRCVEMGLEAILENYNEWRATSSPSSEFVVGTFSRSSTLKL ILERSLQLIDGQSTSQVKVVCSQSTPGGEGEHMASDLLNASWISDESFQQQLQQGRIN LVVVGADCILPHGIVNKVGTAQLATICKASKVPILCCTDRWKLWEDEFPPPLEEIFEL VSRDALDRVLLPPEKR" gene complement(388908..>392568) /locus_tag="THAPSDRAFT_25870" mRNA complement(join(388908..391545,391605..391777, 392043..392122,392258..392327,392508..>392568)) /locus_tag="THAPSDRAFT_25870" /product="predicted protein" CDS complement(join(389185..391545,391605..391777, 392043..392122,392258..392327,392508..392568)) /locus_tag="THAPSDRAFT_25870" /codon_start=1 /product="predicted protein" /protein_id="EED87565.1" /translation="MGVSSCLLHSSLRQCVISRLLHREPPIITSVIDRYHISYSQNSS GQTCCYVTVTGQPGVVFELTDGHGRHYEQLNPSHPSKTTGMPLSIALQPMNTTTQSSS VSSVSPPAHATDAAALGQENNGDQVNLRQCGGRGCNIRQHCVNNNNNNTTVKKETRSD SLSVIYESTPDGDIRTIELTKPGTSDKGEENADDDYASFTARLKDSYESRGRGTTTKP PTRHKQNLSSQFLDISISGDANNSGFGGEASSVADSAKSTTATDFAPPSPPSSPRGEP SMTATTTQREPSASSNMTIQNSHSDNGIVVPVPIHAKSLKEVTDNDNMASYSAPLTVP EFSRKHRRELSGRSNPALTHRRVNTVGDSEPVKAEHPHNMASIDEWKSANASTRDGAS RLGVASMPPPSTMGTHYPYYWNQLGGSAIVTASYDNSRGYYSSQHHQYHQYPPPAPYY NSSPQSDAGATYNVASYPPPPPIYSQSAESSLRKYYPESSYRGGHSAGAYSNAALAEA VASTRLYPTQEQTYGVAAGGNAKNDGYDTDESNGRKHRTESSLGNFLATSNIFDEDFV ERYGDHQAAAAAAAAAAAATTQHVEGYNSAPDLPQGVTIHRRTESEASFSRSMSDDNF LRNVKTGDDDPDAPRASPPTYSSCYQVGMPYQGQVDSHGALQSYVDYYGQQGMWQQQQ QQQLLGYYTQSGHLSTGEHGSMGDMASGVHSEHHKRFRRKCAVADCPNRVVQGGLCIS HGARRKTCSFPGCTKNVKKQGRCSAHGPPRRRCDAEGCEKVAVQGGKCISHGAQKKKC SLDGCAKQAIMGGMCKKHYDSVNGVVKVRASRRRSLKQKSASMECFSPIAVPLTDEHP KRESGHQRGLSFFQEIDNMDTIISDGMNDPRASSGASPVPVQFTISVALNNGNNVTGV " gene <393107..394602 /locus_tag="THAPSDRAFT_25871" mRNA <393107..394602 /locus_tag="THAPSDRAFT_25871" /product="predicted protein" CDS 393107..394207 /locus_tag="THAPSDRAFT_25871" /codon_start=1 /product="predicted protein" /protein_id="EED87492.1" /translation="MARLRFLPLAAALALLTFIQPSTKYPLQDTDESSSRKKTKHDVF GNNTNAKEHMTLRHRHDMETTSLDESPESTPTPPQYDVPIYLSDPSTGQISSMEWNLL SSQWDRQTTFSHLLQSINNPLLPTTTTNTSIPFLPNKKIITIFHLSPKSGSSTLRKAC LETQYDTCHKPRKGPNMKWPDGYLSPRALVKLMHECTSTHHYCVKHQPLILNYTTMYD TSTFLHVFPFRQYDEWVASALKQIHFRDGDDGCNEAEELLDRCQPHKYELDFGKYGKS YLAYFIHSLRMVRKSRKNRGTVNAHHHILLYDYTTLDGTMQGLNRLYGVPLLKGTDEK ENSVRPGGTCGGEMRMLDKFHDCFSDALLEVR" gene <394936..>396118 /locus_tag="THAPSDRAFT_12107" mRNA join(<394936..395051,395174..395292,395439..>396118) /locus_tag="THAPSDRAFT_12107" /product="predicted protein" CDS join(394936..395051,395174..395292,395439..396118) /locus_tag="THAPSDRAFT_12107" /codon_start=1 /product="predicted protein" /protein_id="EED87493.1" /translation="MSKSTILSPKANADDDMANGTKKYPKPWSMYTIFFRLEHYYIRQ SMSGGDIDDDIKQEIKLAPGHCDELEHPRPDKYKDIVLPPFWYSSAQRVITAKRRKHK KKLGRMSLTNINDKISRNWRSSDDSVINYCHMLAIAEKKRYEEHVAKFGGGTAGVGKK AKANAKRTNGTKSITKSEAVSSLNCAHSHADNKNHATLESEATAKNQTVDPAFTFSRS SMDLVDAIVSGFCSEILNRQSLEEPVKRQRSSYPFNRNTASDPANQTTFVCVPQWSQA DAASLLDALSDDDDGQVTVLCELCRGKM" gene complement(396501..>397645) /locus_tag="THAPSDRAFT_25872" mRNA complement(396501..>397645) /locus_tag="THAPSDRAFT_25872" /product="predicted protein" CDS complement(396590..397645) /locus_tag="THAPSDRAFT_25872" /codon_start=1 /product="predicted protein" /protein_id="EED87566.1" /translation="MALSPYYPNPNGAASSSSAATFQSSAPQQYHDQHVEVDNSANFR HEEAPSTQDTTQDCANTNTSTSHTTAKHRHSFIDFDNTENYSYSEIDLLDINLFRYDA GTTTNTATAVHNAASSSLSDVLCTDNLVLGDENLLDDAMPFALAVDFCPPHQVNIEHK KSMSSMKEIVSHDPNGSGGTVQKASMMCFSSSQQQAHQGLYHLPPLVGTRKNPSSSNA SPTITPIRRVCNKQGCTNRMVQGGSCISHGAKRRSCNIPGCTKKVKMQGRCSSHGPPR KRCEFDGCVKVAVKGGRCIAHGANKKRCSLDGCVKQAVTTGMCIAHYSEVNGVKFVSG KRKRRKTFVDDIAVDDC" gene <400207..402633 /locus_tag="THAPSDRAFT_25873" mRNA join(<400207..402067,402147..402233,402317..402633) /locus_tag="THAPSDRAFT_25873" /product="predicted protein" CDS join(400207..402067,402147..402233,402317..402570) /locus_tag="THAPSDRAFT_25873" /codon_start=1 /product="predicted protein" /protein_id="EED87494.1" /translation="MSGQSTNTNHPKAFADAATNGCNPIRKSTIDGQQHQQHSHQRIN SNVIYESTPEGDIRTIEHEPKDDFASFTARLKDSWDNDVDVAREEDGKMTKHKKDLSV QFLDISISGDPSQQLLGVGDAGVGGKGEDVTTSATTFDFEPPAPAAALLQREPTHLFA NPSKLSSSSSSSSSTSAGSNGGGTTSTRTAPGLSYPLYNAAGIMTSSNISDSPLLNSG IVVPTPIYAKSPPVPVAAPSPTDSSRKHRRELSARKPAMAHRRVNTKGESEAVVTKGH ERDNSIVMTAIQERTTPTPPSPGTAMPPPLGPPPPGHTYPYYYRGSLASPQDVYSSNI PAPPAAGGTYYDSSPRSDAGASYNTGVALPAYTDPRYSLSAQSSPLGCYPGNIVGSMS FPPQQPNLGQPEQEEQPIYSEEMLFAKLKSGNEDGHDNHHRKQSSLSLGSYLVSGAGD SHHRKQSSLGSFLASAGIFEEDFEIDEGYNSAPDGAHGHTKSLSTMSFLGSLSNDDFL RDAAGDGEGAQLAAAAHIGGMYGPPPSLPGYSQPQPFLGGHAIQTQFVGGQATDLDGL TEDQRRNRRRCAIPNCPNRVVQGGLCISHGARRKLCSFPGCTKNVKKAGRCSAHGPPR KLCEVEGCVKVSVQGGKCISHGAKKKSCSVEDCAKQAIMSGMCKKHHDQSKGLVKVRA PRSKKNSDDANNKQGQGAAAGHERGLSIFQDSETMDNIINNGVSGLLSSSS" gene <404134..407073 /locus_tag="THAPSDRAFT_25874" mRNA <404134..407073 /locus_tag="THAPSDRAFT_25874" /product="predicted protein" CDS 404134..406962 /locus_tag="THAPSDRAFT_25874" /codon_start=1 /product="predicted protein" /protein_id="EED87495.1" /translation="MASFIQKSDPWIENTFLYSCSFSNNDQHPLHYDTRDSHGYTKEY EENNNNTMHQEPSQFDSQQFECTENTIPSGLIDSQPDGLQGVVGGGIESNNNNNNSVV VTPPPSRQAKIVQIIHRGGSGWIGNSVVGKATVRGNTNRTITAAAADDDEHCPPYIIL HDGKYSTVAFLSREAMESVGLDTTLFDLDRIKNHDNSGAAGEESSPIKTRTRRGRAKK TLGHRADESDAGDISDASSVTSTMSSRAERAARRAKQPLSSTAINSNKVTLRDKSLVS ITHYTVSTVLQCCSSSNIHHNRVLNSLDIPSNIQKQLHNQLHSHLFLCLYLQGPITII GGENQGLIGNSVDVHCSVRLRQLLRDHDDGGVGDERGERGGRSHETLLEKLEACHLFY QNAMKKKVELQQQQRGSLMCWGSDVSEDVGQGRGGKKRLHGKRNNKGKTSRRKGENGS NFLVPDWPWTSRLDSVPSNITAAEHSPGNVDQLLGGGLDDMLGSHPDEYETESPSKRT AGRASTGTYLAQWDLIDSDDEEGKEEGAGNDGGEKATGDEVQQGDAAALFENVDYLDE ILDFSSESEAAAETLEKSNPPDDDAEDDERDEVRNNTASFVGIDQMIVEEDSEDDDGD NGHLYTQPDAMVNDDFDEDGPFSQVPISLKSDSPKKAYDDGPESQIPIGLRKQPQKRA YTRTQRRSFPEKSTEPINEESQIPLITRRTLRQVQSDDDDGPESQLPFGILLKQPPTA SKHHVEEKPEESRNESQLPFATRRSTFAKGGEQYYDSDDSDSWMKVVPRSKGNSNLSA VQRRAAKGSNNDRSQLAGEGSEENTPTNRRGGAEAAIEVVEEDDFSPDNHAQGEELQS SHNKSRRDRDHHVSFNLEKSNPLPASNVERASKPAVQSQSTSTSAATRVKSAKPPTRP VYGGMVPKKRKNYDMNEFFSRARRMYEC" gene <407759..408909 /locus_tag="THAPSDRAFT_25875" mRNA <407759..408909 /locus_tag="THAPSDRAFT_25875" /product="predicted protein" CDS 407759..408730 /locus_tag="THAPSDRAFT_25875" /codon_start=1 /product="predicted protein" /protein_id="EED87496.1" /translation="MNIKQFSLATYLLSASSPNTNSVHVTAFSHTPRRDATRLGYRTY SDEYGDGDRMSLLTLLEEPTATNVAASSSSSSSENNHVFQEGIVASAFASDLQTKTSK PLSATPTASPLMHWEGQVDVDGDRLSLSTPLRLSSPIQTAPSSASTSSRPATTSSIPS SRATSRTSPITPLASLSDYNHHVLSPSNTQLTLIRFHAPWCQVCRTTSVAYERMASKL SKKGVRFLSVNIDNNNPECEKNVLKDMLDVEAVPMAVVYHPSRGVLGKVKLNRGNLTE LKKRLGGYVSGALHRQGEMMWMEALLTGLLHQEERRGAGVGSAKSDE" gene <409964..412255 /locus_tag="THAPSDRAFT_270136" mRNA join(<409964..410267,410834..411084,411193..412255) /locus_tag="THAPSDRAFT_270136" /product="n-acetylornithine aminotransferase" /note="Has EST support" CDS join(409964..410267,410834..411084,411193..412077) /locus_tag="THAPSDRAFT_270136" /note="Involved in arginine biosynthesis. Putatively chloroplast targeted; GO_function: GO:16740; acetylornithine transaminase activity - transferase activity [Evidence 8483] [PMID 3992]; GO_process: GO:6520; arginine metabolism - amino acid metabolism [PMID 6525]" /codon_start=1 /product="n-acetylornithine aminotransferase" /protein_id="EED87497.1" /db_xref="InterPro:IPR005814" /translation="MKLLFSSLLLATTSTSEAFAPSILTSAAVTTTSTATQLHQSTTS SSPGSVTSNLTPPSKIDSSSIANLFDTRVQKTYGRYPITFVSGDGCALVDEDGREYLD FVSGIATCALGHNNPALTKAVCDQMTKLHHVSNLYYTPQQGLLAAWLCENSCADKAFF CNSGAESNEAAIKLARRHASNRGITDPVIITAESSFHGRTLATVTATAQPKYHKGFTY GGEMVRGFRYTPYNDEEALKKLVDEINTTPEEDAKAGRKRGLAAIMLEPLQGEGGIRP GTKEYFATARKLCDDNGALLICDEVQVGMGRSGSLWGHEQLGVEPDVFTSAKALGGGV PIGAMMAKGAAAEVFGPGDHASTYGGNPLACAAGLAVAEYLSEHDILTNVRERGDQLS AGLEEISKRYPSVLGEVRGWGLLKGVAIKDDAGCTAAELVGDAMKEGLLLVAAGPSVV RFVPPLIVKKEEIDEALARFEKAIEKRVS" gene complement(<412402..>413069) /locus_tag="THAPSDRAFT_12114" mRNA complement(join(<412402..412864,412916..>413069)) /locus_tag="THAPSDRAFT_12114" /product="predicted protein" CDS complement(join(<412402..412864,412916..413069)) /locus_tag="THAPSDRAFT_12114" /codon_start=1 /product="predicted protein" /protein_id="EED87567.1" /translation="MAEIAIGCASSRRVTMIKPTVGKVKTSAYALPDNEFVYGIESPL DKEGAGKAQYRSLVSKVIQSWSRTEPSKPPCSMQSFPATNRRALENGCLTSKAQREYG KQYPVMKQNPKQCTKSQSKASESLNDVVDQKLKAKTSDVITSLFQPQTQPQQPPQAFG IQSKKNKVSMTELLRCIPAELEEESDYPDLSGKKRKGRLPPAKST" gene <414060..>417528 /locus_tag="THAPSDRAFT_25878" mRNA join(<414060..415740,415877..415984,416071..416261, 416340..416549,416624..417025,417106..417281, 417312..>417528) /locus_tag="THAPSDRAFT_25878" /product="predicted protein" CDS join(414060..415740,415877..415984,416071..416261, 416340..416549,416624..417025,417106..417281, 417312..417528) /locus_tag="THAPSDRAFT_25878" /codon_start=1 /product="predicted protein" /protein_id="EED87498.1" /translation="MHYVTNTPVQATLPSYSTLVLLIDPISTSCRIAQEISKRGHHLC ALWTKDYIGDSAEQQSKRSPGTYGCGGLRYKAELEEGRDYVVGDGDVSSLVDVVQEVA STAKLSIVGCFSGSGLSRATRLADGLAVKLGLEPCLVPPKCVGNDGGVGGIVPDICNK NTQQELLKAAGLRCIRQVCSSTLDESTVHFLETEKYPLVVKPASKDRASGGIKLCRTK EEAVDHFNLLIGANQSNGSKMEVICQEYLRGTEVVVDHVSRNFQHKTCMVWKYDKRPA NGENHVYFGMIPVESDSYEAELAIPYVRKVLDALGCKNGPSHAECIITADGRGAVLVE MNVRAQGGDGSWSRLATALTGRYSQIEASVDAWLDEDEFDSLPDAPLSPFQSHGLQVH FVSYSEGEVVSTPGFEVLKHLPSFVSLSPSVGIGSTVEYTTDLATSPGVCLLMNKDEN KLKKDLDFLRYMEEINGLFTYKTNVESLARPTAASYGTPHRRIKSTVDRMEKPSLLRI LSNDRPELARQGMLMKRMTTVDSSKEVVIVTDPYSTGCLLVNEMCSRGYRVIGEMKYH AEVTECESLADTVQAIYKAAGSLRVVACLAGGEAGVDCADAVSERMSLRTNGTHIANR RDKKVQQEMIIAAGMRGVRQAAGQKFEDVVDFLQTEHYPVVLKPTDSAGSDGVKLCHN FEEAKVHFHHLLEVEAVNGGMNTEVLCQEFLRGKEYVVDQVSRDGVHKTMMVWVYDKR PANNAAFVYFGMLPVDPNSAEAKILIPYARGVLDVLGVKNGASHGEVILTSTGPCLVE MNVRAHGGDGNWRSLARGLTGGYSQVEVTVDSYLDKKQFSVIPNLPPTPFKANGQEVI LVSYFRGKVKSTPGYDVIRELKSFVYMETGIKAGSFVERTVDLLTGIGSVILVHSDKE IIDADVAKILWLKIRVVFGLHAIYRATSYLFALLFGRSLWHRREMEKKNELFEYEKSG ALFTAVSQRHLSGILVNEGDEC" gene <421481..>423395 /locus_tag="THAPSDRAFT_12116" mRNA join(<421481..421732,421894..421989,422087..422113, 422200..422265,422389..422449,422557..423173, 423207..>423395) /locus_tag="THAPSDRAFT_12116" /product="predicted protein" CDS join(421481..421732,421894..421989,422087..422113, 422200..422265,422389..422449,422557..423173, 423207..423395) /locus_tag="THAPSDRAFT_12116" /codon_start=1 /product="predicted protein" /protein_id="EED87499.1" /translation="MDDNNGRRYEPDRPKTKRKVVQSLIDHTYYNCSLVNVEEELARR SSQRHHKYKNPNRKGLTTNFPAKLHKILSNPEFRHIIRWMPHGRSWTVVNKDLLAKTV CKRYWNHESYESFNRSLNGWGFKRLWRAGPETRQYYHECFLRGLPDHTKLMERLVNPG KRIRDKQGEPNFWNIAKDFPLPPDPYDDEPTRNGGDYEDYDYENNNSVKETSSPHSTT SSRPSFTTRSVSLPTAPHAHGHGSRHHYHPYDLYPHGPHRRPPASFHRHSNSCPPASA HYDEHGWNHLHNHHYHQPYYPSAYQHYPSSTVIYNPYRSCGYSVNCDTSEHNSPMRST NAHSSATKSNAMVLGKDEGEMEDERKPPADGGCEEFVAPPPLPETGSTNPNIDDADVY SKEQSAETERKHEDDDATHYNHFKYNYDEANLQEFASEYAKDD" gene <423944..>430141 /locus_tag="THAPSDRAFT_25879" mRNA join(<423944..424170,424297..424396,424447..426087, 426172..426402,426484..426566,427040..427100, 427302..427390,427473..427897,428215..428425, 428524..428638,428732..428926,429004..429185, 429326..429599,429669..429752,429806..>430141) /locus_tag="THAPSDRAFT_25879" /product="predicted protein" CDS join(423944..424170,424297..424396,424447..426087, 426172..426402,426484..426566,427040..427100, 427302..427390,427473..427897,428215..428425, 428524..428638,428732..428926,429004..429185, 429326..429599,429669..429752,429806..430141) /locus_tag="THAPSDRAFT_25879" /codon_start=1 /product="predicted protein" /protein_id="EED87500.1" /translation="MSQSNSKRPHDYYNPSDEFNSPLNDADFNAAAADFGDFKASKDM RRKNSHETMATFMSVQTSFTKKGRGWECRGCSMLNDMECNQCEVCGGMKHDLSSQQQQ QQQQQLIRLCSMGSGKSSDTKSDKSKQSDKSKQLKQSSQQKDNSKQVSGSSQRSAPDP PPLDKKSRGGGRKKKMSCGRDAPGDAPEVVISSGGGIAGDRAAVKGKQTRNDAAGARE GTKKKGGEKGTSTTPSTAATTRPPITLGAAGLPGEGTSAVIAQLLGLTEELNLHDTTA AGNASNTHSSHSGSNKNNSKGRKKTTFATEVQDIGNATKLKAKIQGDREKAVRHKAAR RSGEKRESKEKQPQQQSELVISNTGSIRGDILVDTAPTQSTLTLGEASDLCASAQPWQ PSRNTEVEASVSQLNSNESRHKQHGGSEDQAKKEKRPKKDHGKKKGKQFKAEGVADGG DEGDSAEQSSPKQKDGSAGDDVDKRSNKAKKSRKQQQSGGEANAPSSVNLTLQAIQQS KQSKGKGGRNTNEDSKSPSQHQEKQQSREHVRKDEQGSTKATAKKDGNNGSRVKDAAA VTKGTKAKRSIANDDTPKEATKQRDKSHPKKADKSHDKSASNKASSSLAATSSHAPQI PPSQPQTTNNLNYGAGRPIVVVHIAEKPSIAQAVAQGLSDGGDTKSGGKGLPTYEFTN PPFNKAPQASKVTHRVTSVAGHVFNVDFPSEYQSWDSVDPAELFHAPVVRKPCKGSVV RHLQDVAKGVDFIVLWMDCDRRDTSKGDTGISIVQCCLMDHTPTLGFCVQRHVEMETF KPEPYWLMDLGVMNAGTMCRAVWDSGRSFNRNKVDDLVAKCKNSSSAVFAKVDAVVTK NRHQGRPVPMNTVAMLKACSKVLGIGPHAALSIAERLYLSGYLSYPRTESTKYPKSFD IVGTLKEQSYDNRWGQYVGELLKSGPNVGKGGADMGDHLTISTSSISLHCQLVTPGFL AIVLYKQYGDEADEGNGGDDEEEKALPPFAEGDQFGLFFAGSAKSGKVCVSDKWATLD VKEKMTTPPTYLTESELISKMEKHGIGTDASISTHIENIMKRNYVELISGRKLKPSRL GLVLAQGYHLIDSSLVLPQIRADIELECNKIAKGLADRDDVVKKAIEMFSAKFLYFAE NVNKMDILFGSSFAQLQDVGKPFTRCGFTRRYLQFITGPPPLPIGGELKQWTGRHCPV VGCNFELCLYSVGAPARTFPLCPNCFNNPRPEWGPVPGENAVAKPGEEDDEAKERSIR RLAGKNICRECPHGDKHPLIEDMTVSPDPDSGGVLILDPHLGPKLTFLMVSKWRLVST REPTIVHLPKSIDKVTILDKKDEVLGCHMMSIEFKDGESPIDGKKKYVSCFAIDEVLQ GLVRVHHGDERLKASGGRGGRGRGGRGGRGGRGGRGSRR" gene <430391..>431164 /locus_tag="THAPSDRAFT_12118" mRNA <430391..>431164 /locus_tag="THAPSDRAFT_12118" /product="predicted protein" CDS 430391..431164 /locus_tag="THAPSDRAFT_12118" /codon_start=1 /product="predicted protein" /protein_id="EED87501.1" /translation="MKQTSPASLLPMLQGPMASSFDAVAAVADGTRARARSISANKED TDGAQDGYRNQQSPSFEAVSALMMLSTSNKKPTPKPKKQPKAEAKQQWTRKKVPSAFN LFYQYKVHRIRTDPDESSHDYSPLPGLEDMSPNDPLLNKSEQEITRYRKEVIEKALSQ CVDQKNEYKQKLPEVAAQQFARGFVDMGKFMSKEWAKQDKATKCIFTHISKERKAKQD KAVFDAYRAASRRYAFPSLNVLDNRSMPLERHGNLSLGR" gene complement(<433387..>433857) /locus_tag="THAPSDRAFT_38807" mRNA complement(<433387..>433857) /locus_tag="THAPSDRAFT_38807" /product="predicted protein" CDS complement(<433387..433857) /locus_tag="THAPSDRAFT_38807" /note="GO_function: GO:3824 - catalytic activity; GO_process: GO:8152 - metabolism" /codon_start=1 /product="predicted protein" /protein_id="EED87568.1" /db_xref="InterPro:IPR000887" /translation="MQACIDGGFKITEFTLTTPDCLEHLANFREKYDGDVMVGCGTIM NTEDAERAVDAGAEFIITPVMLPDVIEWCAKRNIVIVPGCQTPTELVSAYRAGAPLQK LFPGVTGGPAWVKAVSSALPMLSINPTSGVTLENAGDYLRNGAASVGLVAPLFDP" gene <435350..>436686 /locus_tag="THAPSDRAFT_12120" mRNA join(<435350..435609,435740..436192,436284..>436686) /locus_tag="THAPSDRAFT_12120" /product="predicted protein" CDS join(435350..435609,435740..436192,436284..436686) /locus_tag="THAPSDRAFT_12120" /codon_start=1 /product="predicted protein" /protein_id="EED87502.1" /translation="MKLNLTTVKKAMGITPKSVGRVFILDGNNVVGHRVVNLLIDEGE TPDVTIRVGMREEEDNDKDVEKWNCVERVKFVWEDKTTYDNALKDVKTVFVTTPTTEN WDQHFTHFLSACNKAKVKRIIKLSMYHSLRSRAENPRRFFGDPEYLVGRDQFHDVPLV HQHALCDGDLILRGLDCTILFASHLMSNVLKYQGTTLCERKEFYGASGGKKVNYVSPN DVADVAVRAILDPKSHRREGYTLTGAAPIKDEEVATLISQRTDTEVAYVEKPLSFFDK DSAMLEKIKASGLEEEKFAKGDFERLAGREPETFEEYLSADHRMTVDERKALGLADFF TMTNKIELDFPSEPHFVTLVPKTEETPAETAQPVAAQ" gene complement(437192..438922) /locus_tag="THAPSDRAFT_25881" mRNA complement(437192..438922) /locus_tag="THAPSDRAFT_25881" /product="predicted protein" CDS complement(437202..438686) /locus_tag="THAPSDRAFT_25881" /codon_start=1 /product="predicted protein" /protein_id="EED87569.1" /translation="MKRERELVKQQQGGEERSDCSNTNTTPSTLRPPPPPPPLPPDSE TDCKEEEYARDKLAELTQAYEILSDDLTRLLYHRYGIVDGPEDVVGLLNGRMRVDSTM KKSSIAGEEATNGLNGDSSSSSEQGRLMELMGYPIGGFHTHHHHHHTQHRHNTQHRQR LHYLTATITEKLRPLVEGTVSQELFVENIYQECNSLKKSALGAQILRCVGRAYRVEGY RVLRMMEREKRQHGGHQALHWRRRRHSHHKVTDLVRDRWRDAKHLCSAALASTKLAIA EQKLKRVYTEHERRKEQRKTEREERRIVLARGGIGERANGNREELDLISNIGALPGEE EVAGDDFDTGMLSDDEEGDFLDEDNGRVEEELRHLQHQKAIQAILSVSQMEALWKITK IELDQTVRCACQWILSPTDYDGQWHSFFPSEQSPFAEDWQHHTRHCYSSHRSRYSSNQ RHDGWVGTTGESVSMEVGRLRAAAALVLVGDIMVECSKEGTSWK" gene <439793..>442440 /locus_tag="THAPSDRAFT_12122" mRNA join(<439793..440365,440528..441642,441751..442386, 442407..>442440) /locus_tag="THAPSDRAFT_12122" /product="predicted protein" CDS join(439793..440365,440528..441642,441751..442386, 442407..442440) /locus_tag="THAPSDRAFT_12122" /codon_start=1 /product="predicted protein" /protein_id="EED87503.1" /translation="METFYEGMDPSLPLSVYIRKQEECQDFANDAKVPISDQTMITTA TKHALQCGDYTEAWKEWNRGTDAQKTWRDWKNHWTRAFNENRAIQRLTGNSFRANATI ETELSTQLVTSLDNLAYAAVQKNDTIVKFIETIKQQQDTIHKLQAQNGELMVKLLGGQ SAADANAKKGGTTGDKHAWDPSGYCWTHGFKLLSSSTIFSGCANPPNLNTTALLDTAA NISLLANGAPSERANSQLTPKSVMQPKGDRLFTTETLLLLLNKVPLEAREAHRAPGIT NNLLSASALADAGCELFFHQTGCEVSLNGEIILRGWSDPDTRLWRVSLLADGSNSIVP ADQNIVQSPTVNGIYECENTGELIKFYYATMGYPVISTWTQAIDKGYFRGWRGLTSDR VQRFIKPNEQCEQGHMDQRRTGIRSTKSSHAVPPPDIVNTMEEPEQAPQNDKTNMVFM TIAEAEGQLFTDQTGRFPVTSNRGNNYIVLFYVVDANFIKSYPIKSRHRTELLKAYDD VYKYLRIRGYRPKLHRLDNETSKDVEDFIAEQNAKHQYTPPDIHRTNIAERMIRTPCT QNPNLSAYEAMEGMFSFDATPMAPIGTECMIHVKPSKRHTWGYHSMKAWYFAPALKHY RCIKVVTDAGDVRTTDTFKFLHHTLPVPKVSSTDRIVKATKHLRQAINDNTTTAPDEL HAIENLRALLLGSPPQPPLPETSVQPASPPQAETSDLPPPEPQTAQPPVQFVQTGSPP VQPSHRQSPHAIPFNDDESEAYTSSGTDDLPLPQRPSHYSIRHRR" gene <445816..>446356 /locus_tag="THAPSDRAFT_12123" mRNA join(<445816..446152,446229..446285,446307..>446356) /locus_tag="THAPSDRAFT_12123" /product="predicted protein" CDS join(445816..446152,446229..446285,446307..446356) /locus_tag="THAPSDRAFT_12123" /codon_start=1 /product="predicted protein" /protein_id="EED87504.1" /translation="MVSESIITPEKKRVSEGKKSPSDRKERHKLNKKPRKVRQWKPNQ DEEENFNGSTALGILQSTKTRILAAITCTMSNGSPGTKCDGTSACDGMDPEKVGCGSC NGSYACNGFGGASIGENSCNDYYACFNADAPEKSVYAKYCPGSGL" gene complement(<449574..>451739) /locus_tag="THAPSDRAFT_12124" mRNA complement(join(<449574..449778,449883..449964, 450104..450135,450233..450320,450404..450581, 450720..450770,450925..451011,451132..451218, 451339..451425,451500..451535,451641..>451739)) /locus_tag="THAPSDRAFT_12124" /product="predicted protein" CDS complement(join(449574..449778,449883..449964, 450104..450135,450233..450320,450404..450581, 450720..450770,450925..451011,451132..451218, 451339..451425,451500..451535,451641..451739)) /locus_tag="THAPSDRAFT_12124" /codon_start=1 /product="predicted protein" /protein_id="EED87570.1" /translation="MYLILCIHAYASLFICFASKQNPTRSPSVSPTKRRTETPTGFPT SAPTPSKGEESTPTSSPISYYGKGKGTKGTAPTPSKGEESTPTSSPFSYYGKGKGTKG TAPTPSKGEESTPTSSPFSYYGKGKGTKGTSPSQAPTASPITPDPTKAATTTTSTIST TTTTSMTTTPEPKFWPDWLGDDTCVFNEKFPQYMQLNPSWTGSTLEDCCKRYYSWRYD NCMVEGGGTSNTAMLYYPNWEGSAHVCVNDGEAPDYITQAASTFMFEDLEDCCEKYYW WNMAECLGSAANAGSNKYYADYRLSKCVKDCTDSDCGGLVGGVWDELYDDKSVCCAEK FWWVEDCDA" gene complement(<452777..>453733) /locus_tag="THAPSDRAFT_264928" mRNA complement(join(<452777..453333,453379..>453733)) /locus_tag="THAPSDRAFT_264928" /product="hypothetical protein" CDS complement(join(<452777..453333,453379..453733)) /locus_tag="THAPSDRAFT_264928" /note="No reasonable similarity to any known sequences.; hypothetical gene" /codon_start=1 /product="hypothetical protein" /protein_id="EED87571.1" /translation="MCLALQLQECWCSAKIIPCILCMDACASLFIANSSSHPQIDCSH FFALHPLQRRNDPPSTKSPSQAPTMTTIQKMIRGCDGAANDQFGYDLAVKGDWAAIGA YGVSSYTGAVYLYKHDAEWDWFGYTVALSGTGHLVVGARGDDDKGYNSGSVYLLTLNS SSNTWGDEQKIVASDGAASDNFGDTVAMSGTGHLVVGAPYEDDKGSNAGAVYLLTLNS SSNTWGDEQKIVASDGAASDNFGDTVAMSGTGHLVVGAPYEDDKGSNAGAVYLYTLNS TSNTWGNEQKIVASDGAADDNFGVTVAM" CONTIG join(AAFD02000042.1:1..454954) //