LOCUS HBJA01130309 5879 bp RNA linear TSA 01-FEB-2021 DEFINITION TSA: Eutreptiella gymnastica, TRINITY-DN39798-c5-g1-i1. ACCESSION HBJA01130309 HBJA01000000 VERSION HBJA01130309.1 DBLINK BioProject:PRJEB37159 BioSample:SAMN02740100 Sequence Read Archive:SRR1296768, SRR1296802, SRR1296866 KEYWORDS TSA; Transcriptome Shotgun Assembly. SOURCE Eutreptiella gymnastica ORGANISM Eutreptiella gymnastica Eukaryota; Discoba; Euglenozoa; Euglenida; Spirocuta; Euglenophyceae; Eutreptiales; Eutreptiella. REFERENCE 1 AUTHORS Scheremetjew M., Meng A., Cochrane G., Corre E., Pelletier E., Niang G., Holt S., Finn R., Kale V. JOURNAL Submitted (13-JAN-2021) to the INSDC. Roscoff Biological Station - Sorbonne University, CNRS / Sorbonne Universite, STATION BIOLOGIQUE DE ROSCOFF, Place Georges Teissier, 29680 Roscoff, France COMMENT Assembly Name: 37fc5a59e54183c1591ee587e54a4e93 Assembly Method: Illumina HiSeq 2000 Sequencing Technology: Trinity-V2.6.5 Eutreptiella gymnastica-like, TSA FEATURES Location/Qualifiers source 1..5879 /organism="Eutreptiella gymnastica" /strain="CCMP1594" /mol_type="transcribed RNA" /db_xref="taxon:73025" gene 1..5879 /locus_tag="EGYM00163_LOCUS44761" /standard_name="ORF type:complete len:1495 (+),score=357.62" /note="ID:GENE.TRINITY-DN39798-c5-g1-i1~~TRINITY-DN39798- c5 -g1-i1.p1" /note="source:transdecoder" mRNA 1..5879 /locus_tag="EGYM00163_LOCUS44761" /standard_name="ORF type:complete len:1495 (+),score=357.62" /note="ID:TRINITY-DN39798-c5-g1-i1.p1" /note="source:transdecoder" exon 1..5879 /locus_tag="EGYM00163_LOCUS44761" /note="ID:TRINITY-DN39798-c5-g1-i1.p1.exon1" /note="source:transdecoder" 5'UTR 1..46 /locus_tag="EGYM00163_LOCUS44761" /note="ID:TRINITY-DN39798-c5-g1-i1.p1.utr5p1" /note="source:transdecoder" CDS 47..4531 /codon_start=1 /transl_table=1 /locus_tag="EGYM00163_LOCUS44761" /note="ID:cds.TRINITY-DN39798-c5-g1-i1.p1" /note="source:transdecoder" /inference="protein motif:InterPro:IPR030960" /inference="protein motif:GO:0005737" /inference="protein motif:KEGG:00400+4.2.1.10+1.1.1.25+2.7.1.71+2.5.1.19+4.2.3 .4" /inference="protein motif:InterPro:IPR006264" /inference="protein motif:InterPro:IPR001986" /inference="protein motif:GO:0016765" /inference="protein motif:GO:0003866" /inference="protein motif:InterPro:IPR000623" /inference="protein motif:InterPro:IPR016037" /inference="protein motif:InterPro:IPR008289" /inference="protein motif:MetaCyc:PWY-6416" /inference="protein motif:KEGG:00400+2.7.1.71" /inference="protein motif:GO:0003855" /inference="protein motif:InterPro:IPR027417" /inference="protein motif:GO:0009073" /inference="protein motif:InterPro:IPR001381" /inference="protein motif:GO:0003856" /inference="protein motif:MetaCyc:PWY-6164" /inference="protein motif:GO:0004764" /inference="protein motif:KEGG:00400+4.2.3.4" /inference="protein motif:InterPro:IPR036291" /inference="protein motif:InterPro:IPR022893" /inference="protein motif:InterPro:IPR023000" /inference="protein motif:InterPro:IPR031322" /inference="protein motif:InterPro:IPR023193" /inference="protein motif:InterPro:IPR010110" /inference="protein motif:InterPro:IPR013708" /inference="protein motif:KEGG:00400+4.2.1.10" /inference="protein motif:MetaCyc:PWY-6163" /inference="protein motif:GO:0003824" /inference="protein motif:KEGG:00400+1.1.1.25+4.2.1.10+2.7.1.71+2.5.1.19+4.2.3 .4" /inference="protein motif:InterPro:IPR006151" /inference="protein motif:InterPro:IPR036968" /inference="protein motif:MetaCyc:PWY-6707" /inference="protein motif:KEGG:00400+2.5.1.19" /inference="protein motif:InterPro:IPR013785" /inference="protein motif:GO:0055114" /inference="protein motif:GO:0004765" /inference="protein motif:InterPro:IPR013792" /inference="protein motif:KEGG:00400+1.1.1.25" /inference="ab initio prediction:InterProScan:5.28-67.0" /protein_id="CAE0833465.1" /translation="MSYKKVSILGKESIYVGNDLFDIIPGDLKKAVKSTKFAVVTDEN LAPLYLDKLKAAFKKEGIDLLAQIIPPGEQTKCRAMKEKIENTLLEAKCGRDTCILAL GGGVVGDLTGFVASTFLRGVPFVQIPTSLLAMVDSSIGGKTGLDTLHGKNLIGAFHQP VRIYIDMDVLKSLPERQLWNGMAEVVKSAAIWSADAFKQLEEEYEKVQERDPEILASI IWTCAGIKADVVTKDEKESGLRELLNYGHSIGHAVEALMQPGMLHGEAVAIGMVKEAE LARHMGLLSPASVGRLIRCIQVYHLPIHMPQLTVPDLMEKMAVDKKNKGGKKYITLLN SIGSCHMNKAIPVEDPLIEFILSPAVVVQPPSKPVNCTIQVPGSKSGSNRALLLAALA EGRCEMEGLLHSDDTQVMLDALAKLGACQFEWIAGGSKLVIKGNGGRLNIPSSEIYLG NAGTASRFLTTVCNYVKGKGHTVITGDKRMQERPCGPLVDALRANGCKIDYLNREGCL PLKIQSDSFKGGVIELSAKVSSQYVSSVLISAPYAQKEVTLKLIGEVVSPIYIDMTVQ MMREFGGVVEKKADNLYIVKQQKYKLPAKYSVECDASSSTYPLCIAAITGGKVKVQNV GKDSMQGDAQFCWLLQKMGCKVEQTVDCTTVEAPADGVLKPVEVDMGDLTDAFMGAAA LMAVAPGTSKIYGIANQRVKECNRIAVMREELGKCGVVCTELPDGLVINGVDAKTLHG ADIKCYNDHRIAMSFAVLGCKVPGINILEKHCVEKTYPQFWDDLKNIMGVAVGAPASA TTRTGKRGLENGDAPSAKKARTGSSVDTVVLIGMRGAGKTGLGREAAGVLGRKFLDLD HAFEAKHGPIMDFVKAKGWDAFRVAEAQVLQESVKAHPKGHIIACGGGIVETEAGRTE LKKLKQVVEVRRNIDDVCKYLKSDKSRPYLGEEPDLIFKRRDPLYIQCSQYQFNVPKG TTDWKPVHKELVGLLKRLEGGAQIVDPSKPSYFLSLSFPNYAAQAELIPKIVTGSHAI EARIDLLESQDKEFIAAQISALRRVSDLPIVYTVRSIGQAGKFPEDEAKMFDLLHYGV RLGCEFVDMETCWSCQARDKLLANKGSSKIISSFHDPMGCCSWPEMKMQFEEGSQKGK ADIVKVVGMATKIEDVFALRQVVSELELSQPVIALCMGEAGKLSRVLNTYLTPVTHPA LPMKAAPGQLSVSEIHQIRHSIGLLPKKNFYLFGSPISQSMSPTIHNTGFGVLGLPHV YSLSESPDPEHVRKVLQDPNFGGSSVTIPHKQNVQPMLQSLSPSAQAIGAVNTIIPQA DGTLHGDNTDWLAMRTLITQTLAASNRKAEVGIVLGAGGTARAAMYTLQQLSLSKYYI FNRTLEKAQQLANEFGGTAITSLDGIKDLDVIIGTIPADAQSAYPKELFNKKPVAVDL AYRPRRTPLLKQATAAGCETVEGIALLIEQGLFQFQIWTGCIAPREEIAAAVYGAYKD " 3'UTR 4532..5879 /locus_tag="EGYM00163_LOCUS44761" /note="ID:TRINITY-DN39798-c5-g1-i1.p1.utr3p1" /note="source:transdecoder" BASE COUNT 1423 a 1430 c 1724 g 1302 t ORIGIN 1 gcgccccctc tcgccacgct gcgcgaacac cacagcacgc taaaagatgt cgtataaaaa 61 ggtttccatc cttggcaagg aatcaattta tgttggcaat gacctcttcg acattattcc 121 aggcgatctc aagaaagcgg tgaagtcgac aaaatttgct gtggtgaccg atgaaaacct 181 tgcacccctt tatttggaca agctaaaggc tgcctttaaa aaagaaggaa tcgatttgtt 241 ggcgcaaatt attccacccg gtgagcaaac gaaatgtcgg gccatgaagg agaaaatcga 301 gaacaccttg ttggaagcta agtgtggccg agatacttgt atccttgctt tgggtggggg 361 agttgttggg gatttgactg gttttgtggc gtcgacattc ttaagagggg tgccgtttgt 421 ccagattccg acttcactcc ttgccatggt ggattccagc attggtggga aaacgggcct 481 tgacacttta catggcaaaa atttgatcgg cgcatttcac caacccgttc gcatttatat 541 cgacatggat gtgttgaaga gccttccgga gagacaactt tggaatggaa tggcggaagt 601 ggtcaagagt gcggcgattt ggagtgcgga tgctttcaaa caattggaag aagagtatga 661 aaaagtgcag gagcgggatc ccgagatctt ggcgtccatc atctggactt gtgcgggcat 721 caaggctgac gttgtcacca aggacgagaa ggagagtggc ctccgtgagc tcctcaatta 781 cggccacagc atcgggcatg ccgtggaagc attgatgcaa ccaggcatgc tgcacgggga 841 ggctgtggcg attgggatgg tcaaagaggc tgaactggct cgccatatgg ggctgctgag 901 ccctgccagt gttgggcgac tgattcgatg cattcaggta taccatttac ccatccacat 961 gccccaactg accgtgccgg atctcatgga gaagatggcc gtggacaaga agaacaaggg 1021 tggcaaaaag tacatcacgt tgctgaatag catcggatct tgccatatga ataaggccat 1081 ccccgtcgaa gatcccctca ttgagtttat cctgtctccg gcggtcgtgg tccaacctcc 1141 ctccaagccc gtcaactgca cgatccaagt ccctggttcc aagtctggct ccaaccgagc 1201 gctgctgcta gctgcccttg cggagggcag gtgcgagatg gaggggctgc tgcattccga 1261 cgacacacag gtcatgcttg acgcattggc caagttgggc gcatgtcagt tcgagtggat 1321 cgctggaggc agcaaattgg tgatcaaggg caacggcggc cgtcttaaca tcccgtcgtc 1381 ggagatctat ttgggcaatg cggggactgc ctccagattc ctgaccaccg tgtgcaatta 1441 cgtcaaaggt aaagggcaca ccgtcatcac gggcgacaaa agaatgcaag aacggccctg 1501 cggaccactg gtggacgcgt tgcgagcaaa cggttgcaag atcgattatt tgaaccggga 1561 gggatgcctt cctctgaaaa tccagtcgga ctcattcaag gggggcgtca ttgagctgtc 1621 tgcaaaggtg agctcccaat acgtctcctc ggtgctgatc agtgcgccgt atgcccagaa 1681 ggaagtcacg ctgaagctga tcggggaagt tgtatctcca atttacatcg acatgacggt 1741 gcagatgatg agggagtttg gcggcgttgt ggagaagaag gcggataatc tatacattgt 1801 caagcagcag aagtacaagc ttcccgccaa gtacagtgtg gaatgtgatg cctcgtcatc 1861 tacgtatccc ctctgcatcg ctgccatcac gggtggcaag gtgaaggtgc agaacgtagg 1921 caaggacagc atgcaaggag atgcccagtt ctgctggctg ctgcagaaaa tgggttgcaa 1981 ggtcgagcag accgtggatt gcacgactgt ggaggccccc gccgatggcg tcctcaagcc 2041 tgtggaagtg gatatgggag atttgacaga cgctttcatg ggggccgctg ccctgatggc 2101 agttgcaccc gggacgtcca agatctacgg cattgccaac caaagagtca aggaatgcaa 2161 ccgaattgca gtgatgcggg aggaacttgg caagtgcggt gttgtatgca cggagttgcc 2221 cgatggcctg gtgatcaatg gagtggacgc caagacactc cacggggctg atatcaagtg 2281 ttacaatgac catcggattg caatgagctt tgctgtattg ggctgcaagg tcccagggat 2341 caacatcctg gagaagcact gcgttgagaa gacttacccg cagttctggg acgatttgaa 2401 gaatatcatg ggagtggccg tgggggcccc agcatcagca acgacacgca ctgggaagcg 2461 tgggctggag aacggagacg ccccgtccgc caaaaaggcg cgcactggca gcagcgtaga 2521 caccgtcgtc ctgattggca tgcgcggcgc tggcaagacc gggctgggca gggaggctgc 2581 tggtgttctg ggccgcaagt ttttggattt ggatcatgca tttgaagcga agcacggccc 2641 tatcatggat ttcgtcaagg ccaagggttg ggatgcgttc cgtgttgctg aagctcaagt 2701 cctgcaggag tctgtcaaag cgcatccgaa ggggcacatc atcgcttgtg gagggggtat 2761 tgtggagacc gaggcaggac gtacagaatt gaaaaagctg aagcaggttg tggaagttcg 2821 tcgcaacatt gacgacgtgt gcaaatactt gaagtctgac aagagtcgcc cgtatcttgg 2881 agaagagcct gacctcatct tcaagcgccg ggatccgttg tacatccagt gctcacagta 2941 tcagttcaac gttcccaaag gcacgacgga ttggaaaccc gtccacaagg agctggtcgg 3001 tctcttgaag cggttggaag gtggggcaca gattgtggac ccatccaagc caagctattt 3061 cctgtccttg agtttcccga actatgccgc acaagccgaa ttgatcccaa agattgtcac 3121 aggttcgcat gccattgagg cccgcatcga tctcctggag tcacaggaca aggagttcat 3181 tgcagcacag atcagcgcct tacgccgcgt ctcagacttg ccaattgtgt acaccgtgcg 3241 atctattggg caggcgggca agttccctga agatgaggca aagatgttcg acctgttgca 3301 ttacggcgtg cgcttggggt gcgaattcgt ggacatggag acctgctgga gctgtcaagc 3361 gcgtgacaag ctgctggcca ataaagggtc ttcgaagatc atttcgtcct tccacgaccc 3421 aatgggttgc tgctcttggc cggaaatgaa gatgcagttc gaagaaggga gccagaaggg 3481 gaaggctgac atagtcaaag tggtgggcat ggctacaaag atcgaggatg tcttcgcttt 3541 gcgccaggtc gtcagtgagc tggagctgtc ccagccggtg atagccctgt gcatgggaga 3601 ggctggcaaa ctgtcccggg tgctcaacac gtatcttacc ccagtgacgc atccagcgtt 3661 gccaatgaag gcagcccctg ggcagctctc agtcagcgag atccatcaga tccggcactc 3721 catcgggctt ttgccaaaaa agaatttcta cctctttggt agccccatct ctcagtccat 3781 gtcccccacc atccacaaca ccggtttcgg ggtgttgggg ttgccccacg tctactccct 3841 ttctgagtcc ccggacccag aacacgtccg taaagtactg caagacccga atttcggcgg 3901 ttcctcggtg accatcccac acaagcagaa cgtgcagccc atgctgcagt cgttgagtcc 3961 ctcggcccag gccatcggcg cagtcaacac cattattccc caggcggacg gcacgctgca 4021 tggcgacaac accgactggc tggccatgcg cacgttgatc acccagacac tggcggcgag 4081 caaccgcaaa gctgaagttg ggatagtact gggagcaggt ggcacagcac gtgctgcaat 4141 gtacacgctg cagcagctgt ctctgtcaaa atactatatt ttcaacagaa ccctcgagaa 4201 ggcccagcag ctggcaaatg aattcggggg cactgcgatc accagtttgg acggaatcaa 4261 ggatctggac gtcatcattg ggacgattcc tgctgatgcc cagagcgcat acccgaagga 4321 attgttcaac aagaagcccg tggctgtgga tctggcatac cgaccaagaa ggacaccact 4381 gctgaagcaa gccactgctg ctggatgcga gactgttgaa ggcattgctc tgctgattga 4441 gcagggcctg ttccaattcc agatctggac tgggtgcata gccccgcgtg aggagattgc 4501 tgcagctgtg tacggtgcgt ataaagatta gaaaaatatg caaggcggtt gtatgctgcg 4561 acatgtcggg cagtaagtta catgacgtca tatggggttc acagtaggtc acaggcatca 4621 atgctcaatc gtgggatgca gtccagtgct tcaaaatgat gctcttcaca gtctgtacca 4681 cgagggcagt ttgttgcaac atgtcacagt gggcctgttg cggcacttct gagtggactc 4741 tgagagggcc cgtgattgca tttattgaca gggatagttg tataacgcac taatcaaagg 4801 gaccggaaac agttgcaaaa agaattgtgc tgattaattg gaagtgactg gagtggagaa 4861 catcagggaa aatggttgca acgaaggaaa ctggggggag tcgaaaggtt acaagaatcg 4921 gtagagatcg cagtgatgga caggtgttgg tgctgcacaa gccctgtctg tgcagtctgc 4981 ctatgctgcc cctgtacctg tatctcagtg ctgcgtagcg cagccaccct ttctcaaacc 5041 gcaaactagc gtttctaatg tcctaattat atatgttact gtgctggggt gagccatgcc 5101 atgcaactat gtgcaaacta ttccagccgc cccgttaggg aaacaaggca aacacaatgt 5161 tacgttgtgc tgggctattg gacaagtggc aagaaagtct acaatggtag ggcctgcaca 5221 gcatggtatg gaggccgcca gagtggggct gcaccagccg tggcggaggc ggagcccggt 5281 cctaagatga gggtacaatg ccaactgtgc ttggcactgc aatgcctcca ctgcaacgga 5341 ggaaccggca gcaaacgttg aatgatctgg acatgcagtc ttttcccctc cagcattggg 5401 ttgttcgttt tccagatcag ggcagcagag gctctttgcg aagctgctgt tggtgcgccg 5461 tttcataatt gtctgcatga cctttgtttt tccagacacc ttcagaggga aagggttcag 5521 cgaaaagaca gggagtagtg ggtgcaacct atggttgatc gccggtatcc agagcacaac 5581 agggagtagc acaggggcat tgggtcacag gtaaaaatca gggtgcaagg agtctcattt 5641 tgtgcacgta tctgggacat gccatggcag gagtgtactc aagcagcact ggcaggcaca 5701 gagtttctgc tagggcagat tagttcgttg tcgttccttt tggccagcca gatatagccg 5761 ccatcaagtg caagaacgca caagtctttc gttgccagca ttgatggatt ccattctcgg 5821 catgggtttt ggcacagtgc aattcaaata tgcaaatctg tgctttattt aagcaaaaa //