LOCUS       KV891522              368823 bp    DNA     linear   CON 24-FEB-2017
DEFINITION  Opisthorchis viverrini isolate Khon Kaen unplaced genomic scaffold
            O_viverrini-1.0_Cont67, whole genome shotgun sequence.
ACCESSION   KV891522 LASN01000000
VERSION     KV891522.1
DBLINK      BioProject: PRJNA230518
            BioSample: SAMN03378119
KEYWORDS    WGS; HIGH_QUALITY_DRAFT.
SOURCE      Opisthorchis viverrini
  ORGANISM  Opisthorchis viverrini
            Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes;
            Trematoda; Digenea; Opisthorchiida; Opisthorchiata;
            Opisthorchiidae; Opisthorchis.
REFERENCE   1  (bases 1 to 368823)
  AUTHORS   Mitreva,M.
  TITLE     Draft genome of the nematode, Opisthorchis viverrini
  JOURNAL   Unpublished
REFERENCE   2  (bases 1 to 368823)
  AUTHORS   Mitreva,M., Pepin,K.H., Martin,J., Ozersky,P., Palsikar,V.B.,
            Zhang,X. and Wilson,R.K.
  TITLE     Direct Submission
  JOURNAL   Submitted (31-MAR-2015) The Genome Institute, Washington University
            School of Medicine, 4444 Forest Park, St. Louis, MO 63108, USA
COMMENT     The human liver flukes, Opisthorchis viverrini, O. felineus and C.
            sinensis remain important public health problems in many parts of
            the world. Clonorchis sinensis is widespread in China, Korea and
            Vietnam, while O. viverrini is endemic in Southeast Asia, including
            Thailand, Lao Peoples Democratic Republic (Lao PDR), Cambodia and
            central Vietnam. Human infection follows the consumption of raw or
            undercooked cyprinoid (freshwater) fish haboring infective
            metacercariae. Recent reports suggested that about 35 million
            people are infected with C. sinensis globally; with up to 15
            million human infections in China alone and another 8-10 million
            individuals infected with O. viverrini in Thailand and Lao PDR.
            More than 600 million people, mainly in Asia, are at risk of
            infection with these two liver flukes (Petney et al 2013 Int J
            Parasitol 43, 1031-46).
            
            The infections are associated with hepatobiliary diseases including
            hepatomegaly, cholangitis, fibrosis of the periportal system,
            cholecystitis, gallstones and are major aetiological agents of bile
            duct cancer, cholangiocarcinoma (CCA). O. viverrini and C. sinensis
            are classified as Group 1 carcinogens  metazoan parasites that are
            carcinogenic to humans  by the the International Agency for
            Research on Cancer, World Health Organization (WHO) (Bouvard et
            al., 2009 Lancet Oncol 10, 321-2). Therefore, not only do these
            liver flukes cause pathogenic helminth infections, they also are
            carcinogenic in humans in similar fashion to several other more
            well known biological carcinogens, in particular hepatitis viruses,
            human papilloma virus and Helicobacter pylori.
            
            The liver fluke endemic area of Khon Kaen, Northeast Thailand has
            reported the highest incidence of liver cancer in the world (Shin
            et al 2010 Cancer Sci. 101, 579-85; Sripa et al. 2014 Acta Tropica
            141, 361-367). In regard to socioeconomic impact, it was estimated
            20 years ago that the total direct cost of O. viverrini infection
            to the work force (between the age of 15 and 60 years) in Northeast
            Thailand was US$ 80 million per annum. More recently, it has been
            reported that liver and bile duct cancer, the end-stage consequence
            of liver fluke disease, ranks number five in Thai males among all
            diseases with highest number of disability-adjusted life years
            (DALYs) (see Sripa et al., 2012 Trends Parasitol 28, 395-407).
            
            The liver flukes used here as a source of genomic DNA for the WGS
            were obtained by collecting the metacerciae of O. viverrini from
            wild caught fresh water fishes in the vicinity of Khon Kaen City,
            Khon Kaen province, Thailand. Hamsters were experimentally infected
            with these metacercariae at Khon Kaen University, after which adult
            O. viverrini worms were recovered from the biliary tract of the
            hamsters at euthanasia six weeks after infection.  [The outbred,
            male Syrian (golden) hamsters (Mesocricetus auratas) were reared at
            the animal facilities of the Faculty of Medicine, Khon Kaen
            University, Khon Kaen, Thailand. Protocols for the experiments were
            approved by the Animal Ethics Committee of Khon Kaen University,
            approval number AEKKU25/2554, according to the Ethics of Animal
            Experimentation of the National Research Council of Thailand.]
            Genomic DNAs were recovered from pools of 10 to 20 adult
            (hermaphroditc) flukes and provided by Dr. Banchob Sripa
            (banchob@kku.ac.th) and Dr. Paul Brindley
            (pbrindley@email.gwu.edu).
            
            This assembly consists of fragments, 3kb and 8kb insert whole
            genome shotgun libraries. The sequences were generating on the
            Illumina platform. An initial assembly was generated using
            Allpaths_LG. To improve scaffolding and contiguity, we used our in
            house tool Pygap (Gap closure tool), which uses the Pyramid
            assembler with Illumina paired reads to close gaps and extending
            contigs. An alternate assembly was generated by first using flash
            (fast length adjustment of short reads Bioinformatics 27:21 (2011),
            2957-63) to merge the Illumina fragments. These reads were then fed
            to the Newbler assembler (Roche). Newbler contigs > 500 bases were
            used to fill gaps in the allpaths scaffolds using PBJelly (PLoS ONE
            7(11): e47768. doi:10.1371/journal.pone.0047768). The final step
            was using L_RNA_scaffolder (BMC Genomics 2013, 14:604), which uses
            transcript alignments, to improve contiguity.
            
            The repeat library was generated using Repeatmodeler (A. Smit, R.
            Hubley http://www.systemsbiology.org/). The Ribosomal RNA genes
            were identified using RNAmmer
            ((http://www.cbs.dtu.dk/cgi-bin/nph-sw_request?rnammer ) and
            transfer RNA's were identified with tRNAscan-SE (Lowe and Eddy,
            1997). Non-coding RNAs, such as microRNAs, were identified by
            sequence homology search of the Rfam database
            (http://selab.janelia.org/software.html). Repeats and predicted
            RNA's were then masked using RepeatMasker (A. Smit, R. Hubley & P.
            Green http://repeatmasker.org). Protein-coding genes were predicted
            using a combination of ab initio programs Snap (I. Korf, 2004),
            Fgenesh (Softberry, Corp) and Augustus (M. Stanke, et. Al 2008) and
            the annotation pipeline tool Maker (M. Yandell et. al., 2007) which
            aligns mRNA, EST and protein information from same species or
            cross-species to aid in gene structure determination and
            modifications. A consensus gene set from the above prediction
            algorithms was generated, using a logical, hierarchical approach
            developed at the Genome institute. Gene product naming was
            determined by BER (JCVI: http://ber.sourceforge.net).
            
            Our goal is to explore this WGS draft sequence of O. viverrini to
            better define proteins and other metabolites involved in parasitism
            that impact health and disease and are relevant to host-parasite
            relationships, parasitism, carcinogenesis and other biological and
            pathological processes.
            
            For information regarding this assembly or project, or any other
            GSC genome project, please visit our Genome Groups web page
            (http://genome.wustl.edu/genome_group_index.cgi) and email the
            designated contact person. For specific questions regarding the O.
            viverrini genome project contact Makedonka Mitreva
            (mmitreva@genome.wustl.edu) at Washington University School of
            Medicine. The National Human Genome Research Institute (NHGRI) of
            the National Institutes of Health (NIH) provided funds for this
            project.
            
            ##Genome-Assembly-Data-START##
            Current Finishing Status :: High-Quality Draft
            Assembly Method          :: allpaths LG v. 43357 (2012-12-28)
            Assembly Name            :: O_viverrini_1.0.pg.lrna
            Genome Coverage          :: 23x
            Sequencing Technology    :: Illumina
            ##Genome-Assembly-Data-END##
FEATURES             Location/Qualifiers
     source          1..368823
                     /organism="Opisthorchis viverrini"
                     /mol_type="genomic DNA"
                     /submitter_seqid="O_viverrini-1.0_Cont67"
                     /isolate="Khon Kaen"
                     /isolation_source="wild caught freshwater fish"
                     /db_xref="taxon:6198"
                     /chromosome="Unknown"
                     /sex="hermaphrodite"
                     /dev_stage="adult"
                     /lab_host="hamster"
                     /country="Thailand: Khon Kaen province"
                     /note="pooled from 10-20 individuals"
     gene            370..39168
                     /locus_tag="X801_00515"
     mRNA            join(370..396,1695..1816,1856..2005,2050..2194,2236..2343,
                     2363..2459,5611..5745,8424..8520,8751..8779,10792..10911,
                     11196..11797,13165..13229,14086..14152,14593..14741,
                     17455..17503,18351..18468,23843..23979,27553..27655,
                     29964..30084,34511..34616,36251..36361,36536..36649,
                     39106..39168)
                     /locus_tag="X801_00515"
                     /product="hypothetical protein"
     CDS             join(370..396,1695..1816,1856..2005,2050..2194,2236..2343,
                     2363..2459,5611..5745,8424..8520,8751..8779,10792..10911,
                     11196..11797,13165..13229,14086..14152,14593..14741,
                     17455..17503,18351..18468,23843..23979,27553..27655,
                     29964..30084,34511..34616,36251..36361,36536..36649,
                     39106..39168)
                     /locus_tag="X801_00515"
                     /codon_start=1
                     /product="hypothetical protein"
                     /protein_id="OON23559.1"
                     /translation="MLHEQLPVQELVVRIHVESEDAFKDYPLLVVVKQRNEVTSFRVP
                     TVVNELSVYHNVSRTLCPLKLSSDFSVNFTVELSTFSLKNVSFKFRAESVTDFDLRKD
                     DPLNFLLRPAQPVYFRYQFTHPLDSVAVKVVSPDNVCMVLSLQTMQCPVSDGLDTVQN
                     AGLYQTVTNLGVISANVCSMRTSTELQYPDGFYVVLVLKATDFACTGMERLLPIPTGD
                     SSYSHPNNFNYAYLSLIVAHFQQHREPNRQNKNVTLKILPAPSRWDYIIPISGAFGFF
                     ALFYLAGIIIICTSRYLPHRHERSVDLTPQHFDLLSEEAEAFCQPTDVVRNYGTATHI
                     TTTHPSLHTDSSLVDMLDTEESESRSLKPPKPLCTQRARRGNDSQGGDSRSRTSTTQQ
                     LPSTSHHADVSLSVPRGENSLGGSTASLNHSRTLIMEHTREHHGWFSSSDEEQPEGHF
                     VGAQDTLDKYSHRSPQVRSFTPGSVIQHEVSETRPITELRRPNHSPVSTGTTQTSAPD
                     KQAPSERGSREVLTNAERNTDDVTILLSQNDMFRIVPVSNLSRKRYTTLNRKYLLYFW
                     YLIIISIFYGLPAIQLIMIYQKTLVETGNEDLCYYNFECARPLGIFTAFNNIISNIGY
                     IMLGLLFLTATARRDLIHRRRRKLDPEVTETRGLPQHYGLYYAMGLALTMEGIMSACY
                     HMCPSFSNFQFDTAYMYILAMLIILKIYQTRHPDVNASAHSAYMVMAVVIFLGVTGVV
                     YGSQTFWIAFTILFLLMSVVLTGEIYYMGQWNIDYCLPRRLFSMIKSDGIRSLRPMYL
                     ERMILLLVANLVNFALAGYGVATRPRDFSSFLLSVFMINMMVYTLFYIFMKIRHKERI
                     LLAPILYMVAGLVCWSAAIYFFFLRNTTWEVTPAQSRALNHPCLLFDFYDAHDIWHFL
                     SASSMFFSFMMLMNLDDDLVDRPRDQIAVF"
     assembly_gap    32602..32701
                     /estimated_length=100
                     /gap_type="within scaffold"
                     /linkage_evidence="paired-ends"
     gene            complement(<48312..59191)
                     /locus_tag="X801_00516"
     mRNA            complement(join(<48312..48525,52248..52402,58487..58679,
                     58911..59191))
                     /locus_tag="X801_00516"
                     /product="hypothetical protein"
     CDS             complement(join(<48312..48525,52248..52402,58487..58679,
                     58911..59191))
                     /locus_tag="X801_00516"
                     /inference="protein motif:HMMPfam:IPR010432"
                     /codon_start=1
                     /product="hypothetical protein"
                     /protein_id="OON23560.1"
                     /db_xref="InterPro:IPR010432"
                     /translation="MTTEWSNYYASLEQWVKEQHQALSVSEQQMRQAAEEQNRWWRAS
                     FLAQKQYEKEMQRFQLRASQNLAASQRHSAAQVDNRHIPVQCKPVDFPFITLRRARIW
                     AKSELRKSELFLTRLEQLLFGGDGEQDMSDADGDLFDLDLLLIVDHIGRFFGAIIEAI
                     CISRCFFGRGPGGATIGKWIMNLRVVSCDEIIPLPEMVEVVPGSNLGIIRAIVRSVLK
                     STSFLTIFSFICMMLFRYHRCTYDIIAGSLVVQPLPIVVQMPDQNVGNDMGAAVQPNE
                     HENQR"
     assembly_gap    61687..65254
                     /estimated_length=3568
                     /gap_type="within scaffold"
                     /linkage_evidence="paired-ends"
     assembly_gap    89052..89405
                     /estimated_length=354
                     /gap_type="within scaffold"
                     /linkage_evidence="paired-ends"
     gene            97872..118919
                     /locus_tag="X801_00517"
     mRNA            join(97872..98028,104352..104440,104536..104637,
                     105449..105644,105812..106125,110474..110581,
                     114952..115031,118805..118919)
                     /locus_tag="X801_00517"
                     /product="F-box domain protein"
     CDS             join(97872..98028,104352..104440,104536..104637,
                     105449..105644,105812..106125,110474..110581,
                     114952..115031,118805..118919)
                     /locus_tag="X801_00517"
                     /codon_start=1
                     /product="F-box domain protein"
                     /protein_id="OON23561.1"
                     /translation="MSGGSASGLTTEDSDTTDDEDSESSTVATCTTEQTARKSYQDLA
                     DKVEKKLLTIRNPDHWATGDGFDDLPAEALDPEDEPYKKTHISALPHELLLRIFRWAV
                     GSHLDTRILGRLARVCRGFYLLACDSSIWRSICLRLWPRLLDHHSHGVRGEQLTAAVP
                     LHYGYKDWRDMAIHRPHVLLDGCYLCRITYVRPGEALSGIYRPMHLVVYYRGIRFYPD
                     GHVNMLTSYHAPNAVVAALGKPFAITQRDASALPTGTEADSSGLVQDSLSASGGGLLQ
                     GTYVLVDPDLIVCTLFRPRNKEKLHPFRRRRQAQLFSSEQQVTYNIRFKLSSSKKRLH
                     NVLHWDSYTIRQLNTTTQTDQLITLNVFSDQFPECRFSPVRSYVSTVAPEPL"
     gene            127508..135426
                     /locus_tag="X801_00518"
     mRNA            join(127508..127708,135214..135426)
                     /locus_tag="X801_00518"
                     /product="glutaredoxin-like protein"
     CDS             join(127508..127708,135214..135426)
                     /locus_tag="X801_00518"
                     /inference="protein motif:HMMPfam:IPR008554"
                     /note="KEGG: cqu:CpipJ_CPIJ019232 3.2e-08 acetyl-CoA
                     acetyltransferase, mitochondrial; K00626 acetyl-CoA
                     C-acetyltransferase"
                     /codon_start=1
                     /product="glutaredoxin-like protein"
                     /protein_id="OON23562.1"
                     /db_xref="InterPro:IPR008554"
                     /translation="MLSSRLSSLVTRPPRIFQRAPPLVTINRLTSQTLAYPSVPTLVM
                     FTKPDCSLCRAAIEQLRPYANKYFRLQLVDITEAKNSVWRKYQYDIPVFHILSVGTQL
                     THENKGTYLMKHHVDWNRLKNSIPELDQCPELDEY"
     assembly_gap    133310..133633
                     /estimated_length=324
                     /gap_type="within scaffold"
                     /linkage_evidence="paired-ends"
     gene            complement(<138738..147955)
                     /locus_tag="X801_00519"
     mRNA            complement(join(<138738..138959,139070..139252,
                     142165..142477,144024..144164,147819..147955))
                     /locus_tag="X801_00519"
                     /product="hypothetical protein"
     CDS             complement(join(<138738..138959,139070..139252,
                     142165..142477,144024..144164,147819..147955))
                     /locus_tag="X801_00519"
                     /note="KEGG: ddi:DDB_G0274493 8.7e-08 lig1; DNA ligase I;
                     K10747 DNA ligase 1"
                     /codon_start=1
                     /product="hypothetical protein"
                     /protein_id="OON23563.1"
                     /translation="MVPRMYILTDMRLSDHHAIVPSLDQAKVEQEIKLDHSGGIHSIP
                     LRKMTLRILLKRFMFFQKTNSAAHPGFAQSLKRHITVAKPSLSTVNTKLVRQRSCPRT
                     RKDQGAQEQRVDDAICHREDESNTPSYSLDFTDDLCSHSRPDEIEQDPNLLDSSSKNS
                     DRSAIATKVEELTPWERWLLQKEKIRRQELKRLRQKEANLRAKQAQAAASQAEKEQRS
                     LAKCKEWLESKQRLLKERNEAAAMARKEQLLKERFKKESKKLESERKYALWCEQKKHE
                     AKETAELIRQKELAAAASAQRRKEEAMVKFVAWMKSAQQRKPNPPVSYGYADGTVI"
     gene            162517..164347
                     /locus_tag="X801_00520"
     mRNA            join(162517..162771,162804..162988,163028..163162,
                     164245..164347)
                     /locus_tag="X801_00520"
                     /product="ubiquitin--protein ligase"
     CDS             join(162517..162771,162804..162988,163028..163162,
                     164245..164347)
                     /locus_tag="X801_00520"
                     /inference="protein motif:HMMPfam:IPR000608"
                     /note="KEGG: smm:Smp_055960 8.0e-72 ubiquitin-conjugating
                     enzyme E2r; K02207 ubiquitin-conjugating enzyme E2 R"
                     /codon_start=1
                     /product="ubiquitin--protein ligase"
                     /protein_id="OON23564.1"
                     /db_xref="InterPro:IPR000608"
                     /translation="MSEFTDQSRTGLQLGLPRTQTCDLFMASKKPQSSAVKALQKELL
                     ELNATPVEGFKVNISSVENMFVWDVAIFGPPKTLYEGGYFKARLFFPPDYPYSPPSMQ
                     FLTKMFHPNIYADGEVCISILHSPGEDPQSDELASERWNPTQNVRTILLSVISLLNEP
                     NIHSAAHVDASISYRKWKESNGKENEFEQIVSKQSNNRRITVPTVARAISLNFGAEPN
                     GATTPIG"
     gene            177465..178342
                     /locus_tag="X801_00521"
     mRNA            join(177465..177541,177925..178342)
                     /locus_tag="X801_00521"
                     /product="hypothetical protein"
     CDS             join(177465..177541,177925..178342)
                     /locus_tag="X801_00521"
                     /note="KEGG: smm:Smp_055960 7.8e-13 ubiquitin-conjugating
                     enzyme E2r; K02207 ubiquitin-conjugating enzyme E2 R"
                     /codon_start=1
                     /product="hypothetical protein"
                     /protein_id="OON23565.1"
                     /translation="MHHSPCRNTWATLVTETAQQTHGLFYLVLQTHADAEKDGVHVPT
                     TVEEYCVSGRPGNDPDQSETGQDGSDFDDYYEYYPSDVDDADEDDDVACSESLDAGAD
                     SAMEENEGEVASRPKHNAQPNQSAANVSPGQRVGDGDRESSSSQTSRAASSVSKATHD
                     SENS"
     assembly_gap    208140..208266
                     /estimated_length=127
                     /gap_type="within scaffold"
                     /linkage_evidence="paired-ends"
     gene            212774..>229928
                     /locus_tag="X801_00522"
     mRNA            join(212774..213081,217914..218190,220075..220495,
                     229624..>229928)
                     /locus_tag="X801_00522"
                     /product="hypothetical protein"
     CDS             join(212774..213081,217914..218190,220075..220495,
                     229624..>229928)
                     /locus_tag="X801_00522"
                     /codon_start=1
                     /product="hypothetical protein"
                     /protein_id="OON23566.1"
                     /translation="MMSVNTFTGVFQLFTEKLNYSHHRDGSQDETSVRYTKELRGFMK
                     GEKSFSKLIVRSDRHLWTQALQLLNTGKTPITRSIREGTEETQPMILVDNECWICLKL
                     KLTKRWTKLLNLQCLEKFGRSKSFSFPFIVNVKRNSKEYDTLIQKITGRSSRNQASKR
                     GRTRVCSSLGHRGREHAVGKTQESGFLAPCSKDDEVKSTNIVLIPLYPAASDQLHNSV
                     SYSSEHHDNKSCAQVDLDLDSVERTSDECIGNNKPTALLCMSLRNASTNSSCALSAGD
                     DLGRWYITFVPKVHQTKIVEFFFIACLTGHTPQALLGQTLDSFIHPSDSRRLSQWLSA
                     DASNGKNEMEKEADTLECRWRCANCTYTWLSISRVCSTQVLDQRTTTQPIQNNLHFYR
                     LTWLSGPTDLENFFDPISVSPVAVRSDSLSLSSEAMDKWCREMRF"
     assembly_gap    220989..221088
                     /estimated_length=100
                     /gap_type="within scaffold"
                     /linkage_evidence="paired-ends"
     gene            234763..>240625
                     /locus_tag="X801_00523"
     mRNA            join(234763..234819,237047..237561,238837..239486,
                     240369..>240625)
                     /locus_tag="X801_00523"
                     /product="hypothetical protein"
     CDS             join(234763..234819,237047..237561,238837..239486,
                     240369..>240625)
                     /locus_tag="X801_00523"
                     /codon_start=1
                     /product="hypothetical protein"
                     /protein_id="OON23567.1"
                     /translation="MNEQLVGASQTYINQQFVQPRSEEPPVKVTRRSHEPPHQAKKAE
                     LTDIPPIQLGPESFNNFSSVPPKQNTGFWNPTTQKAYLEKEYPGQMPTAPQFTQYLHA
                     RRLQEECNNRKLYIPQHLMVPNVNGKRSFCQAFAERVPCAVKGTPCCVKPHAILPNPL
                     ACMCGVQKCRAFVQQYCCSGTSHSQTTPGGPCATCSLDQCASHQRASIICELHPHVNQ
                     IWIMKNNWHSSNTCPHNESTNTTGTQPAANLLSGSNGSGSNGVTPGSCYHHACLYHSN
                     RCRVENTNPFPTCKCVSSVSSIPCCCSGNPTTPLMNTFNYAISNWCNNQQMLPQYPLN
                     HLNENGISQRGGEGAPNVFFSAAIKPATIPMVSNPIPNFAADGGRVCTGIQAQSNYQG
                     PLAGDFRNCGEPMENVVLPLEVYETKFLAHPHSSLPPEIITNTPAKPTTQPKQPYPST
                     TEGLLKQTHRFAKFEHLQPVSSKVLRPPSSHNESAEGQTNDLT"
     assembly_gap    253777..253876
                     /estimated_length=100
                     /gap_type="within scaffold"
                     /linkage_evidence="paired-ends"
     gene            complement(<264443..>265107)
                     /locus_tag="X801_00524"
     mRNA            complement(join(<264443..264561,264673..264798,
                     264834..264889,264925..265034,265069..265107))
                     /locus_tag="X801_00524"
                     /product="leucine Rich repeat-containing domain protein"
     CDS             complement(join(<264443..264561,264673..264798,
                     264834..264889,264925..265034,265069..>265107))
                     /locus_tag="X801_00524"
                     /note="KEGG: dme:Dmel_CG10839 8.6e-36 CG10839 gene product
                     from transcript CG10839-RA K10411"
                     /codon_start=1
                     /product="leucine Rich repeat-containing domain protein"
                     /protein_id="OON23568.1"
                     /translation="DKPTTIKEAIKKWEEKTGQKASEAKEVKLYAQYPPIEKLDASLS
                     TLTACEKLSLSTNCIEKITNLNGLKNLKILSLARNNIKNLNGLEVLGDTLEQLWISYN
                     ILERLKGIGECRNFRNQVIILGNPLEENNSETWRDEACRRLPRLKKLD"
     assembly_gap    267859..267958
                     /estimated_length=100
                     /gap_type="within scaffold"
                     /linkage_evidence="paired-ends"
     assembly_gap    271979..272123
                     /estimated_length=145
                     /gap_type="within scaffold"
                     /linkage_evidence="paired-ends"
     assembly_gap    274750..274849
                     /estimated_length=100
                     /gap_type="within scaffold"
                     /linkage_evidence="paired-ends"
     gene            278043..>298298
                     /locus_tag="X801_00525"
     mRNA            join(278043..278078,278114..278289,285149..285311,
                     292578..292796,298131..>298298)
                     /locus_tag="X801_00525"
                     /product="Pyrimidine 5'-nucleotidase"
     CDS             join(278043..278078,278114..278289,285149..285311,
                     292578..292796,298131..>298298)
                     /locus_tag="X801_00525"
                     /note="KEGG: smm:Smp_097180 2.9e-74 hypothetical protein;
                     K01081 5'-nucleotidase"
                     /codon_start=1
                     /product="Pyrimidine 5'-nucleotidase"
                     /protein_id="OON23569.1"
                     /translation="MDPELTESTKSRLTYLRDKYYPVEIDPTLSDAEKIPQMVEWWRL
                     SHESIVSCGLHRDALAKTVRECGLVLRDGVQEFTELLRFHQIPLLIFSAGLGDVIELL
                     LESFSMYTENVRVVSNFMQFSDEGLLVGFTDPIIHSFNKTAASIANGDYARLSSQRPC
                     VLLLGDSTGDVHMADGATVDDPTGISGTVLRIGFLNEAVEANLEKYKTLYDIVLVNDD
                     TFAVPLAVMKSILQCHQAVADGIAAPTTNNTDDIEL"
     assembly_gap    294164..294978
                     /estimated_length=815
                     /gap_type="within scaffold"
                     /linkage_evidence="paired-ends"
     gene            <308503..>311947
                     /locus_tag="X801_00526"
     mRNA            join(308503..308763,309472..309554,310839..311075,
                     311800..>311947)
                     /locus_tag="X801_00526"
                     /product="hypothetical protein"
     CDS             join(<308503..308763,309472..309554,310839..311075,
                     311800..>311947)
                     /locus_tag="X801_00526"
                     /codon_start=1
                     /product="hypothetical protein"
                     /protein_id="OON23570.1"
                     /translation="STYKTDVRPLCFLEFCELVCKILLWAVFPSLYPLVTCLECFVGN
                     FSIPETTLRGQRLSNNEFLVPPNVRFASFVPFERCCETEFETVFIKECRHTGLIPYML
                     GGHQPIVCRGEALLETRCSDPGEQIQTQPCVIRRIHKNTMFCSSNIHLYCCVFNSILT
                     IVVVVVLIHIMTKIYRPTTAKNPSGPVSVPHCESGRRPNDLGANPHLCRFNSCYWCSL
                     QFPELKEHMKEFYSDTFCYTSGRRI"
     gene            complement(321713..353100)
                     /locus_tag="X801_00527"
     mRNA            complement(join(321713..321868,323033..323182,
                     323878..324186,326761..326955,333306..333622,
                     342971..343169,352888..353100))
                     /locus_tag="X801_00527"
                     /product="eukaryotic aspartyl protease"
     CDS             complement(join(321713..321868,323033..323182,
                     323878..324186,326761..326955,333306..333622,
                     342971..343169,352888..353100))
                     /locus_tag="X801_00527"
                     /inference="protein motif:HMMPfam:IPR001461"
                     /note="KEGG: spu:585631 4.6e-53 hypothetical LOC585631;
                     K04521 beta-site APP-cleaving enzyme 1 (memapsin 2)"
                     /codon_start=1
                     /product="eukaryotic aspartyl protease"
                     /protein_id="OON23571.1"
                     /db_xref="InterPro:IPR001461"
                     /translation="MQLSLALIIFIFATTHCKTNIILPIRLRRTENEALFAASFPNSA
                     KMAVSNLAGLPGMGYYLPVTLGNPPQELNLLVDTGSADLAVAGRKLANVDRWFNASAS
                     STLECSGFSKQVRYQQGSWIGPLCKDRLTLVLRNMSHNQSITRLPPVPIHFSLIEDAQ
                     NFFLSHYGSTWEGIIGLGYPKLMMDNIPSRQNQVEQVLSHSPWIRVLWKLFEQTPEQL
                     HPFETLTSAWNLPNMFSLLLCGLTQQSFSVIGSSPVEMNGLLLIGGMNLTGMTQTVNT
                     ILYTPVREPWFYEVVLTDLRVEEQSVVEDCKELNVEFSIVDSGTTNIQVPERVFQPLL
                     GHIKTYVAKQSSSTATMLQNYASFWAGASMLCEASSDNQIGGASGLPHTLFPVIEFQM
                     LALGADMNKEALSLTLSPQQYIRFVGRHPKDGVMRDCFAFGIRPHHSGTILGAVFLEG
                     FFTVFHRNSLKVGAAKLFNLHQPHFNTKMDAFETNLFTCTFNCVHTIEKGFDASSKGR
                     VCDQ"
     assembly_gap    353430..353541
                     /estimated_length=112
                     /gap_type="within scaffold"
                     /linkage_evidence="paired-ends"
CONTIG      join(LASN01000956.1:1..32601,gap(100),LASN01000957.1:1..28985,
            gap(3568),LASN01000958.1:1..23797,gap(354),LASN01000959.1:1..43904,
            gap(324),LASN01000960.1:1..74506,gap(127),LASN01000961.1:1..12722,
            gap(100),LASN01000962.1:1..32688,gap(100),LASN01000963.1:1..13982,
            gap(100),LASN01000964.1:1..4020,gap(145),LASN01000965.1:1..2626,
            gap(100),LASN01000966.1:1..19314,gap(815),LASN01000967.1:1..58451,
            gap(112),LASN01000968.1:1..15282)
//