LOCUS       KN716699               75662 bp    DNA     linear   CON 12-MAR-2015
DEFINITION  Dictyocaulus viviparus strain HannoverDv2000 unplaced genomic
            scaffold D_viviparus-1.0_Cont550, whole genome shotgun sequence.
ACCESSION   KN716699 AZAF01000000
VERSION     KN716699.1
DBLINK      BioProject: PRJNA72587
            BioSample: SAMN02873952
KEYWORDS    WGS; HIGH_QUALITY_DRAFT.
SOURCE      Dictyocaulus viviparus (bovine lungworm)
  ORGANISM  Dictyocaulus viviparus
            Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
            Rhabditina; Rhabditomorpha; Strongyloidea; Metastrongylidae;
            Dictyocaulus.
REFERENCE   1  (bases 1 to 75662)
  AUTHORS   Mitreva,M.
  TITLE     Draft genome of the bovine lungworm Dictyocaulus viviparus
  JOURNAL   Unpublished
REFERENCE   2  (bases 1 to 75662)
  AUTHORS   Mitreva,M., Pepin,K.H., Abubucker,S., Martin,J., Minx,P.,
            Warren,C., Palsikar,V.B., Zhang,X. and Wilson,R.K.
  TITLE     Direct Submission
  JOURNAL   Submitted (12-NOV-2013) The Genome Institute, Washington University
            School of Medicine, 4444 Forest Park, St. Louis, MO 63108, USA
COMMENT     Dictyocaulus viviparus, the bovine lungworm, is the cause of
            parasitic bronchitis in cattle (husk, verminous pneumonia,
            dictyocaulosis) with a world wide distribution in temperate areas.
            The predominant hosts are cattle, but it can also infect deers.
            Infection occurs on pasture as infective larvae develop from L1
            that are shed with the feces to free-living infective L3 that are
            ingested by cattle while grazing. Free-living larvae do not feed;
            survival on pasture is therefore limited to a few months depending
            on temperature and humidity. The parasite is able to interrupt
            development inside the host. Infective larvae that have been
            exposed to low temperatures on pasture before infection
            subsequently develop only to preadult larval stages that survive
            winter conditions as hypobiotic larvae in the lung and resume
            development in spring. These animals contaminate pastures the
            following spring and represent the major source of infection for
            other susceptible cattle. The strain being sequenced
            (HannoverDv2000) was obtained from the laboratory of Drs. Thomas
            Schnieder and Christina Strube (Christina.Strube@tiho-hannover.de)
            and has been maintained in calves since August 2000. Worm isolation
            and DNA extraction was performed by Christina Strube and/or the
            Genome Institute's production team.
            
            This assembly consists of fragments, 3kb and 8kb insert whole
            genome shotgun libraries. The sequences were generated on the
            Roch/454 platform and assembled using Newbler. To improve
            scaffolding, inhouse tools CIGA (Cdna tool for Improving Genome
            Assembly) and Pygap (Gap closure tool) were used to map 454 cDNA
            reads using blat to the genomic assembly to link genomic contigs
            based on cDNA evidence. Only joins confirmed by additional
            independent data typing were accepted and close gaps followed by
            the Pyramid assembler and Illumina paired reads to closing gaps and
            extending contigs
            
            The repeat library was generated using Repeatmodeler (A.F.A. Smit,
            R. Hubley & P. Green http://repeatmasker.org). The Ribosomal RNA
            genes were identified using RNAmmer (Lagesen et. al., 2007 Nucleic
            Acids Res.) and transfer RNA's were identified with tRNAscan-SE
            (Lowe and Eddy, Nucleic Acids Res. 1997). Non-coding RNAs, such as
            microRNAs, were identified by sequence homology search of the Rfam
            database (Griffiths-Jones et. al., 2003 Nucleic Acids Res.).
            Repeats and predicted RNA's were then masked using RepeatMasker (A.
            Smit, R. Hubley & P. Green http://repeatmasker.org). Protein-coding
            genes were predicted using a combination of ab initio programs Snap
            (Korf, 2004 BCM Bioinformatics), Fgenesh (Salamov A., Solovyev V.
            2000, Genome Res.) and Augustus (M. Stanke, et. al., 2008
            Bioinformatics) and the annotation pipeline tool Maker (M. Yandell
            et. al., 2007 Genomc Research) which aligns mRNA, EST and protein
            information from same species or cross-species to aid in gene
            structure determination and modifications. A consensus gene set
            from the above prediction algorithms was generated, using a
            logical, hierarchical approach developed at the Genome institute.
            Gene product naming was determined by BER
            (http://ber.sourceforge.net).
            
            Our goal is to explore this WGS draft sequence of D. viviparus to
            better define proteins involved in nematode parasitism that impact
            health and disease and are relevant to both host-parasite
            relationships and basic biological processes.
            
            For information regarding this assembly or project, or any other
            GSC genome project, please visit our Genome Groups web page
            (http://genome.wustl.edu/genome_group_index.cgi) and email the
            designated contact person. For specific questions regarding the D.
            viviparus genome project contact Makedonka Mitreva
            (mmitreva@genome.wustl.edu) at Washington University School of
            Medicine. The National Human Genome Research Institute (NHGRI) of
            the National Institutes of Health (NIH) provided funds for this
            project.
            
            ##Genome-Assembly-Data-START##
            Current Finishing Status :: High-Quality Draft
            Assembly Method          :: Newbler v. 2.6
            Assembly Name            :: D_viviparus_9.2.1.ec.pg
            Genome Coverage          :: 12.20x
            Sequencing Technology    :: 454
            ##Genome-Assembly-Data-END##
FEATURES             Location/Qualifiers
     source          1..75662
                     /organism="Dictyocaulus viviparus"
                     /mol_type="genomic DNA"
                     /submitter_seqid="D_viviparus-1.0_Cont550"
                     /strain="HannoverDv2000"
                     /isolation_source="Cow lung"
                     /host="cattle"
                     /db_xref="taxon:29172"
                     /chromosome="Unknown"
                     /country="Germany"
                     /lat_lon="51.00 N 9.00 E"
                     /collection_date="Aug-2000"
                     /collected_by="Drs. Thomas Schnieder and Christina Strube"
     gene            3999..11729
                     /locus_tag="DICVIV_11832"
     mRNA            join(3999..4061,4430..4579,5131..5259,5801..5944,
                     6949..7081,7135..7244,7563..7694,7771..7833,7925..8085,
                     8269..8365,8472..8661,8732..8874,8937..9032,9615..9794,
                     10369..10425,10519..10616,11278..11395,11521..11570,
                     11627..11729)
                     /locus_tag="DICVIV_11832"
                     /product="CRAL/TRIO domain protein"
     CDS             join(3999..4061,4430..4579,5131..5259,5801..5944,
                     6949..7081,7135..7244,7563..7694,7771..7833,7925..8085,
                     8269..8365,8472..8661,8732..8874,8937..9032,9615..9794,
                     10369..10425,10519..10616,11278..11395,11521..11570,
                     11627..11729)
                     /locus_tag="DICVIV_11832"
                     /inference="protein motif:HMMPfam:IPR001251"
                     /inference="protein motif:HMMPfam:IPR006797"
                     /inference="protein motif:HMMPfam:IPR008273"
                     /note="KEGG: isc:IscW_ISCW023769 5.6e-27 retinal-binding
                     protein, putative"
                     /codon_start=1
                     /product="CRAL/TRIO domain protein"
                     /protein_id="KJH42177.1"
                     /db_xref="InterPro:IPR001251"
                     /db_xref="InterPro:IPR006797"
                     /db_xref="InterPro:IPR008273"
                     /translation="MVQTYQSPVRVYKHPFELVMAAYEKRFPTCPQIPIFVGSEVTYE
                     YHSEDGAQWVIERKCQLNVDAPYLVKKVHPDNPEWTCFEQNAALDVKSFFGFETAVEK
                     LAMKQYAANLAKGKEILEYFIDELIKSGVTYIPQFEHKDSGSSADSAIDISKEHSSDE
                     TWRRAFTEERREFFIIALRFVTYWLPMLARRYDKTGKRRRSVKFVIAESKLEAEYIRR
                     FLGQLSPLEESRLCELKYGLHAHHKGKLPNDAHLLRFLRARDFDVAKAKELVHNSIIW
                     RKQHNVDRILQEWTPPSVMTQFFPGCWHHNDKEGRPLYLLRLGNLDMKGMLRSCGLEN
                     IVKLTLSICEEGLIKTAEATRQIGAPISTWSLLVDLEGLSMRHLWRPGVQSLLRIIEI
                     VEAHYPETMGQVLVVRAPRVFPILWTLISPFIDETTRNKFMINSGELVKGEISKYVDD
                     QYIPDFLGGTCLVSCPSGGHIPKSQYRPVQELPDDADVLRSMYTTASVTRGYPVEVII
                     PVTSMGCVLTWDFDILKSECEFIVYHTPKIIQEAMTPHSPSMLKPVEMVTAAIANNPL
                     PVVISDPSLTLGLDLTIEEKPVVFQEGDSMQGSHFCSRSGTYILQWRIPEISGQHNTT
                     FDFSIGTHKCRLMYYHELLNSADFRKGGSNFLNISKSCRCCLQYKKVSKEAKSHWLVF
                     CQFIHNFDSLLSRWEMFNISQTVGSVASLESCRSSFSSIAPPSQPGTPCVGSKIAK"
     assembly_gap    9340..9520
                     /estimated_length=181
                     /gap_type="within scaffold"
                     /linkage_evidence="paired-ends"
     assembly_gap    17384..17877
                     /estimated_length=494
                     /gap_type="within scaffold"
                     /linkage_evidence="paired-ends"
     assembly_gap    20400..21171
                     /estimated_length=772
                     /gap_type="within scaffold"
                     /linkage_evidence="paired-ends"
     gene            34753..38263
                     /locus_tag="DICVIV_11833"
     mRNA            join(34753..34807,34918..35076,35236..35340,35776..35858,
                     35914..35998,36230..36369,36617..36747,36890..36971,
                     37097..37238,37339..37462,37638..37754,37824..37906,
                     38211..38263)
                     /locus_tag="DICVIV_11833"
                     /product="hypothetical protein"
     CDS             join(34753..34807,34918..35076,35236..35340,35776..35858,
                     35914..35998,36230..36369,36617..36747,36890..36971,
                     37097..37238,37339..37462,37638..37754,37824..37906,
                     38211..38263)
                     /locus_tag="DICVIV_11833"
                     /inference="protein motif:HMMPfam:IPR017442"
                     /note="KEGG: phu:Phum_PHUM368540 1.3e-30
                     serine/threonine-protein kinase NIM1, putative"
                     /codon_start=1
                     /product="hypothetical protein"
                     /protein_id="KJH42178.1"
                     /db_xref="InterPro:IPR017442"
                     /translation="MLRTWCGSPPYAAPELLLGKEYDGLKADIWSLGVILYILVTGGF
                     PFPGDNVENLKRAVLSCHMKIPYWVSVECADLIKKMLVIHPFKRLSINAVMQHRFVVK
                     SSPLYRRPLKLNPTVLLFMQQHGLWTEEQIVDDVLKHNFESSIFATYEMLCDKIEKNS
                     LKGLTNDYPRRGSRGSILSGKANVEADSLKATVSTHHLAQLNFSTSIECDSDDSSASD
                     IYEESPSSSNNGKKQERCQFGLAMKEELGNRGEIRRHTLCASEKILSPDMMAQLPVSS
                     PLYQAVLNHSAFTDLNTSPGGTNLVTSQLPLFFALPLMPMLDYAQMLPVPNSERRASA
                     GENLLNPGTIESVNHIIHSAPTGNVARSIEEEGEGYLSKHAGKRNTVHTASSAFGPSS
                     PAPRHCPYTKASSTERRSSWASPTITMQQQQQLERMHRQASNSEVLESSTVAFSDMKS
                     IL"
     gene            <39450..40447
                     /locus_tag="DICVIV_11834"
     mRNA            join(<39450..39587,39655..39786,40093..40204,40269..40447)
                     /locus_tag="DICVIV_11834"
                     /product="hypothetical protein"
     CDS             join(<39450..39587,39655..39786,40093..40204,40269..40447)
                     /locus_tag="DICVIV_11834"
                     /codon_start=1
                     /product="hypothetical protein"
                     /protein_id="KJH42179.1"
                     /translation="AGRSPTHFEHRSAPFGCPQISITDEHNRSLAPGSSSFDPISFFE
                     QKVNEVVCGQRPATVIGFSPASKSGASSPEQSVKVDEGDGNDDRVKSFVSTLPLADVV
                     HELKNCLNEMHIQFEETSEVVYMDQEMNRLSLSTGVEIGCATLPPEHKSHVEFAIVAG
                     VSPNSELLCEELISRLRVLDPTASYE"
     assembly_gap    41442..41541
                     /estimated_length=100
                     /gap_type="within scaffold"
                     /linkage_evidence="paired-ends"
     gene            complement(42402..43660)
                     /locus_tag="DICVIV_11835"
     mRNA            complement(join(42402..42539,42627..42728,42896..42982,
                     43071..43153,43296..43418,43600..43660))
                     /locus_tag="DICVIV_11835"
                     /product="hypothetical protein"
     CDS             complement(join(42402..42539,42627..42728,42896..42982,
                     43071..43153,43296..43418,43600..43660))
                     /locus_tag="DICVIV_11835"
                     /inference="protein motif:HMMPfam:IPR002524"
                     /codon_start=1
                     /product="hypothetical protein"
                     /protein_id="KJH42180.1"
                     /db_xref="InterPro:IPR002524"
                     /translation="MGLHPIPRLILHSLSPLVVIGAYVSGLTIPQWVGESWMWRHADP
                     ILAVVLTLLFLLLIIPSFKEMMPYIFAHTPAKFHVESFQNEISETFPDVTCTHIHAYR
                     LWPGNVFEALIHLNFLVDKSKQAWSSDAAIRYTEVRQNVTSVLTRAGAQKVVVEPCFL
                     DATEIGRPWVGCVGVTCFRQDRGCCKVDEKLEVQVAR"
     assembly_gap    52203..52395
                     /estimated_length=193
                     /gap_type="within scaffold"
                     /linkage_evidence="paired-ends"
     assembly_gap    58456..58555
                     /estimated_length=100
                     /gap_type="within scaffold"
                     /linkage_evidence="paired-ends"
     gene            64134..67833
                     /locus_tag="DICVIV_11836"
     mRNA            join(64134..64305,64359..64452,64519..64665,65703..65802,
                     67648..67833)
                     /locus_tag="DICVIV_11836"
                     /product="HAD superfamily hydrolase, 5'-nucleotidase"
     CDS             join(64134..64305,64359..64452,64519..64665,65703..65802,
                     67648..67833)
                     /locus_tag="DICVIV_11836"
                     /inference="protein motif:HMMPfam:IPR008380"
                     /note="KEGG: phu:Phum_PHUM227090 2.1e-64 Cytosolic purine
                     5'-nucleotidase, putative"
                     /codon_start=1
                     /product="HAD superfamily hydrolase, 5'-nucleotidase"
                     /protein_id="KJH42181.1"
                     /db_xref="InterPro:IPR008380"
                     /translation="MKVSNLSINVLIIYHILEYKSPDLEILSFDLAVQRLIDIGYPEE
                     IRKFKYDPIFPVRGLWFDYSYGNLLKVDGFGNILVGMHGFKFLKAAEIEEIYPNRYLQ
                     LSESRVFVLNTLFNLPETHLLAYLIDFFDNHPEYTPLEDKTGLRGGDVLMSYKSIFYD
                     CRSALDWVHLESNMKEIILENMEKYVMPDDRAPLLLRQLREAGRQTFLLTNSDYGYTD
                     VINFDFILGTLLPN"
     gene            67900..73172
                     /locus_tag="DICVIV_11837"
     mRNA            join(67900..68001,68173..68181,70843..70918,71000..71087,
                     71470..71665,72025..72032,72524..72696,72874..72968,
                     73056..73172)
                     /locus_tag="DICVIV_11837"
                     /product="5' nucleotidase family protein"
     CDS             join(67900..68001,68173..68181,70843..70918,71000..71087,
                     71470..71665,72025..72032,72524..72696,72874..72968,
                     73056..73172)
                     /locus_tag="DICVIV_11837"
                     /inference="protein motif:HMMPfam:IPR008380"
                     /note="KEGG: ssc:100154612 4.6e-53 cytosolic IMP-GMP
                     specific 5-nucleotidase; K01081 5'-nucleotidase"
                     /codon_start=1
                     /product="5' nucleotidase family protein"
                     /protein_id="KJH42182.1"
                     /db_xref="InterPro:IPR008380"
                     /translation="MTFLLGHKWRSFFNITVVNARKPKWFAEGTVFRECTKVDTSTGA
                     VKLGFHTGPLKEGVVYSGGSCDAFHKIVKARGKDVLYIGDHIFGDVLRSKKSRGWRTF
                     LVVPELDHELMVWTDRRPLFEQLNQLDNTLADIYKHLDATARRKPQIHNILQKVKVER
                     YADLYASSCYNLAHYPSFYFFRAAMQLLPHESTVDHASVVNSSKKDLLERKETIGQQV
                     RGWTQKTSNENEFCHEEEEDEQSISDGEVHMTCTLRHSKSFSDDVTEHEVADTDIVVV
                     PPAYVEQEQQM"
CONTIG      join(AZAF01013451.1:1..9339,gap(181),AZAF01013452.1:1..7863,
            gap(494),AZAF01013453.1:1..2522,gap(772),AZAF01013454.1:1..20270,
            gap(100),AZAF01013455.1:1..10661,gap(193),AZAF01013456.1:1..6060,
            gap(100),AZAF01013457.1:1..17107)
//