LOCUS KJH42952.1 283 aa PRT CON 12-MAR-2015 DEFINITION Dictyocaulus viviparus phosphoprotein phosphatase 1 domain protein protein. ACCESSION KN716607-9 PROTEIN_ID KJH42952.1 SOURCE Dictyocaulus viviparus (bovine lungworm) ORGANISM Dictyocaulus viviparus Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; Rhabditina; Rhabditomorpha; Strongyloidea; Metastrongylidae; Dictyocaulus. REFERENCE 1 (bases 1 to 97522) AUTHORS Mitreva,M. TITLE Draft genome of the bovine lungworm Dictyocaulus viviparus JOURNAL Unpublished REFERENCE 2 (bases 1 to 97522) AUTHORS Mitreva,M., Pepin,K.H., Abubucker,S., Martin,J., Minx,P., Warren,C., Palsikar,V.B., Zhang,X. and Wilson,R.K. TITLE Direct Submission JOURNAL Submitted (12-NOV-2013) The Genome Institute, Washington University School of Medicine, 4444 Forest Park, St. Louis, MO 63108, USA COMMENT Dictyocaulus viviparus, the bovine lungworm, is the cause of parasitic bronchitis in cattle (husk, verminous pneumonia, dictyocaulosis) with a world wide distribution in temperate areas. The predominant hosts are cattle, but it can also infect deers. Infection occurs on pasture as infective larvae develop from L1 that are shed with the feces to free-living infective L3 that are ingested by cattle while grazing. Free-living larvae do not feed; survival on pasture is therefore limited to a few months depending on temperature and humidity. The parasite is able to interrupt development inside the host. Infective larvae that have been exposed to low temperatures on pasture before infection subsequently develop only to preadult larval stages that survive winter conditions as hypobiotic larvae in the lung and resume development in spring. These animals contaminate pastures the following spring and represent the major source of infection for other susceptible cattle. The strain being sequenced (HannoverDv2000) was obtained from the laboratory of Drs. Thomas Schnieder and Christina Strube (Christina.Strube@tiho-hannover.de) and has been maintained in calves since August 2000. Worm isolation and DNA extraction was performed by Christina Strube and/or the Genome Institute's production team. This assembly consists of fragments, 3kb and 8kb insert whole genome shotgun libraries. The sequences were generated on the Roch/454 platform and assembled using Newbler. To improve scaffolding, inhouse tools CIGA (Cdna tool for Improving Genome Assembly) and Pygap (Gap closure tool) were used to map 454 cDNA reads using blat to the genomic assembly to link genomic contigs based on cDNA evidence. Only joins confirmed by additional independent data typing were accepted and close gaps followed by the Pyramid assembler and Illumina paired reads to closing gaps and extending contigs The repeat library was generated using Repeatmodeler (A.F.A. Smit, R. Hubley & P. Green http://repeatmasker.org). The Ribosomal RNA genes were identified using RNAmmer (Lagesen et. al., 2007 Nucleic Acids Res.) and transfer RNA's were identified with tRNAscan-SE (Lowe and Eddy, Nucleic Acids Res. 1997). Non-coding RNAs, such as microRNAs, were identified by sequence homology search of the Rfam database (Griffiths-Jones et. al., 2003 Nucleic Acids Res.). Repeats and predicted RNA's were then masked using RepeatMasker (A. Smit, R. Hubley & P. Green http://repeatmasker.org). Protein-coding genes were predicted using a combination of ab initio programs Snap (Korf, 2004 BCM Bioinformatics), Fgenesh (Salamov A., Solovyev V. 2000, Genome Res.) and Augustus (M. Stanke, et. al., 2008 Bioinformatics) and the annotation pipeline tool Maker (M. Yandell et. al., 2007 Genomc Research) which aligns mRNA, EST and protein information from same species or cross-species to aid in gene structure determination and modifications. A consensus gene set from the above prediction algorithms was generated, using a logical, hierarchical approach developed at the Genome institute. Gene product naming was determined by BER (http://ber.sourceforge.net). Our goal is to explore this WGS draft sequence of D. viviparus to better define proteins involved in nematode parasitism that impact health and disease and are relevant to both host-parasite relationships and basic biological processes. For information regarding this assembly or project, or any other GSC genome project, please visit our Genome Groups web page (http://genome.wustl.edu/genome_group_index.cgi) and email the designated contact person. For specific questions regarding the D. viviparus genome project contact Makedonka Mitreva (mmitreva@genome.wustl.edu) at Washington University School of Medicine. The National Human Genome Research Institute (NHGRI) of the National Institutes of Health (NIH) provided funds for this project. ##Genome-Assembly-Data-START## Current Finishing Status :: High-Quality Draft Assembly Method :: Newbler v. 2.6 Assembly Name :: D_viviparus_9.2.1.ec.pg Genome Coverage :: 12.20x Sequencing Technology :: 454 ##Genome-Assembly-Data-END## FEATURES Qualifiers source /organism="Dictyocaulus viviparus" /mol_type="genomic DNA" /submitter_seqid="D_viviparus-1.0_Cont458" /strain="HannoverDv2000" /isolation_source="Cow lung" /host="cattle" /db_xref="taxon:29172" /chromosome="Unknown" /country="Germany" /lat_lon="51.00 N 9.00 E" /collection_date="Aug-2000" /collected_by="Drs. Thomas Schnieder and Christina Strube" protein /locus_tag="DICVIV_11054" /inference="protein motif:HMMPfam:IPR004843" /note="KEGG: cel:R03D7.8 1.5e-31 hypothetical protein; K01090 protein phosphatase" /db_xref="InterPro:IPR004843" intron_pos 10:0 (1/8) intron_pos 37:2 (2/8) intron_pos 68:0 (3/8) intron_pos 108:0 (4/8) intron_pos 132:2 (5/8) intron_pos 177:0 (6/8) intron_pos 219:2 (7/8) intron_pos 249:0 (8/8) BEGIN 1 MSAQRVGFEV SEKKLVTGQL SFPLLCRVLT NMELRIRKEV IELNSHEMPT YNKVEFIAGG 61 NGHSLVKVDQ MFLSGYENYK ITHLLQHGPY VYDWRPFELV SLLSQATEIF EGESTILILR 121 APIVIIGDIR GQYQDLHRWL CIVDFPPRSK ILFLGGVIDP EEPGSLDCLA FIAAMKVRFP 181 HDVFLIRGMG ETLPIVFHAR FRGLNNSAVQ SLVLSPKKKA ATRLCNSLPI AARISNRILA 241 VHSGLSHEGK GTYNLLSLDR FFPTGDSTSK FFVHTIVVYN SLP //