LOCUS KE124997 444250 bp DNA linear CON 06-JUN-2013 DEFINITION Ancylostoma ceylanicum unplaced genomic scaffold A_ceylanicum-1.0_Cont223, whole genome shotgun sequence. ACCESSION KE124997 ASSL01000000 VERSION KE124997.1 DBLINK BioProject: PRJNA72583 KEYWORDS WGS; HIGH_QUALITY_DRAFT. SOURCE Ancylostoma ceylanicum ORGANISM Ancylostoma ceylanicum Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; Rhabditina; Rhabditomorpha; Strongyloidea; Ancylostomatidae; Ancylostomatinae; Ancylostoma. REFERENCE 1 (bases 1 to 444250) AUTHORS Mitreva,M. TITLE Draft genome of the parasitic nematode Anyclostoma ceylanicum JOURNAL Unpublished REFERENCE 2 (bases 1 to 444250) AUTHORS Mitreva,M., Abubucker,S., Martin,J., Minx,P., Warren,C., Pepin,K.H., Palsikar,V.B., Zhang,X.W.E. and Wilson,R.K. TITLE Direct Submission JOURNAL Submitted (14-MAY-2013) The Genome Institute, Washington University School of Medicine, 4444 Forest Park, St. Louis, MO 63108, USA COMMENT Ancylostoma ceylanicum is a parasite of humans and carnivores in Asia. The parasite was adapted to the Syrian golden hamster (Mesocricetus auratus) in 1972 by Ray and Bhopale. The strain (Indian) was distributed worldwide from the lab of Dr. Jerzy Behnke in the 1980's. The sequenced strain was obtained by Dr. John M. Hawdon (jhawdon@gwu.edu) from Dr. Ricardo Fujiwara at the Federal University of Minas Gerais, Brazil. The strain was maintained in Dr. Hawdon's lab in dogs and hamsters since 2007. Worm isolation and extraction of nucleic acids was performed by Dr. Verena Gelmedin and others in the Hawdon lab, or the Genome Institute production team. Voucher specimens are on deposit in the U.S. National Parasite Collection (accession number 102954). For the original isolation and adaptation to hamsters see Ray, D.K., Bhopale, K.K., 1972. Complete development of Ancylostoma ceylanicum (Looss, 1911) in golden hamsters, Mesocricetus auratus. Experientia 28, 359-361 This assembly consists of fragments, 3kb and 8kb insert whole genome shotgun libraries. The sequences were generating on the Roch/454 platform and assembled using Newbler. To improve scaffolding, inhouse tools CIGA (Cdna tool for Improving Genome Assembly) and Pygap (Gap closure tool) were used to map 454 cDNA reads using blat to the genomic assembly to link genomic contigs based on cDNA evidence. Only joins confirmed by additional independent data typing were accepted and close gaps followed by the Pyramid assembler and Illumina paired reads to closing gaps and extending contigs The repeat library was generated using Repeatmodeler (A.F.A. Smit, R. Hubley & P. Green http://repeatmasker.org). The Ribosomal RNA genes were identified using RNAmmer (Lagesen et. al., 2007 Nucleic Acids Res.) and transfer RNA's were identified with tRNAscan-SE (Lowe and Eddy, Nucleic Acids Res. 1997). Non-coding RNAs, such as microRNAs, were identified by sequence homology search of the Rfam database (Griffiths-Jones et. al., 2003 Nucleic Acids Res.). Repeats and predicted RNA's were then masked using RepeatMasker (A. Smit, R. Hubley & P. Green http://repeatmasker.org). Protein-coding genes were predicted using a combination of ab initio programs Snap (Korf, 2004 BCM Bioinformatics), Fgenesh (Salamov A., Solovyev V. 2000, Genome Res.) and Augustus (M. Stanke, et. al., 2008 Bioinformatics) and the annotation pipeline tool Maker (M. Yandell et. al., 2007 Genomc Research) which aligns mRNA, EST and protein information from same species or cross-species to aid in gene structure determination and modifications. A consensus gene set from the above prediction algorithms was generated, using a logical, hierarchical approach developed at the Genome institute. Gene product naming was determined by BER (http://ber.sourceforge.net). Our goal is to explore this WGS draft sequence of A. ceylanicum to better define proteins involved in nematode parasitism that impact health and disease and are relevant to both host-parasite relationships and basic biological processes. For information regarding this assembly or project, or any other GSC genome project, please visit our Genome Groups web page (http://genome.wustl.edu/genome_group_index.cgi) and email the designated contact person. For specific questions regarding the A. ceylanicum genome project contact Makedonka Mitreva (mmitreva@genome.wustl.edu) at Washington University School of Medicine. The National Human Genome Research Institute (NHGRI) of the National Institutes of Health (NIH) provided funds for this project. ##Genome-Assembly-Data-START## Current Finishing Status :: High-Quality Draft Assembly Method :: Newbler v. MapAsmResearch-04/19/2010-patch- 08/17/2010 Assembly Name :: A_ceylanicum1.3.ec.cg.pg Genome Coverage :: 26.10x Sequencing Technology :: 454 ##Genome-Assembly-Data-END## FEATURES Location/Qualifiers source 1..444250 /organism="Ancylostoma ceylanicum" /mol_type="genomic DNA" /submitter_seqid="A_ceylanicum-1.0_Cont223" /specimen_voucher="USDA:USNPC:102954" /db_xref="taxon:53326" /chromosome="Unknown" assembly_gap 3554..5466 /estimated_length=1913 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 9023..9557 /estimated_length=535 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 12803..14078 /estimated_length=1276 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 17064..17279 /estimated_length=216 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 20256..20542 /estimated_length=287 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 24830..24929 /estimated_length=unknown /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 32348..32923 /estimated_length=576 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 33767..34027 /estimated_length=261 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 34777..35430 /estimated_length=654 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 36507..36606 /estimated_length=unknown /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(36673..51511) /locus_tag="ANCCEY_07633" mRNA complement(join(36673..36696,38380..38585,40881..40971, 41096..41182,47038..47154,51497..51511)) /locus_tag="ANCCEY_07633" /product="hypothetical protein" CDS complement(join(36673..36696,38380..38585,40881..40971, 41096..41182,47038..47154,51497..51511)) /locus_tag="ANCCEY_07633" /note="KEGG: cbr:CBG09515 7.3e-07 Hypothetical protein CBG09515; K11140 aminopeptidase N" /codon_start=1 /product="hypothetical protein" /protein_id="EPB73263.1" /translation="MIYLKIYSAQTRSGLISDAFAAAQIDRLDYETVFQLLEYLPKEK SDLVWHMVKNGLASIVDFYGNEPDGEWAMKRKATMRRNHTAKETLDKNEKLFVTTANL SALKYISTKATSTLHAKYSKLSLSVADVGDPPLLSPVIAAQGAITGDSGVRPLSQGDT VACDTDRRSHRSVHYLEDL" assembly_gap 39343..39666 /estimated_length=324 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 42308..44058 /estimated_length=1751 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 45091..46222 /estimated_length=1132 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 47210..50691 /estimated_length=3482 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 55997..56905 /estimated_length=909 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 58589..59706 /estimated_length=1118 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 71443..82954 /locus_tag="ANCCEY_07634" mRNA join(71443..71572,73127..73197,73581..73721,73780..73936, 74004..74143,74208..74259,74706..74810,74872..74970, 75037..75140,75217..75355,75424..75542,75647..75714, 75795..75879,75939..76109,76173..76313,76386..76514, 76529..76672,76729..77006,77126..77255,77322..77413, 77605..77654,77740..77816,77892..78014,78097..78237, 78428..78508,78567..78662,78724..78869,78929..79024, 79090..79237,79296..79439,79496..79789,79850..79989, 80071..80268,80328..80449,80622..80728,80799..80911, 80970..81078,81150..81218,81287..81415,81474..81607, 81685..81754,81822..81962,82034..82201,82268..82377, 82440..82602,82667..82780,82847..82954) /locus_tag="ANCCEY_07634" /product="myosin head" CDS join(71443..71572,73127..73197,73581..73721,73780..73936, 74004..74143,74208..74259,74706..74810,74872..74970, 75037..75140,75217..75355,75424..75542,75647..75714, 75795..75879,75939..76109,76173..76313,76386..76514, 76529..76672,76729..77006,77126..77255,77322..77413, 77605..77654,77740..77816,77892..78014,78097..78237, 78428..78508,78567..78662,78724..78869,78929..79024, 79090..79237,79296..79439,79496..79789,79850..79989, 80071..80268,80328..80449,80622..80728,80799..80911, 80970..81078,81150..81218,81287..81415,81474..81607, 81685..81754,81822..81962,82034..82201,82268..82377, 82440..82602,82667..82780,82847..82954) /locus_tag="ANCCEY_07634" /inference="protein motif:HMMPfam:IPR001609" /inference="protein motif:HMMPfam:IPR002928" /inference="protein motif:HMMPfam:IPR004009" /note="KEGG: phu:Phum_PHUM098460 0. myosin-9, putative K10352" /codon_start=1 /product="myosin head" /protein_id="EPB73264.1" /db_xref="InterPro:IPR001609" /db_xref="InterPro:IPR002928" /db_xref="InterPro:IPR004009" /translation="MSNSDFEQDPGFQYLGMSREARAASAARPFDSKKNVWVPDPEEG FIAAEIQSVQGDQVTVVTAKGNTVTVKKDEAQEMNPPKFDKTEDMANLTFLNEASVLA NLKDRYKDMMIYTYSGLFCVVINPYKRLPIYTESVIKFYMGKRRNEMPPHLFATSDEA YRNMVQDRENQSMLITGESGAGKTENTKKVISYFAIVGATQAAKGAKGEGTKGGTLEE QIVQTNPVLEAFGNAKTVRNNNSSRFGKFIRTHFSAQGKLAGGDIEHYLLEKSRVVRQ AAGERSYHIFYQIMSGHDPKLRDQLKLNNDIKYYHFCSQAELTIDGVNDKEEMGLTQE AFDIMGFEDEEVMDLYKSCAAILHMGEMKFKQRPREEQAEPDGDEDAQNVAHNLGVNH EEFLKALTKPRVRVGTEWVNKGQNLEQVHWAVAGLGKAIYARMFKWLIGRCNKTLDAK QIERRYFIGVLDIAGFEIFDFNSFEQLWINFVNERLQQFFNHHMFVLEQEEYKREGIQ WTFIDFGLDLQACIELIEKPLGLISMLDEECIVPKATDMTYVQKLNDQHLGKHPNFQK PKPPKGKQSEAHFAVVHYAGTVRYNATNFLEKNKVCPHLILTPDSIDSIQTDPLNDTA VALLKTHSHGCKLMLEIWADYQTQEEAAEAAKSGAGGGKKKGKSASFMTVSMIYRESL NNLMNMLYQTHPHFIRCIIPNEKKTSGMTMPMDSLSRTRRHNPVLLLQRSRNLNSFRF DRLGACTQPVDLQWCTGSYAVLAADQAKSSDDVKVASVAITDKLVTDGSLKDEEFKIG NTKVFFKAGILARLEDMRDEILRVIMTNFQSRVRWYLGQTDLRRRMQQQAGLLIIQRN VRSWCTLRTWEWFKLYGKVKPLLKAGKEAEEMEKLSDKIKSLEEAVAKGDESRKQLES QVAGLVEEKNQLFLNLEKEKANLQDAEERNQKLAALKADLDKQLAEVQYEQEIAEHKK HAQDLELSLKKAESEKQARDHNIRSLQDEMANQDEAVARLNKEKKHQEEVNRKLMEDL QAEEDRVNHMEKVRAKLEQQLDDLEDAMDREKRSRQDLEKAKRKVEGELKVAQENIDE ITKQKHDVEQNLKKKEAELHQLSTRLEEEQSLVAKLQRQIKELQARIAELEEELENER QSRAKADRSRSELQRELEEISERLEEQGGATAAQLEANKKREAELAKLRRDQEEANLN HETALASLRKKHHDAVAELTDQLEQLQKLKAKADKEKAQLQRELEELSASVDSEVRSR QDIEKQLKVVEVQYAEAQTKADEQSRQLNDFAALKNRLHNENGDLGRQLEDMENQLNS LHRLKAQLTSQLEETKRSYDEEARERQALAAQVKNFEHENDSLRDQLDTESEAKAELL RQISKQNAEIQQWKARFESEGLAKLDEIEEAKRKLQGKVQELTDANEMAFAKIGSLEK TRHKLMQDLDDAQARFDKIIDEWRKKHDDLAAELDAAQRDNRNLSTDLFRAKTAQDEL TEHLESVRRENKQLAQEVKDLADQLGEGGRSVHELQKMVRRLEVEKEELQKALDEAEA ALEAEEAKVLRAQVEVSQIRSEIEKRIQEKEEEFENTRKNHQRALESMQATLEAETKH KEEALRIKKKLEADINELEIALDHANRANADAQKTIKKYMETVRELQLQVEDEQRQKD EIREQFLNSEKRNAILQTEKEELSQVAEAAERARRNAETDCIELREHNNDLSAQLNGI TAVKRKLEGELQAMHAELDETLAELKNVDEMGKKAAADAARLAEELRQEQEHSMHVER IRKGLEVQIKEMQIRLDEAEAAALKGGKKIIAQLESRIRSLEQELDGEQRRHQETDKN WRKSERRVKEVEFQLEEDKKNQERLTELIDKLQAKLKVFKRQVEEAEEVAATNLGKYR QLQAQLDDAEERADVAENALSKMRNKIRASASMVPSGSGGLAQSASSAVIRSTSFARS QDF" assembly_gap 92061..92160 /estimated_length=unknown /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 94629..94745 /estimated_length=117 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 96027..97264 /estimated_length=1238 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 98976..100863 /estimated_length=1888 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 110695..111082 /estimated_length=388 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 113383..114998 /estimated_length=1616 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 116460..117219 /estimated_length=760 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 120402..120668 /estimated_length=267 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 133043..133142 /estimated_length=unknown /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 138705..139601 /estimated_length=897 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 141880..142752 /estimated_length=873 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 143386..147536 /estimated_length=4151 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 148961..154001 /estimated_length=5041 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(172607..175302) /locus_tag="ANCCEY_07635" mRNA complement(join(172607..172684,173622..173769, 174444..174538,174659..174762,175125..175302)) /locus_tag="ANCCEY_07635" /product="hypothetical protein" CDS complement(join(172607..172684,173622..173769, 174444..174538,174659..174762,175125..175302)) /locus_tag="ANCCEY_07635" /inference="protein motif:HMMPfam:IPR008380" /note="KEGG: dme:Dmel_CG32549 3.3e-12 CG32549 gene product from transcript CG32549-RD K01081" /codon_start=1 /product="hypothetical protein" /protein_id="EPB73265.1" /db_xref="InterPro:IPR008380" /translation="MASPPNATELLRAAASQNDFVAIRGTTGRKMASLPDFIITEITD GQNLACPDMELPINCVSQRSLLDFTLRGSLVETTTSSKAQTKFNTLKGLLRITDDMES EFGMMGSMLRCGWRQTHFAAQLKKYADLYTCNVYNLIEYSGTHYFNSPIQLLPHEEKI MRGFIDTKSINTFDERSSEEGETLSDISCCGCCLTGNETI" assembly_gap 175871..175991 /estimated_length=121 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 177063..178423 /estimated_length=1361 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 179286..179406 /estimated_length=121 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(180337..187725) /locus_tag="ANCCEY_07636" mRNA complement(join(180337..180423,185625..185741, 187111..187174,187664..187725)) /locus_tag="ANCCEY_07636" /product="hypothetical protein" CDS complement(join(180337..180423,185625..185741, 187111..187174,187664..187725)) /locus_tag="ANCCEY_07636" /inference="protein motif:HMMPfam:IPR008380" /note="KEGG: dme:Dmel_CG32549 2.9e-14 CG32549 gene product from transcript CG32549-RD K01081" /codon_start=1 /product="hypothetical protein" /protein_id="EPB73266.1" /db_xref="InterPro:IPR008380" /translation="MGLAGKDILYVGDHIFGDVLKSKKVGGWRTLLIVPELDNEMKIW SSQLMKFNGLLELNNSLSDFQTIQQSAIRKKILKEITLKSEHDERPREEGGRKEFGEV TLALEMR" assembly_gap 181361..184814 /estimated_length=3454 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 195111..196597 /estimated_length=1487 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 198459..199974 /estimated_length=1516 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 200783..201455 /estimated_length=673 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(202332..209717) /locus_tag="ANCCEY_07637" mRNA complement(join(202332..202400,202523..202663, 206766..206871,207853..208005,208075..208202, 208259..208344,209612..209717)) /locus_tag="ANCCEY_07637" /product="HAD superfamily hydrolase, 5'-nucleotidase" CDS complement(join(202332..202400,202523..202663, 206766..206871,207853..208005,208075..208202, 208259..208344,209612..209717)) /locus_tag="ANCCEY_07637" /inference="protein motif:HMMPfam:IPR008380" /note="KEGG: dme:Dmel_CG32549 9.0e-41 CG32549 gene product from transcript CG32549-RD K01081" /codon_start=1 /product="HAD superfamily hydrolase, 5'-nucleotidase" /protein_id="EPB73267.1" /db_xref="InterPro:IPR008380" /translation="MSRANSPDARGNNFIIRSTDVDGRTIRQQKARWLTVYNSPICEQ LAFDLALKRLIDIGYPKEIQICEYKREFVVRNAWFDKRLGNLLKTDEHCNILSAFRGF RKLQKKEIREYYPNKHIALEETRIFVLNTVFNVPETLLLATIVDYFEHQCGLDYSKLA NGLGYKRKDKQQIVLFSTIFEDCRSTIDWIHTQGCFKELITANLSKYIARDDRAVAMF QKLSKEGKKLFLLTNSSWHYTDLHIQKMKRFQEAVKHLAVVFNG" assembly_gap 203230..204202 /estimated_length=973 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 205160..205796 /estimated_length=637 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 209136..209235 /estimated_length=unknown /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(216517..223909) /locus_tag="ANCCEY_07638" mRNA complement(join(216517..216618,223622..223909)) /locus_tag="ANCCEY_07638" /product="hypothetical protein" CDS complement(join(216517..216618,223622..223909)) /locus_tag="ANCCEY_07638" /codon_start=1 /product="hypothetical protein" /protein_id="EPB73268.1" /translation="MEYVNTTKFGQADGLSRLMRKHQVESEDVVIAAVENDVCTLLKE CIRRLPVTVDDVESYTRTDAVLGKVISCVNTEKWPKANQKLAYFQNRCKTLSPNYFAN FGYYGEQSQRGDNFIIQSFGLDGRTII" assembly_gap 219239..219338 /estimated_length=unknown /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 219944..221047 /estimated_length=1104 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(223917..224639) /locus_tag="ANCCEY_07639" mRNA complement(223917..224639) /locus_tag="ANCCEY_07639" /product="hypothetical protein" CDS complement(223917..224639) /locus_tag="ANCCEY_07639" /note="KEGG: phu:Phum_PHUM531970 4.8e-39 enzymatic polyprotein, putative" /codon_start=1 /product="hypothetical protein" /protein_id="EPB73269.1" /translation="MENLLALFERIAEYGLKVRLEKCSFAKPEIRYLGFIVDKNGRRP NPEKIEAIKSMVMPKDVGQLRAFLGIITYYAAFMPAMKDLRGPLDALLKKKVKWEWTS KQQIAFEKLKKALSSELACLLLLLAHYDPRQKIVVTADACGYGIGCLISHRYEDGSEK PIAHASRSLTAAEKNYSQIEEEALGIVFAVKNFHKYVFGIKFLLLTDHKPLLPIFGDK KGVLVYSANRLKREMGYNSAWI" assembly_gap 229260..229359 /estimated_length=unknown /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 230835..231736 /estimated_length=902 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 237682..237781 /estimated_length=unknown /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 239947..241052 /estimated_length=1106 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 247971..248070 /estimated_length=unknown /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 250843..250942 /estimated_length=unknown /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 259025..259171 /estimated_length=147 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 262028..262500 /estimated_length=473 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 263143..263647 /estimated_length=505 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 265602..265701 /estimated_length=unknown /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 268396..285415 /locus_tag="ANCCEY_07640" mRNA join(268396..268475,268779..268887,269078..269170, 269223..269318,269375..269480,272392..272507, 272590..272669,273415..273523,273951..274044, 274234..274340,274556..274780,274861..275034, 275134..275358,275514..275628,276443..276621, 277848..278004,278150..278241,278749..278852, 278908..278998,281539..281626,283254..283592, 285321..285415) /locus_tag="ANCCEY_07640" /product="hypothetical protein" CDS join(268396..268475,268779..268887,269078..269170, 269223..269318,269375..269480,272392..272507, 272590..272669,273415..273523,273951..274044, 274234..274340,274556..274780,274861..275034, 275134..275358,275514..275628,276443..276621, 277848..278004,278150..278241,278749..278852, 278908..278998,281539..281626,283254..283592, 285321..285415) /locus_tag="ANCCEY_07640" /inference="protein motif:HMMPfam:IPR019381" /codon_start=1 /product="hypothetical protein" /protein_id="EPB73270.1" /db_xref="InterPro:IPR019381" /translation="MEKGGVVPMRLFANWEMDRASSSVVQSALQQKARIIPRLPCSAS CYDICILVPSAMDIQYPVLGNKRSLRSNDIVVTPYHGRVDLDLDISFTIQYPHFVKRK GNTLQILIQRRRRYKNRPFPSSFKTIAVGMVNLTQLLQHGGLREIPLWSSEADDKQSS GVQSIGRLNLVTCQSQPLEIDMERDRVGKKQGGIENDLSEDDSDSDYEDVPGDSDINE PTSARNTARGAAGEVNLKTSRATRKNNIRQRFVTLLKKFKVPEEEGIPSRSASAAMVP TDKELQDLFEELENMSDSGPEMAADEISIGSNPRPGLRPFFTRSKEILPAIYDREGNI IKRHLFFKDAENGSDSEGEAEDLDWSSETEKERDKDEIKISQSQDFAKDSMMPTLTPS GEHMPTTNTSSPAVAMKPQLRVASTSSAPLVQSTSLGGISHSLTTGSISKKAEKDRTV AMSAEKNVGVSEQYARDGFFQLSTLLSAYPAASAGCWLCSYSDLPHLSTISVPTVNCP SGNAVKQAIGQIVNRILNLSVLFHNQFLSMILRAYVECLHHKSSSNWLHYLRFAIIPA PHSLIAKLIEGLDVVLDHLSRDMWERWAEISPAEKQSITEKMNAWLTTGGACLNLPIG EALLQMTERGQEADACRVFVPFLAEVRVGQSSEDEEATCMYSSPRSVEIEKEYVVSAR DREYSAGLSSSPPNSPHIRSDAHEMHVEYWLGRDPSNENVNVMGAQSLTPNSKKDLCK GSMKATFRTLVITRTCNQPLLLLSFVKEKRKEKMLQKLGMKKGQKSENENPPVQVAAV SRLLCSGASKHSDLTDEVDVSEDCSFESVLWLDVCSLDFSTGVLVTKPGIASEEEAFI EEMSLLPAPRRARVLKMAFLMFLKQILNLMELIELPVEEQYRIGPVAALTQLTMQVRR THFVIPVFSFTQGVDCATFSNFEEGRIAWLNLSTKRMQGHART" assembly_gap 270084..270242 /estimated_length=159 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 292259..292740 /estimated_length=482 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 294563..294662 /estimated_length=unknown /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 298884..300531 /estimated_length=1648 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(303612..309981) /locus_tag="ANCCEY_07641" mRNA complement(join(303612..303677,303766..303837, 304123..304233,304302..304472,304552..304656, 304718..304860,308573..308729,308827..308928, 309007..309104,309168..309274,309959..309981)) /locus_tag="ANCCEY_07641" /product="WD domain, G-beta repeat protein" CDS complement(join(303612..303677,303766..303837, 304123..304233,304302..304472,304552..304656, 304718..304860,308573..308729,308827..308928, 309007..309104,309168..309274,309959..309981)) /locus_tag="ANCCEY_07641" /inference="protein motif:HMMPfam:IPR019781" /note="KEGG: edi:EDI_308810 0.00013 DNA topoisomerase K03164" /codon_start=1 /product="WD domain, G-beta repeat protein" /protein_id="EPB73271.1" /db_xref="InterPro:IPR019781" /translation="MKIVPTTSGKVVSCVESGQVNVWSDSGDSCADWEAGCGVKVMRG SNARNELLTGGLEHLIKTWDVETGKRMWSARNMPLDFLGLEVPIMCTDARYVDDSGVI VEATKLHEMRLYDPRAQRRPVKKIPFMDVPITAVSRCYKDNHVLAANSIGEMGLFDLR SKIHPVCKYKGQAGAIRSIDAHPTAPYVATCGIDRFVRVHDIDTKKLAHKIYCKTRLN RVLIRSELPSLLTVIEGNDEQEWMELKKEMNCDSDSVSASASDEDVSEDECTWKKLAV SDADQNSVEIRQRKHKKKAEVGVFEATDEDEEPASKRRKITKSKKRKEEVTNNEVTDN EFSYDEEVEPCKKAKIVQAVKRQDEIKQEVDSDEEPPKRRKKSGRKGKSS" gene complement(313978..334677) /locus_tag="ANCCEY_07642" mRNA complement(join(313978..314080,323922..324037, 324093..324222,324943..325077,325152..325318, 325376..325519,325926..325994,326271..326337, 326415..326566,326658..326756,326858..326953, 327008..327160,328835..328930,328992..329106, 330512..330651,330711..330840,330908..331053, 332299..332432,332590..332708,332803..332936, 332991..333094,333309..333373,333437..333557, 334298..334454,334588..334677)) /locus_tag="ANCCEY_07642" /product="hypothetical protein" CDS complement(join(313978..314080,323922..324037, 324093..324222,324943..325077,325152..325318, 325376..325519,325926..325994,326271..326337, 326415..326566,326658..326756,326858..326953, 327008..327160,328835..328930,328992..329106, 330512..330651,330711..330840,330908..331053, 332299..332432,332590..332708,332803..332936, 332991..333094,333309..333373,333437..333557, 334298..334454,334588..334677)) /locus_tag="ANCCEY_07642" /codon_start=1 /product="hypothetical protein" /protein_id="EPB73272.1" /translation="MELPFEEEESCPDTQTSITCRLDGNLPTVELARSLGFEVNYVLP LKNELMKRPSRVPTPEFSTKYLDTDDLLFLLMTTLKGGTEPHARDYCIRWHCSKRWDD KELMQRFMWSHAKAVTDEHIRLSFLKCLYNSLDDEEFVFTAKGFFGKDDVQESIEKLE RSIRIVSISQLRSCSRYEEVIAIITRDVDFNSMDGDDLFIMVEYLLDAYSRVKKHNCA MDFASRVFHLLFSYKQLPLAQVSSLLNFIYNTEWNKVTEENSAKIGDVSVEYIQSLDP VKHDDSLPSLALDVLVKAHEKLGERKTCGRDKGAFLLFFMEQLYTCILNEKVMAVLQR EEYYWVWSNVSEEISQCLYCLFGRYSKRRRALEDHDCIVTSPELEKHSTMILELAMPH PLPQYDDKERLGHDVVDLLLNKFPSVLKYSVERGNVLDKFNVWMRDAARENATNRLQW PTTTGESYVQACIWYLMALHHYRQGNHDEIEKYSKLFLTSGYATLESRITAGVWAVLA YTSVYRIFQMDDDLLFLEWPWHVLPFRVSVLVDNKIGVVFFQLASTLYQIATRLSRYF LTLPSDDWRLRRAETLLKDLRSESLRLFEEALSKAHGEAGGICEYQWLGYFFVAKLQA KLNESDVVKVVDGLYEAACSCELSEFFYPIKINVKKQQNIEPVELHYQVHSTVWKYLC RTPNPSLKTLVSLLAYLRAMQSHKVNDIVTRVDLMDEIWNLCHRGFELVTDRFPHMKS YYRLAEMELSRGNVEVAYNHLTKHVFRRKKRDDSLFDSVVEITSQDIDRSGSFPYHVE RALQLTMSLAYQLKDASTIISIITTLVANMEARSEEFILKERQGALLIHAVSRLYILA MESSSPKLLRSELYRAWQVVSRCKALVVRTVETRLQTLIEHIFGSVANFVAEQSVIED NKKKQVRKRKLTSYDLGALQSHPVLLVPGAIPSPSEMSAGGNDPLKRLNLANECALLW LTPKLGFFEIFGKHGRATEAR" assembly_gap 316260..316359 /estimated_length=unknown /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 317954..320904 /estimated_length=2951 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 327767..328553 /estimated_length=787 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 339698..357708 /locus_tag="ANCCEY_07643" mRNA join(339698..339867,340043..340228,341094..341190, 341266..341435,342663..342780,346066..346134, 349818..349939,349998..350113,350855..350967, 351108..351256,351316..351440,352681..352786, 353497..353572,353635..353706,353809..354153, 357221..357297,357531..357708) /locus_tag="ANCCEY_07643" /product="N2,N2-dimethylguanosine tRNA methyltransferase" CDS join(339698..339867,340043..340228,341094..341190, 341266..341435,342663..342780,346066..346134, 349818..349939,349998..350113,350855..350967, 351108..351256,351316..351440,352681..352786, 353497..353572,353635..353706,353809..354153, 357221..357297,357531..357708) /locus_tag="ANCCEY_07643" /inference="protein motif:HMMPfam:IPR002905" /inference="protein motif:HMMTigr:IPR002905" /note="KEGG: cel:ZC376.5 5.6e-135 trm-1; TRNA Methylase family member (trm-1); K00555 tRNA (guanine-N2-)-methyltransferase" /codon_start=1 /product="N2,N2-dimethylguanosine tRNA methyltransferase" /protein_id="EPB73273.1" /db_xref="InterPro:IPR002905" /translation="MYRIALRVTFAADCGIGSHRMCAGVTAAANIGDVPFLRGIRFLA NEKQPLQSSCQNLSCTPTALLVAVKPTLSATFKTQACENRTSTNVAIQQQYSVKKGMK FTHHLTPLLHRYLRFQLEVRKAKGGRILCMNIKYRFYNNMYQSTPLPYQQKIYTPDEA QQHLLARIVVAIARKYGSIIFFRENLRKKTKSGSTCKSKEIAHLGVVEVSVLRQFVRS RASNQSQTSGSGGDGEPVAKKQKLKAVCEDGPIRILDALSASGLRAFRFSQEVENVDY VLANDFSENAVESIKENITLNGVEGKVRANFGDAVVTMMEHRNIDKRFHAVDLDPYGS ASVFLDSAVQCVADRGILMVTCTDMAVLCGNTPETCYNKYGSTTVRLKCCHEMALRIL LRSIDSHANRYSRYIEPLLSISVDFYVRVFVRLHTGARRTKDSATKVGNVIACSGCHS MEVLPILKKVEDGKNVKYCTSVVRQSMMGTDNKCVHCGHPVHHAGPIYIGPIHDRKFV EGILTSLKETPEEERLGTHNRLMGVLTNVSEEIDVPLYYEHDQLFNVVKCSVPKAVSG SHCNPRALKTNAPTHFLWDICRSAVSQNSCFFYSLFAERERELSIQNVAINLLFFTNT LLQAKAFLLVLGHVSGHSNAISFSSATLLASADLRFGTLNLTSIHDFSLEGVPNAKEA NVTAERHDEKAPGHKILSEPIKVKTGDQDQKRKDLSIPHWPASTQKHRKLANDYGFLC CHTTLLLVIVACYDVKITRSFY" assembly_gap 344257..344356 /estimated_length=unknown /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 348730..349265 /estimated_length=536 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 352108..352383 /estimated_length=276 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 354724..354823 /estimated_length=unknown /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 363090..363189 /estimated_length=unknown /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 365023..365122 /estimated_length=unknown /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 365880..366781 /estimated_length=902 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 368635..372265 /estimated_length=3631 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 372826..373038 /estimated_length=213 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 376307..376406 /estimated_length=unknown /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 377131..377230 /estimated_length=unknown /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 379241..386978 /locus_tag="ANCCEY_07644" mRNA join(379241..379315,379362..379448,379684..379796, 382267..382438,382660..382743,382804..382905, 383179..383298,383362..383484,386652..386801, 386853..386978) /locus_tag="ANCCEY_07644" /product="hypothetical protein" CDS join(379241..379315,379362..379448,379684..379796, 382267..382438,382660..382743,382804..382905, 383179..383298,383362..383484,386652..386801, 386853..386978) /locus_tag="ANCCEY_07644" /note="KEGG: edi:EDI_024060 2.2e-10 trichohyalin K13173" /codon_start=1 /product="hypothetical protein" /protein_id="EPB73274.1" /translation="MISAQSSSSLHWSADMAEQSSNSLQSQLKLARDMGYTEDVIFAA LQAQKKDEEGLYLPFESTNAMLDVLNHISVRANWSTVNGVAVDCNDDATQRLDSSSPR VVPRSSSFHVKSPSTSRNEELTRLITTFEKERQRDKDAAENHMTSLKEAELQKCEREK EQLSQTVVELQAVTDRLTLENLRNEHQAKDQKELAEHRKKQHDQQVAAMEESIRTLTE RCSSLSTELNEKDNQLREKVKRLQEIENRNSQVQPEALLEKILNEYQLKRIEEEKRKE ANETFNKKIAKYHEQLLASFPRLEEQRRRLEAEKERAVDESEQLRREIARLREKTCAE CCICLATKPCVLFLPCRHMIICDSCHAESNIAECPACRTRVGDSMKVFS" assembly_gap 379524..379623 /estimated_length=unknown /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 381515..382199 /estimated_length=685 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 392640..394097 /estimated_length=1458 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 395272..396188 /estimated_length=917 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 399140..402542 /estimated_length=3403 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(402563..408890) /locus_tag="ANCCEY_07645" mRNA complement(join(402563..404554,405234..405650, 405726..405859,406127..406353,408063..408209, 408283..408890)) /locus_tag="ANCCEY_07645" /product="hypothetical protein" CDS complement(join(402563..404554,405234..405650, 405726..405859,406127..406353,408063..408209, 408283..408890)) /locus_tag="ANCCEY_07645" /inference="protein motif:HMMPfam:IPR019103" /note="KEGG: cdu:CD36_50510 1.3e-22 ATP-dependent RNA helicase, putative" /codon_start=1 /product="hypothetical protein" /protein_id="EPB73275.1" /db_xref="InterPro:IPR019103" /translation="MVATYSPDGPTPKIVSFSGSGDDSSQFSLWLRRLEDIMRIRASP MTSQQKANFLICYLEGVAREKVEELGEEDRSNYDTVVAHLKRFFEGPQHRYMARQSLS TCQQHPGESSATFANRLLNLVRAATTGQDPASQKERVLEEFVARLRPDIRYYVKLDNP ATFEQALAKAQMVEQLLAEATAERLISPAGPSRTIEVKSAAPQLLHLNEDGTMVEIMA VVMVFPANDLFLAKQTVTIVEESVIRLVSVLHLGNLLDRHDLAASHLGNDLVQVHRCM PIPPKNYRLLASNGTCYTKPKVELSLSNGPPLHMFIDPTTYVLSNEAPPGDWLPRGPS SETTSNTLSLWPSWFSPWSLYDIWVFVCCVSITTGLLKRRHGADDAPLAIAVPPLPWA SPPAREQDTVEVARVEIDTTNVWPPRASLSPINVLTIANNEQFFVAQIPVKVNGIQVL ALVDTGAAISITSKATAPLLGVFALADTDIPCAVGMAGVPVKIIGRARLRFEIGSFTL HQPIVIKTTTETPPTKFRPPRIPVKFQKELDEHINKLLRAGRIVESDTWTKRRPVVLW VVGTHSVKRALVRRVEPLAERGALKVTLDAYGWSSNPLEEDIKKRGRLMEDGYLLDIC VRLTKAPAAANPIYENISRMQVFENLETDSTASAILSHVYGAAPLGCPNRDEPQPMPD VEPRVDVLVNDNIVTFSAEQRKAVALGTSGFPIAAIQAAFGTGKTLVGAVIAAQLVDR DEIVLVTASTNAAVAQFAQTILSLSAYRHLRVLRYVSDAAVLENMAPTNVDMNKILIS LHDTYTNRLSPEAMELCNKFTIGRRILERYIENPDLALYLTDEEKEEYAIAERNVSRT LEKMIALMLTLRPPHILCITTASLMNTIGTPDGAFNAYRDKFTVLIGDEASQIPEPAL TAISNRLPNLRQIYIGDIHQLEPHAKCPRDSHAAVYGARSVMSVLCSARAVPVASLVR TFRAHPALNELPNRVAYDGTLVSGITAFARPMLIRAMEFPAPGIPFMLIDVDGQSTRA ENMSHFNPVEVETCVKLIELLKARGIAPEYICVITFYREQFRRVEQATLDQGIEISTV DSIQGREKEIVILLTTKTHFTPESADFLDEYRRMNVALSRCRQGQFILGHVPSLATVT FWRRVIDWATSLQAIVTPETLERYFHDV" assembly_gap 406441..407220 /estimated_length=780 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 410311..410762 /estimated_length=452 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 414116..414691 /estimated_length=576 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 417514..>420531 /locus_tag="ANCCEY_07646" mRNA join(417514..417670,419158..419254,419484..419654, 419721..419819,420289..420364,420442..>420531) /locus_tag="ANCCEY_07646" /product="prenyltransferase and squalene oxidase repeat-containing domain protein" CDS join(417514..417670,419158..419254,419484..419654, 419721..419819,420289..420364,420442..>420531) /locus_tag="ANCCEY_07646" /inference="protein motif:HMMPfam:IPR001330" /note="KEGG: cbr:CBG04611 4.9e-81 Cbr-tag-114; C. briggsae CBR-TAG-114 protein; K05954 protein farnesyltransferase subunit beta" /codon_start=1 /product="prenyltransferase and squalene oxidase repeat-containing domain protein" /protein_id="EPB73276.1" /db_xref="InterPro:IPR001330" /translation="MSYIQGLDASRTWMCFWGLHSLNILGAVSSHQQKAEIIAFLKAC QHPDGGYGGGPGQYAHLAPTYASVMALASLQMEEALESINLETLSRFLHRMKQPDGSF TMHDGGEADIRGTYCALSVAALCGIMTDALRDGAAEWIIKCQTYEGGFGGEPSAEAHG GYAYCAVASLVILDRYRLADSEMLLQWLAKRQMRFEGGFQGRTNKLVDGCYSFWQAAN FPLIEGEMAREV" assembly_gap 427383..427482 /estimated_length=unknown /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 430894..431334 /estimated_length=441 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 437898..437997 /estimated_length=unknown /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 439313..440650 /estimated_length=1338 /gap_type="within scaffold" /linkage_evidence="paired-ends" assembly_gap 442123..442572 /estimated_length=450 /gap_type="within scaffold" /linkage_evidence="paired-ends" CONTIG join(ASSL01028927.1:1..3553,gap(1913),ASSL01028928.1:1..3556, gap(535),ASSL01028929.1:1..3245,gap(1276),ASSL01028930.1:1..2985, gap(216),ASSL01028931.1:1..2976,gap(287),ASSL01028932.1:1..4287, gap(unk100),ASSL01028933.1:1..7418,gap(576),ASSL01028934.1:1..843, gap(261),ASSL01028935.1:1..749,gap(654),ASSL01028936.1:1..1076, gap(unk100),ASSL01028937.1:1..2736,gap(324),ASSL01028938.1:1..2641, gap(1751),ASSL01028939.1:1..1032,gap(1132),ASSL01028940.1:1..987, gap(3482),ASSL01028941.1:1..5305,gap(909),ASSL01028942.1:1..1683, gap(1118),ASSL01028943.1:1..32354,gap(unk100), ASSL01028944.1:1..2468,gap(117),ASSL01028945.1:1..1281,gap(1238), ASSL01028946.1:1..1711,gap(1888),ASSL01028947.1:1..9831,gap(388), ASSL01028948.1:1..2300,gap(1616),ASSL01028949.1:1..1461,gap(760), ASSL01028950.1:1..3182,gap(267),ASSL01028951.1:1..12374, gap(unk100),ASSL01028952.1:1..5562,gap(897),ASSL01028953.1:1..2278, gap(873),ASSL01028954.1:1..633,gap(4151),ASSL01028955.1:1..1424, gap(5041),ASSL01028956.1:1..21869,gap(121),ASSL01028957.1:1..1071, gap(1361),ASSL01028958.1:1..862,gap(121),ASSL01028959.1:1..1954, gap(3454),ASSL01028960.1:1..10296,gap(1487),ASSL01028961.1:1..1861, gap(1516),ASSL01028962.1:1..808,gap(673),ASSL01028963.1:1..1774, gap(973),ASSL01028964.1:1..957,gap(637),ASSL01028965.1:1..3339, gap(unk100),ASSL01028966.1:1..10003,gap(unk100), ASSL01028967.1:1..605,gap(1104),ASSL01028968.1:1..8212,gap(unk100), ASSL01028969.1:1..1475,gap(902),ASSL01028970.1:1..5945,gap(unk100), ASSL01028971.1:1..2165,gap(1106),ASSL01028972.1:1..6918, gap(unk100),ASSL01028973.1:1..2772,gap(unk100), ASSL01028974.1:1..8082,gap(147),ASSL01028975.1:1..2856,gap(473), ASSL01028976.1:1..642,gap(505),ASSL01028977.1:1..1954,gap(unk100), ASSL01028978.1:1..4382,gap(159),ASSL01028979.1:1..22016,gap(482), ASSL01028980.1:1..1822,gap(unk100),ASSL01028981.1:1..4221, gap(1648),ASSL01028982.1:1..15728,gap(unk100), ASSL01028983.1:1..1594,gap(2951),ASSL01028984.1:1..6862,gap(787), ASSL01028985.1:1..15703,gap(unk100),ASSL01028986.1:1..4373, gap(536),ASSL01028987.1:1..2842,gap(276),ASSL01028988.1:1..2340, gap(unk100),ASSL01028989.1:1..8266,gap(unk100), ASSL01028990.1:1..1833,gap(unk100),ASSL01028991.1:1..757,gap(902), ASSL01028992.1:1..1853,gap(3631),ASSL01028993.1:1..560,gap(213), ASSL01028994.1:1..3268,gap(unk100),ASSL01028995.1:1..724, gap(unk100),ASSL01028996.1:1..2293,gap(unk100), ASSL01028997.1:1..1891,gap(685),ASSL01028998.1:1..10440,gap(1458), ASSL01028999.1:1..1174,gap(917),ASSL01029000.1:1..2951,gap(3403), ASSL01029001.1:1..3898,gap(780),ASSL01029002.1:1..3090,gap(452), ASSL01029003.1:1..3353,gap(576),ASSL01029004.1:1..12691, gap(unk100),ASSL01029005.1:1..3411,gap(441),ASSL01029006.1:1..6563, gap(unk100),ASSL01029007.1:1..1315,gap(1338), ASSL01029008.1:1..1472,gap(450),ASSL01029009.1:1..1678) //