LOCUS DOYF01000103 16228 bp DNA linear ENV 10-SEP-2018 DEFINITION TPA_asm: Ruminococcus sp. isolate UBA9236 contig_3427, whole genome shotgun sequence. ACCESSION DOYF01000103 DOYF01000000 VERSION DOYF01000103.1 DBLINK BioProject: PRJNA417962 BioSample: SAMN08019450 Sequence Read Archive: SRR6486461 KEYWORDS WGS; Third Party Data; TPA; TPA:assembly. SOURCE Ruminococcus sp. (metagenome) ORGANISM Ruminococcus sp. Bacteria; Firmicutes; Clostridia; Clostridiales; Ruminococcaceae; Ruminococcus. REFERENCE 1 (bases 1 to 16228) AUTHORS Parks,D.H., Chuvochina,M., Waite,D.W., Rinke,C., Skarshewski,A., Chaumeil,P.A. and Hugenholtz,P. TITLE A standardized bacterial taxonomy based on genome phylogeny substantially revises the tree of life JOURNAL Nat. Biotechnol. (2018) In press PUBMED 30148503 REMARK Publication Status: Available-Online prior to print REFERENCE 2 (bases 1 to 16228) AUTHORS Parks,D.H. TITLE Direct Submission JOURNAL Submitted (04-APR-2018) School of Chemistry and Molecular Biosciences, University of Queensland, Chemistry Bld, Cooper Road, St Lucia, Brisbane, Queensland 4072, Australia COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: CLC de novo assembler v. 4.4.1 Expected Final Version :: yes Genome Coverage :: 6.11x Sequencing Technology :: Illumina HiSeq 2500 ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 05/03/2018 14:14:32 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 2,857 CDS (total) :: 2,834 Genes (coding) :: 2,415 CDS (coding) :: 2,415 Genes (RNA) :: 23 rRNAs :: 1, 1 (5S, 16S) complete rRNAs :: 1 (5S) partial rRNAs :: 1 (16S) tRNAs :: 18 ncRNAs :: 3 Pseudo Genes (total) :: 419 Pseudo Genes (ambiguous residues) :: 260 of 419 Pseudo Genes (frameshifted) :: 215 of 419 Pseudo Genes (incomplete) :: 48 of 419 Pseudo Genes (internal stop) :: 22 of 419 Pseudo Genes (multiple problems) :: 121 of 419 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..16228 /organism="Ruminococcus sp." /mol_type="genomic DNA" /isolate="UBA9236" /isolation_source="metagenome" /db_xref="taxon:41978" /environmental_sample /note="metagenomic; derived from metagenome: metagenome" gene complement(95..1297) /locus_tag="DEP65_06095" CDS complement(95..1297) /locus_tag="DEP65_06095" /inference="COORDINATES: protein motif:HMM:PF00251.18" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="HCB95231.1" /translation="MKNYMRIEFKISDENVFLYSIKNDMKTFECNISNNEINCNVYYD FNEKPLSLKSDIKRNDTVKVILMPFRIELWVNSVLKDEEWPAGNCLFTKEDEIKSNIS ISVSEHPYAKKIQPAVIGTFENANGWKPEENVFVGDCMPYVCDNRYHVLYLKDRHHHR SKWGNGAHQWEHISTHDFKTWEIHPMAIEITESYEGSICTGSWIKKDNTDYLFYTVRM ADNSPAPIRRSISKDGYHFDKDNDFSIILSKKYNAKSARDPKVILADDCLYHMFLTTS LEAEQKGCLAHLISKDLYKWEETENPIYISDSSEQPECPDYIKYNGYYYLIFSLRGRA HYMISQKPLDGWIMPENPIIPCDLVPKGAVWNDKIIFTGFKPINAYAGTMTFAAAVND ENGILIFE" gene complement(1297..3033) /locus_tag="DEP65_06100" CDS complement(1297..3033) /locus_tag="DEP65_06100" /inference="COORDINATES: protein motif:HMM:PF02782.14,HMM:PF13411.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="HCB95232.1" /translation="MFINDVVKRTGLTKKAIRFYEDKGLLSVQRQINGYRSYSEDNIL TLKKIKMLRSCGVSVSDIKLLFGNMITIEELLIKRKKEVENAYENYSSMFEDILNVFQ NYKKGQYDLDVQFNETTYNSFIPSDTLILGIDIGTTSISAVVIDIENKTNVETYTLDN AAGIKSSSPCFNEQNPRIIHDKVIKLAELIVTGYPGIKAIGVTGQMHGIVYIDEKGDA VSPLVTWQDKRADEMLENSCTYCEKIFEVTSKRIYTGFGFATHYYNLLNELVPSSAYS FCSIMDYVVMKLTNQYRPLIHVSVAASFGLFDTKTLSFDKESILKLGMDNIALPDVTD EFFIAGNYKKIPVSVAIGDNQASFIGAVENLHETVLVNIGTGSQISFMSDFCVTDDRL ELRPLFKDKYILCGSALCGGASYALLESFFRSYIKASNSDNSLQYDILNKLAYDAYKQ NKKPLAVNTLFSGKRSNPNIQGSILNITRENFTPGQLALGFITGICKELYDFMPINPE KKLIVASGNAVRKIPVMKQVIEDMFGLPVQISYNNEEASVGAALFSAIAANVVKNIYE ASAFISYREDIK" gene complement(3661..3912) /locus_tag="DEP65_06105" CDS complement(3661..3912) /locus_tag="DEP65_06105" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="HCB95233.1" /translation="MHSERVEVERKFSLAKRKCGIGLIVTRLKETTCHCLAMSVLLLN LRKISKVLFTKIIIMFQNTLKSDNPSLKIHSISKMAFIQ" gene complement(3899..4093) /locus_tag="DEP65_06110" /pseudo CDS complement(3899..4093) /locus_tag="DEP65_06110" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006907146.1" /note="incomplete; partial in the middle of a contig; missing stop; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="IS5/IS1182 family transposase" gene complement(4231..5232) /locus_tag="DEP65_06115" CDS complement(4231..5232) /locus_tag="DEP65_06115" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_004611682.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="HCB95234.1" /translation="MDDELRDNLIRSGRDARYDECAKKLLSDKRILAWILKNSAVEYE NCTIDEIVGYIEDTPEISSVAVDAGETAQSVKGSANEDIVLNEGKITYDLRFEALAPG TDGELIQLIINIEAQNAFKPNYPLLKRAVYYCSRMISAQKGMEFFNSEYNKIKKVYSI WVCTNPPETHKDTITRYRISEENIIGSVSEDRQHYDLMNIVMLCLGENQKSERNALGM LETLFSTDDNIKDKIEILDKTFDLKMTHELETEVSEMCNLSEGVYAKGLDKGLAQGMD KGMALGMDKGMALGMDRSLLESIRNIKQSLNISTEKAMDILKVPVDKRADFERQLGQ" gene 5496..6446 /locus_tag="DEP65_06120" CDS 5496..6446 /locus_tag="DEP65_06120" /inference="COORDINATES: protein motif:HMM:PF00271.29" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="HCB95235.1" /translation="MRFFCVVHLQGFEXTNYDRIIQQIKNTPQNEKWIIFVSSKSTWK TLSKNIAEQTGKKIRFISAETKKRSIWNEIIENSSFRGDILITTNVPDNGINITDERV RHIVIPFCAKSDFIQMLGRKRIVNTESVNVYAEIPSEAKLRSYIKETSHKTEVLHNIL SHTDNSGIIIRNLQNLWLNCDKNINKLFFIDNECRLCPNTAAYTKLISIQSFYEEVYQ NICDTDKYIKTISSWIPGELSPDITFLDSVSGKSNRTIDEFNDCYKNKLLEPNSIYDE FIRIYKSECSKIYSGDELKSKLNLKKAKNIRKSTINNAIL" assembly_gap 6748..6767 /estimated_length=20 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(6770..7141) /locus_tag="DEP65_06125" CDS complement(6770..7141) /locus_tag="DEP65_06125" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="HCB95236.1" /translation="MNSCALNNKQTRSFSFMNYIPEIEVTEGGTLVTGMDQLAAGDLL NVKLSGFKPDDEISASAFVAQYSGDRLKAVSMVDGSRDSSIAGNEIALSQQVAEDVDK IKVIYMNSLNYSSLCASYDIK" assembly_gap 7194..7214 /estimated_length=21 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(<7215..8436) /locus_tag="DEP65_06130" CDS complement(<7215..8436) /locus_tag="DEP65_06130" /inference="COORDINATES: protein motif:HMM:PF00657.20" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="HCB95237.1" /translation="MIFRAVLMTAHYRKVTAAINNSLEFITAESRTFNWPNAGFEFEF SGTSAEVYVDTAQPIDSTAYNMVYFNAAVLDENDEVISVNRMALKNGWNTIYTSSAAD KVKIMLVRSSEACRGTIRMSKIRTDAAPSAAAPRQKQIEFIGDSYTAGFGNSPELSEA TYYCAQNTDNWNSYTGFVARHYKADNTVIAYQGKGVCVNVGGDSTETMSQQFNYSDIV VPSKNMSTRETWNFMKYRPQVVVVWLGTNDNTGMSKAGIENPTQYFQDNYVKLLENIR AKYPFASIICCSRTNWGYPEQVTAAVEQMGGEAKRFYNLRLTSFKASSFGHPNVEEDK AIADELIAKIDSIRDVWHTVDIGSDETVSIIANYNTGVINVIGKTPEPGDQVAVYVLH HGETEFSADGAAYID" gene complement(8440..9066) /locus_tag="DEP65_06135" /pseudo CDS complement(8440..9066) /locus_tag="DEP65_06135" /inference="COORDINATES: protein motif:HMM:PF13408.4,HMM:PF14287.4" /note="too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" assembly_gap 8766..8813 /estimated_length=48 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene complement(8973..9737) /locus_tag="DEP65_06140" /pseudo CDS complement(8973..9737) /locus_tag="DEP65_06140" /inference="COORDINATES: protein motif:HMM:PF00239.19,HMM:PF07508.11" /note="too many ambiguous residues; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="hypothetical protein" gene complement(9796..10017) /locus_tag="DEP65_06145" CDS complement(9796..10017) /locus_tag="DEP65_06145" /inference="COORDINATES: protein motif:HMM:PF00239.19" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="HCB95238.1" /translation="MKQKQQYNKITALYCRLSRDDEFNGDSVSIQTQKAMLKHYADEN GFGNCQYFIDDGYSGTNYNRPDFQRLLXD" gene 10255..11694 /locus_tag="DEP65_06150" CDS 10255..11694 /locus_tag="DEP65_06150" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="HCB95239.1" /translation="MKICDFCSTAKILMDFIGESKNINQIDFMYELFKDFMESDEAKD FDFDNGLVCRWLNGTAKLSPKITAYYSALGNLEAMAIDIEENILPLFYDKDMAVTELY DLLMSDTTVSETKKQELADNYPYKDDADISNFISKLVFFGMERKFIKRDANTKKLIAS GALSPQTKDYIYSLVPKPCKHFCGRDNELEKLHTMLEDENKIFIQGIAGIGKSEFVKM YAQKYKKEYTNILYFSYGGSLKQMITDCDFADDSLTDGKNILLKKHNRFLRSLKEDTL IIIDNFNITASDDELFDVIMKYRCKILFTTRSRFKDYTYFELKEMPLESLLTLSEYFY ADTRSNTNVVENIINELHYHTLSVELAARLLTSGILEPCRLLTELQSTKSVLHTDDKI NIVKDGTSSKATYYEHIHKLLSLIGLSDKAVKIMRCMTLIPYDGINPRMLAKWIGLNN LNTINELIEYGFXTITEKSRSIHLYRKLP" gene 11793..12548 /locus_tag="DEP65_06155" CDS 11793..12548 /locus_tag="DEP65_06155" /inference="COORDINATES: protein motif:HMM:PF13424.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="HCB95240.1" /translation="MFKIAENTIDIANNDDTERYKLFLKDVFAYMEKYAYRSGMELII SELQSLVDTNEDKAILLDYKAAYEHICNKNHKKALQYEQRAVKLCEEIATVNPHLAAN IYGNIGGLYHTENQLDKAKYYMELAYQTLVDSGIDFTNDAVIQVCNYANLAASMGEPR KAIQALKRCANAVKEYNSENSSDYANLAWDIGCIYTQMRDKDTAVMYFKTALRIYTDL WANEPELLQMKLTELKNMAAVYGVNIKNLISAN" gene 12659..13243 /locus_tag="DEP65_06160" CDS 12659..13243 /locus_tag="DEP65_06160" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_018211855.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="HCB95241.1" /translation="MIKLNTEYLKQIFTKYDYIMTTAQLNLEKLYYRDIQQMLKKGMI EKVKRGYYHWIEDNDCKEVVIINKLFPDAILCMETALFYYSYSDRNPAEWHIAIDKNA SRQRTKIDYPFVRAYRIESDLLLIGETKXRIYDRDRTMCDVLKNMNKMDREIFNKSIQ GYVKDPQKNIPNLMRYAKELRVQKRVKDLIGVWL" gene 13243..14160 /locus_tag="DEP65_06165" CDS 13243..14160 /locus_tag="DEP65_06165" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006574953.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="nucleotidyltransferase" /protein_id="HCB95242.1" /translation="MADIAASVLAKLKNKAKASGISYQQCLQLFMQEEFLRKLSKSGY DNFLVLKGGLFIYTITNFESRATIDVDFLLRGYSNSIDDVKGLICKIIDTPTGNNYIN MTAKGFEEISPQRKYHGVSTQIIGQIKNIRVPFNVDIGVGDVIVPKAEQRKINTQLPD FEAPFIKTYSLESTIAEKFDTILQRFELTSRMKDFHDIYYLARTFDFNVARLQKAIFE TLRRRGTPYDKDSFKRVISLADDIDMQKRWKYFLKNIKDDTLEFSVVIDEIQTFLEPV FEAIVNEDEWQKVWNFITKWDKKERSLES" gene 14157..>14406 /locus_tag="DEP65_06170" CDS 14157..>14406 /locus_tag="DEP65_06170" /inference="COORDINATES: protein motif:HMM:PF14130.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="HCB95243.1" /translation="MSKVLTVEQREQAGSDSYNRFEYQVHWIVCHIISKLQEDAECIV FCEFHDDMAEFSPNNQQYQFFQIKTKEDSSDWTIAEMSK" assembly_gap 14407..14501 /estimated_length=95 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene <14502..15242 /locus_tag="DEP65_06175" CDS <14502..15242 /locus_tag="DEP65_06175" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="HCB95244.1" /translation="SNNDFDKEVLLWQAVIEDGKKLQEENSVLYQKIKNRLKDEFTND MPSNFDSVFDAFIQNTFVHKSDLQLAAYENQTKGEFFSHLADKNIPTNTANLILQQLL NDVRKKSKEKIKVPISKKSLVEKKGIDVAQIGKKIDGNIKNSGNYNAFHDYLITQALS DNDICRIEAAKTLHDAKWLDVKDVRYQEIVIILRKTISTYCESFKANEFSGELKRLCI QELQKHNLLSDSLDKSLIEVLYYEQKFS" gene 15326..>16228 /locus_tag="DEP65_06180" CDS 15326..>16228 /locus_tag="DEP65_06180" /inference="COORDINATES: protein motif:HMM:PF13476.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="HCB95245.1" /translation="MIRKLIVISQSESRSLEVPFEKGLNIILGGNKTGKSSIIKSIFT TLGCECKRVEADWKKLVSTYLLFFKYGERQFCIVRQDKKFQIFENINNDFSCIIETNA FHEYSNCLMDILEIKMPCISKDGKQFNITPPLLFRFQYIDQDEGWSKIADSFKNVAYI KDWKANTNKYVCGYLDDSYYALQAQKAEHILEKDDKKKELNYNQSFVSRITSTLTQIE NIESVEEVTTDIESLLAKAEELRKMQFSYNAEMTVLENDIYINQHKLHIVEHNLIETK KDIEYAMTQEDELICPFCGTIYSNG" BASE COUNT 5015 a 3111 c 2899 g 5003 t ORIGIN 1 cacctgcata gcaggtggtt tatttttatt tgtgattaca aaccaaagtt gtcctaaggg 61 ctcaggacaa ctttggcatc attataacta aaaatcattc aaatattaaa attccgtttt 121 catcatttac tgcagccgca aatgtcattg ttccggcata agcgtttatt ggtttgaaac 181 ctgtaaatat aatcttgtca ttccatacgg cacctttagg aaccaaatca catggaatga 241 tagggttttc aggcattatc caaccgtcta aaggtttttg ggatatcatg taatgcgctc 301 tgcctcttaa actgaaaatt aaataatagt aaccgttgta ttttatataa tcaggacatt 361 ctggctgttc tgaactgtct gaaatataaa tcggattttc tgtttcttcc catttatata 421 agtcttttga aatcaggtgt gcaagacaac ctttttgttc agcttcaagg cttgtggtta 481 aaaacatatg atacaagcag tcatccgcta atatcacttt tggatctctg gcgctttttg 541 cgttatattt ctttgacaat atgattgaaa aatcattatc cttatcaaag tgatatccat 601 ctttagaaat gcttctgcgt atcggagccg gagagttatc agccattctc actgtataga 661 aaagatagtc ggtattatct ttttttatcc atgacccggt acaaatcgag ccctcgtacg 721 actcggttat ttcgattgcc ataggatgta tttcccatgt tttaaaatca tgcgtggaaa 781 tatgttccca ttgatgcgct ccgttccccc acttgcttct gtgatgatgt cggtctttta 841 aatataggac atgatatctg ttatcacaca cataaggcat acaatcacca acaaaaacgt 901 tttcctccgg tttccagccg ttcgcatttt caaatgttcc tataacggca ggttgaattt 961 tttttgcata tggatgttcg gaaacagaaa tgctgatatt gctttttatc tcatcttctt 1021 ttgtgaacag acaatttccg gcaggccatt cttcatcctt gagtacggaa ttaacccaaa 1081 gctcaatgcg aaacggcata agtataactt tgacggtatc atttcttttg atgtcagatt 1141 tcaaagacaa cggtttttca ttaaaatcat aatatacatt gcaatttatt tcattatttg 1201 aaatattaca ttcaaatgtt ttcatatcat ttttaatcga atacaaaaaa acattttcgt 1261 cagatatttt aaactctatt ctcatataat ttttcatcac tttatatcct ccctataact 1321 tataaatgca gatgcttcat aaatattttt tacaacattt gcggctattg ctgaaaataa 1381 tgccgcaccc acagacgcct cttcattgtt gtatgatatc tgaacgggca atccgaacat 1441 atcctcaatc acttgcttca taaccggaat ttttctcaca gcattgccgg atgcaacaat 1501 taactttttt tcgggattta tcggcatgaa atcatatagc tctttgcaaa ttccggttat 1561 aaaacccaaa gcaagctgac cgggtgtaaa gttttctctt gtaatattaa gtatagagcc 1621 ttgtatattc ggattgcttc gtttgccgga aaataatgtg ttgacagcaa gcggtttttt 1681 attctgttta tatgcatcat aagccaactt atttaaaata tcatattgca aagaattgtc 1741 cgaatttgat gcttttatgt aacttctaaa aaaactttca agcaatgcat aagatgctcc 1801 tccgcaaagc gcagacccgc ataaaatata cttatcttta aaaagcgggc gaagttccaa 1861 tcggtcatcc gttacacaga aatcagacat aaaagaaatc tgacttcccg ttcctatatt 1921 aaccaataca gtttcatgca gattttcgac agctcctata aagcttgctt gattatctcc 1981 gatggcaacc gataccggaa tctttttgta atttcctgcg ataaaaaatt catctgtaac 2041 atccggtaaa gctatattat ccattccaag ctttaaaatt gactctttat caaatgacaa 2101 ggtttttgtg tcgaaaagcc caaaacttgc cgcaaccgaa acatgtatca aaggacggta 2161 ttgatttgta agcttcatta caacgtaatc cattatgctg caaaagctgt atgcgcttga 2221 cggcacaagc tcatttaaca gattataata atgagtggca aatccaaatc ccgtataaat 2281 tcttttgctt gtaacttcaa aaattttttc gcaatatgta cagctgtttt caagcatttc 2341 gtcagcgcgc ttatcctgcc aggtaaccag aggagatacg gcatcccctt tttcatcaat 2401 gtacacaatc ccatgcatct gaccggtaac tccaatggct ttaatgccgg gataacccgt 2461 aacaatcagt tctgcgagtt taataacctt atcatgaata atccttggat tctgttcgtt 2521 aaaacaaggg cttgacgatt ttatgcctgc agcgttatct aatgtatacg tttctacatt 2581 tgttttgttt tctatatcaa taacaactgc acttattgag gttgttccta tatctatccc 2641 taatattaat gtatcggatg gtataaaaga attatatgtt gtttcattaa attgtacatc 2701 taaatcatac tgtccttttt tataattttg aaaaacattt aaaatatctt caaacatact 2761 tgaatagttt tcatacgcat tctcaacctc cttttttctt ttaataagca attcttcaat 2821 tgtaatcata ttaccaaaca gaagctttat atccgaaacc gaaactccgc atgaacgcag 2881 catttttatt ttttttaaag tcaaaatatt atcttcggaa tacgaacgat agccatttat 2941 ttgtctttga acagatagca gtcctttgtc ttcataaaaa cgaatggctt ttttagtcag 3001 ccctgttctt ttcacaacat cattgataaa catatcatac aatcgccttt ctgtaagtag 3061 aaatcttata ttatatatta taaaccttca ccttgggtga atgtcaatac tttttaaaat 3121 atttttaatt atattgcagt tttcattact ttaggcaccc ttatgtcaaa ctataactaa 3181 aaatttcaac tcatttctgt catatcaata tccctgcaaa agtcttgcat cttccatata 3241 acatcaattc ttttgccagg ataaaccaat accttatcaa tgagcatatc agccaatgct 3301 tgtgttaagg tgttttcctt tttcacatct tttgcgattt gaataaattt ttctcttgtt 3361 tagttatcgt tttttgtttg ctcaagttca ctacagctac ggtcatattg taattcataa 3421 tcactcatat ctgcatcaca gttttctttc agcatttgat attcgtcgag tgtaattttc 3481 cgtgtcagaa gttgctcata tagatgctgc ttttctttcc ggcaattctc tatctgtttt 3541 tcaagttcaa ccgtctgctc tgtctttgta ttgaagatta tctaactcgg ttataccatt 3601 aatatttaat atgatttcgg cttgctttga aattatatca aataatacct tatgtaatgc 3661 ctactgaata aatgccattt ttgaaatgct gtgaattttc agcgagggat tgtcactttt 3721 cagcgtgttt tgaaacatta ttataatttt cgtaaaaaga accttgctga tttttcgtag 3781 gttcagcagt aatacggaca tggcaaggca gtggcaagtc gtttctttca gtcttgtcac 3841 aataagtcca attccgcatt tgcgttttgc aaggctgaat ttgcgttcaa cctctacacg 3901 ctccgagtgc aagacgaagt ggctttgcaa cattcccctt tcgattagtg aaaagctgtg 3961 cataacgctg ttctatttca tcccatggaa tcctttcggc tttctttacc caacgattat 4021 ctggattaag tttcattccg ataggtgtat taaaatctgt gaaactgatt tgcttgcttt 4081 taaatttata catatttgta tctcccataa aaacaaaagt gcaaggtttt ttgccgtttt 4141 cggcattttt ccttgcactt attataccat aaatagcctg aaatgtcagt agttttgctg 4201 ttttttattt attcagtagt cattaaaata ttattgtcca agctgccttt caaagtcggc 4261 acgcttgtcg acaggcacct ttagtatatc cattgccttc tcggttgaaa tgttaaggct 4321 ctgcttgata ttgcgaattg attcgagaag agaacgatcc ataccgagag ccataccttt 4381 atccataccg agagccatac ctttatccat gccttgcgcc aagcctttgt caaggccctt 4441 tgcatatacg ccttcactta agttacacat ttcagatacc tccgtttcca attcatgcgt 4501 cattttcaaa tcaaatgtct tatccagaat ctctatcttg tcttttatgt tatcatcggt 4561 tgaaaacagc gtttcgagca ttcccagtgc gtttctttcg gatttttgat tttcacccaa 4621 gcacagcata actatattca tcaagtcata gtgttgtctg tcttcgctca cactgccgat 4681 tatgttttcc tctgatattc tgtacctcgt tatcgtgtcc ttgtgcgtct ccggcggatt 4741 tgtacaaacc catatcgagt ataccttctt tattttatta tattcggaat tgaaaaattc 4801 catccctttt tgcgccgaaa tcattcggct gcaatagtat accgccctct ttaacaacgg 4861 atagttcggc ttaaaggcat tttgcgcctc gatgtttatt atcagttgaa taagctcgcc 4921 gtctgtgccc ggtgccagcg cctcaaagcg taaatcatat gtgattttac cctcatttaa 4981 cactatatcc tcgttagccg agccctttac ggattgcgcc gtctctcctg catccaccgc 5041 caccgagctt atctcaggtg tgtcctctat atagccgact atctcatcaa tagtacaatt 5101 ctcgtactcg acagcactgt tcttcaaaat ccacgcaaga atacgcttgt cggagagcag 5161 cttctttgca cattcatcat atcgcgcatc cctgccgctc ctaatcagat tatctctaag 5221 ctcatcatcc atcggactca ccttctccca cacttatatt ttaccacatt tatattataa 5281 aagcaagatt tttttaatat atcgtattta taaataaatt gcaatataaa atgtccacaa 5341 atattagggt cagagaatac tgcgccgggc tttatgatta aaaatctgac aacattttga 5401 caacgttttt gctgtcggat gagattaaat aacctcacat gagctaacgt ataaacataa 5461 agaaaaccgc ataaacacgg gattttccca agtttatgcg gttcttttgc gtggtacacc 5521 ttcagggatt cgaanntacg aattatgacc ggattataca gcaaataaaa aacacgcctc 5581 aaaacgaaaa atggattatt tttgtttcat ctaaaagcac ctggaaaacc ctttcaaaaa 5641 atatagcgga gcaaaccggc aaaaaaatac gattcatttc tgctgaaacc aaaaaacgca 5701 gcatatggaa tgaaataatt gaaaattcct ctttccgagg tgacattctg attacaacca 5761 atgttccgga taacggcata aatattacag atgaaagagt cagacacata gttataccat 5821 tttgcgcaaa atccgatttt atacaaatgc tcggcagaaa gagaatcgta aacacagaat 5881 ctgtaaatgt atatgcagaa atacctagcg aggcaaagct acgaagctac atcaaagaaa 5941 catcacacaa aacggaagta ttacataaca tactgtccca tactgacaat tccggaataa 6001 taataagaaa tctgcaaaat ctttggctta actgcgataa aaatattaac aagctctttt 6061 ttatagacaa tgaatgtcgc ctatgtccaa acactgctgc atatacaaag cttataagca 6121 tccaatcttt ttatgaagag gtttatcaaa atatatgtga cactgataaa tatattaaaa 6181 ccatatcttc gtggattccg ggcgagttgt cacccgatat cacatttctc gacagcgtgt 6241 ccggcaaaag caatcggacc atagacgaat ttaatgattg ctataaaaac aaacttctcg 6301 agccaaacag catttatgac gaatttattc gcatctacaa atcagaatgc agtaaaatat 6361 actcgggtga tgaattaaag tcaaagctca acttaaaaaa ggcaaaaaat attcgaaaat 6421 caaccattaa caacgcgatt ttataagctc ctttgttatg acaataatag ccttttgtaa 6481 aaaccagaaa agtcagaatt gttattttct gccttaaaaa attagagcag tacaaacgat 6541 ttatgcacga aagcactgac agaaaagacg aacccaaact cacacttgaa catatttctg 6601 cacttgaaga aattatcaag attgtcgagg aacacgagta gtccataaag gtaaacgttc 6661 tcggacatat gcctgagata tattcaaata cagactgagc cgtatcaccc ccacagaggc 6721 aatgacgcgg ctcagtctga aacgctcnnn nnnnnnnnnn nnnnnnnaat tatttaatat 6781 cgtaggatgc gcagagcgaa gaatagttca aggagttcat atagatcact ttaatcttgt 6841 ctacatcctc tgctacctgc tgtgacagcg caatctcgtt tcccgctatc gaagaatcgc 6901 ggctgccgtc caccatgctg acagccttta aacggtctcc gctgtactgt gcaacgaatg 6961 ccgaggctga tatctcatca tccggcttaa agccgctcag tttaacatta agcaagtctc 7021 ctgccgcaag ctgatccata ccggtcacta aagtgccgcc ttcggtaacc tctatttccg 7081 gaatgtagtt cataaacgaa aagcttcttg tctgcttatt gttcagtgca cacgaattca 7141 tcgcaagagt gtattcgccc gcaagcttat ttacagtaaa ctcaaaggag tagnnnnnnn 7201 nnnnnnnnnn nnnnggtcta tatatgctgc gccatcagca ctgaattctg tctcgccgtg 7261 gtgaagcaca tacaccgcaa cctgatcgcc cggctctggc gttttgccga taacattgat 7321 aactccggta ttgtaatttg caattattga cactgtttca tccgagccga tatcaaccgt 7381 gtgccacaca tctctgatgg agtcaatctt ggcaatcagc tcgtctgcta tcgccttatc 7441 ctcttcaaca ttggggtgtc cgaagctgct tgccttaaag ctcgtcaaac gcagattgta 7501 gaatcgcttt gcttctccgc ccatttgctc aactgcggcg gtaacctgct ccggatagcc 7561 ccagttggtt cttgaacagc agattattga tgcaaacgga tacttcgccc gaatattctc 7621 cagaagcttt acataattat cctgaaaata ctgtgtcgga ttctctatcc ccgcctttga 7681 catacccgtg ttgtcgtttg taccgagcca tacaaccaca acctgaggcc ggtacttcat 7741 gaagttccac gtctcacggg tggacatatt tttcgaaggc accacaatat ccgaataatt 7801 aaactgctgt gacatagttt cggtgctgtc accgcccaca ttgacgcaca cacccttgcc 7861 ttgatacgca ataacggtgt tgtcagcttt gtaatgacgc gcaacaaagc ctgtgtacga 7921 attccagtta tcggtgttct gtgcacaata gtatgtggcc tccgaaagct cgggcgagtt 7981 gccgaagccg gcagtgtagc tgtcacctat aaattcaatc tgcttctgcc tcggtgcagc 8041 tgcggagggc gcggcgtctg ttcgtatctt gctcatgcgg atagttccgc gacacgcctc 8101 ggacgagcga acaagcatga tttttacctt atcggcagca gatgaggtgt atatcgtatt 8161 ccagccgttt ttcaaagcca ttctgtttac cgagataacc tcgtcatttt catcaagcac 8221 cgcagcatta aagtatacca tattgtacgc tgttgaatca atcggctgtg cagtatccac 8281 atatacctct gcggaggttc ccgaaaattc aaactcaaaa cccgcattcg gccagttaaa 8341 cgtgcggctc tccgccgtaa taaattcaag tgagttgttt attgcggcgg taactttcct 8401 gtaatgggcg gtcatgagaa ctgccctaaa aatcatatgt tatgaaccga taccgacatt 8461 cccgataaat cgatagtaga tctcaatcgg cattactttg ttaccgtcct cgtctatctg 8521 ttcatggata acgattttct caatcaaggt attcagtaaa accgcgtcaa gctcggttat 8581 cgttgtgtat tgttcgagta aatccgcaaa ttcctgtacg gatttgctct gctttgtata 8641 atgagtgagc tttatttgaa gctcattgta tcgctctttt agctgcgctt cctctttttc 8701 aatgtttgcg gacatcgcga aataccgttc ctcggaaatg cgtttaaaaa ccttatcttc 8761 gtaaannnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnncatcgtg 8821 tatttggctg tcagaacgat taataaaccg ctctatgtat aatgacctat cattgcttga 8881 aagcgcgata tggcggttta tatcttccaa tacaatggcc tttaaatcat cagcatttat 8941 tgaatgaggt gtgcagctgc ctctaccatg ccttacatac ctgttgcatt tatatttcgg 9001 agtatgatta acgctgtgtt cctttttgta tacaattgtt tttccgcact ctccgcatac 9061 aagcaaaccc ttaaagatat tatcgggatt ggcttcgtta aaatgcttct taacgctgat 9121 acgcttctgt nnnnnnnntc tcggctgaca agcggctcat gtgtattcgg aacggttatc 9181 cactcgcttt ccggtcttac aataacgcgc ttatccttaa atgattttga cgtgcgccgc 9241 tggctgacca tatcgcccat ataaacagga ttttgaagaa taacctgtac gccgcgcttt 9301 atccaatcat acggaaactg tatgccctcc gttttctgta gaacgccgtt tttatccctt 9361 gtatacgctc cgggagttgg tatatgctcc tgtttaagag tatatgcaat ctcggtgcag 9421 gtcttaccct ccaatgccat acggtacatg cgctgtacaa taggagcgtg ttcatcggga 9481 atgagcttgt gcttatcctc cggcgatttc atatagccgt atggagcgcg tctgcccgtg 9541 tattcgcctt tcagagcctt tgttctgtac gccgaacgaa cttttttgga aatatccttt 9601 gcataccatt cgtttatgat gttcttgaac ggagcaaatt cgttctcacc gttatcactg 9661 tcaataccat cattgactgc gataaatcta acattatgtt ttgggaacaa cacctcggtg 9721 taatatcccg tctgcaaata ctctctgcct aatcgggata aatcctttac tatgaccgtt 9781 ccgaccttgc ctgcttcaat ctnngaggag ccgctgaaaa tcgggacggt tgtagttagt 9841 gcctgagtat ccgtcgtcta tgaagtattg gcagttaccg aaaccgtttt catctgcgta 9901 atgttttagc attgcctttt gtgtctgtat gctcacgctg tcgccgttaa attcatcatc 9961 tctgctcagt ctgcaataaa gagccgttat tttattatac tgctgtttct gtttcatttt 10021 ttattccttt ccgaaacagc aattcaagat attttgtata ttactatttt accgcaagat 10081 gagataaaat gcaatagaaa atgtcgaatt attttacaaa atattaataa ttgctgttct 10141 catttctata tatttttata acgatttgtg ttatactggt agtataatat aaatcatgat 10201 attttacatt gggtggaatt tgaccgtaaa ttgacaaaat cggaggtgtg gatcgtgaag 10261 atatgtgatt tctgttctac cgcaaagatt cttatggact ttatcggtga gagcaaaaat 10321 ataaaccaaa ttgattttat gtatgagctt tttaaagatt ttatggaaag cgatgaagcc 10381 aaagatttcg attttgacaa cggacttgta tgccgttggc tgaatggtac tgcaaaacta 10441 agccctaaaa tcacagcgta ttattctgcg cttggcaatc ttgaagctat ggctattgat 10501 attgaagaaa atatattgcc tttgttttat gataaagata tggctgttac ggaactgtat 10561 gatctgctga tgagtgatac aactgtatct gaaacaaaaa agcaagagct tgcagacaat 10621 tatccataca aagatgatgc tgatatatca aacttcatta gtaagcttgt cttttttgga 10681 atggaaagga aattcataaa aagagatgct aacacaaaga agctgatcgc ttccggagcg 10741 ctttctcctc agacaaagga ttacatatac agtcttgtcc ccaaaccctg taagcatttc 10801 tgcggccgcg ataacgagct tgaaaaactg cataccatgt tagaagatga aaacaagatc 10861 tttattcagg gcattgccgg tataggtaaa agtgagtttg ttaaaatgta tgctcaaaaa 10921 tacaaaaagg aatacacgaa cattctgtat ttcagctatg gcggcagttt aaagcaaatg 10981 attacagact gtgatttcgc cgatgatagc cttacggacg gcaagaatat tctcttaaaa 11041 aagcataacc gctttttgag gagcttaaaa gaagacacct taataattat cgacaatttt 11101 aatatcaccg catcagacga tgaattattt gacgtgataa tgaaatatcg atgcaaaata 11161 ttgtttacca cgcgaagtcg ttttaaggat tatacatact ttgagctaaa agaaatgccg 11221 cttgaaagcc ttttaacgtt atcagaatat ttctatgcag atacacgaag taatactaat 11281 gtggttgaaa acattatcaa tgaattgcac taccatacgt tgtcggtaga acttgcagca 11341 cgcttattga cttctgggat cttggagcca tgcagactgt taacggagct tcaaagcaca 11401 aagagcgtac ttcacacaga tgataaaatc aacatcgtaa aggatggtac aagctcaaaa 11461 gcgacctatt atgagcatat ccataagctg ttgtccttga tagggttatc ggataaagct 11521 gtcaaaatta tgcgctgtat gacgctgata ccgtatgatg gaataaatcc gagaatgttg 11581 gcgaaatgga taggattgaa taatcttaac actataaatg agcttataga atacggcttt 11641 nnaacgatta cagaaaaatc acgctccatc cacttataca ggaaattgcc atagacgata 11701 caaaaccggg catatcaagc tgtataaact taattgaagc cgtaaggata ctgtgcctat 11761 atcacggttt agatttaccg tatcacacgc tcttgtttaa aatagctgag aacacaatag 11821 atattgcgaa taatgacgat acagagcggt ataagctgtt tcttaaagat gtgtttgcat 11881 atatggagaa atatgcttac cgttcgggaa tggagctaat tatatcagaa ttgcaatcgc 11941 ttgttgatac gaacgaggac aaggctatac tgcttgatta taaagctgcg tacgaacata 12001 tatgcaataa aaatcataaa aaggctttgc aatatgagca gcgggcggtc aagctatgtg 12061 aggaaatagc gacggtcaat ccgcatttag cggcgaatat ttacggtaat atcggcggtt 12121 tgtatcacac tgaaaatcag cttgataaag ctaaatatta tatggagctt gcgtatcaaa 12181 cgcttgttga cagcggtatt gattttacta atgacgcggt tattcaggtt tgcaactacg 12241 ctaatttagc tgccagtatg ggagaaccga gaaaagcaat acaggcgtta aaacgctgtg 12301 cgaacgcggt taaggaatac aattcggaaa acagcagcga ctacgccaat ctggcgtggg 12361 atataggctg tatctataca caaatgcgtg ataaagacac tgccgtaatg tattttaaaa 12421 ctgcactaag gatatatacc gatttatggg caaacgagcc ggagctattg caaatgaagc 12481 tgactgagct taaaaatatg gctgcggtat acggagtaaa cattaaaaat ttaatatcag 12541 caaattagta acacaaaagt ttataaagtt attgacagta taacagatat ttgatataat 12601 gcattaaatc acaccacaat ttaaaaatca aggtgtgatt taatacagag gtgtgactat 12661 gattaaacta aatacagaat atcttaaaca gatattcaca aagtatgatt atattatgac 12721 gacagctcaa ttgaatttgg aaaaactgta ctacagagat attcagcaaa tgctaaaaaa 12781 aggcatgatt gaaaaggtaa aaagaggtta ctaccattgg attgaagata atgattgcaa 12841 agaagttgtt attataaaca aactattccc agacgcaatc ctctgcatgg aaacagcatt 12901 gttctattac agctatagtg acaggaaccc tgccgaatgg catatagcaa tagacaaaaa 12961 tgcatcaaga cagcgcacga aaatagacta tccgtttgtc agggcttatc gaatcgaatc 13021 cgatttgtta ctcataggtg aaacaaaann tcgtatttat gaccgcgatc gcacaatgtg 13081 cgatgttctt aaaaacatga ataaaatgga tagggaaatt tttaataaat ccattcaagg 13141 ttatgtaaag gatcctcaaa aaaatattcc caaccttatg cgttatgcaa aagaattgcg 13201 agttcaaaaa cgcgtcaaag atttgattgg agtgtggtta taatggcaga tatagcagca 13261 tccgttcttg caaagcttaa aaataaagca aaagcatccg gtattagtta tcaacagtgc 13321 ttacagctgt ttatgcagga ggaatttttg cgaaaacttt caaaatcagg atatgataat 13381 tttctggttt taaaaggcgg cttatttatc tataccatta caaattttga aagcagagcc 13441 actattgacg ttgattttct gctccgtgga tactctaatt ccatagacga tgttaagggt 13501 ttaatttgca aaattattga cactcccacg ggcaacaact atatcaatat gactgcaaaa 13561 ggatttgagg agatttctcc gcaaagaaag tatcatggcg tcagtactca gattatagga 13621 cagataaaaa atatacgagt gccttttaat gtggatatag gtgttggaga tgttattgta 13681 ccaaaggcag aacaacgcaa aatcaataca cagctcccag attttgaggc accttttatc 13741 aaaacctact ctctggaaag caccattgcg gaaaagttcg acactattct gcaacgcttt 13801 gagctgacca gccgaatgaa agattttcat gacatctatt atctggcaag gacttttgat 13861 tttaatgtag cacgattaca gaaggcaatt tttgaaacct tgcggcggcg cggaacgccc 13921 tatgataagg atagcttcaa acgagttata tcccttgcag atgatataga tatgcagaaa 13981 cggtggaagt atttcttgaa gaatatcaag gatgatacac ttgaattttc tgttgtaatt 14041 gatgagatac agactttcct tgagcctgtt tttgaggcga tagtgaatga agatgaatgg 14101 caaaaagtat ggaactttat taccaaatgg gataaaaaag aaaggagcct tgaatcatga 14161 gcaaagtact tactgttgaa cagcgcgaac aagcaggttc cgactcgtat aatagattcg 14221 agtatcaagt tcactggatt gtatgccata taataagtaa acttcaagag gatgcggagt 14281 gtattgtttt ttgtgaattt cacgatgata tggcagaatt ttctcccaat aatcagcagt 14341 atcagttttt tcaaataaag acaaaagaag attcttctga ctggactatt gcggaaatgt 14401 ctaaaannnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 14461 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn ntcaaataat gattttgata 14521 aagaagtttt gttatggcag gcagttattg aagacggcaa aaagttgcaa gaagaaaaca 14581 gtgtactata tcaaaaaatc aaaaacagat taaaagatga atttaccaat gatatgccaa 14641 gcaattttga ttctgtattt gatgcgttta tacaaaatac ttttgttcat aaatctgatt 14701 tgcaattagc ggcatatgaa aatcaaacta agggagaatt tttcagtcac ttagctgata 14761 agaatattcc aacaaataca gctaatctta ttcttcaaca attgctaaat gatgtaagaa 14821 aaaagagtaa agagaaaatt aaagtaccta ttagtaagaa aagcttagtt gagaaaaagg 14881 gtatcgatgt cgctcaaata ggcaagaaaa ttgatggtaa tataaaaaat agcggaaatt 14941 acaatgcatt tcacgattac ttaattacac aagcattatc agacaatgat atttgtcgta 15001 tagaagcagc taaaacactc cacgatgcaa aatggcttga tgtaaaagat gttagatatc 15061 aagaaattgt aattattctg cgaaaaacaa tttctacata ttgtgaatca tttaaggcaa 15121 atgagtttag tggcgaattg aaaagattat gcattcaaga attgcaaaag cataatcttc 15181 tgtcggattc tcttgataaa tctttaattg aggtgttata ttatgagcaa aaatttagtt 15241 gaaacaatta aacaggagca aaattacaaa attttaatca gaaatttgcg aaagcaacag 15301 gaggaaaaag acaatgaaaa ggctaatgat aagaaaactg atagtcatca gtcaaagtga 15361 gtctcgttcg ttagaggttc cttttgaaaa aggacttaat attattttgg gcggaaataa 15421 aactggaaag tcttctatta tcaaaagcat atttactact ttgggatgtg aatgcaaaag 15481 ggttgaggcg gattggaaaa aattagtttc tacctattta ttgttcttta aatatggaga 15541 aagacaattc tgtattgttc gccaagataa aaagtttcaa atttttgaaa atataaacaa 15601 tgatttttct tgcatcattg aaacaaatgc ttttcatgaa tacagcaatt gcttaatgga 15661 tattttagaa attaagatgc cttgcatatc taaagacgga aaacaattca atataacacc 15721 accattgctt tttagatttc aatatattga ccaagacgaa gggtggagca agatagctga 15781 ttcttttaag aatgttgctt atattaaaga ttggaaagca aatacaaata aatatgtgtg 15841 tggatacctt gatgattcct actatgcact tcaagcacaa aaagcagaac atatacttga 15901 aaaagatgat aagaaaaaag aattgaatta taatcaaagc tttgtctccc gtatcacatc 15961 tacactaacg caaattgaga atattgaatc cgttgaagag gtcacaacag atattgaatc 16021 tttgcttgct aaggcggaag aattaaggaa aatgcaattt tcttataatg cagagatgac 16081 agttttggaa aatgatatat atatcaatca gcataaacta cacattgttg aacacaacct 16141 aatagagaca aaaaaagata ttgagtatgc aatgactcaa gaagatgagc tgatttgtcc 16201 attttgtggt actatttatt caaacgga //