LOCUS AQIB01011244 20241 bp DNA linear PRI 17-MAY-2013 DEFINITION Chlorocebus sabaeus isolate 1994-021 Contig0.17267, whole genome shotgun sequence. ACCESSION AQIB01011244 AQIB01000000 VERSION AQIB01011244.1 DBLINK BioProject: PRJNA168621 BioSample: SAMN01760484 KEYWORDS WGS. SOURCE Chlorocebus sabaeus (Cercopithecus sabaeus) ORGANISM Chlorocebus sabaeus Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Cercopithecidae; Cercopithecinae; Chlorocebus. REFERENCE 1 (bases 1 to 20241) AUTHORS Warren,W. and Wilson,R.K. CONSRTM International Chlorocebus aethiops sabeus Genome Analysis Consortium TITLE Direct Submission JOURNAL Submitted (17-APR-2013) Washington University School of Medicine, The Genome Institute, 4444 Forest Park, St. Louis, MO 63108, USA REFERENCE 2 (bases 1 to 20241) AUTHORS Warren,W. and Wilson,R.K. CONSRTM International Chlorocebus aethiops sabeus Genome Analysis Consortium TITLE Direct Submission JOURNAL Submitted (13-MAR-2014) Washington University School of Medicine, The Genome Institute, 4444 Forest Park, St. Louis, MO 63108, USA COMMENT Chlorocebus aethiops sabeus (vervet) Sequence Assembly Release Notes The vervet DNA for shotgun sequencing, and for BAC libraries, is derived from an adult male vervet monkey (Chlorocebus aethiops sabeus; animal id 1994-021) within the vervet research colony housed a the Wake Forest Primate Facility to create the BAC library CHORI-252. A total of 362,969 BAC end sequences have been generated from this library. A total of 143 CHORI-252 BACs (approx. 18Mb) have been finished and submitted to Genbank. Of these 29 were finished and submitted for the MHC region. Whole genome sequences were generated on the Roche 454 Titanium instrument at these coverage levels (vervet genome size of approx 2.9Gb): fragment- 10X, 3kbp- 8X, and 8kbp- 1X. Total sequence genome coverage on the Illumina HiSeq instrument was 95x (45x fragments, 45x 3kb and 5x 8kb). Two independent assemblies were built with the appropriate sequence data, using the ALLPATHS (Broad Institute) and Newbler (Roche) assemblers. Based on superior contig and scaffold contiguity the ALLPATHS assembly was chosen as the reference. The unique sequences from the Newbler assembly were then merged into the ALLPATHS assembly using graph accordance methods (Yao et. al. 2011. Oct. 23 Bioinformatics). Post assembly we integrated 170 finished BACs. These 170 BACs (including the MHC region) were merged into the 1.0 assembly. The top scaffold that each BAC mapped to was identified by MEGABLAST (-e 1e-20 --W 200 --p 98). Contigs of the top scaffold that the BAC mapped to were identified by BLASTN (-W150 --F F). A Perl script was used to create a new contig for each BAC, extend the contig if the 5' and 3' overlapping contigs were longer than the BAC and adjust flanking gaps accordingly. We then sorted scaffolds by decreasing length, assigned new sequence identifiers to contigs and scaffolds, and extended 20-bp and 50-bp gaps to 100-bp as per NCBI's guideline. In the final assembly, referred to as Chlorocebus_sabeus 6.0.3, there were 162,907 contigs with an N50 contig length of 88 kb. There were 2205 supercontigs with the N50 supercontig length of 45 Mb. A total of 2.73 Gb of sequence was assembled in contigs. Including estimated gap sizes, over 2.74Gb were ordered and oriented along chromosomes, 27.6Mb along the CAE*_random chromosomes, with only 18.34 Mb remaining unlocalized. After organizing Chlorocebus_sabeus 6.0.3 into chromosomal AGP files, we labeled this first vervet release as 1.0. ******************************************* Chlorocebus aethiops sabeus Sequence and Assembly Credits DNA source - Dr. Jay Kaplan, Wake Forest Primate Facility, Wake Forest, NC. Genome Sequence - The Genome Institute, Washington University School of Medicine, St Louis, MO and Department of Human Genetics, McGill University, Montreal, Canada. Sequence Assembly - The Genome Institute, Washington University School of Medicine, St Louis, MO. BAC library - Dr. Pieter DeJong, CHORI, Oakland,CA. Assembly curation - Jessica Wasserscheid, Nikoleta Juretic, Dr. Ken Dewar, McGill University, Montreal, QC Canada. LaDeana Hillier, The Genome Institute, Washington University School of Medicine, St Louis, MO. FISH Mapping Data - Mariano Rocchi, Department of Biology, University of Bari, Bari, Italy. cDNA data - RNA sources was Dr. Nelson Freimer, Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, CA, USA Funding for the sequence characterization of the vervet genome is being provided by NHGRI. Author List: Nelson Freimer, George Weinstock, Richard K. Wilson, Wesley C. Warren ******************************************* Chromosome Lengths: column 1 = chromosome column 2 = chromosome length (including estimated gap sizes) CAE1 126035930 CAE2 90373283 CAE3 92142175 CAE4 91010382 CAE5 75399963 CAE6 50890351 CAE7 135778131 CAE8 139301422 CAE9 125710982 CAE10 128595539 CAE11 128539186 CAE12 108555830 CAE13 98384682 CAE14 107702431 CAE15 91754291 CAE16 75148670 CAE17 71996105 CAE18 72318688 CAE19 33263144 CAE20 130588469 CAE21 127223203 CAE22 101219884 CAE23 82825804 CAE24 84932903 CAE25 85787240 CAE26 58131712 CAE27 48547382 CAE28 21531802 CAE29 24206276 CAEX 130038232 CAEY 6181219 ******************************************* Chlorocebus_sabeus 6.0.3 assembly statistics: *** Contiguity: Contig *** Total contig number: 162907 Total contig bases: 2734267806 bp Average contig length: 16784 bp Maximum contig length: 1051246 bp N50 contig length: 88741 bp N50 contig number: 7870 *** Contiguity: Supercontig *** Total supercontig number: 2205 Average supercontig length: 1240031 bp Maximum supercontig length: 126332868 bp N50 supercontig length: 45002363 bp N50 supercontig number: 19 Scaffolds > 1M: 147 Scaffold 250K--1M: 47 Scaffold 100K--250K: 34 Scaffold 10--100K: 473 Scaffold 5--10K: 235 Scaffold 2--5K: 434 Scaffold 0--2K: 835 ##Genome-Assembly-Data-START## Assembly Method :: ALLPATHS and Newbler v. 13-Feb-2013 Assembly Name :: Chlorocebus_sabeus 1.1 Genome Coverage :: 95x Sequencing Technology :: 454 Titanium; Illumina HiSeq; ABI ##Genome-Assembly-Data-END## FEATURES Location/Qualifiers source 1..20241 /organism="Chlorocebus sabaeus" /mol_type="genomic DNA" /submitter_seqid="Contig0.17267" /isolate="1994-021" /db_xref="taxon:60711" /sex="male" /dev_stage="adult" /geo_loc_name="USA: NC, Wake Forest University" /note="housed at the Wake Forest Primate Facility as part of the Vervet Research Colony" BASE COUNT 6080 a 4360 c 4236 g 5564 t ORIGIN 1 tttcttcaag tcatcttaac taaaagtcta tatatacttg gctccacaga ccatttaaca 61 tttattactg ggccaaatat caaagtaatt tacgatgaac aggttaaatt tgtaaattta 121 ataattaagg ccaggcacgg cggctcacac ttgttatccc cagcactttg ggaggcagag 181 gtaagaggat cttctgaggc taggagttca aaaccagctt gggcaacaaa gtgagactgt 241 ctctacaagc atttttttaa aaattagcca ggcatagtgg cacatgcctg tagtaccagc 301 tactcaggaa gctaaggtgg gaggattgct cgagcccagg aggtcaagac tacagtgagt 361 catgatcacg ccactgcact ccagcctggg cgacaaagca agacttcacc tcacaaataa 421 ataagatttt ataagattat atacagtgaa aaaataatag actttcagag ccagactgag 481 ttggttttaa gccatgattt tgtccactta tgagctgtgt gagcttggta aagtttactt 541 accactctag gcctcaattt tctcatttgc aagtaaggga taacgccatc tacttcacaa 601 gaatgttgta aagattcgat gagttaagtt tatgaaatgc ctagttaagt ctcaagccta 661 tgatagaagg ccaagaaatc ttaacctcat cctcctgact ctgcaatcgc catctgaggg 721 taaatcttta tagctaacca attcctggac acatcattat ttttgcaacc atattaaata 781 ccttttactc ttaaaaagaa aaaataagac ttgtaaaaat gttagcagga aagagagaca 841 aaccattttt tcaaatgttg cctcccacaa atcaattcca gatggatggt agatctaaag 901 gtcaagggta aaacaataaa ccctgtagag aaaaacgtaa gagaatatct ttgagacctt 961 caagtttgca gaataaactg gacacaaaat gcactaacca taaactaaaa taaattgata 1021 aagtgaacta cactaagaac ttctgctcat caaaaggtat aattaagaaa gtaaaaagat 1081 ataatacata tccaacaaag gatctgtatc cagaacaaat cttgaactgc tgtaaagcat 1141 taagaaagat ggacgataaa accacaagat ataatttctt ttccactgga atgactacaa 1201 ttgaaaagaa agtaacaagc attggcaagg atgtggaaaa aatggaacac tcatacattt 1261 gctggtggga atggtgtagc tgctgtgaaa aacagtttgg cagatcctca aagtgttaca 1321 cacaaagtta tcaaatgatc cagcaattcc actccaggta tacaaacaac caagcaaaat 1381 gaaaatgtat cttcatgcaa aaacctgtac acaaatttca tagcagcatt attcataaca 1441 tccctaacgt agaaacagcc caaatgtccc ccaactgata aatgaataaa taagatgtgg 1501 tatatccata agaaaggatg aagtactaat gcacgctatg acatggaaaa atcttgaaaa 1561 cattaacact aagtgagaga agttagtcac agaagactac atattgattc cattaatatg 1621 aaatgtcata atagacaaat ccatagagac agaaagcaga ttagttgtta tgaggggtag 1681 tgggtggaag aatgggggaa tggctactaa tacatacagg tttctttccg gagtaataaa 1741 cagtttctgg aattaggcag tggtgatggt tgtacaactc tacgaacata ttaaaaagcc 1801 accaaattgt atactttaaa agtctgaact ttctggtatg tagcttttat cacaataaag 1861 ttgcttaaaa cagaaaaaag acaacccaat agagaaattg ggaaagattt tagcaaacac 1921 taaacaaaat aggatactga agtaatcaag aaatatgaaa atgtgctcaa cttcaatggt 1981 catcaaggaa atgcaaatta aaaccacaag gctactacac acctatcaga atgaccgaaa 2041 ttaaaaagat gaaaaatatt aagtgtgagt gaggatgtag gacatttaga aacctcaatt 2101 actgctggtt agaatgcaaa ctagtaaatg acttggcaat atctattaat attgaaaata 2161 tgcacacctt ttgaccgtat attccagtct taagtatata cccaacagaa atgggtatac 2221 atatgttctc taaaagaaat acactatgct gttcatagta ctactattta taaaagtccc 2281 aaactgaaaa ctacaaaaat gcccatcaac aggttaataa atattggcat attatctcaa 2341 tggcaccaaa tacattaatg acatgaaagg tctacaataa tacacatgaa taaatttcac 2401 aagcaatatc tgagcaaaag aagccagaca caaatgagta tgcagcatgt ggttcccctt 2461 atacaaagaa caacagccca aactaaccta tgctgttaga aatcaggata gtattcatcc 2521 cgtggaggtg ggggttgtgc aggatgtcac tggaaaggga taagaggctg gttacagtaa 2581 gtgtgttgtt tgtgaaaatt cacaaagcta tatacttgtg tgtactcttt taatataaag 2641 tcttaagaga tgctccctgt atcaaaacaa gtatattgag gttgctcttt ttccaatgcc 2701 cttttttcca attcttctta gggccacatg atttactttg ctcaattaag aaacataata 2761 aagcagctga aaatactaaa gagcaattat gcttacaaaa taataattat aaaaagatat 2821 attaccaaaa catggacagg tagaagaaat tatgaacaca ctaactggca gttgtaaggt 2881 tcaaacttct caggttagct tacaaaataa aaatcacagc agtgtgaaag aaatgcttct 2941 ctcctaacac tctagaaaaa cagctgcaat aaacatgata tttgttagac agtttttacc 3001 ctcaacaagc aaaaatgaga agtttatgat gaaactgcac agtaaagggg catctttatt 3061 acataaaaag aaatttgcat caatgattag aagctggatt ttttaatcac caactttaca 3121 gaaataatcc ctaaagtggt gtcattcaac tacagtggat gatttcagta ggttctgaaa 3181 tcaatttagt gtgtcacaaa agacaagtat caaaacacag tatcataata gaaaatttgt 3241 caaaaattca tgatataata aaggcctata ttcttcatga aatgtagtat gtactagttg 3301 caatggaaaa tacatttttt gctttgagct acaataaaaa taaagttagg aaaagctgtc 3361 ctacaagata ccattaacac ttttgctcca tcaagttcca ggttaaaaaa aaagaaaaaa 3421 aaaaggcagt taacttcttg acttttccct ccacttctga tcccagatca ttaaaagttt 3481 taccagattt agcctaaact tttcttttac cacacagacc taactattgc tttccagtga 3541 agcatccatc tcttcttcca ggattaagag tgttcaaata ctactactta tgctaggttt 3601 ggatcatttc acattaacgt ctttctctca attcttcact tgggggaggg aggaatggca 3661 gagtaagagg gaagaagtgt ttgggtggac ttgggcgtca ggcggcattc tcccaactgc 3721 attggtctat tgacttacat actctcagct gaggtataca gaccttctac cttaaagtaa 3781 caaagaaagg aaaccacaag gcaacttgca gcatacagga taccaccaga ctacatattt 3841 cactggatga tatacaaaga cacacttatc cacgtttctg gctgatatgc aaattatctt 3901 ccttaagttt tctaacaatg aaaatgttat cataaaatca gcttactaga atgtataact 3961 gtctcctttt tattactgca gcatatattc cttttctaca cacaaaatgt agataagccc 4021 cactgaatga ctatctatac catcatctcc tctaacaccc tccctcctgt actgctttta 4081 aaaacagtac agatgttaaa caagactaga aaataaattc aatccaagca atacttaaat 4141 gatgacccac ctcaactata aacataaatc ctcatgcact atatgtgtgt gcaggtgttc 4201 aaaatgatgg gaaaagatca gggaatatga tataataaaa agaaatctta aatcctccgg 4261 agttataata ggcatttttt tagtagggaa gggtgaggga actatagtag tccccgttat 4321 ccgcagggga tatgttccag gaccccccat ggatgcttga aactgcagag taccaaaccc 4381 tacatatact gtactatatt ttttcccata tgcaaatact tatgataaac tttaatttat 4441 aagctaggca caataagaga ttaacaacaa taactaataa tagagcaatc taccacttat 4501 gaactgttta tttctggaat tttccatctg atgtttttgg acctcagtta accaaaggta 4561 actgaaacca cagaaagtca aactgtggat aaggagagag tactgttttc tatgtctgca 4621 aatgggtaag tttccttatt tgggtcttga cacaaatgtg tggtaaataa ctttgcagta 4681 gactgggaga gttcaagcct cacctgctaa aacattgtct tcccacattc ttcacaaaca 4741 aagcaaagtg catgttctcc tgacactaac cctaacataa tttttaacat gcaagcagtt 4801 taaaataaaa ttgtaatggc aactaaaagg aaataaaaac tcattattgc ataatggttt 4861 tccagaaaag atctgtacat tagaaaaata acattagaag aaattgaact ctgctctaaa 4921 agaagcaaaa actctgacaa cttcttaacg cattttatat tcttcaaata acttagggat 4981 ttgtttccta gggaaagaaa acttgctgtg tgaccttggg caaatcagat taacttctct 5041 gtgcttcttt tctttttttg aaacggagtt tccctcttgt tgcccaggat ggagtgcagt 5101 ggcgcaatct cagctcacag caacctctgc ctcctgggtt caagtgattc tcctgcctca 5161 gcctcccaag tagctgggat tacagcaccc accaacatgc ctggctaagt ttctgtattt 5221 cttagcagag acgaggtttc accatgttgg ccaggctgga ctcaaattcc tgacctcagg 5281 tgatcaccac ctcggcctcc caaagtgctg ggattacagg tctgcacttc ttttctttaa 5341 ctgcaaaaat gaggaggcta aggaggtaat ctgtaagcct gaatgaccca ccacgaaata 5401 agtctaactg ttgctaagct atctataggt tctacaatga agaaaatcaa atacagcaag 5461 atcattttaa ttacacagat tctttcagcc attttaatta atgtgtctag taacactgta 5521 gcaatgcaat gaaaaggaca ctggcactcc ccgtttaaaa ttaaaaaata atttccttaa 5581 aaatgaggat aggaggaaat taacaggcac catttgtgtg tgtgcctaga tgactaaaat 5641 taatgggtca gatctgacct caaacttatg ttttaaacag cagcgaagta caggttaggt 5701 acacagacaa aaagtaacgg ttgcttaatt aaataaaaat actaaaaatg ttgacaacta 5761 cagtagtata ttttagctag tattgccaca atactggttc tacacaataa tgcaaatgct 5821 agtaataaag gcgtcaccaa acatttgaat ccttattata ttccacattc taagttaagt 5881 gctttacatg aaatatctca tttaattcac atgataatca aggaggtagt gttaagaaag 5941 agatttaata atgctcaatt gttttcatta ttttcataca cagtatctaa cgtgtttttc 6001 agtggaaaag gtggaagtgg ctctgctatt atccacactt ttcagataca aagactgaag 6061 cttttcaaag catttcaaag ggttgctcaa ggaaatagaa tgaagtgata gagctggcac 6121 tagaatccag ttcccaaagg tggtctctgg tggttgacta tgataatttc tcaatgtact 6181 catgtccaca aagataacaa gattctttcc ttctgtcttt attttttttt agagacaggg 6241 tctcacttta ctgtccaggc tgatctcgaa ctcctggctt caagcaatcc gctgcctcag 6301 cctcccaaag tgtggggatt acagatgaga gtcactgtac taggtgcgtc ctcctttgtt 6361 tccctccctc ccttcattcc ctctttcctt tcactccttc cctctctctt tctttctttt 6421 tctttctcaa gctctcaata ttttgctaag gctactcttg aactcctgag ctcaagcaat 6481 cctcccacct cagcttcctg agtcctggga ctacaggcac atgccacaaa gcctggattt 6541 tacttttaat taactaatta gggcaaggat gtggctcaca gctatagtac cagcactttg 6601 ggaggctaaa gcaagaggat catttgagcc caggagtttg agaccttgtc ccttcaaaaa 6661 aatcagccag atgtggtggt acgcacctgt agtcccaact acttggaaga ccagggaggg 6721 agaattgctt gagcctgtga agtcgaggcc gcagtgagct gcgactgtga cactgcactc 6781 cagcctgggc aacagagcaa aaccgggtct ctaatttttt tttttttttt tcccctgggg 6841 ggatagagtc tcactctgtc gcccaggctg gagttcagtg gtaccatctc agctcactgc 6901 aacctccacc tccagcgttc aagcgattct cctgcctcag cctcccaagt agctgggact 6961 acaggaatgc gccaacacac ctgtctaagt tttgtatttt tagtagagag ggggtttcac 7021 catgttggcc agactggtct tgaactcctg acctcagcct cccaaagtgc tgggattaca 7081 ggcatgagcc accacaccca gccttaaatt ttttttaaaa aattaacaaa ttaattaatt 7141 ttagagacgc tgctctgtta cccaggctgg agtgcagtgg catgaggata gctcgctgca 7201 gcctcaaagt tttgggtcaa gtaacaattt tgcttttatt gagaccggat ttcactatgt 7261 tttcagggtg gtcttaaact cttgggcccc cacctcacct cagtctcctg attggagtag 7321 ctgagattac agatgcacac cactgtgcct ggctcaagct tccttatttc tgacaatgtg 7381 cttgagccat ctctcattgg tgcttaaaaa agagggagaa caaaacccca gaataattat 7441 cttcatgcta cagtaaattt atttgcaatg ccaacaggga aatgtacagt ctctcaaagt 7501 aactacttag gaccaggtgt ggtggctcat acctgtaata ccaacacttt gagaagccaa 7561 tacgagagga tcgcttgagc ccaggagttc aagaccagtc tgggcaacat agaaagatcg 7621 ctctacaaaa aaatttaaaa gttggctggg catggcagca tatgcctgta agctactcag 7681 gaggctgaga tgaaaggatt acttgagccc aggagttcaa ggttgccatg ggccatgacc 7741 acaccacagc actccagcca tagcgacaga gtgagatcct acagagagag accctgtttc 7801 aaaaaaagaa agaaaataaa aaagtcacag tttctaaagc acatatcaaa ctttgtattt 7861 cctagtttac attgatattt tattagtgaa tcaatcaata tatataatcc tttcagtaga 7921 atattaaagt aagcaaaatg aacacactga tgttaatgat tattctaagc tcaattttta 7981 taaaatttgt agaggaatta ttaaaaaccc ctgctagtat ctgacccaga aggcagaaga 8041 atcagtgttg tacactctac tacttactga aaccataatt aactactaac agcagatctc 8101 ccccagtgag aatcctgagg atgatagtac aggcattaaa acacacctca tcacataaat 8161 atgaaagcat ataaggccgg gcgcggtgtc tcatgcctgt aatcccagca ctttgtgagg 8221 ctgaggtggg tggattgctt gaactcagga gttcgagagc agcctgggca acatggcaaa 8281 acctcatctc taccaaaaat acaaaaaatt agctgggcat ggtggtgtgt gcctgtagtc 8341 acagctactg gggaggctga gatgggggag gattacttga gcctgggagg tggaggctgc 8401 agtcagtgga gactgtgcta ctaaattcca gcctagagtg agaccctgtc tcaaaaaaaa 8461 aaaaggtggg aggttggggg tgcagtggct cccacctgta atccctacac tttgggaggc 8521 tgaggtaggc agatgacttg aggtctgaag ttctagacca gcgtggccaa cagggtgaaa 8581 gcccatctct actaaaaaca caaaaattag ttgggggtgg tggtgggcac ctgtaatccc 8641 agctacttgg gaggctgagg cagaagaatc gcttgaactg gagtggtgga ggttgcaatg 8701 agctgagatc gcgccactgc atttcagcct gggtgacaga gcaagactct gtctccaaaa 8761 aaaaaaaaag tgtatataaa caacatgaaa ctactaattt gttcaaggcc tattaatatt 8821 agaaaaagtt gatgttcaat gatgtaattc ctaaaagtat cccaaggttt atatacattg 8881 tattccatgg ttttaataaa cggagatatc ctgattccta catttgaaag caattttaaa 8941 gaatatttat taaacaactt ctgggccttt tattagaaaa gattaatttt gggctgggca 9001 cagtggttca cgcctataat tccagcactt tgggaggccg aggcctatgg atcagctgag 9061 gtcagcaatt caacaccagc ctgatcaata tggtgaaagc ccgtctctac taaaaataca 9121 aaaattagcc gggcatggtg ccgcgggcac ctgtaatccc agccactaga gagaccgagg 9181 caggagaatt gcttggactt ggaaggcgga ggttgcagtg agccgagatt gtgccactgc 9241 actccagcct tggcaacaga gccagactct gtctcaaaaa aaaaaaaaaa aaagagatca 9301 attttagggg cagggtcaga ctcttcaagt ttcacaatta taactttata tatagaaaac 9361 aacactaaat aaacctactg ttctcttttt acacattgtt tccagaatga gtgagaaagg 9421 agccagagtg agatttctgt gttagggcat gaggtcgggc gtggtggctc acacctgtaa 9481 tcccagcact ttgggaggtg gaggcaggtg gatcacttga ggtcaggagt tcgagaacag 9541 cctggccaac atggtgaaac cccaactcta ctaaaaatac aaaaattagc tggacgtgat 9601 ggcaggtgcc tgtaatccca gctacttggg aggcccgagg caggagaact agggaggttg 9661 cagtgagccg agatcgcgcc attgcactcc agcctgggca acaagagtga aactccgtct 9721 caaaacaaca acaaaaagat ggcatgagtt tcaaagtttg aaaaagacga tgttagaaga 9781 ataaagaaga aaacaatgac tattgagtat cagaaattaa ctttttcaaa gtaagtacta 9841 aactgaaatc ccatcgcaac aggattgcat gtaggttagt taggcaccac ttaaagagag 9901 cagttccttt gggagacctc atctctgaca aaatagaaaa aaatatttaa aaagagcagt 9961 tccatacgat agtaggtatg gtctaggcat tgcattttct atagaagtaa ctccacttgc 10021 agtcacttgc ttctctaaga acacctaaat gctgcccagt ctgcagcaaa tggctcagaa 10081 gataaaaacc tgttattaca ctggtgacac attacatgtc aatgaccctt ccatcacctc 10141 gaaatctctc ccagaagaaa gcatgagaaa gaatgaaaat gctattttcc atgtctccac 10201 actggtgtga atacttgctt cagaaagtat gccctcagat atcccctatc tcagagtttg 10261 taagtgtgga aaagaatatg aatattttaa tttagaagca ggtcctataa caggcatctg 10321 tgaggtcact ccaaaatttt aattttgaaa atggtcttgt tgggttcact ggataaataa 10381 tggaaaatgt aggcaaaatt ttgatattaa acatagttca catgtaggga gtaacttaca 10441 ataacttcaa aaactgagaa gccctaacat ttttcccgcc ttacattagg attggaggac 10501 caaaataaaa gtgccattca tagtgagcac aaatctgacc cactgtataa gagaatctga 10561 aatgttattt ttttttttta aagacttcta cacaaaatgt taaatcctcc actaggagtc 10621 aatttctttt ttaaaagtaa ttttgcatat ctatgctttg agtgactata caagtattta 10681 atgatcactt cagaaatcct cgaaaataca ttaaagttaa aaaaaaaaat aggtcaccat 10741 ccactaataa ttttttagac aacctggttt attatctttc agctttattt tcctgagcac 10801 tcaaatgtac ctgcacatac acagggtgtt tgaaacacaa ttttgagcga atgtatagca 10861 ctctatctgt ggttaggccc tgatttgttt aataatttct cagctgttag atattttggt 10921 tgtaatgtta acctcttaag ataacttttg gcaatatagt ccttgaatac gtcatatcat 10981 tcaaaatctt tgtgaaacat ggctaagggg cgatgttagg cccacgaaaa aaaggggact 11041 aagaaaaata cttcaacagt atttgccttt agatggtggg attacaggta attcaggttc 11101 tttgctgtcc tttattacca tatccaaatg ttctctaatg gacacatgtt gataataaga 11161 aaacaatgta gttatttagt aacatgaaaa atcaacattt ataaacagac agctaccatt 11221 aaaattcaat acagtaatcg gggaaaagga tgtatgttat gggaaaacaa aacataggat 11281 ttaagtttgt aaatatgtcc atataacaaa agaagactag aaaaccaaat gttcaacacc 11341 aggaggtact ccttaaccaa tctatggtac aacagactac catgagctaa ctgaaagcat 11401 gaggcagatc ttgacttgct actacagaaa gactgctaag ataggtcatc aactgtgaag 11461 actgaggtgt gcaacagcat ggatcatatg attccaagat tccaaggtaa caaaaagaaa 11521 caaagagaga aggatgtgag aagagactct ttcttttgaa acaaagtctc actctttttg 11581 cccaggctgg agtgcaatgg catgatctca gctcactgca acctctgcct cccaggttca 11641 agtgattctc ctgcctcagc ctctcaagta gctaggatta caggtgtgca ccgccatgcc 11701 tggctaattt tttatatttt taatagagat ggggtttcac catgttggtc aggctgttct 11761 cgaactactg acctcaggtg atctgcccac ctcggccttc caaagcgggg attacaggcg 11821 tgagccactg cgcccggcca aagagacttt tttataagca taaaatatat ttgaaaggct 11881 atgtaagaaa ctgtccacag tggttacctc tggagtgggg ataagaagtt gggtacacaa 11941 tatgatccca atcatgaaaa tatgcactca tacataaaga atacataaaa aggaactaca 12001 ctgctcatag cggaacttct gaatccattt caaggctgtc ccctaaagat ttttagatgg 12061 atattgaaaa ataatgaaaa caatggaatg ttcaatctat ctgaattaca gggactcaca 12121 aaatcaaaag acagcagtgg atataaatgt tattataatt ggtatttcaa aacccaatca 12181 tataagtttt tactacaacc aagaaatggc cagggaattg gttattcaag aaaactaact 12241 tctgattaaa tcactctaga acttgttgaa tatgggtgat tttattcaaa attttcctgc 12301 cacagttact ccagtttaca agcttggtaa tctaataatg aaatatatca ctgtccatct 12361 aggcgataac aaatgaaagg aagacttttt gtatgttaag atttaacaca gcacttttcc 12421 ctctaaaaac aaagataagg aaaccaatta cagaatcaag ggcagagggt cttaggaacg 12481 cagacacaaa attctcctaa actagatcat tactatttta aagctctctt taataaaagt 12541 ctaactaaat aagaattact tcatcatgtg caacattaat gctccagatg cagattggta 12601 actaaggttg tccctagtgt aaaaatcaga tttatagatt aataacacat ctgcactact 12661 gtatgaacgc tgatttattt gactgcactt acataaagaa aatacaaact tgttttatgt 12721 tggaatgtag tcattaagtt ggaatccttt tgatgtgtat ggctttcttt caaacactca 12781 actttttttt tgcttattaa cattcatgtg ttttactgag tttccttgag tgcatgtatt 12841 ataaaaacat cacgaatccc caaggattca tgatcacaat cggacactgt gagaaacagc 12901 tggtaatact ggataacacc gcagggtaga gaactcaaaa tcagatcaga taacacaaag 12961 aaaagcacag ctttcacctc acatttggga actgacaatt ctccttccat ttcctctctc 13021 agtgtggctc ttgggggttt ggttagctga aggggatagg aagataatac ggtttcttta 13081 taggctccct tttcctttct gggggaaaaa atcacccttt actaccaaga gtatggctag 13141 aggagggaat tttagtgtaa tggaaaggca atagatacga taatcctgaa gatcagaaag 13201 atgatctgtc tcagccctat cacctagaca gctgggcaat ttaagaatgt tgtctcacct 13261 aaaacatggg acccagggca ggaaatgaag gtagcaccag gaagtgcatt ctcttacttt 13321 cagtgaaaac gttcttgcct gtgaggagct aatattgagt gacactatta cacccttaaa 13381 tggcgctaca cgctgcggcc actctgttga cagttcttca cgttaacact cacacaggtg 13441 tcacagcaca caaccatcaa cgctacttct aaagttgccc ggttccatga gcaaaagacc 13501 acatgctttg taccacccag gcccaggccg ttcataggta ggggcttagc tgacgccttt 13561 tgaggtatta tgtttctctg cctcctggaa gttaagctag ggttgttttc ctttatatac 13621 tcagatttta aaaaaaaaaa cattcctctg cactacctct ttacctctgt ggatctaaac 13681 ctcaatctaa accacaccct cagcagtcgg aacagaacgg aagttcccac tccaggaaat 13741 cttgagaagg aggtttggga gggtcaggag tggcacggga aaggagaagt cttgtcgaaa 13801 tcaagtctgt ttaagtccgt cgaacctact tccaggtcat ttcgatctcc gtctccttcc 13861 ccgctggcct ttcaagtcac tcagccggcg ccgaggtgat ttaacacctg ccgccgcagg 13921 ccccgcctcc ggactgcggg tcaggatccc tctcctagga aagcgcaaag gctgccttca 13981 cagtcactac tttggaaacg ttccctcccc gcccttccta actttggaag ccaaaaacga 14041 gggtgggtgg aagagagaga aggccgtgcg gaaaaagagc aaggttgtcc ccccttccca 14101 gggccgcccc ccggccgagt cgccggcagc tgcctccagc tccagcccgt cttcccacag 14161 ccgcgaaccc cgtgcacagg gcgctccgcg gaggcgggat gccagcggcc tccggaacca 14221 gcctcggcca aggcagcgcc cgcgacagcc tctccttcgc gggatcgggc ctggctgccc 14281 cgtgaaggag cccatggggg cctccgcctc cccgccgccg gccccacctc cagcccgcag 14341 ctccgcggcc gccgtcggag cccttacccg tagaacgtgg tagccttcgg tgcccccgcc 14401 tgggatctcg acgctctgcg aggagcccat ggcggcgggc gatctgtgtg gccgagccgg 14461 gatccgcgct gctcccgccc cccgcgcacc gcccgctctg ctcctccgcg tagcacttgg 14521 gacgtggcac tcgctgccgc cggagagccg ggccgcacag cccgccggga gatcgtctcc 14581 gctgcggccc gggccgctgc gacgccgcag acagcgccac ctgccgcgag ccactggaag 14641 gtgcccgagc gcgcgcgggc cgggagggtg gaaagggagg gccggagaag ggctgggagg 14701 tgccgcgtcg tgctcgcgag agaacagaaa tgcgcttccc cctgctttgg cgccttgcta 14761 cggggccctg gattgggcca gatttatgca gacgcgttcc cgctcgcctg aaagaacagg 14821 ttgcaggtcc ccctggtggc cagaggcagg gaaagagctg ctgataaaca ggagcgaact 14881 caacacacag caagctctct gttctccggt acaggtattt tctgtacagg tattttccca 14941 gactgtacag ctaaggggaa tagtggcccc ctgtggggcc actgcacagt atgagcagag 15001 catactgcat ctgctcagct aaattgtata aaaggaaacg ctccctttat aaaacaatag 15061 tctcacagag tcacctacca acttcttttt tggtttgggt tttttttttc tgcctccgta 15121 attgtgagaa taagattaga ggattggtca caatcagagc gatgccagcg tgggaagcac 15181 ggttggattt ccctggcatt ccaccacctg atgctgctcg gttccccgtc ccacctcctt 15241 caggtagcca ggtcgtattc atgcttattc cgttgtcgta ttcctttgct gcttttgccc 15301 cttacatgta ctccctctga aacccccagt tgccttgtgg tcagttggac tagtggtgcg 15361 tcaaccaggt attatgtgga ttcctatgtg atgcaatgca aagtatgtgc tatcacccat 15421 gatgtattct agtccaaggt ggttaacttg actctatggc atcttgagtt tgatgtttac 15481 aggatgtaag gggaattgag gagcaagtaa aatgacacca cagggaatca ggcaaataca 15541 ggaagtgggc cttccttcag gacagctggc cagttctctt tattgagtca gagtcctatg 15601 aaagggccta tgccagatta aaaggaacac gaagaatata accaagagac acactgtaag 15661 ggcttgagtt gggggccaac ttggagaaat aaacgtaaag aacatttttg gttcatttgg 15721 agagatttga atatgtactg agtattagaa gagtagttat tcacatggaa acgtggaaat 15781 tattgtcttt ttaaaaagct ggttacaaga tgggatgtgt ctgtagacaa tatgaaaaac 15841 atgtcctcaa aatacagatt aatacgtatg tatgcatatg aagatgtggg aggagaatat 15901 gaaccaaggg ttacagtggt gatctctggg tggtgacaga gatgttttat tttttgtttt 15961 ggctggttgc atgttctttg atgttcatga agtacttttg taataagaaa atattattta 16021 aaatttaaaa ataattttct ttgctatcaa caagtgaata gcaggttttt agaaggccta 16081 ataaatccat aaatccataa atgaaatctc cagttattag gctcctcaag gacagggtct 16141 gtagtggagc tgtctttatg ttcttgatat ttaccagagt gcctagcagc tcattcaaca 16201 atttaataaa tttagtgcaa gagaataaat gtaagaacta gatctaatta taatgttctc 16261 gttctgcttg gcagttataa ccctctttat gatggaagct gttgccatgt tcagtggtca 16321 aaatttttgg accagttgtc tcccatatgt ctgatggtag ccatggcttg tagaacccaa 16381 gctgtggatc attggtggtc cgtggggaag tctagccttc cttagagctg tagtgtatgt 16441 ttcagcctca tgagtggctg cttgctattc tccacacaag ccatttactt tcttaccttc 16501 ttgattttat tcatgttatt ctctacttct gaaatgcctt tccttatatt ttccacttac 16561 aaaaatacta actttgaaaa aatcccaatc ctttaagtcc catctcaaat cttacttatt 16621 tatccttaag gtcagcgtga ctctctcctc gccatgaaac agttccttca tttctgtaac 16681 atcactgtca tcatcacagg tctgtcttct accatttatg gtccattcat tcaaagcaag 16741 gatcatatgt ttttcttttc cttcctttcc ttccttcctt ccttccttcc ttccttcctt 16801 ccttccctcc ctccctccct cntccttcct tccttccttc cttccttcct tccttccttc 16861 cttccttcct tccggtttca ctctgtctcc caggctgcaa tgcaggagtg tgatcacagc 16921 tcactgcagc ctccacctcc tgggctcaga atcatcccta cctcagtctt ccgagtagct 16981 aagactacag gcacgtgcta ccacacccag ctaatttttt tgattttttg catagacaat 17041 gtctcactat gctgccaggc ttatttcgaa ctcctgggct cacatgattc tcccacctca 17101 gtctgccaaa gtgctgggat tataggtgtg agccaacacg cccagccaat catgtgtttt 17161 catctttaga tccccagttc ctggttgagt attgggaaca tggtagtact gtttattgat 17221 ttgaattgaa tgtgtgtaat gaatagcttc aaagcctttt ctttttcaga atcagttaat 17281 aagttacctc tgtggttact atttgtgtct catcaaaatc tgtgggtctt ttttcttttt 17341 tttcctaatt cccagggtga tggtaaaatc tcatgttatt tctttagcag cagttggtgg 17401 ggggtgtggg ggtgaggtat agggcccacc tggtttggga ctgaaaaaaa tgtgagaatg 17461 attgccccat ttttcctggt gccagttgtt ttactctttt agttggagaa tacgctccaa 17521 cagaaggcaa agaataacat ctcaaggatg ttgccaaagc taatataaat aatgtaacat 17581 gatggaggga acacgtggga gattattgtc aagtcagcct caattggagt aagaatctgc 17641 tgcaggcagg ttggtatttt gaaacatcag aggcagcaga aaagcgaagt ccaacagaag 17701 caaatgtcta caggctaaaa aaccataagt aatctattct tccggaagaa gtatctagta 17761 cttagtttta tgtgtgttct gagttccatt aacattaatt ctgagtcctt cctttccttc 17821 ctttcctccc ttcccctccc ctcccctccc ccccttcctt ccttccttcc ttctgacaga 17881 gtctcgctct gttgcccagg ctggagtgca gtggtgcaat ttcaacttgc tgcaacctcc 17941 acctcccagg ttgaagcaat tcttgtgtct cagcgtccca agaagctagg attacaggca 18001 tgcgccacca tgcccaggtg attttcatat atttagtaga gacagggatt tgctgtgttg 18061 gccaggcttg tctcaaactc ctggcctcaa gcaatctgcc catctcggct tccgaaagtg 18121 ttgggattac aggtgtgagc caccacacct ggccaattct gagttcttgt aaataagtag 18181 ttcatgatgg agggatcatt tttatcatac acattgaata tcccttatct gaaatgcttg 18241 agaccagaag tgttttggat tttggatttt ggaacattac tggttgagaa tttctaatcc 18301 aaaaatccaa aatccaaagt gctccaatga gcatttcctt tgagtgttat gtcagtactc 18361 aaaaagtttc cgattggagc actttggatt tttagattag tgatactcag cttggtgttg 18421 gctagtgcag ccttatcttt ttgccttcag actaccctga gaaagtttgg ttcctaccat 18481 atttggataa ggatcacatt ggatagggca gacggggatc tcctaaccta gcctgttttg 18541 ctgagagagc aaaccacatg agaccacatg agagaccatg ttacatgcca tgcacgctca 18601 gtcacagctg actggactag aggggaaccc tgttatacag gcagtcagca ctcacactga 18661 atgaggtggc ctttagaaaa ggttctcccc ccacaagtgg atggtagtga ctagctacac 18721 caatcaggtt ttcattttgg ggacttggaa ctgggaaata caaggagaga ctccatcagt 18781 gtatatttat ttgcatatta gaatgtaagc cccgttaggg catgggcttt gcctgagtgt 18841 ctactactgt gctctttgta ctaacaaaga gctggcacac aatagttact caataaatgg 18901 ctgttgaatg tgtgaatgat catgttagta gttatgttga tggcagtggt gacgcgtatg 18961 gagtggccat tgcaaggacg ccaactgtag tgggggaagt gtggccaggg ctgtgcattt 19021 gttggcccgt gggggccagg aacagatgat cccagtggga gctcccatgc cttaccgagt 19081 tggtgggtgg gagcccacca actcctcggt gcagctgcag ctgcccagtt atggctccag 19141 acccaggcat ccctatgctc tcaggggccc aggaagcctc tttcccccac aagctcagaa 19201 gtgcctgctc ccactccctg gcctctccac atccccagcg cccactctgg tgtggaacac 19261 agttgtggcc gagccccggt actgttgcaa cccaaccagg tgtgtgtatg ttcggggcag 19321 tactgagaca ccagccctct gctacctcgg ctcccttcag actttgggca ccagtgagca 19381 tgggaggaag gctggggtgc tgaggatggc ttgacatggg cctgcagaca gcctgggcac 19441 cacaaatggc ctgttgatgg tggcaggagg cagacaggtt cctaggtggg aagaggcagg 19501 tccccggtga aaccccatct tcaggccagg cctaaaacct gggggcctgg ctgccagttc 19561 caggtggagt tcatgaccca gagtgagaac ttccgtgatg ccttttggcc agttggatgg 19621 tgacttttcc atgaaccaat cagcatgtac ttcctccctt ttcagctcat gaaaacccca 19681 gactcagcca gactcagaga ctcatcagga ttacctgctt gtggatagga gctaccccct 19741 ttgggtctcc tctctgttga gagctgtttg gttgcccaat aaagctcttc tctgccttga 19801 tcaccctcca gttgcccatg caacctcatt cttcctggat atgggacaag aactcgggac 19861 ccaccaaatg gcaggagtga aaggagctgt aacacttccc tggatggctc cctgagctat 19921 ggtgggagct aaaggggctg caacactata gccctcccac cctccgcagg ctccagacag 19981 ctgccccatg tgatgggaag cagcagtggg tctgggctag cccaggagcc gtgggccaga 20041 gcgtggcagc aggactgaaa gagctgtaac acaaatgggc tgaaacatga cccccaaaac 20101 acgccccccc actcaccatg ctacaggcat cgagaaggag agaagagctg cagcccttct 20161 ggatgcccag acctctgggc tccctgaatc agggctgtga cacactgtaa caccttcttt 20221 ggagccctgc agttcctggc a //