LOCUS VBCO01000119 17413 bp DNA linear ENV 29-MAY-2019 DEFINITION Betaproteobacteria bacterium isolate BP_19 14_0903_12_30cm_scaffold_12260, whole genome shotgun sequence. ACCESSION VBCO01000119 VBCO01000000 VERSION VBCO01000119.1 DBLINK BioProject: PRJNA449266 BioSample: SAMN11380346 KEYWORDS WGS. SOURCE Betaproteobacteria bacterium (soil metagenome) ORGANISM Betaproteobacteria bacterium Bacteria; Proteobacteria; Betaproteobacteria. REFERENCE 1 (bases 1 to 17413) AUTHORS Diamond,S., Andeer,P.F., Li,Z., Crits-Christoph,A., Burstein,D., Anantharaman,K., Lane,K.R., Thomas,B.C., Pan,C., Northen,T.R. and Banfield,J.F. TITLE Mediterranean grassland soil C-N compound turnover is dependent on rainfall and depth, and is mediated by genomically divergent microorganisms JOURNAL Nat Microbiol (2019) In press PUBMED 31110364 REMARK Publication Status: Available-Online prior to print REFERENCE 2 (bases 1 to 17413) AUTHORS Diamond,S. and Banfield,J.F. TITLE Direct Submission JOURNAL Submitted (01-MAY-2019) Earth and Planetary Science, Jill Banfield's Lab at Berkeley, University of California, Berkeley, CA 94720, USA COMMENT The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: IDBA_UD v. 1.1.1 Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: 10x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 05/15/2019 19:24:30 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 4.8 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 3,427 CDSs (total) :: 3,389 Genes (coding) :: 3,303 CDSs (with protein) :: 3,303 Genes (RNA) :: 38 tRNAs :: 36 ncRNAs :: 2 Pseudo Genes (total) :: 86 CDSs (without protein) :: 86 Pseudo Genes (ambiguous residues) :: 8 of 86 Pseudo Genes (frameshifted) :: 51 of 86 Pseudo Genes (incomplete) :: 27 of 86 Pseudo Genes (internal stop) :: 5 of 86 Pseudo Genes (multiple problems) :: 5 of 86 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..17413 /organism="Betaproteobacteria bacterium" /mol_type="genomic DNA" /isolate="BP_19" /isolation_source="temperate grassland biome" /db_xref="taxon:1891241" /environmental_sample /geo_loc_name="USA: Angelo Coast Range Reserve, CA" /lat_lon="39.74 N 123.63 W" /collection_date="2014-09-03" /metagenome_source="soil metagenome" /note="metagenomic" gene <1..2120 /locus_tag="E6H56_07695" CDS <1..2120 /locus_tag="E6H56_07695" /inference="COORDINATES: protein motif:HMM:TIGR03623.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=3 /transl_table=11 /product="hypothetical protein" /protein_id="TMH41425.1" /translation="PRLPDALQRFLPEPKRPKLVVAYAFDILPPQTKEFLDGLGTEIA FCRPDPRFATVSRAAFDSAKHEIEAAARWARARLEAGGKRIGVVIPSLREKRKEVVRV FSRVMRPGGEKTAMPFNVSLGTPLQEFPIVNAALTVLRFSQEEISFEEASRIVRSPFI GGAESELGARMKLEARLRDKLGATVALPKLIAFLEKKTVLRESLERVFGMRETGLFSQ KTPGEWARHFSAVLEAAGFPGERSLDSDEFQAQAKWHEVLGELSKLDRVSKEISFSGA FQILKKICADTSFQPDSPEAPIQVLEIRDSTYVEFDHLWVTGLTDEAWPLKSSPNPFL PLAQQRKAGIPEASAETSLALDRRITDGWKQAAGEVVFSCFTKEQDRDVLSSPLIADV PLKPIEVPAFPRFRDEIFKLKKLETLQDRVAPAVREKQVRGGTRVLSDQAACPFRAFA RHRLHAEELEQPVEGLDASARGKLVHELMKHLWGFLKDSSSLQGNLDSAIEQAAAAAV KELKLEGRLAELERSRMARIAREWLEVEKSRPEFSVVGVEDKRKISFAGLEFDARIDR MDKLSSGGHAIIDYKTGGNITPRRWDPPRPDEPQLAIYAVAVKEEVTAVAFAKVRPGE MRFMGYSRDDKAIPKVQKANAWQPLLRDWKVEAERLGQSFAGGEAGVDPKKDLMTCRY CGLETFCRVYEKINVLAEEEFEE" gene 2126..5128 /locus_tag="E6H56_07700" CDS 2126..5128 /locus_tag="E6H56_07700" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011380080.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DNA helicase UvrD" /protein_id="TMH41436.1" /translation="MKDAAQRERALDVRHSFIVQAPAGSGKTELLVRRFLKLLEVVKK PEEVLAITFTKKAAAEMRKRVLERLPNSAEIAHRLRIQTIDAFCTALTRQVPVLARFG AQPEIIEDAKPLYKEAAARVFEEFSPATERLLAHLDNNIPLATEQLAKMLGSRDRWLR KTGAVPTRAELEATLVSERNRLLKRAQALYPKASEALARGYLTQKGEWTKRSPPPPEL VRIPGLREALFALCNMPPAEYDDRQWEALEAILALLKPAVAHLKVLFGERGQADFTEF AHGALEALGSVDDPSDLLLSLDQKISHVLVDEFQDTSLSQFELLTKLTSGWQEGDGRT LFLVGDPMQSIYRFREAEVSLFLQAKHSGLGSVKLEPIELSTNRRSQEGLVKWFNESF PRVLPAQEDQTSGAVPYLPASPHEPALPGAAVTWHCGYDREEEAKRVVAIVREASGSK AILVRNRAHLDEIVPALKEAGVRFRALDIEQLGEKQVVQDLYALTRALLHLGDRIAWL ACLRAPWCGLTLADLLLLSSGRSEEGGSEYTSGGALIFDKMRDVTHLSADGQKRVDRA RSVLAPLVKNRLRGSLRERVEGAWLALGGPACVESETDLEDAEIFLDELERLEEAGEV DLAALEDKIDRRLYAQPDVKATKDAVEIMTIHRAKGLEFDTVIVPGLDRLPRSGPKPL LVWKSLLPAGLLLAPIDETGASEDPTYQYVRELDKEADDIEAGRLFYVAATRAKQRLH LLACAKADEDLRPKEPSRRSLLAKIWWQAREHFGDAPADAIAEPERMPIHDVLHRLPA GFALPKAPTSAKWTAPDEGRQEEEIEFSWAGETARHVGTIVHRWLQRIADDELRTWDA TRIDALRSRFAKELERRGIPPRDLKASTEQVSVALKNAISDERGRWVLGPHPEARSEH RIRVTSAAGASTYIVDRIFRTAEDVRWIVDYKTSSHEGKDLEGFLDREQERYRTQLER YARALAAGDRSMLGLYFPLLAGWRQWPQ" gene 5128..5991 /locus_tag="E6H56_07705" CDS 5128..5991 /locus_tag="E6H56_07705" /inference="COORDINATES: protein motif:HMM:PF00892.18" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DMT family transporter" /protein_id="TMH41426.1" /translation="MELSTALALLASAFLGTAVVIANVGLRYLDPARGALVSIPSTTL LFGVLALFLFRGEGWNAAAFAIFAAVGLIFPALVTFLNFASNRLAGPTIAGTISSTTP LFAALGAILFLGEPLSSAATAGTAAIVLGVIALTARGSGPPRSWAAWVILLPLAGAAI RGGAQAAVKGGLMLWPDPFIAALVGYCVSSVTIFAANRAFVPRPNAPLARRGVFWFIA VGVCNGLGVLAMYAALNSGRVSVVSPLVATYPLVTLMFSAIFLREERFGPRVLLGVAL TVAGVVVLARA" gene complement(6002..7177) /locus_tag="E6H56_07710" CDS complement(6002..7177) /locus_tag="E6H56_07710" /inference="COORDINATES: protein motif:HMM:PF13458.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="ABC transporter substrate-binding protein" /protein_id="TMH41427.1" /translation="MRFSIRHHLSLVALACVPFLASRGGRQMEEGINLLLKERNYAFA GRKVDIIFADTAGQPALAKTKTQELVEREKVHAIIGPLATFEALAIDEYMLSSKTPLI TPTSAAQNDLAQKKQSDYVIHVYGTAAQPMYALGDYAAKKLELKRIAMIADDFTYGHE GAAGFHRVFEDGGGRVVQKLWPPLNVPDYGSFIGQLKTNVDGIYAGFAGSNPLRFLRT YKEYGLKLPLFGNPTFVDEGILKNMGDEALGVYSASWYTVDRDTPDNRRFVESIQREY KVTPGFYTAGTYTAGLWLEEAMKRVKGRFEDKPAFIRALHEVKLDHGPMGPTRLGEYG KPILNIYIRRVERKGGQLVNTTIATYPEVSQFWTYDPKQFIAGPQYSRDIPVAKYLE" gene 7284..8252 /locus_tag="E6H56_07715" CDS 7284..8252 /locus_tag="E6H56_07715" /inference="COORDINATES: protein motif:HMM:PF02625.14,HMM:PF04945.11" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="YHS domain-containing protein" /protein_id="TMH41428.1" /translation="MRPDLLKLAARLAEREERFAIVTVVRREPPSSARVGDVALVSER GEYHGWTGGGCTRSSVLLEAMRAIADGEPRLLSLSPEPEGGRRPGVVFLPMTCESGGT VEIYVEPVLPAARLLLFGSSPAVRVLARIAHAVGYRVDVIDPEADESAFPDAKVQRSL VADALPRGAHVLVATMGDGDIEAIEAALARSPAYLGVIASPKRFAQLRDALLARGVAR EAIERIAAPAGLDLGARTPEEIAVSIIAQIVERRRRAAKTGQTENVPVREAVDPVCGM SVKVAGARHTAEALGVTYYFCCAGCRTKFLADSARYLAAGAGARGS" gene 8481..9350 /locus_tag="E6H56_07720" CDS 8481..9350 /locus_tag="E6H56_07720" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012845244.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="xanthine dehydrogenase family protein subunit M" /protein_id="TMH41429.1" /translation="MIPAAFDYHRADTLDEALKLLKKHGDDAKVLSGGMSLLPMLKLR LASFAHLVDINRVPGLDYIKEEKGTLRIGAMTRQAALERSDVIKSKYPILADAVPLIA DPLVRNRGTIGGNVANGDPGNDQPAIMIALGATFIVRGAKERSVAANQFYKGLYDTAL ARSEILTEIRIPVPPPKSGGAYTKLKRKTGDFAVAAAAVQLTLGKYGAVERAGIALTN AGLMPIEAVDAAKYLVGKMPDEKTIAEAAKMAAAKSAPSADRRGSVEYKKEMARVLTA RALHKAVQRAGGS" gene 9351..9821 /locus_tag="E6H56_07725" CDS 9351..9821 /locus_tag="E6H56_07725" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013076572.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="(2Fe-2S)-binding protein" /protein_id="TMH41430.1" /translation="MAMHKISVTINGAVREAEVESRLLLVHLIREVFRMTGTHVGCDT SHCGACTVVLDGNPVKSCTVLAVQADGSKITTVEGLEQGGKLHPVQEGFTEKHGLQCG FCTPGMLMTTSALLERNKNPSEQQIREAISGNLCRCTGYVNIVKAVQYAAEKMR" gene 9846..12215 /locus_tag="E6H56_07730" CDS 9846..12215 /locus_tag="E6H56_07730" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_018050717.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="carbon-monoxide dehydrogenase large subunit" /protein_id="TMH41431.1" /translation="MSEPRTDAKICGMGHSMKRKEDPRFIRGQGRYIDDHVLPGMLYM DIVRSPHAHAKIKKINTAKALKIPGVLAVIDGPTLAKYNLHWMPTISGDTQMVLPIDE VMYQAQEVCAVIATERYIAADGVDAVEVEYEPLPVMVDPHKSLAPGAPVLRKDKEGKK DNLAFHWQAGDKADAERALAASDVVISENVYLPRIHVASMETCGCIADFDTAQGKLTV YMTTQAPHAVRTVLALVAGHVGLSEERIRVVSPDIGGGFGGKVPVYPGYVVAIAASVV IGKPVKWIEDRMENLQADSFARDYHMKVELGAKRDGTMTSLKIKTVADHGYSNASANP SKFPAGLFSICTGSYDLKHAFCEVDAAYTHKPPGGVAYRCSFRVTEAVHAIERATDAL AQKLNMDPAELRMKNFIKPEQFPYKSVLGWEYDSGNYGAALKKAMDIIGYDALRREQA EKRKKGELMGIGISSFTEIVGAGPSRDFDILGIKMFDSAEIRVHPTGKAIARFGTKSQ GQGHETTYAQIVAEELGIPAAHVQVEEGDTDTAPYGLGTYASRSTPTSGAAAALASRK IRDKARKIAAHLLEVSEQDLEWEPGKFFVKGAPQMAKTIQECAFAAYTNHPQGMEAGL EAVHYYDPPNLTFPFGSYICVVDIDRGTGDVKVRRFVAVDDCGNIINPMIVDGQIHGG LTQGIGPALYEEISYDDEGNISGGSFLDYYVPTALETPKWETDKTITPSPHHPLGAKG VGESATVGAPPAIANAVVDALRHLGVTHLDIPITPEKVWTILKNKGVAE" gene 12426..14102 /locus_tag="E6H56_07735" CDS 12426..14102 /locus_tag="E6H56_07735" /inference="COORDINATES: protein motif:HMM:PF00512.23,HMM:PF02518.24,HMM:PF08448.8, HMM:PF13185.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GAF domain-containing protein" /protein_id="TMH41432.1" /translation="MGVETRAAAFDPLSAGEAQLRVRLELVSRLGSLAAGARNLGELF RALHEETERVMDATVFLFALYDEASETVQVVRQMDRGVEHSGGSFPLGKGFTSEVIRT GAPRLVRRWSAEGPPVRLLYGTEAGDLVTPQSGVVVPILSGDRVLGVLSAQSYRPEAY EEADLLSLSAIAAQAGITIQRLRVTEQMAVEHERHASELEAVLATMNDALLIVDARGA IVRLNRAARELLCLDSASLVFGQPLEQQRLERWPVTAREIAAALVPVIDALRSGASHA GMEIELGSGQLRVLSLGASVLRSPKGVPQGGVIVFRDITGQRELERLREDIFAMAWHD MKTPITVIRGHAELLLRRLSCGERDPKVLESDAALIVEHTDHLSKLLTTLFDISSLEA GLLSISPRPTDLGALTRDVTRGMRATARRNIEVLAHQGVVGRCDEQRIRQVLTNLLSN ALKYSPEGSTVTVSLTADERSVTVRVSDEGIGLDDIELAQLFRRGYRAEPARKLAGAE LGLYFCNGVVTAHGGRMWAESFGHGRGSTFCFTLPLREEHGADRPVEDRA" gene 14099..>14980 /locus_tag="E6H56_07740" CDS 14099..>14980 /locus_tag="E6H56_07740" /inference="COORDINATES: protein motif:HMM:PF00082.20" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="TMH41437.1" /translation="MRRRRLLGIGCALVALALALGSAVSPARAQLLRPPPVILPPPPV ILPPPPVILPPPPASIQEKIDPALLALMAADPQKLLPVIVEMQQPLPPFIGAPNVGRA LEALDLLRFNGVPVAALSLIDAAAGFANAAGINALSLVPTVAFIHHDATVGPRRSAEP PAADPPDQLSRAYARVVKAHRVWRQGITGSGVTVAILDSGVAADADLVEPENRLLASV NFADQRLTSDPGGHGTHIAGIVAGNGSRSGGEFVGIAPRANIVDVRVLGNTGSGRISS VIRGIEWVLARRTVYNIR" assembly_gap 14981..15202 /estimated_length=222 /gap_type="within scaffold" /linkage_evidence="paired-ends" gene 15203..15414 /locus_tag="E6H56_07745" /pseudo CDS 15203..15414 /locus_tag="E6H56_07745" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013787492.1" /note="incomplete; partial in the middle of a contig; missing N-terminus and C-terminus; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=3 /transl_table=11 /product="serine protease" gene complement(15384..15932) /locus_tag="E6H56_07750" CDS complement(15384..15932) /locus_tag="E6H56_07750" /inference="COORDINATES: protein motif:HMM:PF00805.20" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="pentapeptide repeat-containing protein" /protein_id="TMH41433.1" /translation="MKVSQATLSQATLSQAMLSQAMLSQATLSQAMLSQATLSQAKLS QAMLFQTRVSQVSRNQGMPPSVGSFQWSGEPKRTGYSARAKPSAGRSPSLAQVKPTTG WSGSGRNVPGRNAPGRGALGTSAPSQSGTSTSSGSVVSRSAVMVAAYASSSPLPSAIG SGSPLPLSSPYSCVVPTRSALT" gene 16111..16423 /locus_tag="E6H56_07755" /pseudo CDS 16111..16423 /locus_tag="E6H56_07755" /inference="COORDINATES: similar to AA sequence:RefSeq:NP_952344.1" /note="frameshifted; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="response regulator" gene 16501..17211 /locus_tag="E6H56_07760" CDS 16501..17211 /locus_tag="E6H56_07760" /inference="COORDINATES: protein motif:HMM:PF03992.14" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="TMH41434.1" /translation="MTFHQAALVPALLCLSAVNSIAQTPPAASPAVPADVSPLYVVTY VEARPTAREEAAALLKSYREASRSSSGNLRSVVVRSVVRPGQFVVAAAWKDKAAWDAH MAAAGTKEFREKLNALRNAPADDRFHNALSVGPMEVASGSVYGVTHVDVIPPQKDNAI VALKVLGEANRAAAGNVRFEIVQQTNRPNHFTVFEIWRSREAFDANGMSAHQREFRDK LVGMAGALYDERLYEILN" gene 17222..>17413 /locus_tag="E6H56_07765" CDS 17222..>17413 /locus_tag="E6H56_07765" /inference="COORDINATES: ab initio prediction:GeneMarkS-2+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS-2+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="TMH41435.1" /translation="MKRTAIAIAFLAAPALAFAQWNVPAESQRCPSKWGAGDQRGSSN HMKNPETVLRGARLIKTGEV" BASE COUNT 2943 a 5738 c 5686 g 2824 t ORIGIN 1 tcccgcggct gccggatgcg ctgcagcggt tcctgccaga gccgaagcgg ccgaagctcg 61 ttgtcgccta tgcgttcgac atcctaccgc cgcagacgaa ggagttcctc gacgggctcg 121 ggacggagat cgctttttgc aggccggatc cccggttcgc caccgtttcg cgagcggctt 181 tcgattccgc gaagcacgag atcgaggccg ccgcgcggtg ggcgagggcg cgcctggaag 241 cgggcggcaa aaggatcggg gttgtcattc cctccttgcg ggaaaagaga aaagaggtgg 301 ttcgcgtttt ctcgcgagtc atgcggcctg gcggcgagaa gacagctatg cctttcaacg 361 tttcgctcgg tactccgctt caagaatttc ccatagtgaa tgccgcgcta acggtgttgc 421 gcttttcaca agaagaaatc tcgttcgaag aggcgagtcg gatcgtccgc tcgccgttca 481 tcggcggggc tgagagcgag ctcggtgcgc gcatgaagct cgaggcgcgg ctgcgcgaca 541 agctcggcgc cacggtcgca ctgccgaagc tgatcgcttt tctcgagaag aagaccgtgc 601 ttcgcgagag tctcgagcgg gtgttcggga tgcgcgagac cgggctgttc tcgcagaaga 661 cgccgggcga gtgggcgcgg catttctcgg cggtgctcga ggcggccggc ttccccggcg 721 agcgctcgct tgactcggac gaattccagg cgcaggccaa gtggcacgag gtgctcggcg 781 agctttcaaa attggacagg gtttcaaagg aaatatcttt ttcgggagcg ttccagatcc 841 tgaagaagat ctgtgccgac acgtcatttc agcctgacag ccccgaagcg ccgatccagg 901 tgctcgaaat acgggattca acctacgtcg agttcgacca cctctgggtg accggcctca 961 ccgacgaggc ctggccgctg aagagctcgc ccaatccctt cctgccgctc gcgcagcagc 1021 ggaaggccgg catcccggag gcgagcgccg agacctcgct cgcgctcgac cggcgcatca 1081 ccgacggctg gaagcaggcc gcgggcgagg tggtgttctc ctgcttcacg aaagagcagg 1141 accgcgacgt tctgtcgagt ccgctgatcg ccgacgttcc tctgaaaccg atcgaagtcc 1201 ccgccttccc gagattccgg gacgagatat tcaaattgaa gaagctggaa actcttcaag 1261 atcgcgtagc gcccgccgtc agggagaaac aggtccgcgg cggcacgcgc gttctctccg 1321 accaggcagc gtgcccgttc cgcgccttcg cgcgccaccg cctgcacgcc gaggagctcg 1381 agcagcccgt cgaaggcctc gacgcctcgg cgcgcggcaa gctggtgcac gagctgatga 1441 agcacctgtg gggtttcctg aaggattcct cttccctcca gggaaatctc gattccgcga 1501 tcgagcaggc cgccgccgcc gcggtgaagg agctgaagct cgagggtcga ctcgccgagc 1561 tcgagcgctc gcgtatggcg cgcatcgcgc gcgagtggct cgaggtcgag aaaagccggc 1621 cagaattttc cgtggtgggc gtcgaagaca agagaaagat cagcttcgcc gggctcgagt 1681 tcgacgcgcg catcgaccgc atggacaagc tctccagcgg cggccacgcg atcatcgact 1741 acaagaccgg tggcaatatc acgccgcggc gttgggatcc gccgcgcccg gacgagccgc 1801 agctcgcgat ctacgcggtg gcggtgaagg aggaggtgac ggcagtcgcc tttgccaagg 1861 tacgtcccgg agaaatgcgc ttcatgggct actcgcgcga cgacaaggcg atccccaagg 1921 tacagaaggc gaacgcgtgg cagccgctgc tgcgcgactg gaaggtggag gccgagcggc 1981 tcggccagtc gttcgcgggc ggcgaggcgg gcgtcgaccc gaagaaggac ctgatgacct 2041 gccgctactg cggcctggag acgttctgcc gcgtctacga gaagatcaac gtgctggccg 2101 aagaggaatt cgaggagtga gcgagatgaa ggatgccgcc cagcgcgaac gcgcgctcga 2161 cgtgcgccac tcgttcatcg tccaggcgcc cgccggctcc gggaagaccg agctgctggt 2221 gcgccgcttc ctgaaactgc tggaggtcgt caaaaagccc gaggaagttc tcgccatcac 2281 cttcaccaag aaagcggcgg ccgagatgcg caagcgcgtg ctggagcgcc tgccgaattc 2341 ggcggagatc gcccaccgcc tgcgcatcca gaccatcgac gcgttctgca ccgcgctcac 2401 gcgccaggtg ccggtgctcg cccgcttcgg cgcccagccc gagatcatcg aggacgcgaa 2461 gccgctctac aaggaggccg cggcgcgcgt cttcgaggaa ttcagtcccg cgaccgagcg 2521 gctgctcgcg cacctcgaca acaacatccc cctcgccacc gagcagctcg ccaagatgct 2581 cgggagccgc gaccgctggc tgcgcaagac cggcgccgtg ccgacccgcg ccgaactcga 2641 agccacgctg gtttccgaac gaaaccgcct gctgaagcgc gcccaggcgc tctatcccaa 2701 ggcttcggag gcgctcgcac gcggctacct gacgcagaag ggcgagtgga ccaaacgctc 2761 acccccgcca cccgagctgg tccgcatccc gggcctgcgc gaggcgctct tcgccctctg 2821 caacatgccg ccggcggaat acgacgaccg gcaatgggag gcgctggaag cgatcctcgc 2881 gcttctcaag cccgccgtcg cgcacctcaa ggtgctgttc ggcgagcgcg gccaggctga 2941 cttcaccgag ttcgctcacg gcgcgctcga ggcgctcggc tcggtcgacg acccgagcga 3001 cctgctgctc tcgctcgacc agaagatctc gcacgtcctg gtcgacgagt tccaggacac 3061 ctcgctctcg caattcgagc tcctcaccaa gctgacttcc ggctggcagg agggtgacgg 3121 ccgcacgctg ttcctcgtcg gcgacccgat gcagtcgatc taccgcttcc gcgaagccga 3181 agtgtcgctc ttcttgcagg ccaagcactc cgggctcggg tcggtgaagc tcgagccgat 3241 cgagctcagc accaaccgcc gctcgcagga aggcctcgtg aagtggttca acgagtcgtt 3301 cccgcgcgtg ctccccgcgc aggaagacca gacatcgggc gccgtgccgt accttcccgc 3361 ttcgccgcac gagccagcgc ttcccggcgc cgccgtgacc tggcactgcg gctacgaccg 3421 cgaggaggaa gcgaagaggg tcgttgctat cgtccgggag gcttcgggca gcaaggcgat 3481 cctcgtccgc aaccgcgcgc acctcgacga gatcgtgccg gcgctcaagg aggcgggcgt 3541 gcgcttccgc gccctcgaca tcgagcagct cggcgagaag caggtcgtgc aggacctcta 3601 cgcgctgacg cgcgcgctcc tgcacctcgg cgaccgcatc gcctggctcg cgtgcctgcg 3661 cgcgccctgg tgcggcctca ccctcgccga tctcctgctc ctctcgtccg gccggagcga 3721 ggaaggcggc agcgagtaca ccagcggcgg cgcgctcatc ttcgacaaga tgcgcgatgt 3781 cacccacctc tcggccgatg gccagaagcg cgtcgaccgc gcccgctcgg tcctcgcgcc 3841 gctggtgaag aaccggctgc gcggctccct gcgcgagcgc gtcgagggcg cctggctcgc 3901 gctcggcggc cctgcgtgcg tcgaaagcga gaccgatctc gaggacgccg agatcttcct 3961 cgacgagctc gagcgcctgg aagaggcggg cgaggtcgac ctcgcggcgc tcgaggacaa 4021 gatcgaccgc cgcctctacg cccagcccga cgtaaaggcg acgaaggacg cggtcgagat 4081 catgaccatc caccgcgcca agggcctcga gttcgacacc gtcatcgtcc ccggcctcga 4141 ccgcctgcct cgctccggcc cgaagccgct cctcgtctgg aagtcgctcc ttcccgccgg 4201 actgctcctc gccccgatcg acgagaccgg cgccagcgaa gacccgacct accaatatgt 4261 acgggagctc gacaaggaag ccgacgatat cgaggccggc cgcctcttct acgttgccgc 4321 cacccgcgcc aagcagcgcc tgcacctcct cgcctgcgcg aaggccgacg aagacctcag 4381 gccaaaggaa ccctcgcggc gctcgctcct cgcgaagatc tggtggcaag cgcgcgagca 4441 cttcggcgac gcgcccgccg atgccatcgc cgagcctgag cgcatgccga tccatgatgt 4501 gctgcatcgc cttccggccg gcttcgcttt gccgaaggcg cctacttcag cgaagtggac 4561 ggcgccggac gagggacggc aggaagagga gatcgagttc tcatgggccg gagaaacggc 4621 gcgtcatgtc ggaacgatcg tgcatcgctg gctgcagcgc atcgccgatg atgagttgcg 4681 cacgtgggac gccacgcgca tcgacgcgct gaggagccgc tttgcaaagg aactcgagcg 4741 gcgtggcatt ccgccgcgtg acctgaaggc ttccaccgag caagtcagcg ttgccctgaa 4801 gaacgcgatc tcagacgagc gcggacgctg ggttctcggt ccgcatcccg aagcacgcag 4861 cgagcatcgc atccgcgtga caagcgctgc cggggcgagc acctacatcg tggatcgcat 4921 cttccgcacg gcggaggatg tgcggtggat cgttgactac aagaccagca gccacgaggg 4981 caaggacctc gaggggtttc tcgaccgcga gcaggagcgc tatcgcactc agttggaacg 5041 ctacgccagg gcgctcgcgg cgggcgaccg ttcgatgctc ggactgtatt tcccattgct 5101 cgccggctgg aggcagtggc cgcagtgatg gaactcagca ccgcgctcgc gctcctcgcc 5161 tcggcgtttc tgggaacagc agtagtgatc gcgaacgtcg gtcttcgcta cctcgacccc 5221 gcgcgcgggg cgctcgtcag cattccctcg acgacgctgc ttttcggcgt actcgcgcta 5281 ttcctctttc gtggcgaagg ctggaacgcg gcggcgttcg cgatcttcgc cgcggtgggg 5341 ctgatctttc cggcgctggt gacttttctc aacttcgcgt cgaaccgctt ggcggggccg 5401 accatcgccg ggacgatctc gagcaccacg ccgctcttcg cggcgctggg cgcgatactt 5461 tttctcggag agccgctctc gagcgccgcg actgcgggaa ccgcggcgat cgtgctgggg 5521 gtgatcgcgc tcaccgcgcg cggctcggga cccccgcgca gctgggcggc gtgggtgatc 5581 ctgctgccgc tcgcgggcgc cgcgatccgc ggaggggcgc aggcggcggt gaaagggggt 5641 ctgatgctct ggcccgatcc cttcatcgct gcgctggtcg gttactgcgt gtcgtccgtg 5701 accatttttg cggcgaaccg cgccttcgta ccgcgcccca acgccccgct cgcccggcgc 5761 ggcgtcttct ggttcatcgc ggtcggagtc tgcaacggcc tcggggtgct ggcgatgtac 5821 gcggcgctca acagcgggcg cgtgagcgtg gtctcgccgc tggtcgcgac ctacccgctt 5881 gtcacactca tgttctcggc gatttttctg cgtgaggagc ggttcggccc gcgcgtgctg 5941 ctcggggtcg cgctgacggt ggccggcgtc gtggtgctcg cgcgcgcctg agcgccccgc 6001 gttattcgag gtacttcgcc accgggatgt cgcgcgagta ctgcggtccc gcgatgaact 6061 gcttcggatc gtaggtccag aactggctca cctcgggata ggtggcgatg gtggtgttga 6121 cgagctggcc gcccttgcgc tcgaccctgc ggatgtagat gttgaggatg ggcttgccgt 6181 actcgccgag gcgggtcggt cccatcggcc cgtggtcgag cttcacttca tgcagcgctc 6241 ggatgaacgc gggcttgtcc tcgaacctgc ccttgacgcg tttcatcgcc tcctcgagcc 6301 acaggcccgc ggtgtaggtg ccggcggtgt agaagccggg cgtcaccttg tactcgcgct 6361 ggatcgattc gacaaagcgc ctgttgtcag gcgtgtcgcg gtccaccgtg taccagctcg 6421 ccgagtacac gccgagcgcc tcgtcgccca tattcttgag gatgccttcg tcgacgaaag 6481 tcgggttgcc gaagagcggc agcttcagcc cgtactcctt gtaggtgcgc aggaaacgca 6541 aggggttgct gcccgcgaag ccggcgtaga tgccgtccac gttggtcttg agctggccga 6601 tgaaggagcc atagtcgggc acgttgagcg gcggccagag cttctgtacg acacggccgc 6661 caccgtcctc gaacacgcgg tggaagcccg cggcgccctc gtggccgtag gtgaagtcgt 6721 ccgcgatcat cgcgatgcgc ttaagctcga gcttcttcgc ggcgtagtcg ccgagcgcgt 6781 acatcggctg ggctgcggtt ccgtatacgt ggatgacgta gtcgctctgc ttcttctgcg 6841 cgaggtcgtt ctgtgccgcc gaggtcggcg tgatcagcgg cgttttcgag gacagcatgt 6901 attcgtcgat ggcgagcgcc tcgaaggtgg cgagcgggcc gatgatcgcg tggaccttct 6961 cgcgctcgac gagctcctgc gttttcgttt tcgcgagagc gggctgcccg gccgtgtcgg 7021 cgaagatgat gtcgaccttg cgtcctgcga aggcgtagtt gcgctccttc aacagcaggt 7081 tgatgccttc ctccatctgc cggccgcccc tcgaagcgag gaacgggacg caggcgagcg 7141 ctacgagcga cagatgatgc cgaatcgaga aacgcatgac tcctccaggg cgtgaaaagc 7201 ggatttgttc tggtcgacct tatcctacca gcatcggtcg cgccggaggc ttacaatgac 7261 cgtttccccg acaccgtcag cctgtgcgcc cggatctcct caaactcgcg gcccgtctcg 7321 ccgagcgcga agagcgcttc gcaatcgtga ccgtcgtgcg gcgcgagccg cccagctcgg 7381 cccgcgttgg cgatgtcgcg ctggtgagcg agcgcggcga gtaccacggc tggacgggcg 7441 gtggctgcac gcgctcgagc gtactgctcg aggcgatgcg ggcgatcgcc gacggcgagc 7501 cgcgcttgct gagcctttcg cccgagcccg agggcgggcg gcgtcccggc gtggttttcc 7561 tacccatgac ctgcgaaagc ggaggtaccg tggaaatcta cgtcgagccg gtgctgccgg 7621 cggcgcgcct gctgctattc ggaagctcgc ccgcggtgcg ggtgctcgcg cgcatcgccc 7681 atgccgtggg ctaccgcgtc gacgtcatcg acccggaagc cgacgagagc gcattcccgg 7741 acgccaaggt gcagcgctct ctcgtggcgg atgcgctgcc gcgcggcgcg cacgtcctgg 7801 tggcgacgat gggcgacggc gacatcgaag cgatcgaggc ggcgcttgcg cgctcgcccg 7861 cttacctcgg cgtgatcgcg agccccaagc gcttcgcgca gcttcgcgat gccttgctcg 7921 cccgcggcgt cgcgcgcgaa gccatcgagc gcatcgccgc gcccgcgggg ctggacctgg 7981 gcgcgcgcac gcccgaggag atcgccgtga gcatcatcgc gcagatcgtc gagcgccggc 8041 gtcgcgcggc aaaaacgggg cagacggaaa acgtaccggt gcgcgaggcg gtcgatccgg 8101 tgtgcggcat gagcgtcaag gtcgcgggcg cgcgccacac cgcggaggcg cttggcgtca 8161 cctactattt ctgctgcgcg ggctgccgca cgaaatttct cgccgactcc gcgcgctacc 8221 tcgccgcggg cgccggggcg cgcggctcgt gaatgccgct cgtcgacgga atcgacaccg 8281 atataaggaa atgctgataa acggggaggg cttgctccgc ctggtggcct gatgtatcgt 8341 cggcagcaca agaataatat cccgtcggca tacgtttacg cgaaggacga agggaatgcg 8401 gcgcggcgat ccagccgcgc ctcggttctg ttttcggtcc gtggaggcta aagcgcgcaa 8461 cgcgcggaaa ggggagaccg atgattcctg cggcatttga ttatcaccgt gctgacacac 8521 tcgacgaggc cctgaagctc ctgaagaagc acggcgacga cgcgaaggtg ctctcgggag 8581 gcatgagcct actgcccatg ctcaagctgc gcctcgcctc cttcgcccat ctcgtcgaca 8641 tcaaccgcgt tcccggcctg gactacatca aggaagagaa ggggactctg cgcatcggag 8701 cgatgacgcg ccaggccgcg ctcgagcgct ccgacgtgat caagagcaaa tacccgatcc 8761 tcgccgacgc ggtgccgctg atcgccgatc cgctggtgcg caaccgcggg acgataggcg 8821 gcaacgtggc gaacggcgat cccgggaacg accagccggc gatcatgatc gcgctgggcg 8881 ccaccttcat cgtgcgcgga gccaaggagc gcagtgtcgc ggcgaaccag ttctacaagg 8941 ggctgtacga caccgccctc gcgcgcagcg agatcctcac cgagatccgg attcctgtgc 9001 cgccgcccaa gagcggcggc gcgtacacca agctgaagcg caagaccggc gacttcgccg 9061 tcgccgcagc cgcggtgcag ttgacgctgg gcaagtacgg cgcggtcgag agggccggca 9121 tcgcgctcac caacgccggc ctcatgccga tcgaggcggt cgacgcggcg aagtatctcg 9181 tcggcaagat gcccgacgaa aagaccatcg cggaagcggc gaagatggca gcggcgaaga 9241 gcgcgccttc cgccgaccgg cgcggctcgg tcgagtacaa gaaggagatg gcgcgcgttc 9301 tgacggcgcg cgcactgcac aaggcggtcc agcgcgcggg aggaagctaa atggccatgc 9361 acaaaatcag cgtcaccatc aacggcgcgg tgcgcgaagc cgaggtcgag tcgcgcctcc 9421 tgctcgtgca cctcatccgc gaggtgttcc gcatgacggg cacccacgtc ggctgcgata 9481 cctcgcactg cggcgcgtgc accgtcgtcc tcgacggtaa tccggtgaaa tcgtgcaccg 9541 tgctcgcggt gcaggccgac ggcagcaaga tcaccacggt cgagggcctg gaacagggcg 9601 gcaagctcca tccggtgcag gagggcttca ccgagaagca cggcctgcaa tgcggcttct 9661 gcacgccggg catgctcatg accacgagcg cgctgctcga gcgcaacaag aacccgagcg 9721 agcagcagat tcgcgaagca atttcgggga atctctgccg ctgcaccggt tacgtgaaca 9781 tcgtgaaggc cgtgcaatac gcggccgaga aaatgcgcta gctgaaccgg aatcaggaga 9841 cagcaatgag cgaacccaga accgacgcca agatctgcgg catgggccat tccatgaagc 9901 gcaaggagga cccgcgcttc atccgcggcc agggccgcta catcgacgat cacgtgctgc 9961 ccggaatgct ctacatggac atcgtgagga gcccccacgc gcacgcgaag atcaagaaga 10021 tcaacactgc aaaggcgctc aagatccccg gcgtgctcgc ggtgatcgac gggccgacgc 10081 tcgccaagta caacctccac tggatgccga cgatttccgg cgacacgcag atggtgctgc 10141 cgatcgacga ggtcatgtac caggcgcagg aagtctgcgc ggtgatcgcc accgagcgct 10201 acatcgcggc cgacggcgtc gatgcggtgg aagtcgagta cgagcctctt cccgtgatgg 10261 tcgatccgca caagtcgctc gcccccggag cgccggtcct gcgcaaggac aaggagggaa 10321 agaaggacaa cctcgctttc cattggcagg cgggcgacaa ggcggatgcc gagcgcgcgc 10381 tcgccgcatc cgacgtcgtg ataagcgaga acgtctatct tccgcgcatc cacgtcgcct 10441 cgatggagac ctgcggctgc atcgccgact tcgatacggc gcagggcaag ctcactgtat 10501 acatgaccac gcaggcgccg cacgcggtgc gcacggtgct cgcgctggtc gcgggccacg 10561 tcgggctgtc cgaggagaga atccgcgtcg tgtcccccga catcggcggc ggcttcggcg 10621 gcaaggtgcc cgtgtacccg ggctacgtgg tcgcgatcgc ggcctcggtc gtgatcggca 10681 agccggtgaa gtggatcgag gaccgcatgg agaacctcca ggccgactcc ttcgcgcgcg 10741 actaccacat gaaagtcgag ctcggcgcga agcgggacgg caccatgact tcgctcaaga 10801 tcaagaccgt ggccgaccac ggttactcca atgcatcggc gaacccctcc aaattcccgg 10861 cggggctgtt ctcgatctgc accggctcct acgacttgaa gcacgctttc tgcgaagtcg 10921 atgcggccta tacccacaag ccgccggggg gcgttgccta ccgctgctcc ttccgcgtca 10981 ccgaagccgt gcacgcgatc gagcgcgcca ccgatgcgct cgcgcagaag ctcaacatgg 11041 atccggccga gctgcgcatg aagaacttca tcaagccgga gcagttcccg tacaaatcgg 11101 tgctcggctg ggagtacgac agtggcaact acggcgcggc gctgaagaaa gcgatggaca 11161 tcatcggcta cgatgcgctg cgccgcgagc aggccgagaa gcgcaagaaa ggagagctga 11221 tggggatcgg catctcgagc ttcaccgaga tcgtcggagc cggcccctcg cgggacttcg 11281 acatcctcgg catcaagatg ttcgactcgg ccgagatccg cgtgcatccg acgggcaaag 11341 ccatcgcgcg cttcggcacc aagagccagg gacagggcca cgagacgacc tacgcgcaga 11401 tcgtcgccga ggagctcggc attcccgccg cgcacgtgca ggtggaagag ggcgacaccg 11461 acaccgcgcc ctacggcctg ggcacctacg cgagccgctc gacgccgacc tcgggcgcag 11521 ccgccgcgct tgcctcgcgc aagatccgcg acaaggcgag gaaaatcgcg gcgcaccttc 11581 tcgaagtgag cgagcaggat ctcgaatggg agcccggcaa gttcttcgtc aagggcgcgc 11641 cccagatggc gaagaccatc caggagtgcg cgttcgcggc gtacaccaac catccgcagg 11701 ggatggaggc gggcctggag gccgtgcact actacgatcc gccgaatctc acctttccct 11761 tcggcagcta catctgcgtg gtggacatcg accgcggcac gggcgatgtg aaagtcaggc 11821 gcttcgtcgc ggtggacgac tgcggcaaca tcatcaaccc gatgatcgtc gacggccaga 11881 tccacggcgg gctcacccag gggatcgggc cggcgctcta cgaggagatc agctacgacg 11941 acgagggcaa catctccggc ggaagctttc tcgattacta cgtgccgact gcgctcgaga 12001 ccccgaagtg ggagaccgac aagaccatca cgccttcgcc gcaccatccg ctcggcgcga 12061 aaggcgtggg cgaatcggcg accgtaggcg caccgcccgc gatcgcgaat gcggtcgtcg 12121 acgcgctacg tcacttgggc gtcacgcacc tcgacatccc gatcacgccg gaaaaggtct 12181 ggacgatcct gaagaacaaa ggtgtcgcgg agtagcgcct gacgcagcaa gccgcgcgtg 12241 gggcgccgac gttgtgcctc ggggaataac ccttatgaat cagcacggag caactgcgcc 12301 ccatccctct tctagtccga atggcacatc ggatcgacga cgttcgtaaa tataatttag 12361 gtaaatagac ttagggggaa gtgtgccctt tccctgtttg ttcctagctt gggaggtgaa 12421 cccccttggg cgtcgagaca cgcgctgccg cctttgatcc gctgagtgca ggtgaggcgc 12481 agcttcgcgt gcgactcgag cttgtgagcc ggctcggctc gctcgccgcc ggcgctcgca 12541 acctcgggga gttgttccgc gctctccatg aagagaccga gcgcgtgatg gatgcgacgg 12601 tctttctctt cgccctctac gacgaggcga gcgagacggt gcaggtcgtc cgccagatgg 12661 accgcggcgt cgagcactcc ggcggttcgt ttcccctggg caagggtttc acgagcgagg 12721 tgattcgaac gggtgcgcct cggctcgtcc gccgatggtc cgccgaggga ccaccggtcc 12781 gcctcctcta cggcaccgag gcgggggact tggtcacacc ccagtcgggc gtcgtggtcc 12841 cgatcctatc cggcgatcgg gtgctcggtg tgctctctgc gcagagctat cgacccgaag 12901 cctacgaaga agcggatctc ctcagcctga gcgccatcgc tgcgcaagcg ggcatcacca 12961 tccagcgcct gcgcgtgacg gagcagatgg ccgtggagca cgaacggcac gcctcggagc 13021 tggaggcggt cctcgcgacc atgaacgacg ccctcctcat cgtcgacgcg cggggcgcca 13081 tcgtgcgatt gaaccgggcg gcccgggagc tcctctgcct ggacagcgcg agcctcgtgt 13141 tcggccagcc gctcgagcag cagcgcctcg agcggtggcc ggtaacggcg cgtgaaatag 13201 ccgcggccct cgttcccgtc atcgatgccc tgcgctcggg cgcgagccac gccggaatgg 13261 agatcgagct cggctcgggc cagctgcgcg tcctcagcct gggcgcatcg gtgctgcgct 13321 cgccgaaagg cgtcccgcaa ggcggcgtca tcgtgttccg cgacatcacg gggcagcgcg 13381 agctcgagcg gctgcgggaa gacatcttcg cgatggcctg gcacgacatg aagacgccga 13441 ttacggtgat caggggccat gcggagcttc tcctccggag actctcttgc ggcgagcgcg 13501 atcccaaggt cctcgagtcg gatgccgcgc tgatcgtcga gcatacggac catctctcga 13561 agctcctcac gaccctcttc gacatctcct ccctcgaggc gggtctcctc tcgatttcgc 13621 cgcggccgac cgatctcggc gcccttaccc gcgacgtgac gcgaggcatg cgtgcgaccg 13681 cccgccgcaa catcgaggtg ctcgcccacc agggcgtcgt gggcaggtgc gacgagcaac 13741 ggatccggca ggtgctcacg aatctactct ccaacgccct gaagtactcg cccgagggct 13801 cgaccgtgac ggtgtccttg accgccgacg agcgcagcgt gaccgtgcgc gtgagcgacg 13861 agggaatcgg cctggatgac atcgagctcg cgcagctctt ccgccgcggg taccgggcag 13921 aaccggcgcg aaagctggcc ggagccgagc tcggcctcta cttctgcaac ggtgtcgtca 13981 cggcgcacgg cggccgtatg tgggcggagt ctttcggcca cggccggggc agcacgttct 14041 gcttcaccct gccgttgcgc gaggagcacg gtgccgatcg gcccgtggag gatcgcgcat 14101 gaggcgccgc cggctgctcg gcatcggctg cgcgctcgtc gcgctcgcgc ttgcgttagg 14161 gtctgcggtg tccccggcgc gcgcgcaatt gctgcgcccg ccgcctgtga tcctgccccc 14221 gccgcctgtg atcctgcccc cgccgcctgt gatcctgccc ccacctccgg cgagcatcca 14281 ggaaaagatt gaccctgcgt tgctcgcctt gatggcggcc gatccgcaga agcttctgcc 14341 ggtcatcgtg gagatgcagc aaccgttgcc accgttcatt ggcgcgccga acgtcggtcg 14401 cgccctggag gccctcgatc tccttcgctt caatggcgtt ccggtcgcgg cgctgtccct 14461 catcgacgca gccgcggggt ttgcgaacgc cgccggcatc aacgcgttga gcctagttcc 14521 gacggtggca ttcatccatc acgatgcgac ggtgggcccg cggcgcagcg ccgagccgcc 14581 ggcggccgat cctccggatc aactctcaag ggcttacgcg cgcgtggtga aagcccatcg 14641 agtctggcgg caggggataa ccgggagtgg tgtgacggtc gcgatcctcg attcaggcgt 14701 cgcggcggat gcggatctcg tcgagcccga gaaccgtctc ctcgcctcgg tcaacttcgc 14761 ggatcaacgc ctcacgagcg acccgggcgg gcatgggaca catatcgcgg ggatcgtcgc 14821 tgggaatggc agccgctccg gcggcgaatt cgtcggcatt gctccgcgag ccaacatcgt 14881 cgacgtccgg gtgctcggca acactgggag cgggcggatc tcctcggtga tacgggggat 14941 cgagtgggtc ctcgcgcgcc gcaccgtcta caacatccgc nnnnnnnnnn nnnnnnnnnn 15001 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 15061 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 15121 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 15181 nnnnnnnnnn nnnnnnnnnn nnggcggcgc ctcgtctccc tgcgcgtgcc gggaagcgcc 15241 ctggatacgc tgtttcccga ccgcgtcgtg gtggcgcaga acggatcgac ctacttccgc 15301 ctcaccggta cgtcgatggc gaccggcgtc gtgtcgggcg ccgccgcgct ccttctgcag 15361 cgctggccgc acctcactcc caatcaggtc aaggcgctcc ttgtcgggac gacgcagctg 15421 tacggggagg acagcggcaa agggctcccc gatccgatgg cggagggcag ggggctgctc 15481 gatgcgtacg cggccaccat aaccgcgctg cgtgatacga cggagcccga tgacgtgctc 15541 gtgccgctct ggctgggcgc gcttgtgccg agcgcgcccc tgcctggcgc gttcctgcct 15601 ggcacgttcc tgcctgaccc gctccaaccc gtggtcggtt tcacctgcgc caatgacggg 15661 ctccggcccg cggacggctt cgcccgcgcg ctgtacccgg tcctctttgg ttcgccgctc 15721 cactggaagg atccgacgct cggcggcatt ccctggttcc tgctcacttg ggacactctg 15781 gtctggaata gcattgcttg ggataacttt gcctgggaca gtgtcgcctg ggacagcatt 15841 gcctgggaca gcgtcgcctg ggatagcatc gcctgggata gcatcgcctg ggatagcgtc 15901 gcctgggata gcgtcgcctg ggacaccttc acgctcgact gagcctacga gtttccgggt 15961 acctgcctcg attcgcgcgc cgatcccagc gaagcagggc tagggcggcg accttgcgcg 16021 tttcgcactg caacgccggt tttcagcatc ctgtggattg tccggggcgc tttggataga 16081 atccgcattt cgccttcctt tcccgcggac atgacgcgcg ttctcttcat cgacgacaac 16141 gacgacttca ggacgctagc gctgcggtgc gcagcagcgc gcccggccgg ccgacgtgat 16201 cgtcaccgac atcttcatgc cggaaaagga aggcatcgag acgatccacg aactgcgcag 16261 ggagtttccc gaggtgaaga tcatcgccgt gaccggcctc gagccgctga ggcattacga 16321 cgtgttcgag gtcgcgcgcc aagtcggcgc cgtgaagacg ctcaagaagc cgttcaagtt 16381 cgaagacctg atcgccgcgg tgcgcgagct gaccgggggc tgacggggct gaacgcgacg 16441 gccttccatc ccgggtgtac gatacccggc tgctgctgga ctgatcaggg agaagaaaac 16501 atgacattcc atcaggccgc gctcgtcccg gccctgctgt gcctctccgc cgtgaactcc 16561 atcgcccaaa cgcctcccgc cgcctctccc gccgttcccg ccgacgtgag cccgctctac 16621 gtcgtgacct acgtggaggc gaggccgacc gcccgcgagg aagccgccgc gctcctgaag 16681 tcctatcgcg aagcgagccg atcatcgtcc ggaaacctgc gctcggtggt ggtgcggagc 16741 gtcgtccggc cgggacagtt cgtggtcgcc gccgcctgga aggacaaggc cgcctgggac 16801 gcccatatgg cggccgcggg cacgaaagag tttcgcgaaa agctcaacgc gctgcgcaac 16861 gcgccggccg acgaccgctt ccacaacgcg ctctcggtcg gtccgatgga agttgcctcc 16921 ggctcggtct acggcgtgac ccacgtcgac gtgatcccgc cgcagaaaga caacgccatc 16981 gtcgcgctca aggtcctggg ggaggcaaat cgcgccgcgg ccggcaacgt gcgcttcgag 17041 atcgtgcagc agaccaaccg tccgaaccac ttcaccgtgt tcgagatctg gcgctcgcgc 17101 gaggccttcg acgcgaacgg catgtccgcg caccagcgcg agttccgcga caagctcgtc 17161 gggatggccg gcgcgctcta tgacgaacgc ttatacgaaa ttctcaacta aggggactca 17221 catgaaacgc accgccatcg caattgcctt ccttgccgct cccgcgctcg catttgcgca 17281 atggaacgta cccgccgaaa gccagcgctg cccctcgaaa tggggcgcag gtgatcagcg 17341 cggatcgagc aatcatatga aaaacccgga gacggtactg cgcggcgcgc ggctcatcaa 17401 gacgggcgag gtg //