LOCUS PAIW01000034 11618 bp DNA linear ENV 13-JUL-2018 DEFINITION Chloroflexi bacterium isolate NAT218 69392, whole genome shotgun sequence. ACCESSION PAIW01000034 PAIW01000000 VERSION PAIW01000034.1 DBLINK BioProject: PRJNA391943 BioSample: SAMN07618454 KEYWORDS WGS. SOURCE Chloroflexi bacterium (marine metagenome) ORGANISM Chloroflexi bacterium Bacteria; Chloroflexi. REFERENCE 1 (bases 1 to 11618) AUTHORS Tully,B.J., Graham,E.D. and Heidelberg,J.F. TITLE The reconstruction of 2,631 draft metagenome-assembled genomes from the global oceans JOURNAL Sci Data 5, 170203 (2018) PUBMED 29337314 REMARK Publication Status: Online-Only REFERENCE 2 (bases 1 to 11618) AUTHORS Tully,B.J., Graham,E.D. and Heidelberg,J.F. TITLE Direct Submission JOURNAL Submitted (11-SEP-2017) Center for Dark Energy Biosphere Investigations (C-DEBI), University of Southern California, 3616 Trousdale Parkway, Los Angeles, CA 90089, USA COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Date :: 30-MAR-2017 Assembly Method :: Megahit v. 1.0.3, minimus2 v. 2.0.8 Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: Not Applicable Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 10/03/2017 23:53:42 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.2 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 2,409 CDS (total) :: 2,385 Genes (coding) :: 2,300 CDS (coding) :: 2,300 Genes (RNA) :: 24 rRNAs :: 1 (5S) complete rRNAs :: 1 (5S) partial rRNAs :: tRNAs :: 22 ncRNAs :: 1 Pseudo Genes (total) :: 85 Pseudo Genes (ambiguous residues) :: 53 of 85 Pseudo Genes (frameshifted) :: 12 of 85 Pseudo Genes (incomplete) :: 15 of 85 Pseudo Genes (internal stop) :: 11 of 85 Pseudo Genes (multiple problems) :: 6 of 85 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..11618 /organism="Chloroflexi bacterium" /mol_type="genomic DNA" /isolate="NAT218" /isolation_source="marine water sample" /db_xref="taxon:2026724" /environmental_sample /geo_loc_name="Atlantic Ocean: North Atlantic Ocean" /collection_date="2012" /collected_by="Tara Oceans Consortium" /note="metagenomic; derived from metagenome: marine metagenome" gene 7..198 /locus_tag="CL709_02665" /pseudo CDS 7..198 /locus_tag="CL709_02665" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_020384386.1" /note="incomplete; partial in the middle of a contig; missing start; Derived by automated computational analysis using gene prediction method: Protein Homology." /pseudo /codon_start=1 /transl_table=11 /product="Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase GatCAB subunit A" gene complement(401..982) /locus_tag="CL709_02670" CDS complement(401..982) /locus_tag="CL709_02670" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="MAX18752.1" /translation="MAVGTFGFQASTTFVMGRVTFEALENSGDAVGSFSTIESWKTGT SHLGQEIQRTLTGTQVTVQPVALVDLRIVPQAIAVQPPQPFTFEIRVDPNGQTLTGVQ VLLEFKPNSLQAVTIVTGTSSPLGNPLANRFDNSVCTLDIAAGTLGSGTNTQFALAVV NMSGGASGVMTPVVFFAASLRDSAVSIEGDQIQ" gene 1059..2144 /locus_tag="CL709_02675" CDS 1059..2144 /locus_tag="CL709_02675" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_014133820.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="S-(hydroxymethyl)mycothiol dehydrogenase" /protein_id="MAX18753.1" /translation="MPHKVRGVIAAAKGVKVKVGAVFVPDPGPGEALVRVQACGVCHT DLHYREGAINDEFPFLLGHEASGIVEQVGDGVNNVEPGDFVIIAWRAPCGTCRSCVRG HPWYCFASKNAQQPMTLADGTALSPALGIGAFAELTLVDAIQAVKVNRAAGPEIAGLV GCGIMAGLGAAINTGGVSRGDSVAVFGCGGVGDAAIAGSHLAGAHTIIAVDIDDRKLG WAKDFGATHVINSSETDPVKAIRELTGGNGVNVAIEAVGNPVTYEQAFYSRDHAGTVV LVGVPSPDMKIELPLLEIFGRGGALKSSWYGDCLPSRDFSLYIDLYLQGRLPLDKFVS ETIALDDVEEAFAKMERGEVLRSVVKF" gene complement(2164..3723) /locus_tag="CL709_02680" CDS complement(2164..3723) /locus_tag="CL709_02680" /inference="COORDINATES: protein motif:HMM:PF13450.4" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="MAX18754.1" /translation="MDQNQYDAIIIGAGHNGLVTAGYLAKDGLSVLVLERLDKVGGAC TTDEIFPGFSGPMCAYICYMLQGXVIDDLRLRDFGFEILPLGXAGQGSRGLHPFPDGA YLHGPGIETSFDVAQQLKEFSEHDARSYFDWLSFWEEAAGILLPYFLTEPPTLSQVMD DVRGTRREEVLEKMLTSSMMDMVDEFFDDDRVRASFLGIPESDPSATGSVMSNAYFKT TLLTRDRDRGIPKGSMGAVTQAMADAARSFGAEIRVGTAVQDVIVENGEAKGVRLASG EEIRSFIVVSNADPKRTFTTLVXLEGSDESLARKVENWKNQAGCVKFLAALKEPPDFS RYLGNGYDRDAIVNVNIAPSTEYFQQAWDDCKAGKITDNPLMHVQMPSVMDPSLTPKG GVMLSNWVLYYPPELKDGMTWDXARNTVGERIIDVMTEYAPNFRESLIDWTVQTPIDI EERVGITDGNIRHGDVIPQQMLTNRFSYRTPIQNFYLCGAGTHPGGEVTGAPGHNAAQ AILKDLARIAV" gene 4221..4430 /locus_tag="CL709_02685" CDS 4221..4430 /locus_tag="CL709_02685" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="MAX18755.1" /translation="MAESRTQTFVTQDSILLWXINHWKPHIRGLIAALDTTPEYVTIL FLRSSRAVNDFVNQLPQINHASSDG" gene complement(4416..6035) /locus_tag="CL709_02690" CDS complement(4416..6035) /locus_tag="CL709_02690" /inference="COORDINATES: protein motif:HMM:PF07969.9" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="MAX18756.1" /translation="MNIIVTNGRVVTMDAXGSVAXAIAIRNGXIVXIGSTDEIKAYRR XDTTEFDXAGRTVLPGFXEPHNHMVLFGVSLLXVDAXTPPNRTINDVIDRLRSRASET EEGQWITGXGYDDTGLAXMRHPTRDDLDKASTKHPIVIXHSSGHLLAANSLALSIGNV TADTPXPAGGRIGRKPGTSDPDGVLYEGPAQVLVTQHIPAYTKQDVRDGFIRAQDEFS RQGVTTIHDASVGRXRGVDXLDTYESARTDGLLKLRVNMFLQWQLLEETDFXFEPGXG DDWIRVAGXKIVSDGSIQGLTGALRXPXHCXXNEKGWLXYEQDELNAMVLALHRRGYQ IATHANGDAAXDSILDAYENALRIEPRSDHRYRIEHXQVCHPEHIERMRXLGVIPDFF ANHVYYWGDRHRERFLGPDRVNLLDPVGSTIRAGLLPILHSDCPVTPVSPLFCIQSAV HRVTSSGEILNASERVSVEDGVSTMTRNAARAAFQENSLGSLEVGKLGDLVILEQDPF RVAPGEIGQIEVAATVVGGRLMYGKDDLSIG" gene complement(6086..7186) /locus_tag="CL709_02695" CDS complement(6086..7186) /locus_tag="CL709_02695" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="MAX18757.1" /translation="MPSKLPDWITYPAEDWTEITPTQAGLDAARWKHFIANTSVKGAE WEGEDHAGNRWGTVFIRGGYRVHVWGDGDYRFQTASMGKAFTWAALGLAVDRALVDPN EFIWRDWTGEGMLSHPHKYLDRGHHAXLTWNLLGRRTDGXHWXGFPVTNGYFWRQXSS SQGTGTVADPVPXWADWTGDPFYDNFSHAEPGSVGIXSSGGQWRLLQALTALWNXDIK RVLDEELFGXMGIRPXDWDWIXGXDVHEXPXFYPXMPGYGDFXDPPYEINGXPVRGGG GWXXISANTLAKFAHLIATXGQWNGKQLXSPEYVIGHGGGNGSGVAGESRFYTGFAIV SXDGIDFRDPFLPSEFXXGPVDLSPTXQRXXD" gene 7349..9091 /locus_tag="CL709_02700" CDS 7349..9091 /locus_tag="CL709_02700" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_020384193.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="MAX18758.1" /translation="MSKPVLXIDTPVAPPTWALLERQLLKAMSDACVQFFDHYFDERG YLLXXPRWGGDDGPDDAAENILNWTMLHALGGSETVLRLYKKGWNGHLLQYTEAKTVE VPMGREGMYYKEXPVMFDWFHHSEWLSAFYLQGLSXPSDLTYRNRLKRYTGFYNGDDP QADNWDADHKXIRSMFNGSRGPLMXKATGLDWAGDPIXIQGRFRPGHGERDYHQMLVH FQXYNDVVGDHPMNMSATTLAXSAYAVTGETRYRDWVLXYVDAWVERTEANGGLIPTN IGLDGSIGGETEGKWYGGVYGWGFSVYDPAIDAIAHRPSFQNRAHYGFGNAXLXTGDX RYADXWGXMIDIVNSNAVXENGRTVXPNSFGDQGWYNFTPEKFAHGALEVYYWSMDEX DAMRXLNDDWVKYLNGLNPXYPVEALXXDFDELHARLEEMWADEATADTRMSDDPNTI NPAITXTXTKLMXGGLPTGXVGGPLHCRLRYFDPESRRAGIPEDVAALVSRLSNDEVT VSLVNXNXTEPKTVIVQGGAYAEHQIKNARXHXNIVSVDSPHFTVXLGPGAGGELTLK MXRYXNQPTFSFPW" gene 9138..10838 /gene="ggt" /locus_tag="CL709_02705" CDS 9138..10838 /gene="ggt" /locus_tag="CL709_02705" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019460955.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="gamma-glutamyltransferase" /protein_id="MAX18759.1" /translation="MSHSWKDNAGSPFXTYKQGXVASRGMVSSNHPMASAAGLXMLAM GGNAIDAAIATAFALTVVEPMMIGIFGAGFINFYNGNTGXFVNVDNYAXAPKSATADM FGTVSDDWPDYMETVDRANDVGYRAVGVPGALMGWCHAEEKHGXLGLETIXQPAXRXA EHGFPASQYLVDIIGTTQSDLAKFPASAEIFLPGGSPPTVGXLIVRTDYARTLKAIAH KGPDILYHGXIGXMVVDXMAANGGLVTGDDLETYQIQLREPVRQSYREYEIVSTAPTS SGGTHIIQVLNLLEGFDVRESGFGTVENVHLIAEALKISFADRFEYMGDPAFVEVPVD ALISKKYATIRRQEINPSHAEDFRPGNPSAYVSESRDTTHLTVADDEGNVVSMTQTIH AAFGSKVTVPGTGIILNNTMNIFDPHPGNANSIEPGKRMLSSMSPTIVLKDDKPFMAL GTPGATRIFPSVLQAIMNVIDHGMTLQEAVEAPRIWSQGQALEVEGGISPDVRVGLEA LGHNVQVVAKVAGGMNGIMYDPASGLIHGAACWRADGTPVGMSGGPARPSGANAMFTV " BASE COUNT 2626 a 3209 c 3034 g 2538 t ORIGIN 1 accgcatttc aaaagtacac tgcgccctat aacttcaacg gcgctccgac gctctcggtg 61 ccttgcggcc tcaacaacga cggtctgcct ttgagcctcc agatcgtcgg taaacacctc 121 tcagaagcac tattgtgcca ggtcgggcac gcttacgaag gcgcgacgga ttggcacgac 181 cttcacccgc cagtctagaa agccaggcaa taacagatta tctgacggaa attgccaaat 241 attctcaacc ttgagaatcc ttcacaagaa gtggccttag ccgagcagca gatcactaga 301 caacaaccgc aaatccaccc gcaagaacca agaacccgat tggtgtaagg ttcggaactc 361 caggggcatt ttgtatggtt acttctattc ctatgagatt tcactgtatc tgatcacctt 421 caatgctcac cgccgagtca cgcagcgatg ccgcgaagaa cactaccggc gtcatgactc 481 cagatgcccc accgctcata ttcacaacgg ccaatgcgaa ttgggtgttt gtgccgctac 541 cgagtgtgcc tgcagcaata tccaatgtgc aaacagaatt gtcgaatcta ttggccaagg 601 ggttgcccaa tggcgacgag gtgcccgtaa ctatggtgac agcctggagc gagttcggtt 661 tgaattcgag gagcacttgg actccagtca acgtttggcc gttagggtcg actctgatct 721 cgaaagtgaa cggctgaggt ggttgaaccg cgatcgcctg cgggacgatt cgaaggtcca 781 ccaatgcaac gggttgaacc gtgacttgag ttccggtcag cgtcctttga atttcctgtc 841 ccaggtggct tgtgcccgtc ttccaggatt cgattgtcga gaaagacccc acagcatcgc 901 cagagttttc gagtgcctcg aaagttacac ggcccatcac gaaagtcgta gacgcctgga 961 acccaaacgt tccaacagcc aaaatcgccg ttgattcagt tcgatcgatg tggaagaatc 1021 agatcactta catcattgca atcgcatagg agattttcat gccccacaaa gtccgaggag 1081 tgatagcagc cgccaaaggc gtcaaagtca aagttggagc cgtcttcgtg cccgaccccg 1141 gcccaggcga agcccttgtg cgggtgcaag cgtgcggcgt gtgtcacacc gaccttcatt 1201 atcgagaagg cgcgataaac gacgagtttc cattcttact cggacatgaa gcgtcgggga 1261 ttgtcgagca ggtcggagac ggcgtgaaca acgtcgaacc gggcgacttc gtaatcatcg 1321 cctggcgagc gccctgcgga acttgccggt cttgtgttcg cgggcatccc tggtactgct 1381 tcgccagcaa gaacgcccag caacccatga cgctggcaga cggcaccgcc ttgtctcctg 1441 cactcgggat cggcgccttc gcggagctta ctctagtgga cgcgattcaa gcagtcaaag 1501 ttaaccgagc cgcaggcccc gagattgcag gtcttgtcgg atgcggaata atggccggat 1561 taggagccgc catcaacacc ggcggggtgt cgcggggcga ttcggtggct gtttttggct 1621 gtggaggcgt gggggatgcc gccatagccg gctcccatct ggcgggcgcc cacacgatta 1681 tcgctgtcga catcgacgac cggaaacttg gatgggcgaa agattttggg gcgacccacg 1741 ttatcaactc atctgagacc gaccctgtaa aggctatccg cgagctgact ggcggcaacg 1801 gcgtcaacgt tgcgatagag gcggtcggaa atccggtcac gtacgaacag gccttttaca 1861 gccgggatca cgcaggcacg gtcgttcttg tgggagttcc aagcccagac atgaagatcg 1921 agttgccttt gctagaaatc ttcggcagag gcggcgcgct caaatcctcg tggtatggag 1981 attgcctacc tagccgagat ttttcgttat acattgacct ttatcttcag ggcaggctcc 2041 ctctggacaa gttcgtgtca gagaccatcg cgctcgacga tgtggaggag gctttcgcca 2101 agatggagcg aggcgaagtc ctcaggtcag tcgtcaaatt ctaaaggtca aaatctaacg 2161 acactacaca gcaatccgag ccaaatcctt taggatagct tgagcggcgt tgtggccagg 2221 ggcgccggtt acttcgcccc ctgggtgggt acctgcgcca cacaggtaaa agttctgaat 2281 cggcgtgcgg tatgaaaatc tgttggtcag catctgctgt ggaattacgt cgccgtggcg 2341 gatgttcccg tcggtgattc cgacccgctc ctcgatgtct atcggagtct gcactgtcca 2401 gtcgatcagg ctttcacgga agttgggggc gtactcggtc atcacgtcga ttatgcgctc 2461 gcccaccgtg tttctagcnt cgtcccaagt catgccgtcc ttcaattccg gcggatagta 2521 cagcacccag ttggagagca taacgccgcc cttcggagtc agcgaaggat ccataacaga 2581 cggcatctga acgtgcatca gcggattgtc agtaatcttc cctgctttgc aatcgtccca 2641 cgcctgctga aaatattcgg ttgacggcgc gatgttcacg ttaacgatgg cgtcgcgatc 2701 gtatccgttg cccaagtatc gggagaagtc cggaggctct ttcaaggccg cgaggaattt 2761 cacgcaccct gcctggttct tccagttctc gaccttgcgg gcgagagatt cgtcggagcc 2821 ctcaagnttg accagcgtcg tgaatgtgcg tttcggatca gcgtttgaca caactatgaa 2881 actgcggatc tcctcgccac ttgccagtcg aacaccctta gcttcgccgt tctcgactat 2941 cacatcttga accgcggttc cgaccctgat ctccgctcca aaacttcggg ccgcgtcagc 3001 catcgcctgt gtcactgcgc ccatgctccc cttcggaatt cctcggtcac ggtcgcgggt 3061 caacaaagtg gtcttgaagt aagcgttgga cataacgctg cccgtcgcgc tggggtccga 3121 ctcaggaata ccgagaaaac tggcgcgcac gcggtcgtcg tcgaaaaatt catcgaccat 3181 atccatcatg ctggatgtga gcatcttctc aagaacttct tctcggcgcg tcccgcgaac 3241 gtcatccatc acttgagaca gcgtcggggg ctctgtcaaa aaatatggca gaaggattcc 3301 cgcggcttcc tcccagaatg acagccagtc gaaataactc cgcgcgtcgt gttctgaaaa 3361 ttccttgagt tgttgggcca cgtcaaacga tgtttcgatg cctggcccgt gaagataggc 3421 gccgtcagga aagggatgca gtcctcttga accctgacct gctnctccca ggggcaatat 3481 ctcgaaaccg aaatctctta gccgcaggtc gtcgataacn ttgccttgaa gcatgtagca 3541 aatatacgca cacatggggc cgctgaatcc tgggaatatc tcgtcggttg tgcatgcgcc 3601 gccgacctta tccaaccttt ccagcaccag gacagatagt ccgtccttag caaggtatcc 3661 agcggttacc aggccgttgt ggcctgctcc gatgattatg gcatcgtact ggttctggtc 3721 cataaggagc ctccaagaaa cgagtagtca gaattcaggt gtcaggttac tttgggagtg 3781 tatgccatcg tcttaattga atcaacgtcg atgtatcttc nctcttcgta catggattag 3841 cattcaaaac gatgacaacc acgaaataca gacgaatcac catctacggt tacagcggtt 3901 cagaaaaatc gagattagcc cacatcatcg gcgctcggct caacctaaat gtgattgagt 3961 tagacgctct tttaacatgt aaattattgg gacgatacgc tgttggataa gtttcgtgcc 4021 aanctcgaac gganaacaga atcgtctccg attgggtggg tcancgcagg naactatttt 4081 cgcgtcaaag acctgctcat ggatcaggct gacgttgcct ttctggctgc gcctcccgtt 4141 tcatattgtg tactggaggc tcttacggca cacgatccgt gatttgttca ctacaaaaac 4201 cgatatggga aggcaacact ttggctgaat cgcggacgca aaccttcgtg acccaagatt 4261 cgattctgtt gtggngnatc aatcactgga aaccgcatat acgcggtttg atcgccgctc 4321 tggatacgac accagaatat gtaacgatat tgtttttgcg atcttctcgt gcagtcaacg 4381 atttcgtcaa tcaacttcct caaattaacc acgcctcatc cgatggataa atcgtctttc 4441 ccgtacatca gtcttccgcc cacgaccgtc gcggcgactt cgatttgccc aatctcaccc 4501 ggcgcgacac ggaaggggtc ttgctcaaga atcactaagt cgcccagctt gccgacctcc 4561 aacgagccta gcgaattttc ctggaatgcg gcgcgagcgg cgtttcgagt cattgtcgaa 4621 acgccatcct ccactgagac tcgctcagaa gcgttgagga tttctccact gctcgtcaca 4681 cgatgtaccg cgctctgaat acaaaacaat gggctgacag gtgtgaccgg acagtcggag 4741 tgaagaatcg gcaagaggcc ggcgcgaatc gtcgagccaa cagggtccaa caggttgacc 4801 cgatcagggc caagaaatcg ctcgcgatga cggtctcccc agtagtagac gtggttggcg 4861 aaaaaatcag ggattacgcc caacnttcgc atccgctcga tgtgttctgg atgacanacc 4921 tgncagtgtt cgatccggta tctgtgatct gatcgaggtt cgatccgcaa tgcgttttca 4981 taggcgtcaa gaatcgagtc tanngcagcg tcgccattgg cgtgagtggc gatctggtag 5041 cctcgtcggt ggagngccag aaccatcgca ttcagttcat cctgctcata nataagccaa 5101 cccttctcgt tnttntcaca gtgntagggn tctctcaacg cncccgtgag nccttgaatc 5161 gagccgtcag acacgatctt ncagcccgcn acgcgaatcc aatcatcccc tgnnccaggc 5221 tcgaatncga agtcngtttc ctcnagcaat tgccactgga ggaacatatt naccctcagc 5281 ttcaacagtc catcggttct tgctgactca taagtgtcca gcanatcaac gccgcgagnt 5341 cggcccacag aggcgtcgtg aatngtagtg actccctgac gcgaaaattc gtcctgcgcc 5401 cgaatgaanc cgtcccgaac atcctgcttc gtgtaagcgg gaatatgttg tgtgaccaac 5461 acctgngccg gcccttcata cagcactcca tcggggtcag aagttccagg ctttcggcct 5521 atccgacctc ccgccggntc nggagtatcg gcggtcacgt tgccgatgct nagagccagg 5581 ctgttggcgg caagcagatg tccngagctg tgcnaaatga cgattggatg tttggtggac 5641 gctttatcca gatcgtctcg tgtcggatgg cgcatntccg ccaggccggt gtcatcgtaa 5701 ccacngccag taatccactg gccttcctcc gtctcggagg ctctcgagcg gaggcgatca 5761 attacgtcat tgatcgtccg gttgggtgga gtnctagcat ccacgcngag caatgaaacn 5821 ccaaacaata ccatgtgatt atgaggctcn atgaagccng gaagcacngt cctncccgct 5881 ncgtcgaact ccgttgtgtc cgntcgacga taggctttga tctcatcggt ggagccgatg 5941 tncacgatnt tgccgtttcg tatggcaatc gcntcggcga cagagccggn agcgtccatn 6001 gtcaccactc gaccgttggt cacaattatg ttcatctaaa ttggatacct ctcacnaaaa 6061 tcatgggcng atngcatcta naccgtcaat ctcngnatcg ttgccntgtc ggagacaaat 6121 caacgggncc nannaagaat tctgagggca gaaacgggtc tctgaaatcg atcccatcgg 6181 nactgacgat ggcgaatcct gtatagaatc gggattcgcc tgctactccg ctgccgttgc 6241 ctcccccatg accgatcacg tactcgggac tcangagctg tttcccattc cactggcctn 6301 nggtggcgat caggtgggcg aattttgcaa gcgtgttggc gctgatganc ncccanccac 6361 cnccgccacg aacgggntgc ccgttgatct cgtanggcgg gtccangaag tcaccgtanc 6421 ctggcatatn nggataaaan ctcggnttct cgtgaacatc cntgcctgna atccagtccc 6481 aatcatncgg gcgtatnccc atntnnccga atagctcctc atccagaacg cgcttgatgt 6541 cttngttcca cagngccgtc agagcctgaa gcaaacgcca ttggcctccc gaactntaaa 6601 tnccgacact ncccggctcg gcatgggaga agttgtcgta aaatgggtct cccgtccagt 6661 cngcccaacn cggcacgggg tcggcgacgg ttccggttcc ttgcgacgac ganctctgnc 6721 gccaaaagta tccgttcgtg accggaaatc cccnccagtg gannccgtct gtccttcgnc 6781 ccaaaaggtt ccacgtgagn ttggcgtggt ggcccctatc aagatacttg tgagggtgag 6841 acagcatacc ttcgccggtc caatctcgcc agatgaattc gttggggtcc accaatgcgc 6901 gatcgacagc cagtcccaac gcggcccacg tgaacgcctt gcccattgac gcagtctgaa 6961 acctgtaatc tccgtctccc catacgtgga cccgatatcc gcctcgtatg aatacagtgc 7021 cccatctgtt cccggcgtga tcttcgcctt cccattcggc gcccttcacg ctcgtgttcg 7081 cgatgaaatg cttccatcgc gctgcatcca acccagcttg cgtgggcgtt atttcggtcc 7141 aatcttccgc cggataggtg atccaatctg gaagtttcga gggcatcnng gcgcctttct 7201 gaagagaaga atgctttcgt caattaggga tgagaccagc agtatacatc aacagntttc 7261 cagntcggat ncgattgaca catgcgctgg ctggtcggta tcttacncag gcaattgtga 7321 atatccttcc ntgaggcgaa cgcgagttat gagcaaacct gtattgcnca tcgacacacc 7381 cgtagctcct ccgacatggg cnctgctgga acggcagcta ttgaaagcta tgtcagacgc 7441 ctgcgtccag tttttcgacc actatttcga cgagcgagga taccttctgt gnntgcctcg 7501 atggggaggc gacgacggcc ctgatgatgc cgcngagaat atcctgaatt ggacnatgct 7561 gcatgcgttg ggnggatccg aaaccgtgtt gaggctctac aaaaagggtt ggaacggcca 7621 cttgctccag tacaccgaag ccaaaaccgt tgaggtnccg atgggtcgag aggggatgta 7681 ttacaaagag ttncccgtca tgttcgactg gttccatcat agcgagtggc tttcngcgtt 7741 ctaccttcag ggcttgtcgg anccctcnga tctnacgtac cgnaaccggc tnaaacgtta 7801 cacaggtttc tacaacggcg acgatccgca ggccgacaac tgggacgccg atcacaagat 7861 natccgctcg atgttcaacg gtagtcgagg ccctctgatg agnaaggcca ccggccttga 7921 ttgggcnggc gacccaatcg anattcaagg ccggttccgt cccggtcacg gagagcgcga 7981 ctaccatcag atgctcgtcc acttccagga ntacaacgac gtcgtgggcg accatccaat 8041 gaatatgagc gcaacaactc tcgccntgag cgcatacgcg gtgactggcg agacccgata 8101 ccgcgactgg gttctggant atgtngacgc ctgggtcgag cgnaccgagg ccaatggagg 8161 cctgatcccc accaacatcg gcctggacgg gagcatcggc ggagaaaccg aaggcaaatg 8221 gtacggagga gtgtacggct ggggattctc agtgtatgat ccggcgatcg acgccatcgc 8281 ccaccgtccg tcgttccaga accgcgccca ctacggattc ggcaacgcgn tcttgntgac 8341 tggcgaccnc cgttacgccg atntntgggg ncnaatgatc gacatcgtga attcgaacgc 8401 cgtgcnagaa aatgggcgaa ccgtctancc gaactccttc ggngaccagg gatggtacaa 8461 tttcacccct gaaaaattcg cncacggtgc tctngaggta tactactggt cgatggatga 8521 acangacgcn atgcgantnc tcaacgatga ttgggtcaag tatctgaacg gcctcaaccc 8581 nnattaccct gtcgaggcgt tgnatcnaga tttcgacgaa ctccacgccc gnttggagga 8641 gatgtgggcg gacgaggcna cagcagacac ccgaatgtcc gacgacccga acaccatcaa 8701 tcccgccatc acngngacgn tgaccaagct gatgntaggc ggcctcccna cagggagngt 8761 nggaggncca ttgcactgcc gacttcgtta tttcgaccct gagagcaggc gcgccggaat 8821 accngaagat gtcgccgcgc tggtctcncg attaagtaac gatgaggtaa ccgtttcttt 8881 ggtgaacntc aaccanacgg agccaaaaac ngtaattgtg caaggcggcg cntacgcaga 8941 gcaccagatc aagaatgccc gatnncacgn taatatcgta tcggtggata gccctcactt 9001 cacngtccnc cttggtccgg gcgcaggcgg cgagttgacg ctgaagatga ngcgttacnc 9061 gaaccaaccn acgttctcnt tcccntggta atgtaacaac cgcgacacaa nccagnagtt 9121 cgcaaggaga atcagcaatg tcacactctt ggaaagacaa cgccggcagc ccctttgana 9181 catacaaaca gggagntgtg gcgtcccgag gcatggtgtc gtccaaccat ccgatggcgt 9241 ctgcggccgg actcnaaatg ctggcgatgg gcggcaatgc tatcgacgca gccatcgcca 9301 cngcgttcgc attgacagtc gttgaaccca tgatgatcgg catattcggc gctggattca 9361 ttaatttcta caacggcaat acgggcnaat tcgtcaacgt cgacaactat gccantgctc 9421 caaagtcngc gactgcngat atgttcggna cagtctccga tgattggccc gactatatgg 9481 aaacagtgga cagagccaac gacgtaggat accgcgccgt tggcgttccg ggcgcgctna 9541 tgggatggtg tcacgcngag gagaaacacg gcnggctggg nctngagacc atcatncaac 9601 ccgccatncg gtangctgaa cacggttttc ctgccagcca gtatctggtc gacatcatcg 9661 gcacaaccca gtctgacctn gccaaattcc ctgccagcgc ngagattttc ctgcctggcg 9721 gttcccctcc aacngtcgga nctttgatcg tacggactga ttacgcccgc actctgaagg 9781 ccatcgccca caaaggnccc gacattctct accacggagn gatcggagan atggtggttg 9841 acganatggc tgccaatggc ggnctcgtca caggggacga cttggaaact taccagattc 9901 aacttcgcga gcctgtgcgt cagagttacc gtgaatacga aatcgtatcc accgctccga 9961 caagctccgg gggaacgcac atcatccaag tgttgaattt gctggaaggc ttcgacgtgc 10021 gggagagcgg attcggaaca gtcgagaatg tccacctcat cgccgaagct ctgaaaattt 10081 ccttcgccga ccgcttcgag tacatgggcg atcccgcgtt cgtagaagtg cctgtagatg 10141 cgttgatttc caaaaaatac gcgacgatcc gtcgccagga aatcaatccg tcgcacgctg 10201 aagattttcg tcctgggaac ccgtccgcat acgtgtcaga gtcgagagac accacacacc 10261 tgactgtcgc cgatgacgag ggcaacgtcg tctcgatgac tcagacgatc cacgcggcct 10321 ttggttccaa agtgacggtt ccaggaacgg gaataatttt gaataacacg atgaacattt 10381 ttgatccaca tcccggcaac gccaactcca ttgaacctgg caagcggatg ctgagcagca 10441 tgtcacctac catcgtgctg aaagacgaca agccgttcat ggctttgggg acgcccggag 10501 caactcggat attcccctcc gtgcttcaag cgataatgaa cgtaatcgac cacggcatga 10561 cgctgcaaga ggcggtggag gctccaagaa tctggtctca aggacaggcg ctggaggtcg 10621 agggcggaat ttcgccagat gtgcgagttg gactcgaagc gttaggacac aacgttcagg 10681 tagtcgcaaa agtcgccgga gggatgaacg gcatcatgta cgacccggca tctggcctca 10741 tccacggcgc ggcctgttgg cgggctgacg gaacccctgt gggtatgagc ggcgggccgg 10801 cgaggcccag cggcgcgaac gcaatgttca ccgtatgaat atttggttgt cgacgaagtg 10861 ggaaattgcg gatcggattg aatttgccga cctattcgtc gcggttcttc gtgcggcgtt 10921 gtcttagaat aggtcacgag acgatatcaa cattgtcacg aaccttgctt caaggactgt 10981 caggtcaggg agtctggaat ctcctctgaa aaatggtcag gttattcaaa caacttggcc 11041 gatttcgcgt gcacacattt agatcatcga cccattattg ggcgctctcc acgatgatat 11101 actccttcaa gcgatctcct tcatgaatcg cgaatatgcg agggtatagc aattgtattt 11161 ccctaattgt cactggtcta tgaaatcttg tttactgtcg agtaatttgg cccaacgtca 11221 ccgggcaggt tttagatcaa cgatgtctga gcacaacgat tcatggaact gtctacgaac 11281 aatatttcgc acaaagcgca attcgccttg caatttccgg atgctcaggt ccatcgtcga 11341 gaggcaaatt catttgcatc gtccgatagc tgccattcgt ttttggtgaa taataattca 11401 ttatttttga cacgaaatcc agacctttgg cgctataatg cgcccaatca aaaagttcca 11461 aggagaccgc cgcgcatgaa gctcgttttg aaagctgacc atctgatcga ttccaccggc 11521 gcagacccga tttccaatgc cgccttggtc atcgaaaatg gccgaatctc acaaatcacc 11581 acccaagaca aactccacat aggcgagaac gaagaagc //