LOCUS SCVU01000703 10688 bp DNA linear ENV 25-FEB-2019 DEFINITION Planctomycetes bacterium isolate GW928_bin.9 GW928_bin.9-39031, whole genome shotgun sequence. ACCESSION SCVU01000703 SCVU01000000 VERSION SCVU01000703.1 DBLINK BioProject: PRJNA514088 BioSample: SAMN10720252 KEYWORDS WGS. SOURCE Planctomycetes bacterium (groundwater metagenome) ORGANISM Planctomycetes bacterium Bacteria; Planctomycetes. REFERENCE 1 (bases 1 to 10688) AUTHORS Tian,R., Ning,D., He,Z., Zhang,P., Spencer,S.J., Gao,S., Shi,W., Wu,L., Zhang,Y., Yang,Y., Adams,B.G., Rocha,A.M., Detienne,B.L., Lowe,K.A., Joyner,D.C., Klingeman,D.M., Arkin,A.P., Fields,M.W., Hazen,T.C., Stahl,D.A., Alm,E.J. and Zhou,J. TITLE Small and mighty: adaptation of superphylum Patescibacteria to groundwater environment drives their genome simplicity JOURNAL Microbiome 8 (1), 51 (2020) PUBMED 32252814 REMARK Publication Status: Online-Only REFERENCE 2 (bases 1 to 10688) AUTHORS Tian,R., Ning,D., He,Z., Zhang,P., Shi,W., Wu,L., Zhang,Y., Yang,Y., Arkin,A., Matthew,F., Hazen,T., Stalh,D., Alm,E. and Zhou,J. TITLE Direct Submission JOURNAL Submitted (10-JAN-2019) Department of Microbiology and Plant Biology, University of Oklahoma, 101 David L Boren Blvd 2030, Norman, OK 73019, USA COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Date :: MAY-2018 Assembly Method :: IDBA-UD v. 1.1.1 Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: Not Applicable Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 01/19/2019 01:25:26 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 4.7 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 3,706 CDSs (total) :: 3,673 Genes (coding) :: 3,626 CDSs (with protein) :: 3,626 Genes (RNA) :: 33 tRNAs :: 31 ncRNAs :: 2 Pseudo Genes (total) :: 47 CDSs (without protein) :: 47 Pseudo Genes (ambiguous residues) :: 0 of 47 Pseudo Genes (frameshifted) :: 0 of 47 Pseudo Genes (incomplete) :: 47 of 47 Pseudo Genes (internal stop) :: 4 of 47 Pseudo Genes (multiple problems) :: 4 of 47 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..10688 /organism="Planctomycetes bacterium" /mol_type="genomic DNA" /submitter_seqid="GW928_bin.9-39031" /isolate="GW928_bin.9" /isolation_source="groundwater" /db_xref="taxon:2026780" /environmental_sample /geo_loc_name="USA: Tennessee" /lat_lon="35.987421 N 84.264657 W" /collection_date="2018-05" /metagenome_source="groundwater metagenome" /note="metagenomic" gene complement(<1..223) /locus_tag="EPO68_15910" CDS complement(<1..223) /locus_tag="EPO68_15910" /inference="COORDINATES: ab initio prediction:GeneMarkS-2+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS-2+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="TAJ07933.1" /translation="MQAAEVRTIVQRALGRVLAERGIEQAPTGAGTAAPWVHVEVAPA STRPSARCTIVRTSAACIGPQSLSEFAARR" gene complement(287..778) /locus_tag="EPO68_15915" CDS complement(287..778) /locus_tag="EPO68_15915" /inference="COORDINATES: ab initio prediction:GeneMarkS-2+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS-2+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="TAJ07934.1" /translation="MVGLKGALHPGLLDEKFWSERLQEEAAVPLRMLVLPSGSSSEGM EAFELMVSGRDGERLHAVLVRRAQHDDPPAIGARRALRLVPGHAEHHPIDLADCEAEL YFEPAPHKRLEERVLDTLRMLRAARRVEGVAGARALAAGHSELPPPDEFLIAECLLNR GWI" gene 1156..2976 /locus_tag="EPO68_15920" CDS 1156..2976 /locus_tag="EPO68_15920" /inference="COORDINATES: protein motif:HMM:PF01979.18" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="TAJ07935.1" /translation="MTSRAFLRLALGASAFLGVVLPQPSLGLLAQEPAHETAQDDKKQ DEKQPDEKKGEKKDEKPEADAKSAKKPEKKDDKKDKDALKRPKLAAPTGEVTLVHAKR VLTRPGEELADADVLVRDGWIVGVGKGLTAPEGARVLEADVVCAGFLDPWSNAGVEPT SAFALDGTADSQTVEALDPYEDTHLRRQALRGGVTSARVQIAGRAPIGGVGAIVRLDF AIDVASTPKPEPVDAKADAAKAADKHAAAGDKAAADKAGDKKDEAKKDDAKSDEKKAD EGKDAKKDDAKKDAKAEPAAPPRAAPKLEPPYGLSVLNPAASLAASIGIVRQGAGDPF DRIGEVDRLGSALDAGRRLNEQRVGYAEDLAKWEKEIADKEKQLEKDFKKAKKDREKE QKDAEEKGKEFKDKAYKEDRRPTPPVASPEDEVLARVASGEMPLVVEAHRASEIRALL DVLRNHPRVRLVLAGATEAAPLAHELAERSAAVIVFPLPLGTGRMPGWTEHDLGLAGA LQSAGVRVLIGSGGSREGARDLALLAGLAVGHGLDRDAAFAALTLAAAETFDVADRVG SVEVGKQADLLLLQGDPLSPATRVQYVLVGGKVVLEPEAR" gene 3038..4168 /locus_tag="EPO68_15925" CDS 3038..4168 /locus_tag="EPO68_15925" /inference="COORDINATES: protein motif:HMM:PF07969.9" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="TAJ07936.1" /translation="MLLPALLTTLIATAATPANRAAPATPAGPLAIRAHKAYVGAGPA IEDCVVLIDDGRIREVGRGVRVPDGTPLVEHAGVLTAGLVACGSYDGIAGGTTDVTRA YLPEARLADAFAPAHPDFERLLTAGITTVVLVPRPENVIGGVTAVVKTSGGTIVKRDA QLAVALSTNALLFNREPTSYAQALVQLDALLARPQGTLARAKAGELPVLLQVDPEHEV ARALAFAKKHGLKGALYGAERAGEMAAEIRESGLGVVFHPLGLGSDKRARRSVVALSE AGVPFAFALEAPDHSPEDLRLSAALCVRAGLPVEAGLQALTAGAARIAGVADKLGRVE RGLAADLVLWSGEPLDLASRATHVYVDGVLVHSAPSAAPAKD" gene 4183..5424 /locus_tag="EPO68_15930" CDS 4183..5424 /locus_tag="EPO68_15930" /inference="COORDINATES: protein motif:HMM:PF01979.18" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="TAJ07937.1" /translation="MNDPTNNSRAIRSWLGHGALAACVLSSLALAGDKSWLIKAHKIH TAAGDTIEDGFLQVTDGRITAVAGVSMGGGGDMLEVAAVTPGLIDLSVRIDLGFDSVE QSTETPASLDVENGIDLWDPRFERELKSGVTTVVVSPPDRAVIGGWSLALKTGGPPTL EKRRVAPRVALRAAIGDEPSDGNRPAFGQPDSFYNRRPTTRMGVEWVLRKSFYDALAA RKDAALATDETRAFEEVLAGKAPLFIAASTTQDIRTACFLQREFKLPRLIIDAAAEAW KDPQMVVQSGAAVVLPPFSFGGRGGVDGAFHAWNTAALLHERGLTIALSGHNSIERGD RLAVQPGLAMHGGLPFDAALAAVTINPARMAGIDGRVGSIATGKDADLVLWDGRPFEP SSRIVGVLLDGELIVDPRTQK" gene 5472..7001 /locus_tag="EPO68_15935" CDS 5472..7001 /locus_tag="EPO68_15935" /inference="COORDINATES: protein motif:HMM:PF01979.18" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="TAJ07938.1" /translation="MQTTPSHFFARSVLATSFALLTGGLALGADKLVIEAGRVITRAG HEIKDGVIVIENGRITKVGKQGEVDKPWDATVIGGPSFVAFPGFVEAQTSRGLDRPNE NIDVAPFLDIRDSIDPINFAYEDYLRHGVTTLNVQQGGNCVIGGRGMIVRPVGMTVDE MAVRPDYALKLSASPKTGKSAATQSMALRRAFDELKKRLEKQVQDKKDGKARDKREAM FQGRELEGEKAKGRAMDGKGWRVDGLELVPRGELDEKDLQLLELVEGKRDAFVWCAAA MDVRKALDLAKQNGFLERTTLVLAGDCWKAADVVAASGRPAILVPPMLHVELDPLTGK EVETSVPKAFAEKGVKFAIASEGFGPTTPSQQVLWYQAALAVGQGLPRDKAIAAITTV PAEILGLADRVGSLEAGKDGNVLLLDGEPLSIRTHVEFVIIEGKTAYDRSKDIRAKHL LEGQEPRGTSPAGVDGASDAKPGSKQDALKKDEAAEKKAAEEQRKADEKKHYESSEER G" gene 7001..9949 /locus_tag="EPO68_15940" CDS 7001..9949 /locus_tag="EPO68_15940" /inference="COORDINATES: protein motif:HMM:PF07969.9" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007024011.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="TAJ07939.1" /translation="MISLLTLLPLALASGAPAQADGGRDLVALRAGTVLVVKDGAVLE GGATVLVRGSKIAAVGKDLLIPPDATVVDYGPDAVICPGFVLADSSLLGSTGSTRTAE PALSAVQAFDPYARYDWALAAGITSAYVTPARMRLIAGQGAIVKLAGGPDVDRFVRTP VSVHAALSAEARGAPGYWDPPVPPTSDVGLGVAQRQLPRSLMGASVALRELLVAARNH APAADFGPYAALELGALVQQRVAWLIAATDAAEIRAALALSADFGLNLVINDATYAGE VADEIAARGAAVVYKPPFNPDISLLNRGKSADARWPESDVAARLARAGVRLAVAPGSG ASPRDLRFAAFLAMRGGLDSALALRAITLTPAELYGVADRVGSIEPGKDADLVVLNGL PLDMGSSVVSTWSDGATVWSPTASADDAGMAAQPVVIQVDELHLGDGHVLRPGQILLQ NGKIAEIGERVARPRGAVVVRGVAAMPGIVDALGYLGLEGSARSVGVDYKLSRLLGPG DVADRRVARAGVTTVGLSVRNPSPAGVPIVAYKPAANDFDSLVLREPAGVRLRWSDRN RSRSGSEVKELLDKLGKYQQKWAEYDKALASWKPSGAKADDAKPAEAKADEKKEGDDK AAEEAKADGAEEKKDEKADADKKDEKKDDKKKKDEPEPADPVTGVWEGEFALPAPLSG KAKLRLQLLNENGVVRGSLRCDAVSERLVQLQGTWAAAESGAKELRLSGAGSQGPVQF AAAIAKAKMAAVLGAGAWQADVDAEQKSKEYPVAVRPKAKKPKDEPEAGAASAEKGKP VPPRIDERLEPLRQAALGKAGVIVDVDRADEIAACVDAFAAAGLKPVLYGADEAWKLK DKLAGRVSGVLIDLPFRRNEPEGSISFVNRYAELQSAGIPVAFHSAAEEGAADLLTRA MYTVSQGMSASGALRALTSDAARMLGLSDRVGMLKPGLDADVLLLDASPLSEPARVLR AWVNGSEVLP" gene 9946..>10688 /locus_tag="EPO68_15945" CDS 9946..>10688 /locus_tag="EPO68_15945" /inference="COORDINATES: ab initio prediction:GeneMarkS-2+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS-2+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="TAJ07940.1" /translation="MSALGRTLAGAALVACASPLLSHALAAGEPIIGTPRGLAAPARA DGRSGEPGGPGLALRAAKALTCARNGPLVVDDAVVLVRDGKIESVRPAREAEVPAGYE RLDLGPLWVMPGMVDLHFHSAGSFDINDMVYLANPELRVAASVVPRNPNLDRDVAGGV TTVLYIPGSGTNIGGQGVLLKTGLDRFEEMAVRNPGSLKVAQWGNPERWIMGVGKTFE GWTIRNTFQRGRMYAQAWKDFEAGKGPEP" BASE COUNT 1606 a 3792 c 3930 g 1360 t ORIGIN 1 ctcgccgcgc tgcgaactcg gatagactct gcggcccgat gcaggccgcc gaagtccgaa 61 cgatcgtgca gcgcgcgctc ggccgcgtgc tcgccggcgc gacctcgacg tgcacccacg 121 gcgcggcggt acccgcaccg gtcggcgcct gctcgatgcc gcgctcggcg agcacgcggc 181 cgagcgcgcg ctgcacgatc gttcggactt cggcggcctg catcgggccg cagagtctat 241 ccgagttcgc agcgcggcga gcgcagccgc tcgcggtccg ccggcgctag atccagccgc 301 gattcagcag gcactcggcg atcaggaact cgtcgggcgg gggcagctcg ctgtgccccg 361 cggccagcgc gcgcgcgccg gcgacgccct cgacgcgccg ggcggcgcgc agcatccgca 421 gcgtgtcgag cacgcgctcc tcgaggcgct tgtgcggcgc gggctcgaag tacagctcgg 481 cctcgcagtc ggcgagatcg atcgggtggt gttccgcgtg gccgggcacg aggcgcagcg 541 cgcggcgcgc gccgatcgcc ggcggatcgt cgtgctgtgc gcgccggacg agcaccgcgt 601 gcaggcgctc gccgtcgcgg ccgctcacca tcagctcgaa ggcctccatg ccttcgctcg 661 agctgcccga cgggaggacg agcatccgca gcggcaccgc cgcttcttcc tgcagccgct 721 cactccagaa cttttcgtcc aggaggccgg gatgcaatgc ccccttgagc cctaccatga 781 tcggggcctg agccgagcgc ctcgcagggg tcgaacgccg ggggtcagat tgcggcgggg 841 cggtcgccgg gtttgacctc cgctcaatga atccgtttcc gcgtccttac acacgcggcg 901 tgagcgagtt gccgtcgtgc gcagcgagaa cggccgtccg gaaccggggg cgaacgctgt 961 aacatgaccg taagccaaag ccaattcata ggttagaatc gggggatttc tggccagcgc 1021 gcgcggacgt ccctatcatc acgcgccctg gatggacccc ggcgtccagc cctgggagct 1081 cgccctccgg aagcgcactc ggtcgagtcg cggaggactg agtcagagag agcaaccccc 1141 tgaccccgct tgcccatgac ttcccgcgct ttcctccggc tggccctcgg cgccagcgcc 1201 ttcctgggtg tcgtgctgcc gcagccgtcg ctgggtttgc tcgcgcagga gcccgcgcac 1261 gaaaccgcgc aggacgacaa gaagcaggac gagaagcagc cggacgaaaa gaagggggag 1321 aagaaggacg agaagcccga ggccgacgcg aagtccgcga agaagccgga gaagaaggac 1381 gacaagaagg acaaggacgc gctgaagcgg ccgaagctcg cggcgccgac cggtgaagtc 1441 acgctcgtcc acgccaagcg cgtgttgacg cggcccggag aagagctcgc ggacgccgac 1501 gtgctcgtgc gcgacggctg gatcgtcggc gtcggcaagg ggctgaccgc gccggagggc 1561 gcgcgcgtgc tcgaggccga cgtcgtctgt gcgggcttcc tcgatccgtg gtcgaacgcc 1621 ggcgtcgaac cgacgagcgc gttcgcgctc gacgggaccg ccgacagcca gaccgtcgag 1681 gcgctcgatc cctacgagga cacgcatctg cgccgccagg cgctgcgcgg cggcgtcacg 1741 agcgcgcgcg tgcagatcgc cggccgtgcg ccgatcggcg gcgtcggagc gatcgtgcgg 1801 ctcgatttcg cgatcgacgt cgcgtcgacg ccgaagccgg agcccgtgga cgcgaaagcg 1861 gacgctgcga aggccgccga caagcacgct gccgcgggcg acaaggccgc cgcggacaag 1921 gccggcgaca agaaggacga ggccaagaag gacgacgcca agtcggacga gaagaaggcc 1981 gacgaaggca aggacgcgaa gaaggacgac gccaagaagg acgcgaaggc cgagccggcc 2041 gcgccgccgc gcgccgcgcc gaagctcgag ccgccgtacg gcctgtcggt gctcaatccc 2101 gccgcgtcgc tcgcggcctc gatcggcatc gtgcgccagg gcgcgggcga tccgttcgat 2161 cgcatcggcg aggtcgatcg cctcggctcc gcgctcgacg ccggtcgccg gctgaacgaa 2221 cagcgcgtcg gctacgccga ggatctggcg aagtgggaga aggagatcgc cgacaaggag 2281 aagcagctcg agaaggactt caagaaggcg aagaaggacc gcgagaagga acagaaggac 2341 gcggaggaga agggcaagga gttcaaggac aaggcgtaca aggaagaccg ccgtccgacg 2401 ccgcccgtcg cgagccccga agacgaagtg ctggcgcgcg tcgcgagcgg tgagatgccg 2461 ctcgtcgtcg aggcgcaccg cgcgagcgag atccgcgcgc tgctcgacgt gctgcgcaac 2521 catccgcgcg tgcgcctcgt gctcgccggc gcgaccgagg ccgcgccgct cgcgcacgaa 2581 ctcgccgaac gctcggccgc ggtgatcgtg ttcccgctgc cgctcggcac cggtcgcatg 2641 ccgggctgga ccgagcacga tctcggcctc gccggcgcgc tgcagagcgc gggggtgcgc 2701 gtgctgatcg gctcgggcgg ctcgcgcgaa ggcgcgcgcg acctcgcgct gctcgcgggc 2761 ctcgccgtcg gccacggtct cgaccgcgac gcggcgttcg ccgcgctgac gctcgccgcc 2821 gccgagacgt tcgacgtcgc cgaccgcgtc ggcagcgtcg aggtcggcaa gcaggccgac 2881 ctcctgttgc tgcagggcga tccgctgtct cccgcgacgc gcgtgcagta cgtgctcgtc 2941 ggcggcaagg tcgtgctcga gcccgaggcc cgctgaggcc tcgccgcgtc gtcccccgct 3001 catcgctcgc ctcgtatcca tcgaccgcag gatcccgatg ctgctccccg cactgctcac 3061 gaccctgatc gccaccgccg cgacgcccgc gaaccgcgcg gcgccggcga ctcccgccgg 3121 tccgctcgcg atccgcgcgc acaaggccta cgtcggcgcc gggccggcga tcgaggactg 3181 cgtcgtgctg atcgacgacg gccgcatccg cgaggtcggc cgcggcgtgc gcgtgccgga 3241 cgggacgccg ctggtcgagc acgccggcgt gctgaccgcc gggctcgtcg cgtgcggcag 3301 ctacgacggc atcgcgggcg gcacgacgga cgtcacgcgc gcgtacctgc ccgaggcgcg 3361 gctcgccgac gcgttcgcgc cggcgcaccc ggacttcgag cgactgctca ccgccggcat 3421 caccacggtc gtgctcgtgc cgcgtcctga gaacgtgatc ggcggcgtga ccgcggtcgt 3481 gaagacgagc ggcggcacga tcgtcaagcg cgacgcgcag ctcgccgtcg cgctgtcgac 3541 gaacgcgctg ttgttcaatc gcgagccgac cagctacgcg caggcgctcg tgcagctcga 3601 cgcgctgctc gcgcggccgc agggcacgct cgcgcgcgcg aaggcgggcg agctgccggt 3661 gctgctgcaa gtcgatcccg agcacgaggt cgcgcgcgcg ctcgcgttcg cgaagaagca 3721 cggcctcaag ggcgcgctgt acggcgccga gcgcgcgggc gagatggcgg ccgagatccg 3781 cgagagcggt ctcggcgtcg tgttccaccc gctcggtctc ggcagcgaca agcgcgcgcg 3841 gcgctcggtc gtcgcgctgt cggaagccgg tgtgccgttc gcgttcgcgc tcgaggcgcc 3901 cgaccactcg cccgaggacc tgcgcctttc cgccgcgctg tgcgtgcgcg cgggcctgcc 3961 ggtcgaagcc ggcctgcagg cgctgaccgc gggcgcggcg cgcatcgcgg gcgtcgccga 4021 caagctcggc cgcgtcgagc gcggtctggc cgcggacctc gtgctgtgga gcggcgagcc 4081 gctcgacctc gcgagccgcg cgacgcacgt ctacgtcgac ggcgtgctcg tgcacagcgc 4141 gccctcggcc gcacccgcga aggactgacg cgagagccat ccatgaacga ccccacgaac 4201 aactcccgcg cgatccgctc gtggctcggc cacggcgcgc tcgccgcctg cgtgctctcg 4261 agcctcgcgc tcgccggcga caagagctgg ctgatcaagg cgcacaagat ccacacggcc 4321 gccggcgaca cgatcgagga cggcttcttg caggtcaccg acggccgcat caccgcggtg 4381 gcgggcgtct cgatgggcgg cggcggcgac atgctcgagg tcgcggccgt cacgcccggt 4441 ctgatcgact tgtcggtgcg catcgacctc ggcttcgact cggtcgagca gtcgacggag 4501 acgccggcgt cgctcgacgt cgagaacggc atcgacctgt gggacccgcg cttcgagcgc 4561 gagctcaaga gcggcgtcac gaccgtcgtc gtcagcccgc ccgaccgcgc cgtgatcggc 4621 ggctggtcgc tcgcgctgaa gaccggcggc ccgccgaccc tcgagaagcg ccgcgtcgcg 4681 ccgcgcgtcg cgctgcgcgc cgcgatcggg gacgagccga gcgacggcaa tcgcccggcc 4741 ttcggccagc ccgacagctt ctacaaccgc cgcccgacga cccgcatggg cgtcgaatgg 4801 gtgctgcgca agtcgttcta cgacgcgctc gccgcgcgca aggacgccgc gctcgcgacg 4861 gacgagacgc gcgcgttcga ggaagtgctc gccggcaagg cgccgctgtt catcgcggcc 4921 tcgaccacgc aggacatccg caccgcgtgc ttcctgcagc gcgagttcaa gctgccgcgc 4981 ctgatcatcg acgcggccgc ggaggcctgg aaggatccgc agatggtcgt gcagagcggc 5041 gcggccgtcg tgctgccgcc gttctcgttc ggcggccgcg gcggcgtcga cggcgcgttc 5101 cacgcgtgga acaccgcggc gctgttgcac gagcgcgggc tcacgatcgc gctctccggt 5161 cacaacagca tcgagcgcgg cgaccgactc gccgtgcagc ccggcctcgc gatgcacggc 5221 ggactgccgt tcgacgcggc gctcgcggcc gtgacgatca atcccgcgcg catggccggc 5281 atcgacggcc gcgtcggcag catcgcgacc ggcaaggacg ccgacctcgt gctgtgggac 5341 ggccgcccgt tcgagccgtc ctcgcgcatc gtcggcgtgc tgctcgacgg cgagctcatc 5401 gtcgatccgc gcacgcagaa gtgatctcgc ctctcgccac tcgcccaccg cacaccctca 5461 cccccgatcc catgcaaacc acgccttccc acttcttcgc gcgctccgtg ctcgcgactt 5521 cgttcgcgct gctcaccggc ggcctcgcgc tgggcgccga caagctcgtc atcgaggcag 5581 gtcgcgtcat cacgcgcgcc ggtcacgaga tcaaggacgg cgtgatcgtg atcgagaacg 5641 gtcgcatcac gaaggtcggc aagcagggcg aggtcgacaa gccctgggac gcgaccgtga 5701 tcggcggacc gagcttcgtc gcgttccccg gcttcgtcga ggcacagacc tcgcgcggcc 5761 tggaccgccc gaacgagaac atcgacgtcg cgccgttcct cgacatccgc gactcgatcg 5821 acccgatcaa cttcgcatac gaggactacc tgcgccacgg cgtgacgacg ctcaacgtgc 5881 agcagggcgg caactgcgtg atcggcgggc gcggcatgat cgtgcggccg gtcggcatga 5941 cggtcgacga gatggcggtg cggcccgact acgcgctcaa gctctcggcc tcgccgaaga 6001 ccggcaagag cgcggcgacg cagtcgatgg cgctgcgccg cgcgttcgac gagctgaaga 6061 agcgcctcga gaagcaggtg caggacaaga aggacggcaa ggcccgcgac aagcgcgagg 6121 cgatgttcca gggccgcgag ctcgagggcg agaaggccaa gggccgcgcg atggacggca 6181 agggctggcg cgtcgacggg ctcgagctcg tgccgcgcgg cgagctcgac gagaaggatc 6241 tgcagctgct cgagctcgtc gagggcaagc gcgacgcgtt cgtctggtgc gccgccgcga 6301 tggacgtgcg caaggcgctc gacctcgcga agcagaacgg cttcctcgag cgcacgacgc 6361 tcgtgctcgc cggcgactgc tggaaggccg ccgacgtcgt cgccgcttcc gggcgtccgg 6421 cgatcctcgt cccgccgatg ctgcacgtcg agctcgaccc gctgaccggc aaggaagtcg 6481 agacctcggt gccgaaggcg ttcgccgaga agggcgtcaa gttcgcgatc gcgtcggagg 6541 gtttcggccc gaccacgccg agccagcagg tgctgtggta ccaggccgcg ctcgcggtcg 6601 gacagggtct gccgcgcgac aaggcgatcg ccgcgatcac gacggttccc gccgagatcc 6661 tcggcctcgc cgatcgcgtc ggctcgctcg aggcgggcaa ggacggcaac gtgctgctgc 6721 tcgacggcga gccgctgtcg atccgcacgc acgtcgagtt cgtgatcatc gaaggcaaga 6781 ccgcctacga ccgcagcaag gacatccgcg ccaagcacct gctcgagggc caggagccgc 6841 gcggcacgag cccggccggc gtcgacggcg cgtcggacgc gaagcccggt tcgaagcagg 6901 acgcgctgaa gaaggacgag gccgccgaga agaaggcggc cgaggagcag cgcaaggcag 6961 acgaaaagaa gcactacgag tcgagcgagg agcgcggctg atgatctctc tgttgaccct 7021 gctgccgctc gcgctcgcga gcggcgcgcc cgcgcaggcc gacggcggcc gcgacctcgt 7081 cgcgctgcgc gcgggcaccg tgctcgtcgt caaggacggc gccgtgctcg agggcggcgc 7141 gaccgtgctc gtgcgcggct cgaagatcgc cgcggtcggc aaggatctgc tgatcccgcc 7201 cgatgcgacg gtcgtcgact acggccccga cgcggtgatc tgcccgggct tcgtgctcgc 7261 cgactcgagc ctgctcggct cgaccggctc gacgcgcacc gccgagcccg cgctgtcggc 7321 cgtgcaggcc ttcgacccgt acgcgcgcta cgactgggcg ctcgccgccg gaatcacgag 7381 cgcctacgtg acgccggcgc gcatgcgcct gatcgcgggc cagggcgcga tcgtcaagct 7441 cgccggaggt ccggacgtcg atcgcttcgt gcgcacgccg gtctccgtgc acgcggcgct 7501 cagcgccgag gcgcgcggcg cgcccggcta ctgggatccg ccggtgccgc cgaccagcga 7561 cgtcggtctc ggcgtcgcgc agcggcagtt gccgcgctcg ttgatgggcg cgagcgtcgc 7621 gctgcgcgag ctgctcgtcg cggcgcgcaa tcacgcgccg gccgccgatt tcggcccgta 7681 cgcggcgctc gagctcggcg cgctcgtgca acagcgcgtg gcgtggctga tcgccgcgac 7741 cgacgcggcg gagatccgcg cggcgctcgc gctgtccgcc gacttcggcc tgaacctcgt 7801 gatcaacgac gcgacgtacg cgggcgaggt cgccgacgag atcgcggcgc gcggcgcggc 7861 ggtcgtgtac aagccgccgt tcaatcccga catctcgctg ctcaatcgcg gcaagtcggc 7921 cgatgcgcgc tggccggaat ccgacgtcgc cgcgcgcctg gcgcgcgccg gcgtgcgtct 7981 cgcggtcgca ccggggtccg gcgcgagccc gcgcgacctg cgcttcgcgg cgttcctcgc 8041 gatgcgcggc ggcctcgatt ccgcgctcgc gttgcgcgcg atcacgctga cgccggccga 8101 gctctacggc gtcgccgatc gcgtcggcag catcgagccc ggcaaggatg ccgacctcgt 8161 cgtgctgaac ggcctgccgc tcgacatggg ctcgagcgtg gtctcgacgt ggtccgacgg 8221 cgcgaccgtc tggagcccga cggcgagcgc cgacgacgcg gggatggccg cgcaaccggt 8281 cgtgatccag gtcgacgaac tgcacctcgg cgacggccac gtgctgcgcc cgggccagat 8341 cctgctgcag aacggcaaga tcgcggagat cggcgagcgc gtcgcgcgtc cgcgcggcgc 8401 ggtcgtcgtg cgcggcgtcg cggcgatgcc gggcatcgtc gacgcgctcg gctacctcgg 8461 cctcgagggc tcggcgcgca gcgtcggcgt cgattacaag ctgtcgcggt tgctcggccc 8521 cggtgacgtc gccgatcgac gcgtcgcgcg cgccggcgtc acgacggtcg gcctgtccgt 8581 gcgcaatccg tcgccggccg gcgtgccgat cgtcgcgtac aagcccgcgg ccaacgactt 8641 cgactcgctc gtgctgcgcg agccggcggg cgtgcgactg cgctggtcgg atcgcaaccg 8701 ctcgcgctcc ggcagcgaag tgaaggagct gctcgacaag ctcggcaagt accagcagaa 8761 gtgggccgag tacgacaagg cgctcgcgag ctggaagccg agcggcgcga aggccgacga 8821 cgcgaagccg gccgaggcga aggccgacga gaagaaggaa ggcgacgaca aggccgccga 8881 agaggcgaag gccgacggcg ccgaggagaa gaaggacgag aaggccgacg ccgacaagaa 8941 ggacgagaag aaggacgaca agaagaagaa ggacgagccc gagcccgccg acccggtgac 9001 gggcgtgtgg gagggcgagt tcgcgctgcc cgcgccgctg tcgggcaagg ccaagctgcg 9061 gctgcagctg ttgaacgaga acggcgtcgt gcgcggctcg ctgcgctgcg acgcggtgtc 9121 cgagcggctc gtgcagctgc agggcacgtg ggccgcggcg gagtcgggcg cgaaggaact 9181 gcgcctgtcc ggcgccggct cgcagggccc ggtccagttc gccgccgcga tcgcgaaggc 9241 gaagatggcc gcggtgctcg gcgccggcgc gtggcaggcc gacgtcgacg ccgagcagaa 9301 gtcgaaggag tacccggtcg ccgtgcggcc gaaggcgaag aagccgaagg acgagcccga 9361 ggccggcgcg gcgtccgcgg agaagggcaa gcccgttccg ccgcgcatcg acgagcggct 9421 cgagccgctg cggcaggccg cgctcggcaa ggccggcgtg atcgtcgacg tcgatcgcgc 9481 cgacgagatc gcagcgtgtg tcgacgcgtt cgcggcggcc ggcctgaagc cggtgctgta 9541 cggcgccgac gaggcgtgga agctcaagga caagctcgcg ggccgcgtgt cgggcgtgct 9601 gatcgacttg ccgttccggc gcaacgagcc cgagggcagc atcagcttcg tcaaccgcta 9661 cgccgagctg cagtcggccg gcatcccggt cgcgttccac tccgcggccg aggaaggcgc 9721 ggcggatctg ctgacgcgcg cgatgtacac ggtctcgcag ggcatgagcg cgagcggtgc 9781 gctgcgcgcg ctgacctcgg atgccgcgcg gatgctcgga ttgtccgacc gcgtcggcat 9841 gttgaaacca gggctcgacg ccgacgtcct gttgctcgac gcgagcccgc tgtccgaacc 9901 cgcgcgcgtt ctgcgcgcgt gggtcaacgg aagcgaggtg ctgccgtgag cgcgctcggt 9961 cgaacgctgg ccggcgccgc gctcgtcgcg tgcgcgagcc cgctgctctc gcacgcgctc 10021 gcggccggcg agccgatcat cggcacgccg cgcggcctcg ccgcgcccgc gcgcgccgac 10081 ggtcgctcgg gcgaacccgg cggcccgggg ctcgcgctgc gcgccgcgaa ggcgttgacg 10141 tgcgcgcgca acggcccgtt ggtcgtcgac gacgcggtcg tgctcgtgcg cgacggcaag 10201 atcgagtcgg tgcgcccggc gcgcgaagcc gaggttccgg ccggctacga gcggctcgat 10261 ctcggtccgc tgtgggtgat gccggggatg gtcgacctgc acttccactc ggcgggctcg 10321 ttcgacatca acgacatggt gtacctcgcg aacccggagc tgcgggtcgc ggcgagcgtg 10381 gtcccgcgca atccgaacct cgatcgcgac gtcgccggcg gcgtgacgac cgtgctgtac 10441 atccccggct cgggcacgaa catcggcggg cagggcgtgc tgttgaagac cggcctcgat 10501 cgcttcgagg agatggcggt gcgcaacccg ggctcgctca aggtcgcgca gtggggcaat 10561 cccgagcgct ggatcatggg cgtcggcaag accttcgaag gctggacgat ccgcaacacg 10621 ttccagcgcg gccgcatgta tgcgcaggcg tggaaggact tcgaggcggg caaggggccc 10681 gagcccga //