LOCUS JAAYJW010000483 11829 bp DNA linear ENV 18-APR-2020 DEFINITION Candidate division WS1 bacterium isolate AS10tlH2TH_342 321848_AS10, whole genome shotgun sequence. ACCESSION JAAYJW010000483 JAAYJW010000000 VERSION JAAYJW010000483.1 DBLINK BioProject: PRJNA602310 BioSample: SAMN13893989 Sequence Read Archive: SRR3166102, SRR2016847, SRR3166092, SRR2016852 KEYWORDS WGS. SOURCE candidate division WS1 bacterium (anaerobic digester metagenome) ORGANISM candidate division WS1 bacterium Bacteria; candidate division WS1. REFERENCE 1 (bases 1 to 11829) AUTHORS Campanaro,S., Treu,L., Rodriguez-R,L.M., Kovalovszki,A., Ziels,R.M., Maus,I., Zhu,X., Kougias,P.G., Basile,A., Luo,G., Schluter,A., Konstantinidis,K.T. and Angelidaki,I. TITLE New insights from the biogas microbiome by comprehensive genome-resolved metagenomics of nearly 1600 species originating from multiple anaerobic digesters JOURNAL Biotechnol Biofuels 13, 25 (2020) PUBMED 32123542 REMARK Publication Status: Online-Only REFERENCE 2 (bases 1 to 11829) AUTHORS Campanaro,S. TITLE Direct Submission JOURNAL Submitted (28-JAN-2020) Department of Environmental Engineering, Technical University of Denmark, Bygningstorvet, Bygning 115, Kgs. Lyngby 2800, Denmark COMMENT The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Date :: JUN-2018 Assembly Method :: Megahit v. 1.1.1 Genome Representation :: Full Expected Final Version :: No Genome Coverage :: 312.7031892135x Sequencing Technology :: NextSeq 500 ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 04/13/2020 17:14:56 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 4.11 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 4,129 CDSs (total) :: 4,085 Genes (coding) :: 4,059 CDSs (with protein) :: 4,059 Genes (RNA) :: 44 rRNAs :: 1 (5S) partial rRNAs :: 1 (5S) tRNAs :: 39 ncRNAs :: 4 Pseudo Genes (total) :: 26 CDSs (without protein) :: 26 Pseudo Genes (ambiguous residues) :: 0 of 26 Pseudo Genes (frameshifted) :: 5 of 26 Pseudo Genes (incomplete) :: 21 of 26 Pseudo Genes (internal stop) :: 0 of 26 CRISPR Arrays :: 1 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..11829 /organism="candidate division WS1 bacterium" /mol_type="genomic DNA" /submitter_seqid="321848_AS10" /isolate="AS10tlH2TH_342" /isolation_source="anaerobic digestion of organic wastes under variable temperature conditions and feedstocks" /db_xref="taxon:2099669" /environmental_sample /metagenome_source="anaerobic digester metagenome" /note="metagenomic" gene complement(213..875) /locus_tag="GX131_19380" CDS complement(213..875) /locus_tag="GX131_19380" /inference="COORDINATES: protein motif:HMM:NF024863.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="SGNH/GDSL hydrolase family protein" /protein_id="NLO07989.1" /translation="MMTLLTRPWLCVAGEMQAEFAPIEDNPELPRVLLIGDSISVGYT LPVREMLAGEANVHRACENCGATSRGVERVDAWIADGPWDVIHFNFGLHDLKIMEDGN HQVPIEQYEENLEQIVAKLQQTGAEVIWASTTPVPEGVGGPKRDPEDVTRYNGVAEAV MHAHRIPVDDLYEFALPQLEKIQMPKNVHFTPEGSKVLARQVVAAIRGALMPLRTAGG TP" gene complement(917..1999) /locus_tag="GX131_19385" CDS complement(917..1999) /locus_tag="GX131_19385" /inference="COORDINATES: ab initio prediction:GeneMarkS-2+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS-2+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NLO07990.1" /translation="MPSSLTPRERCERAVRFEDVDHVPLTIYESKIPQAEAERKLRND GLCIVQRGPNVFSVRHPNVLDSESHTYVENGRQLTRTVTRTTAGDLTTVSEPAGFTTW HHEKMFKGPDDYAALLAYASDVEVVPSYEPFIRMSEMKGGDAVMRGAVGLEPMQLIIH HWMGVEVFSVEWMERQDEIMKLHDALVEERRKTYPVLAASPCEHFNYGGNVTPEIIGP RRFRDYYLPHYEECCEVLHTAGKLVGCHFDANTKIIADAIAETSLDYIEAFTPAPDTD MTLAEARAVWPDKALWINFPSSVWLRSDEQMQEVTLDLLQQAAPGNGLLIGVTEDVPD GRLFPGLQAINRVILEHGRLPIQPKS" gene 2136..3377 /locus_tag="GX131_19390" CDS 2136..3377 /locus_tag="GX131_19390" /inference="COORDINATES: protein motif:HMM:NF013227.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NLO07991.1" /translation="MSKLAIDGGTPVRTDPWPARLQIDDREINAVMELMTAAKAGGAF DRYGGIHVDEYEVKFAEKIGTRWATGMSAGTAAVHSALAALRLEPGTEVITAPITDQG AVMPVVAQGYIPVFADASPESMNMSPEGIRAKLTARTGAIICGHIAGIPCQMDEIMAI AAEHDLYVIEDCAQAHGATYGGKTVGSIGHLGAYSLMSGKHTVAGGQGGMVTTDDEEL YWNAKRFADRGKPFNSEVAGNLFLGLNYRMTELDAVIGKVQLEKMDDIANRRRAFVYK LEEEIADLQAFNVCWYPEKSDPSWWFLLIRIYPEKLTCDKATAVEAIQAEGIPVGMNY GAIAPKAHWLQEQSAFGRDSHFPWDCVWPDGDWEWALDVPNAWVADENHMRMGVHECC DEQTALDTAEALRKVEAAYMT" gene 3749..5230 /locus_tag="GX131_19395" CDS 3749..5230 /locus_tag="GX131_19395" /inference="COORDINATES: protein motif:HMM:NF019339.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="tetratricopeptide repeat protein" /protein_id="NLO07992.1" /translation="MSEHSEDPIEAAKALLQEGKLAEVVRLLQPDVARHQDDSVAWSL LGAAYFELEQWEAAEEAARHTLRLTPNSPRQWSNLGTILRKQGRHDEAEDAQHRALEL DRKYRRAETELAKIRKERKTPSEDDGWRVRTPDGDFIGPLSKTELTELLTLHQVESGW SARRGAGRIVTVREAIGDEAFDEAVRARAASASTEPEMDRQPESQKELPSAAGPIKAK LLVSALVAVIVVLGASLVWLLVTGTRHESAMIAERPATEVAVARSPEPEKPVNSPPSD SLTQSASREPTQAPIAEEPEPSSRVTTSYPQSARQEHPEAERPHQSTDDPAKAVPPGR QAVASSSSVRTEALDAIGDLETAVEVGVNYRDYSNYLIEAKRELRRLQSRVPAGALWW TEIELAMYDYEFAGDAWDWKFSGSGVRNFVYEGSPEFQLAYRKYGQSGLEPWLSRSEA DPGIPSAGIPPTPSRWMLWVDGIVQTAWSSASSHLARASVTTR" gene 5414..6448 /locus_tag="GX131_19400" CDS 5414..6448 /locus_tag="GX131_19400" /inference="COORDINATES: protein motif:HMM:NF013567.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="Gfo/Idh/MocA family oxidoreductase" /protein_id="NLO07993.1" /translation="MSEDRKARLAFIGAGGFATNSLYPNLHKVPEIDLVAICDLDEDK ARRNARNFGAREVYTDADKMLDEQQPDGVFVIGPAPMHYKVAPVVLKRGIPVYVEKPS ANTSPEAKELAELAEANNTWGQCGFMKRFAYVYTMAKQIMAREEFGALNMLTIHFGQG PYPQIWGIDSARRSMLIGQLCHIFDLTRFLGGDVATVQALYREVTPTQFAYLVNLTFK SGAIGQLDLNGLETKTGFRDIREWVKLVGFESFIECDGMQSLTWQPPKEWIDFPEHTG RFTYNYDPAWTGISNSRANFGYLGEVRHFALRCIGEVEGGPDLWDSYEALRLGEAIYE ATETGGIVEL" gene 6583..7299 /locus_tag="GX131_19405" CDS 6583..7299 /locus_tag="GX131_19405" /inference="COORDINATES: protein motif:HMM:NF019231.1,HMM:TIGR04294.1" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_018196858.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF1559 domain-containing protein" /protein_id="NLO07994.1" /translation="MRRGFTLIELLVVIAIIAILAAILFPVFARAREKARQSACLNNV KQISLGILMYAQDYDERMPMLYMNRRPSSSDWYGIMHMLDPYIKNRNVHDCPSASHTS NLTSYGGNRSYGYHREIIVANGSRKMALIQRPAEIVMMGDVCHDQNNNCTLNSPASGP FKCDPDGTNCQVCGGTHNSLYAEPLGSHSSRYDRPDFNFLERHNGTGNAGFCDGHAKA MKHSQLYNGGDPHPHFDWGA" gene 7394..9175 /locus_tag="GX131_19410" CDS 7394..9175 /locus_tag="GX131_19410" /inference="COORDINATES: protein motif:HMM:NF018190.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NLO07995.1" /translation="MKRALIAVAFASICTLAAAQVEHAGPPAKKLLEYGWDVPKPSYV AEHIREMEERPFDGVLMRISGLGSVFDPTPRTREQYAAELDALERIEWDRFTDNFLMT YAASKMDWFSDEHWAIIVDNIRLLASGAEAGGCVGLCFDAEPYGENPWHYPTQPGADQ HSFADFQAKVRQRGAQFIDAVEAEMSEPVVHTFYLTTLAGFRNAAMAETAEQRDEILR DYSYGLYAAFINGILDAIDPGTVLTDGNEPAYYYHDSRQYLDVYHYIKQGALGAIAPE NHGKYRGQVQVAQALYVDHLMGLRARKVEGHYLTPEEQAQWFEHNTYWALKTSDRYVW CYSEKMNWWTDTDVPAGLEQAIINARTTHDAGQPMEIDADALWQKAHQRMQEEIEADL LRRDATIAKLSRALTPVIDGARDDEAWQYATELEPFVATFSVERPDLATTLAHVAYDD EALYVAIRCMEPQMEAMEVIGGSRDDNVWMGDSVDLFLQREQGGRFYHIIVNPANVIW DAVHEGEAGDRSWDPDYQSATYRGDEFWNVEIALPWATMGWEAPTRGDTLRANICRQR RAVSELSSWSQVVTGFVEPGSFGTWKF" gene complement(9288..10076) /locus_tag="GX131_19415" CDS complement(9288..10076) /locus_tag="GX131_19415" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007416826.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="LmbE family protein" /protein_id="NLO07996.1" /translation="MSENKVALAVGAHPDDVEFLMAGTLAMLGDAGYELHIMTVGNGN CGTAEYTHEEIIRIRGREARNAAALIGATYHHGLVNDIEIYYEDELLRRVTARIRQIK PNIVLTQSPQDYMEDHTNTSRLTVTACFTRGMRNWQTIPREVPTFQDVYVYHAQPLGN LDPMRNPIIPSLLVDITEKMDVKAAMLREHKSQKNWLDISQGKDAYIDAMRDMAEQVA DMSESTVKYAEGWRQHLHVGHSAHDGDPLSEVLGSKVECIRQGG" gene 10357..11664 /locus_tag="GX131_19420" CDS 10357..11664 /locus_tag="GX131_19420" /inference="COORDINATES: ab initio prediction:GeneMarkS-2+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS-2+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="NLO07997.1" /translation="MQIGFSSANITPSYGMEVPGGMSKRFTQGVHDELQATAAVFDDG EQAVALVGVDSLSIKHSVVERAREIIFDAIGIVPEHVMCGASHTHAGGPAADIFISES DPEYLTHMSRQIATAVIDANRRKQELQVGVRVGEVKGVAHPRRWLMKDGSQRSHPRAD QANMDRPQGELDESCNVVGVAGEDGIPRGCVVNFTCHGTTGWGAGFASADWIGYFRQC LKRVFGDDFGVVFLNGACGDVTQVDNTQDPDYVQSGPIAARRVGGSVAGETLKQLVQM RFVDSMAVGGKRVQIELEPRVPTEEQIAWAREHAESTEPNPARWATDSIWSREWLNLA KMVETEAVVPCELQAIRIGDTAFASNPGEFFCSLGMGIKRGSPFGNTFVVELANGSIG YVPSEDAYEGGYESQMAPSSKLVAGSGEKIVAETIELLKGLHR" BASE COUNT 2274 a 3661 c 3772 g 2122 t ORIGIN 1 ctcaatcgct gcggcgcgaa gttcgcccct gcggtcggcc ggacctgctt agtgagtggc 61 gctcggctcg ggagtgagcg ccggttccag cgtcagatga accttcagcg agaaactcgg 121 gatcgtcccg ccatccctca tacagctcct cagagcctgt ctctgctccc atggcttgtg 181 ctcctacccg ctccttttga tgcccaggta actcacggcg tcccgccagc agttcgcaac 241 ggcatgagcg caccgcggat ggccgccacc acctggcgcg ccagcacctt cgagccctcg 301 ggcgtgaagt gcacgttctt cggcatctgt atcttttcaa gctgcgggag cgcgaactcg 361 tagaggtcat ctaccgggat gcggtgggcg tgcattaccg cctcggccac gccattgtat 421 cgggtgacgt cctcgggatc gcgcttcggc ccaccaacac cctcgggcac cggggtggtt 481 gaggcccata tcacctcagc acccgtctgc tgaagtttcg ccacgatctg ctcaaggttt 541 tcctcatact gctcgatcgg cacctggtga ttgccgtcct ccatgatctt cagatcgtgc 601 aggccgaagt tgaagtggat cacgtcccac ggtccgtccg caatccacgc atcaacccgc 661 tccactccgc ggctcgttgc tccgcagttc tcgcaggccc gatgcacatt cgcctcgccg 721 gcgagcatct cgcgcaccgg cagcgtgtaa ccgacggaga ttgaatcccc tatgagcagc 781 acccgcggca gttccgggtt gtcctcaatc ggggcgaact ccgcctgcat ctcgcccgcc 841 acgcacagcc acggccttgt cagcagcgtc atcatgagcg ttccctccat ctgcattctc 901 cgttgcgcgg taggagctaa gacttcggct ggatcggcag gcggccgtgt tcgaggatga 961 cccggttgat tgcctgcagc cccgggaaga ggcggccgtc gggcacgtcc tcggtgacgc 1021 cgatcagcag gccgtttccc ggcgccgcct gctgcagcag gtccagcgtg acctcctgca 1081 tctgctcatc gctgcgcagc cacacggaag atgggaagtt gatccagagc gccttgtcgg 1141 gccacacggc ccgcgcctca gcgagcgtca tgtcggtatc cggggcaggc gtgaaggcct 1201 cgatgtagtc gagcgaggtc tcggcgatgg catctgctat gatcttcgta ttggcgtcga 1261 agtgacagcc cacgagtttc ccggcggtat gcagtacttc gcagcactcc tcgtagtgcg 1321 gcaggtagta gtcccggaag cgcctcggac cgatgatctc gggcgttaca ttgccgccgt 1381 agttgaagtg ctcgcacggg gaggccgcaa gcaccgggta ggtcttccgc cgctcctcca 1441 cgagcgcgtc gtggagcttc attatctcat cctgccgctc catccactcc acagagaaga 1501 cctccacgcc catccagtgg tggatgataa gctgcatcgg ctcaaggccc accgcgccgc 1561 gcatcaccgc atcgccgccc ttcatctcgc tcatgcggat gaatggctca tagctcggca 1621 ccacctcgac atccgaggcg taggcaagca gcgcggcata atcatcgggg cccttgaaca 1681 tcttctcgtg gtgccaggtg gtgaagcccg cgggttcgct gacggtggtc agatcgcccg 1741 cggtggtgcg ggtaaccgtg cgcgtcaact ggcgcccgtt ctcgacgtag gtgtggctct 1801 cgctgtcgag cacgttcggg tgccgcacac tgaagacgtt cggaccgcgc tggacgatgc 1861 agaggccatc gttgcgcagc ttgcgctctg cctcggcctg cgggatcttg ctctcgtaga 1921 tcgtaagcgg aacgtggtca acgtcctcga agcgcactgc tcgttcgcag cgctcgcgtg 1981 gggtcagcga tgatggcatg aagcgctcct ctcttgaaat gtatctcagg attcgccaat 2041 tgcaccatct ggcccttcat tgaagggaaa gtgagcgcag cgacgaatac gccgctgccg 2101 cacagccttg acgccacgaa gcgagggttt ttgcgatgag caagcttgcg attgacggtg 2161 gcacacctgt ccgcaccgat ccgtggcccg cgcggctgca gatcgacgac cgtgagatca 2221 acgcggtgat ggaattgatg accgccgcga aggccggggg cgcattcgac cgctacggcg 2281 gcatccacgt ggatgaatac gaagtgaagt tcgccgagaa gatcggcacc cgctgggcca 2341 ccggcatgag cgccgggacc gctgcggtcc actccgccct cgcggcactg cgccttgagc 2401 cgggcactga ggtcatcact gcgccgatca ccgatcaggg cgccgtcatg cccgtagtcg 2461 cacagggcta catccccgtc ttcgccgatg cctcgcccga gagcatgaac atgagccctg 2521 agggcattcg cgcgaagctg accgcgcgca ccggcgcgat catctgcggc cacatcgccg 2581 gcatcccctg ccagatggac gagatcatgg cgatcgcggc cgagcacgat ctatacgtga 2641 ttgaggactg cgcgcaggcg catggagcca cgtacggcgg gaaaacggtc ggctccatcg 2701 gccacctcgg cgcctattcg ctgatgagcg gcaagcacac tgtcgccggc ggacagggcg 2761 ggatggtcac cacagatgac gaggagcttt actggaacgc caagcgtttc gccgaccggg 2821 gcaagccatt caactcggag gtcgcgggca acctcttcct cggcctcaac taccgcatga 2881 cagagcttga cgcggttatc ggcaaggtgc agcttgagaa gatggatgat atcgcgaacc 2941 gccgccgggc gttcgtatat aagctcgagg aggagatcgc ggatctgcag gcattcaacg 3001 tctgctggta tccggagaag tccgacccct cgtggtggtt cctgctgatc cgcatctacc 3061 cggaaaagct gacctgcgac aaggcgacgg cggtcgaggc gattcaggct gagggcatcc 3121 cggtggggat gaactacggc gccatcgccc cgaaagcgca ctggctgcag gagcagagcg 3181 cattcggccg cgatagccac ttcccgtggg attgcgtctg gcccgacgga gactgggaat 3241 gggcgctgga tgtgccgaac gcatgggtgg cggatgagaa ccacatgcgc atgggcgtcc 3301 atgagtgctg cgacgagcag acagccctcg acaccgcgga ggcactgcgg aaggtcgagg 3361 cggcgtatat gacttagcgg tcgccaattg gtcattcgtt cttccgagac gcgttgccaa 3421 ggtgcctgtc cgaagcctga cgcagcatga aggaggatcg cggggatgag acgagaagtg 3481 acgtgggagc aactgggttg tcctcggtct cctaggatca tgccagttgt gggcatcggt 3541 tcagtcgaga tattggccga cgacatacaa ctggctgagg aagcgggagg aagagtggcg 3601 tttcctctgg acgagaagca gcctgcgaaa ggtggtcttc ctgcacgctt gggacggccg 3661 gcgcgaacgc ttgagtagca caatcccgcc ctgacgctcg gagatattgg cgcaggggca 3721 acatgacttc ggcactgggg gcaatcctgt gtcagagcat agcgaagatc ccatagaagc 3781 ggcgaaggcc ctgttgcagg agggcaaact cgcagaggtt gtacgcctcc tgcagccaga 3841 tgttgcgcgg caccaagacg attcggtcgc ttggtcacta ttgggagcag cttactttga 3901 actggagcaa tgggaagcgg ccgaagaggc ggccaggcat acgcttcggc tgacgccgaa 3961 cagtccacgc caatggagca acttgggcac catcctgcgc aagcaggggc gccacgatga 4021 ggctgaagat gcccaacacc gggcgcttga actcgatcga aagtaccggc gcgcggagac 4081 ggaactggcc aagatccgca aggagcggaa aactccgtct gaagatgacg gttggagggt 4141 tcggacgcca gatggtgatt tcatcggccc tctgtcaaag acagaattga cggaactgct 4201 cacactgcat caagtggagt ccggatggag cgcacgtcgg ggcgcggggc gcatagtgac 4261 cgtcagagaa gccatcggcg acgaggcctt cgatgaagct gtccgcgccc gggcggcttc 4321 cgcgtcgacc gagccagaaa tggaccggca accagaatcg cagaaagagt tgccttcggc 4381 ggcaggtcct atcaaggcga aactgcttgt gagcgcccta gtcgcagtca tcgttgtttt 4441 gggcgcgagc ttagtctggc tcctcgtaac cggcacgcgg cacgaatcgg caatgattgc 4501 ggaacgcccc gccaccgaag tggctgtcgc acggtcccca gaacctgaaa aacccgtcaa 4561 ttcacccccc agtgattccc tcactcaatc cgcctctcgg gaaccaactc aggccccgat 4621 tgcggaagag cccgaaccct caagccgcgt aacgacttct tatccgcaat ccgcacggca 4681 ggagcaccca gaagcggagc gaccccatca atctacagat gaccctgcta aagcggtgcc 4741 gccgggacgg caagccgttg cttcctcctc gtctgtgcgt acggaagcgc ttgacgccat 4801 aggtgatctg gagacagccg tagaggttgg ggtgaactat cgagattaca gcaactacct 4861 catcgaggca aaacgcgaac tgaggcgtct acagagcagg gtgcccgcgg gagcattgtg 4921 gtggaccgag atcgaactcg ctatgtacga ctacgaattc gctggcgacg cctgggactg 4981 gaagttctct ggtagcggcg taaggaactt tgtctacgag gggagtcccg agttccaact 5041 ggcgtatagg aagtatggac aatcggggct cgagccgtgg ctcagccgca gcgaggccga 5101 tcctggcatt ccatcggccg gtatcccccc tacgccttca cgctggatgc tctgggtaga 5161 tggcattgtc cagacggcat ggtcatcggc gtcttctcac cttgcgcgcg cgagcgtaac 5221 gaccagatga tcacgcatat tgccgcgtat ctggcgtagc gatgctatca cgcgtgctga 5281 tcccgctttt ctggcgggat cagcaccgtt acgaagctct tcccgcgccg tgcaggactc 5341 cgccgcttcc cggggaaacc tcgcgcacgt ccgtcacctg acgcgcctgc agcttatcag 5401 ggaggccaac gaaatgtccg aggaccgcaa ggctcgactg gcgtttatcg gcgctggtgg 5461 gttcgccacg aattccctct accccaacct tcacaaagtc cccgagatcg atctcgtcgc 5521 gatctgcgac ctcgacgagg ataaggcgcg gcgcaatgcc cgcaacttcg gcgcccgcga 5581 ggtctacacc gacgccgaca agatgctcga tgagcagcag cctgatggcg tcttcgtcat 5641 cgggcccgcg ccaatgcact acaaggtggc ccccgttgtc ctgaagcgcg gcatccccgt 5701 gtacgtggag aaaccctccg ccaacacctc gcccgaggcg aaggagcttg ccgagcttgc 5761 cgaggccaac aacacgtggg ggcagtgcgg cttcatgaag cgcttcgcct acgtctacac 5821 catggcgaag cagatcatgg cgcgcgagga gttcggcgcc ctcaacatgc tcaccatcca 5881 cttcggccag gggccatacc cgcagatctg ggggattgac tcggcccgcc ggagcatgct 5941 catcggccaa ctttgccaca tcttcgacct gacgcgcttc ctcggcggcg acgtggcgac 6001 cgtacaggca ctctatcgtg aggtcacgcc aacgcagttc gcctacctcg tgaacctcac 6061 gttcaagtcc ggcgctatcg gccagcttga cctcaacggc ctcgagacga agaccggctt 6121 ccgcgacatt cgtgagtggg tcaagctggt tggcttcgag tccttcattg agtgcgacgg 6181 gatgcagtcg ctcacgtggc agccgccgaa ggaatggatc gacttccccg agcacaccgg 6241 tcgcttcacg tataactacg atccggcatg gacgggcatc tccaactcgc gcgcgaactt 6301 cggctacctc ggtgaagtgc ggcatttcgc tctgcgctgc ataggtgagg tcgagggcgg 6361 gccggatctg tgggacagct acgaggccct gcgcctcgga gaggctatct acgaagcaac 6421 cgagacgggc gggatcgtgg agctgtgaat gcccggaggc gctcgcagac ggggtaagac 6481 gcgagcgaga taagacccag cggtagaggg aagggggaat gaggggcgta agccagaggc 6541 cggatgaaaa gcgatcgact tcgtgcagtg gaggtcagag aaatgagacg tggctttacc 6601 ctcatcgaat tgctggtcgt gatagcgatc atcgccatcc tggcggcgat cctgttcccg 6661 gtcttcgcac gcgcgcgcga gaaggcgcgt caatccgcct gtctgaacaa cgtaaagcag 6721 atttccctcg ggattttgat gtacgcacag gactatgacg agagaatgcc gatgctctac 6781 atgaacaggc ggccctcgtc aagcgactgg tacggcataa tgcacatgct ggatccatac 6841 atcaagaatc gaaacgtgca cgactgtccc agcgccagcc acacatcgaa cctcacgagc 6901 tatggcggta accgcagcta cggctaccac agggagatca ttgtagccaa cgggtcgcgg 6961 aagatggcgc tcatccagcg tccggccgaa atcgttatga tgggcgacgt gtgccacgac 7021 cagaacaaca actgtacgct caatagtccg gcatccgggc cattcaaatg cgaccctgac 7081 ggcacgaact gccaggtctg tggcggaaca cacaactcac tctatgccga gccgctgggc 7141 tcgcactcgt cgagatatga ccgccccgac ttcaactttt tggagcgcca caacggcacc 7201 ggcaatgcgg ggttctgcga tggacacgcg aaggcgatga agcatagcca gctttacaat 7261 ggcggcgacc cgcacccgca tttcgactgg ggtgcgtaaa cgccccatca tcgaacagta 7321 gcggtccgac ggaaccgccg aacggggtgc cgcctttgtg ggatcccgtc cggcgcacag 7381 ggaggactct ctcatgaaac gcgctttgat cgccgtcgcg tttgcttcaa tctgcacgct 7441 tgcggccgct caggtcgagc acgccgggcc gccggcgaag aagctgcttg agtacggctg 7501 ggatgttccg aaacccagct atgtcgccga gcacattcgc gagatggagg agcggccgtt 7561 cgatggcgtc ctgatgcgaa tcagcggtct cggcagcgtc ttcgacccca cgccgcggac 7621 ccgcgagcag tatgctgccg agcttgacgc gctcgaacgc atcgagtggg accggttcac 7681 cgacaatttc ctgatgacat acgctgcctc gaagatggac tggttctccg atgagcactg 7741 ggccatcata gtggacaaca tccgcctgct cgcgagtggc gccgaagcgg gcggctgcgt 7801 cggcctctgc ttcgatgccg agccctacgg cgaaaacccg tggcactacc ccacccaacc 7861 gggcgcggac cagcacagct tcgccgactt ccaggcgaag gttaggcagc gcggtgcgca 7921 gttcattgac gcggttgagg cggagatgag cgagccggtg gtgcatacat tctacctcac 7981 cactctggct ggcttccgta acgccgccat ggccgagacc gcagagcagc gcgatgagat 8041 cctccgcgac tactcctacg gtctctacgc ggcgttcatc aacggcatcc tcgacgcgat 8101 agaccccggc acggtgctca ccgacggcaa cgagcctgcc tactactacc atgattcgcg 8161 gcagtacctc gatgtctatc actacatcaa gcagggcgcg ctcggggcga ttgcgccgga 8221 gaatcatggc aagtaccggg ggcaggtgca ggtggcgcag gcgctctacg tggaccacct 8281 gatgggcctg agggcgcgca aggtcgaggg gcactacctc acccccgagg agcaggcgca 8341 gtggttcgag cacaacacct actgggcgct gaagacctcg gaccgctacg tgtggtgcta 8401 ctcggagaag atgaactggt ggaccgacac cgatgtgccc gccgggcttg agcaggcgat 8461 aatcaacgcg cgcaccacgc acgatgccgg ccagccgatg gagattgacg ccgatgcgct 8521 ctggcagaaa gcgcaccaga ggatgcagga ggagatagag gcggacctgc tgcgccgcga 8581 cgccacgatc gcaaagctct cgcgagcact gaccccggtg attgacggtg cccgcgatga 8641 tgaggcgtgg cagtatgcga cggaactgga gcctttcgta gcgacattct cggttgagcg 8701 ccccgatctc gccacgacgc tggcgcatgt ggcatatgac gatgaggcgc tctacgttgc 8761 gatccgctgc atggagccgc agatggaggc gatggaggtc atcggcggca gccgcgatga 8821 caacgtctgg atgggcgaca gcgtggatct gttcctccag cgcgagcagg gaggtcgctt 8881 ctatcacatc atcgtcaacc ccgcgaacgt catctgggac gcagtccatg agggcgaagc 8941 gggcgatcga tcgtgggacc ctgactacca gagcgcgacc tatcgcggcg acgagttctg 9001 gaacgtggag atcgcgctgc catgggcgac gatgggctgg gaggcaccga cgcgaggcga 9061 cactttgcgc gcgaacatct gccgccagcg ccgcgcggtg agcgaactga gcagctggtc 9121 gcaggtggtg acgggcttcg tggagccggg gagcttcggg acatggaagt tctgatcgta 9181 agcccggggg atacggcacc ggtaatacgt ccggaggggc cgctcttagc gtcccctccg 9241 gtaacgacca agcgggaatg tgatcccgcg ccctccaaac gactggacta gccgccctgg 9301 cgaatgcatt ccactttgct gccaagaacc tcactgagcg gatcaccgtc gtgggcggaa 9361 tgaccaacat gcaggtgctg gcgccagccc tcggcgtact tcacggtcga ttcgctcatg 9421 tcggctacct gctcagccat gtcgcgcatc gcgtcgatgt aggcgtcctt gccctgtgaa 9481 atatccagcc agttcttctg cgacttgtgc tcgcggagca tcgcggcctt cacgtccatc 9541 ttctccgtga tgtccaccag cagcgacggg atgatcgggt tgcgcatcgg gtcgaggttc 9601 cccagcggct gtgcgtggta gacgtagacg tcctggaatg tgggcacctc gcgagggatc 9661 gtctgccagt tgcgcatccc gcgtgtgaag caggcggtga ccgtcaggcg actggtgttc 9721 gtgtggtcct ccatgtaatc ctgcggcgac tgcgtcagca cgatgttcgg cttgatctgc 9781 cggatgcgag cggtgacgcg gcgcaggagt tcatcttcat agtatatctc gatgtcattc 9841 accaggccgt ggtgataggt ggcgccgatc agcgccgcgg cattgcgcgc ttctcggccg 9901 cggatgcgga tgatctcctc atgcgtgtat tcggcggtgc cgcagttgcc gttgccgacg 9961 gtcatgatgt gaagctcata gcccgcatcg ccgagcatgg ccagcgtccc ggccatcagg 10021 aactcaacgt catcggggtg cgcgccgacc gcgagagcga ctttgttctc ggacatggct 10081 atcgtctcca ggggagttcg ttggatgcaa cggaggggct attcgcgcga gggcgagcga 10141 agtccttctc cggtttcgcg agcggagacg gcaacggcat ttgacaggat acggcgctac 10201 gcgcctacag gatcgtcgct tcgcgacgag caggcacggc acaggcagcg gcaaacactg 10261 tttgaacgtc cacgtcagtg aaggttcccg cagactgatg acgaatcatc cgaccaccgt 10321 ctcgcggccg ttccaaccga aaggagcccc accaccatgc agatcggttt ctcgtcagcc 10381 aacatcaccc ccagctacgg gatggaagtc ccgggcggga tgagcaagcg cttcacccag 10441 ggcgtgcatg acgagcttca ggccaccgcc gccgtcttcg atgacggcga gcaggccgtg 10501 gcgctcgtcg gcgttgacag cctctccatc aagcactcag tcgtcgagcg cgcccgcgag 10561 atcatcttcg acgcgatcgg catcgtgccc gagcacgtca tgtgcggcgc ctcgcacact 10621 cacgccggag gcccggcggc cgacatcttc atctccgaga gcgacccgga atacctgacg 10681 cacatgtcca ggcagatcgc gacggcggtc attgacgcga accgccgcaa gcaggagctg 10741 caggtcggcg tgcgcgtcgg cgaggtgaag ggcgtcgcgc atccgcgccg gtggctcatg 10801 aaggatggca gccagcgctc tcatccgcgc gcagatcagg cgaacatgga ccgcccgcag 10861 ggcgaactcg atgagtcgtg caacgtcgtc ggcgtcgcgg gcgaggacgg catcccgcgc 10921 ggctgcgtgg tcaacttcac ctgccacggc accaccggct ggggcgcggg cttcgcctcc 10981 gccgactgga tcggctactt ccgccagtgt ctcaagcgcg tcttcggcga tgacttcggc 11041 gtggtcttcc tcaacggcgc ctgcggcgac gtgacgcagg tggacaacac gcaggacccc 11101 gactacgtgc agtccggccc gatcgccgcc cgccgcgtcg gcggctcggt cgcgggcgag 11161 acgctcaagc agcttgtgca gatgcgcttc gtagacagca tggccgtcgg cggcaagcgc 11221 gtgcagatcg agcttgagcc gcgcgtgcct accgaggagc agatcgcgtg ggcacgcgag 11281 cacgccgaga gcacagagcc caacccggcg cgctgggcga ccgactcgat ctggtcgcgc 11341 gagtggctga atttggcgaa gatggtggag accgaggcgg tcgtaccatg tgagctgcag 11401 gcgatccgca tcggcgacac ggcgttcgcc agcaacccgg gcgagttctt ctgctcgctg 11461 gggatgggca tcaagcgcgg cagcccgttc ggcaacacct tcgtggtgga gcttgccaac 11521 ggctcgatcg gctacgtgcc gagcgaggac gcgtatgagg gcggctacga gtcgcagatg 11581 gccccatcga gcaagctcgt cgccggctcg ggcgagaaga tcgtcgcgga gacgattgag 11641 ctgctgaagg ggctgcatcg gtagaaggcc gcggaccgcg acggcgaggg ttgcgtcgtt 11701 gtcgtccgga ctgtgtatgt caattgtcag gtgaagttct tcatgcagcc ttgcggtttc 11761 ggttgttctg atggagtcgt tcgtcgtagg tcgtgccgtt gcgccacatt gagcagagta 11821 ccttgaccc //