LOCUS VBCZ01000047 13703 bp DNA linear ENV 29-MAY-2019 DEFINITION Betaproteobacteria bacterium isolate BP_30 14_0903_13_30cm_scaffold_478173, whole genome shotgun sequence. ACCESSION VBCZ01000047 VBCZ01000000 VERSION VBCZ01000047.1 DBLINK BioProject: PRJNA449266 BioSample: SAMN11380357 KEYWORDS WGS. SOURCE Betaproteobacteria bacterium (soil metagenome) ORGANISM Betaproteobacteria bacterium Bacteria; Proteobacteria; Betaproteobacteria. REFERENCE 1 (bases 1 to 13703) AUTHORS Diamond,S., Andeer,P.F., Li,Z., Crits-Christoph,A., Burstein,D., Anantharaman,K., Lane,K.R., Thomas,B.C., Pan,C., Northen,T.R. and Banfield,J.F. TITLE Mediterranean grassland soil C-N compound turnover is dependent on rainfall and depth, and is mediated by genomically divergent microorganisms JOURNAL Nat Microbiol (2019) In press PUBMED 31110364 REMARK Publication Status: Available-Online prior to print REFERENCE 2 (bases 1 to 13703) AUTHORS Diamond,S. and Banfield,J.F. TITLE Direct Submission JOURNAL Submitted (01-MAY-2019) Earth and Planetary Science, Jill Banfield's Lab at Berkeley, University of California, Berkeley, CA 94720, USA COMMENT The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: IDBA_UD v. 1.1.1 Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: 7x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 05/15/2019 20:08:26 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 4.8 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 3,848 CDSs (total) :: 3,807 Genes (coding) :: 3,453 CDSs (with protein) :: 3,453 Genes (RNA) :: 41 rRNAs :: 1 (5S) complete rRNAs :: 1 (5S) tRNAs :: 37 ncRNAs :: 3 Pseudo Genes (total) :: 354 CDSs (without protein) :: 354 Pseudo Genes (ambiguous residues) :: 56 of 354 Pseudo Genes (frameshifted) :: 241 of 354 Pseudo Genes (incomplete) :: 55 of 354 Pseudo Genes (internal stop) :: 43 of 354 Pseudo Genes (multiple problems) :: 38 of 354 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..13703 /organism="Betaproteobacteria bacterium" /mol_type="genomic DNA" /isolate="BP_30" /isolation_source="temperate grassland biome" /db_xref="taxon:1891241" /environmental_sample /geo_loc_name="USA: Angelo Coast Range Reserve, CA" /lat_lon="39.74 N 123.63 W" /collection_date="2014-09-03" /metagenome_source="soil metagenome" /note="metagenomic" gene <1..563 /locus_tag="E6H67_02360" CDS <1..563 /locus_tag="E6H67_02360" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_007874117.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=3 /transl_table=11 /product="low-specificity L-threonine aldolase" /protein_id="TMH07954.1" /translation="RRGLKLHLDGARLWNAAVKQGIEPGEIAGDFDSVSVCLSKGLGA PVGSVLCGSRDFIAAARRWRKMLGGGMRQAGILAAAGRYAIAHNVARLAEDHALAQAL AAGLSRYSALSVSMPQTNMVFVDVPQAIAASFAKHLATHDVLVTGSTKQRWVTHLDVG PADVERALAAVDSFFMRAAHEEGAAA" gene 560..865 /locus_tag="E6H67_02365" CDS 560..865 /locus_tag="E6H67_02365" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_011913574.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="DUF962 domain-containing protein" /protein_id="TMH07955.1" /translation="MSKDFRTFRDFYPFYLSEHAHPACRRLHFVGSTLVLACIVAAIL TRNAWWLLGAVFAGYGFAWVGHFVFEKNRPATFKYPVYSLIGDWVMFRDMLTGRIRW" gene 939..1015 /locus_tag="E6H67_02370" tRNA 939..1015 /locus_tag="E6H67_02370" /product="tRNA-Pro" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:973..975,aa:Pro,seq:tgg) gene 1044..1120 /locus_tag="E6H67_02375" tRNA 1044..1120 /locus_tag="E6H67_02375" /product="tRNA-Arg" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:1078..1080,aa:Arg,seq:tct) gene complement(1168..2481) /locus_tag="E6H67_02380" CDS complement(1168..2481) /locus_tag="E6H67_02380" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009553550.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="decarboxylase" /protein_id="TMH07956.1" /translation="MFESVQRRAAVFTRDGLKVRRLVGSCKKLLSERGEAAGVSLAKT TLDLYAELDKKEQARFFSVLLTEFSPDPKRVLAAAQAYAEQPSAATLSRLSTIAEPPR QELLRRLNRAPGGTATILRMRERLLEMRRDERELDAVDWDLRHLLSSWFNPGFLQIVR VDWRTPAYLLEQIILHEAVHEIQGWDDLRQRLEADSRCFAVFHPALPDEPLIFVEVAL VDRMSDAVASLLGVKSTSSDPARATTAVFYSISNCQPGLRGVSLGNFLIKHVVDVLSQ EFPRLKVFCTLSPIPGFAAWLGALLKEPDAGRHNSLGKALKLVAQELGADAAKVASDP KNAVDRLAPLREPLMQLCAAYLLQGGDGNEPAQDPVARFHLNNGARLERINWLADVSK KGLRESLGLMVNYVYVPRAIEINHEKFVRGEIVASRQVRALLLSD" gene 2767..2842 /locus_tag="E6H67_02385" tRNA 2767..2842 /locus_tag="E6H67_02385" /product="tRNA-His" /inference="COORDINATES: profile:tRNAscan-SE:1.23" /anticodon=(pos:2800..2802,aa:His,seq:gtg) gene complement(2967..3785) /locus_tag="E6H67_02390" CDS complement(2967..3785) /locus_tag="E6H67_02390" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_012431897.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="class I SAM-dependent methyltransferase" /protein_id="TMH07957.1" /translation="MSSTDTDKVFAGSVPKLYETYLVPLIFEPYAADLVTRLASRSLT RVLEIAAGTGVVTRALASILPESVAIVATDLNPPMLGQASATGTKRPVEWRQADAMQL PFEDRTFDAVVCQFGVMFFPDKSKAFSEASRVLRPGGVFIFNVWDRIEENEFADSVTT ALESLFPKDPPRFLARTPHGYYDRPTIERDLANGGFTASPQIATVTARSRAKSARVPA IAFCQGTPLRNEIEARDASRLGEATDVAAEAVAQRFGKGAVDGKIQAHIVTIEN" gene complement(3920..4531) /locus_tag="E6H67_02395" CDS complement(3920..4531) /locus_tag="E6H67_02395" /inference="COORDINATES: protein motif:HMM:PF08238.10" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sel1 repeat family protein" /protein_id="TMH07958.1" /translation="MKANVRNLLAALVVATALGPPAVAGSLDDGIAAYHEKEYAKAAE LWQPLAEKGDPAAQYLLGTLYMAGKGVKQNDATAFMWFQRAANQGNASAQYNVGASYA EGTGVPKSDVDAAKWFQRAANQGMVFAQLNLGLLYAAGNGVPQDNVEAFKWLELAFFA LPTGGPRSDVARAMTDVAAKMTREQIDDAKRRERGWKAQPEVK" gene complement(4547..5509) /gene="panE" /locus_tag="E6H67_02400" CDS complement(4547..5509) /gene="panE" /locus_tag="E6H67_02400" /EC_number="1.1.1.169" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015447130.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="2-dehydropantoate 2-reductase" /protein_id="TMH07959.1" /translation="MRILVVGAGAIGGYFGGRLLQSGRDVTFLVRPRRAAELADAGLI IKSPRGDVTLRQPATILSDNIREPFDLVLLSCKAYDLNGAMDSVFPAIGPETMILPLL NGMRHLDMLDERFGRSHVLGGLCVIAATLNEQHAVIHLNNIHTLSFGERDRCLSDRTR TVASTMADAGFDARLSEDIVQDMWEKWVFLAALAGVTCLMRAPIGDIVASPGGNDLTL SFLEECRAIAEDGGHAPRAAFLDQTRATLTAKGSSFTASMLRDMEGNARIEADHIVGD LLRRRNPADANRSELSLLATAYTHMKAYEARRASALASAPRAGN" gene complement(5598..7148) /locus_tag="E6H67_02405" CDS complement(5598..7148) /locus_tag="E6H67_02405" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_020722310.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="MFS transporter" /protein_id="TMH07960.1" /translation="MTGFSAPPSDAGVIRAAPCALPCTPRTAPWILAATILASSMAFI DGTVVNVALPALQRDLGASLIDVQWVIEAYSLFLAALLLVGGAAGDRFGRRRVFLLGV AVFTLSSIGCGLASTVRELVLARGLQGIGGALLVPGSLAIISASFAERDRGKAIGTWS GATAITAALGPVLGGWLIDHLSWRAVFFINLPLAIAIIAIALRHVPESCNAEMRGKLD WPGALLVTLGLGGVVYALIESSNAGWSHPRILASLLLGLSALAGFIAVEMHRAAPMMP PQLFRSKTFAGANLLTFLLYAALGGSLFFVPLNLIQVQSYSTTAAGAALLPLVLLLAL LSRWSGGLIDRYGAKAPLVIGPVVAACGFALFAVPGVGGSYWITFFPAPLSATVMSAV ETGFSGAASGINNAASRVAALIAIALFGLIMASIFNRSLQTQLERAALAPCVTEAVEK QHNKLAAIELPACADARDAALAKHAIAEAFVSGFRWIMLISAALALASAASAWALIEN TSHRARAT" gene 7802..8344 /locus_tag="E6H67_02410" CDS 7802..8344 /locus_tag="E6H67_02410" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009205300.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="response regulator" /protein_id="TMH07961.1" /translation="MTNLLIVDDEANVLNALRRMLVNPAAAPVLPDLQLKTFTLPIEA LEHVSSHHVDLVISDYRMPVMDGVSFLTRVKELQPDTARIILSACTDMEGIVRAINEA GIFRFVSKPWSDVELKTIVAQVLAHRELLVENRRLADEVRCQSGLISRQQLELARLES ESPGITRVRWTEDGGVLLED" gene 8420..9880 /locus_tag="E6H67_02415" CDS 8420..9880 /locus_tag="E6H67_02415" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_009205298.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="GAF domain-containing protein" /protein_id="TMH07962.1" /translation="MSASPLGPGADKVGDYAWIYALYELGRTAASGAGPQVQQDILEH IVSGLDAQSGSIALLVDGTEDLLEVAAGIDLPPGVLGSRLARGVGLFGHVMATGEPFL INGDVAETGLPLRINERRDRLTHSAMCWPLLVAERIIGAVAVNRAPGHPKYTIHDLDR GQAMTSLLALVIANHRMHVERENQILELSTLNATMQRINEMLEEAQDQLVQSEKMASI GQIAAGVAHEINNPIGYVLSNFGTLDSYMASLFGLLEAYVEFERPLPAPLPTPLDRAR ALREGIDLNFLRSDVVALLAESRDGLLRVKRIVQDLKDFSRGGVEEEWEVVNLHDALD RTLNIVRNEVKYKARIETNYGSLPDIECIPSRLHQVFLNLIVDAGHAIEENGKITIST GTGTTANEIWIRFEDTGCGIPKEHLNRIFEPFFTTKPVGQGTGLGLSVSYSIVCGHGG SIDVESEVGRGTRFTIRLPVRQLRTTVMEAEAALAS" gene 10221..10703 /locus_tag="E6H67_02420" CDS 10221..10703 /locus_tag="E6H67_02420" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_004180656.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="sodium:proline symporter" /protein_id="TMH07963.1" /translation="MIDVRAAVYAGVAAGIVSTLAQIALWLIFSDAFPAILFRDARLT AAIVMGRGVLAPPTTFDLPVMLVATLVHFALSMLYGLILSWLMSRLATPLSTIVGAAF GLILYAANMYGFTVVFPWFEAARDWITLASHLVFGVVAAPVYKALSRRRGVHAHDAGG " gene complement(10730..12202) /locus_tag="E6H67_02425" CDS complement(10730..12202) /locus_tag="E6H67_02425" /inference="COORDINATES: similar to AA sequence:RefSeq:YP_002823460.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="FAD-dependent oxidoreductase" /protein_id="TMH07964.1" /translation="MNLPGERECCWMAGAPETGYPAFRKSASSDVAIVGGGIVGLTTA YVLAQAGVSVAVLEARKIGRQVTGRSTAKVTSQHALIYNYLIEKSGFETARLYADANQ TAVRRLCDWVRTESIACNLERKDAYAYTSRPARRAAIEAEAEAARCLGLEAEALASAP LPFATVGALRFRNQAQFNPVRYLIGLAAAVTTHGGRIFENTRVTNVKSGKRWRVSAGG HHLDAEKVVLATNLPIAGPIQFDKLTQPRSHVAMAFRASPAGMIDGMFIDVDRPTHSL RMGADREGPLLIVLGAKFKTGQEGDVARRFRQLEKWVRDNIPAGDAAWRWVNEDYDSP DRIPYAGVVGPKAPGLYVATGFNAWGISNGTAAGMTIADQIQGRENPWAKLYDPTRKS PKGFNRGGDTKSLVRKIEQIPPGEGGVIKRGKEKIAVWKSIGGTPHALSASCTHMGCT VTWNNADLTWDCPCHGSIFSCDGRVIHGPATEALSRKKLA" gene complement(12227..13552) /locus_tag="E6H67_02430" CDS complement(12227..13552) /locus_tag="E6H67_02430" /inference="COORDINATES: ab initio prediction:GeneMarkS-2+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS-2+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="TMH07965.1" /translation="METIESYLDRKYRELTLLNSCLASPLEVQKPASIGIAIDRKFRL VAAIKAELSLQSWAFTETAWSSTRRARSGPFEFSYGYQRADLHIHGPPIYPALWSPST NIFQHTIYTGSGMSAMAALLTALSRQRGSIQVLVPQGCYSETRELIESFGGLFRILPL EAWRSPSRPREGITRVALLDSSIPTGFFGFLRMPPHDIDLVIFDTTCLWRSSARIQRV VNWAMRSMLPLALVRSHAKLDCLGIEYGRLGSVAVAAPLRGILSGRLGWITDLTRQTG DSVRLLGAAPIPANFPPFTGNSQFERCSVARIAAIIRNNRRMARLLSAELGPKVVSAF QHGLYLTLTPSGNLSIASAKQRAAVLCERLLSSALPVSHAGSFGFDFVAIEWFADSSS RRNVVRVAASDIPLALIDQIAEELAREWIGLELTMKRSSHAIRNSNTFA" BASE COUNT 2629 a 4216 c 4187 g 2671 t ORIGIN 1 acaggcgcgg tctcaagctg catctcgacg gcgcacgctt gtggaacgcg gcggtgaagc 61 agggcatcga gccgggcgag atcgccggag atttcgattc ggtctccgtc tgcctttcca 121 agggactggg cgcgccggtc ggttcggtct tgtgcggcag ccgtgacttc atcgccgccg 181 cgcggcgatg gcgaaaaatg ctcggcgggg gcatgcgcca ggcaggcatt ctcgccgccg 241 ccggacgtta cgctatcgcg cacaatgtcg cgcggctcgc cgaagaccac gcgctcgccc 301 aggcgcttgc ggcaggactg tcccgatatt cggcgctgtc ggtttcgatg cctcaaacca 361 acatggtttt tgtggatgtg ccgcaagcga ttgccgcgag ctttgccaag cacctcgcca 421 cgcacgacgt cctcgtaaca ggctcgacga agcaacgctg ggtcacccac ctcgacgtcg 481 gacctgccga tgtcgagcgc gctcttgcgg cggtcgacag ttttttcatg agggcggcgc 541 acgaagaagg agcggcggca tgagcaagga cttccgcacg ttccgcgatt tctatccttt 601 ctatctctcg gagcatgcgc atccggcctg ccgtcgcctg cacttcgtcg gctcgacgct 661 cgtgctcgcg tgcatcgtcg cggccatcct tactcgcaac gcgtggtggc tgctcggcgc 721 agtgtttgcg ggctacggct tcgcctgggt cggacatttc gtcttcgaga aaaatcgccc 781 tgcgaccttc aagtatcctg tttacagcct gatcggggac tgggtgatgt tcagggacat 841 gctcaccgga cgcatccgct ggtagtgtcg aatgcagtaa agtcatgcgc ttctcgcgtt 901 gacccgaact cctccgccga gggataatcc gctcccttcg gggtgtagcg cagcctggta 961 gcgcgcttgc tttgggagca agatgtcggg agttcaaatc tctccacccc gaccaccggc 1021 aggttcggtt tcggcacgtc aaggcgcccg tagctcattc ggatagagca ccggccttct 1081 aagccggggg taacaggttc gagtcctgtc gggcgcgcca aaacagcccg acagggcaac 1141 gactcgttat ccagtcaggc agtcgcgtca atcgctcagc aacagtgcgc gcacctgccg 1201 cgaggcgacg atctcgccgc gcacgaactt ctcgtggttg atctcgattg cccgaggcac 1261 gtacacgtag ttgaccatca gtcccagcga ctcgcgcaac cccttcttgg agacgtccgc 1321 cagccagttg atgcgctcca gccgggcgcc gttgttgaga tgaaacctgg ccaccggatc 1381 ctgcgccggc tcgttgccgt cgccgccctg cagcaggtag gcggcgcaca actgcatcaa 1441 cggttcccgc agcggcgcca ggcggtcgac ggcgttcttg gggtcgctgg cgactttggc 1501 agcgtcggcc cccagctctt gcgccaccaa ctttagcgct tttcccagcg aattgtgccg 1561 cccggcgtcc ggctccttga gcagggctcc gagccaggcg gcgaacccgg gaatgggtga 1621 cagcgtgcag aacaccttga ggcgcgggaa ctcctgcgac agcacgtcca cgacgtgctt 1681 gatgaggaag tttccgagag acactccgcg caggcccggt tgacagttgc tgatcgagta 1741 aaaaaccgcg gtggtcgcac gggccggatc gctcgacgtg gactttacac cgagcagcga 1801 tgcaactgcg tcggacatcc ggtccaccag cgccacctcg acgaaaatca gcggctcgtc 1861 agggagcgcc ggatggaaga cggcaaagca acggctgtct gcttccagcc gctggcggag 1921 gtcgtcccag ccctgaatct cgtgcaccgc ctcgtgaaga atgatctgct cgagcaggta 1981 agcgggcgtg cgccagtcca cgcgcacgat ctgcaggaag ccgggattga accaggacga 2041 aagcaggtgg cggaggtccc aatcgaccgc gtcgagctcg cgctcgtcac gcctcatctc 2101 gagcagccgt tcgcgcatgc gcaggattgt cgcggtgcct ccaggcgccc ggttcaaccg 2161 ccggagcagc tcctgacgcg gcggctcggc gatggtactc agccgcgaca gggtggctgc 2221 cgacggttgc tcagcgtagg cttgggcagc tgcgagcacc cgctttggat cgggcgagaa 2281 ctccgtgagc agcaccgaga agaatcgcgc ctgttccttc ttgtcgagct cggcgtacaa 2341 atcgagcgtg gtcttcgcga gagatacacc cgccgcctcg ccgcgctcgg acagtaactt 2401 cttgcagctg cctacgagtc gccgcacctt gaggccgtcg cgcgtaaata cagccgctcg 2461 tctctgaact gattcgaaca tgcgtgaata atcaggaaca ggaaaggaag gggttgctcc 2521 cagtatagag ccacggtatc cagggagcta gccacgccga gaaataagcg agaggccacc 2581 cacgagcgcg ggttttgttg agcacgcctc gtgaaccaca caacttgagc aagcattatt 2641 ccggttggtt gagagggctc gatcggccaa ctccttttgg cggatatctc gtccttggcc 2701 gatcgccata ccctgctaga atatgcatgc ggccccgtcg gagcgcagcc ggggcggctc 2761 caaatggtgg ctgtagctca gctggtagag tcccggattg tgattccggt tgtcgtgggt 2821 tcgaatccca tcagccaccc cattcaactg caagagaatt tcgtcccctt catcatccgc 2881 aatccggacg tcgtgtgtcc ggttttgtgc ctagtcgatt tcttcatctt gtggtttggc 2941 gaggccgctg ttgcctgcgt gatggatcag ttttcgatgg tgactatgtg cgcctggatc 3001 ttgccgtcca cggccccttt cccgaaccgc tgagcgaccg cttccgccgc gacatcggtt 3061 gcctcgccga ggcgcgaggc atctcgcgcc tcgatctcgt tccgtagtgg tgtcccctga 3121 caaaacgcga tcgcgggaac ccgggccgat tttgcacgac tgcgcgccgt gaccgtggcg 3181 atttggggtg acgcggtgaa cccgccgttt gcaaggtctc gctcgatggt cggacgatcg 3241 tagtagccgt ggggtgtgcg agcgaggaag cgaggcgggt cctttggaaa cagcgattcg 3301 agcgccgtcg tcacgctgtc agcgaattcg ttttcctcga tccgatccca tacgttgaaa 3361 atgaaaacgc ctcccggcct gagcactcga ctcgcctccg aaaacgcttt cgacttatcc 3421 ggaaagaaca tgactccaaa ctggcataca acggcgtcaa acgttcgatc ttcgaacggc 3481 agttgcatgg cgtctgcctg acgccactcc accggccgct tcgtcccggt cgcggaggct 3541 tgccccaaca tcggcggatt cagatcggtg gcgacgatgg cgacgctttc ggggagtatg 3601 gatgccagag cacgagtcac gacacctgtg ccggcggcaa tctcgagtac gcgagtaaga 3661 gacctggaag ctagccgggt gaccaagtcc gcagcgtaag gctcgaagat cagcggaacg 3721 agatacgttt cgtaaagctt cgggactgag cccgcgaaaa ctttgtcggt atcggtactg 3781 ctcacgatgt gcgcctgtga atcgatccca ttgcacgctg acggtgaatg cgcgatgaat 3841 tgtttgcgcg agaatgccgc ctaaacccgg tccttagtcg atccgcccct gatccaggcc 3901 acgaagcgaa tgcgccgcgt catttcactt ccggctgcgc tttccacccg cgctcgcgcc 3961 gcttcgcgtc atctatctgc tcccgggtca tcttcgcggc gacatccgtc atggcacggg 4021 cgacgtcgct gcgaggcccg cccgtaggca acgcaaaaaa ggcgagctcc agccacttga 4081 aggcttcgac gttgtcctga ggaacaccgt tgcctgcggc atagagaagc ccgaggttca 4141 gttgcgcgaa taccatgccc tggttggccg cgcgctgaaa ccatttggcc gcatcgacat 4201 cgctcttcgg gacccccgtg ccctcggcat agctcgcgcc gacgttgtac tgcgcggagg 4261 cgttgccctg attcgcggcg cgctggaacc acatgaacgc tgtcgcgtcg ttctgtttga 4321 cgcccttgcc cgccatgtac aaggtgccga gaagatattg ggcggcggga tctccctttt 4381 ccgcgagcgg ctgccacaac tcggccgcct tggcgtactc tttttcgtga taggcggcga 4441 tgccgtcgtc gagcgacccc gccacggcgg gtgggccgag agcagttgcg acgacgagtg 4501 cagccagcag attgcgaacg tttgctttca ttgagtgtgt cctaggctaa tttcccgcgc 4561 gcggtgcaga cgcgagcgcg ctcgcgcgcc gtgcttcgta ggccttcatg tgagtatagg 4621 cagtcgcgag tagcgatagt tcgctgcggt tcgcgtctgc agggttccga cgacgaagaa 4681 ggtcacctac gatatggtcc gcttcgattc gggcgttgcc ttccatatca cggagcatcg 4741 aggcggtgaa agacgagccc ttcgcggtca gcgtcgcgcg cgtctgatcc aggaacgcgg 4801 ctcgcggcgc gtgcccgccg tcttcggcaa tggcgcggca ttcttcaaga aaagacagtg 4861 tcagatcgtt gccccccggg gatgccacga tatctccgat cggcgcgcgc atgagacagg 4921 tgaccccggc caacgcggcc aggaataccc acttctccca catatcctgc acgatatcct 4981 cgctgaggcg ggcgtcaaaa ccagcatcag ccatggtgct ggcgacggta cgagttcgat 5041 cggacaggca ccggtcgcgt tcgccaaacg aaagcgtgtg tatgttgttg aggtgaatga 5101 cggcatgctg ctcgtttagc gtcgctgcga tgacacatag gccgccaaga acgtgggagc 5161 ggccgaatcg ctcgtcgagc atgtcgagat gcctcatgcc gttgaggaga ggcaggatca 5221 tcgtttctgg cccgatggcc ggaaagacgg agtccattgc gccgttaagg tcgtaggcct 5281 tgcagctcag caagacgaga tcgaaaggct cgcggatgtt atccgagagt atcgtcgcgg 5341 gttgacggag cgtgacatcc ccacgtggac tcttgatgat cagacccgca tcggccagtt 5401 cggcagcgcg tcggggccgg accaaaaacg tgacatcccg acccgattgc agcagccggc 5461 caccaaaata gccgccaatc gcaccggcgc ccacgacgag aatgcgcata tttcgacttc 5521 ctcctgtttc ggatgcggag cgacttcccc gagcggcttg cacgtcgaca agcgccaggg 5581 tatcaagttc acggtcgtca cgttgcgcga gctcggtgac tcgtgttttc tatcagggcc 5641 caggcactcg ctgcgctcgc cagggcgagc gcagccgaaa tcagcatgat ccatcgaaac 5701 cccgatacga atgcttccgc gatcgcgtgt tttgccagtg ccgcgtcacg ggcgtccgcg 5761 cacgcgggca gttcgatagc ggcgagcttg ttgtgctgct tttccaccgc ttcggtcacg 5821 catggcgcca acgctgcgcg ctcaagctgg gtttgcaggc tgcggttgaa gatcgacgcc 5881 atgatcagac cgaacaatgc aatggcgatc agtgcggcaa cgcgcgaagc cgcattgttg 5941 atccccgagg cggcaccgga aaaaccggtt tccactgcgc tcatgacggt ggcggagagg 6001 ggcgcaggga agaacgtgat ccagtagctt ccacccacgc cggggacggc aaatagggca 6061 aagccacagg ccgccaccac cgggccgatg acgagcggcg ccttcgcgcc gtaacgatcg 6121 atcaacccgc ccgaccaccg cgacagcagc gccagcaaaa gaacgagcgg cagcagcgca 6181 gcgcctgcag cggtcgtgga atagctctgt acctggatga gattcagcgg cacgaaaaag 6241 aggcttccgc cgagcgccgc gtagagcagg aacgtgagca gattggcgcc ggcaaacgtt 6301 ttcgagcgga acaactgtgg cggcatcatc ggcgccgcgc ggtgcatttc aaccgcgatg 6361 aagcccgcca gcgccgacaa gcccagcagc agtgaagcga ggattcgtgg atgagaccaa 6421 cctgcgttgg acgactcgat cagagcgtag acaacgccac cgagcccgag cgtcaccaac 6481 aacgcgcccg gccagtcgag cttcccccgc atctcggcgt tgcagctttc gggcacgtgc 6541 cgcagagcga tggcgatgat cgcgatggcc aagggaagat tgatgaagaa aacggcgcgc 6601 cacgatagat gatcgatcaa ccagccgcca agcaccgggc ccaacgccgc agtgatggct 6661 gtggcacctg accaggtgcc gatggccttg ccgcggtcac gctcggcaaa cgacgcactg 6721 atgatggcga ggctgcccgg gaccaataac gccccgccga taccttgaag gccgcgggca 6781 aggacgagtt cgcgcactgt gcttgcgagt ccacacccga tggaggagag ggtgaacacg 6841 gccaccccaa gcaagaaaac gcgacggcgc ccgaaacggt cgcctgctgc tccgccgacc 6901 agaagcaacg cagcaaggaa cagtgagtag gcttcgatca cccactgcac gtcgatgagg 6961 cttgcgccga ggtctcgctg cagcgccgga agcgctacgt tgaccaccgt tccgtcgata 7021 aacgccatgc tggaagcaag gatcgttgcc gccaggatcc atggcgccgt tcttggtgtg 7081 cacggcaagg cgcagggtgc ggcgcgaatg actccagcgt cggatggagg cgccgagaat 7141 ccagtcatcg cgcgtcaccg ggcagcatat gcagacgatg ttcgaacgat acgcgcaccc 7201 gacgtgcggg cgactggaca cgacaccgcc gcgctggcag tggacctgtt cggagctcat 7261 gcgcaaggtg gcagacggca tgcccgttta cggattgtgg gtgcgaatcc gggcaagcgc 7321 gccaattttt caatgggtca ccgaatgacc gatcgtgtgt ccttcgaacg cggtagcgag 7381 attcacgctt tgcttcattg cattggcacc ctttcgcgat cctcgaattc cacgcgatcc 7441 agcgtcagcg cccatgagat cccgatcacg acaatacaga acaactgcga tcgcgaagtt 7501 cattcgcgga ggtttggtca agcatcctgg gagtatctgc tcagcgacat atgcagcttg 7561 caaatagtgt gcgctgcgat tcggttgtcg ttgcgccaac gatattgacg ccgccaattt 7621 acgtcacatt tggccgcgcg gagcaactgt tgtgctgcgt gacggctcgc tactcgcttt 7681 ccccaccttt ggcacgccac ttgctgatgg atgtcaggag caggccatat atttcccgac 7741 ggccgcctga cgagcctggg cgcacgggac gtctcccaca tctcatttcc aaggggccct 7801 catgaccaat cttctcatcg tagatgatga ggccaacgtt ctcaacgcac tgaggcggat 7861 gttagtcaac cccgcggcgg ccccggtgct gcccgatctg cagctgaaga cgttcacgtt 7921 accgatcgaa gcgctcgaac acgtgagcag ccatcacgtc gatctcgtga tatccgatta 7981 ccgcatgccg gtaatggacg gtgtgtcgtt tctgacgcgc gtgaaggagc tacagcccga 8041 tacggcacgg atcattctga gcgcgtgcac cgacatggag ggaatcgttc gtgccatcaa 8101 cgaggcaggc atcttccggt tcgtaagcaa gccgtggtcg gacgtagagc tgaaaacaat 8161 tgtcgcgcag gtgctcgcgc atcgcgaact gctcgtcgag aaccgccggc ttgccgacga 8221 agtgcgctgc cagagcggtc tcatctcgcg gcagcaactg gagcttgctc ggctcgaatc 8281 cgaaagtcca ggcatcacgc gtgtacgctg gaccgaagat ggcggcgtcc tgctcgagga 8341 ctaacgtggg ccttcccttc cactgccccg gacgcggccg catcgggtcc tttgtacggc 8401 cctcatctcc tgggctggga tgagcgcgtc tcctctcggc cccggcgccg acaaagtcgg 8461 ggactatgcc tggatttacg cgctatatga actgggccgg accgctgcca gcggtgcggg 8521 cccgcaagtg cagcaggaca tcctcgagca catcgtctcc gggctcgatg cacaaagcgg 8581 atcgattgcg cttctcgtcg acgggaccga agacttgctc gaagtcgccg ccggcatcga 8641 cctccctccc ggtgtgcttg gcagccgcct tgcccgtggc gtcggcctgt tcggtcacgt 8701 catggcgact ggcgaaccct tcctgatcaa cggcgacgtt gccgaaaccg gattgccgct 8761 gcggataaac gagcggcggg atcggctgac gcactccgcg atgtgctggc cgctgcttgt 8821 cgccgagcga atcatcggtg cggtcgcggt aaacagggct cccgggcatc ccaaatacac 8881 gatccatgac ctcgatcgtg gacaggcgat gactagcttg ctcgcactgg tcatcgccaa 8941 tcaccggatg catgtcgagc gtgagaacca gattctcgag ctgtcgacgc tcaacgcgac 9001 gatgcaacgc attaacgaga tgctcgagga agcgcaagat cagctggtcc agtcggagaa 9061 gatggcgtcg atcggccaga tcgcggcggg agtcgcccac gaaatcaaca atccgatcgg 9121 ctatgtgctt tctaacttcg gcacgctgga ttcgtacatg gcgagcctgt tcgggcttct 9181 cgaagcctac gtcgagttcg aacggccact gcccgcgccg ctgccgacgc cactcgatcg 9241 cgcacgcgct ctgcgcgagg gcattgattt gaattttctg cgcagcgacg tcgtcgcgct 9301 gctcgccgag tcgcgcgatg gccttctgcg cgtcaagcgc atcgtgcagg acctcaagga 9361 cttctcccgc ggcggtgtcg aagaggaatg ggaggttgtg aaccttcacg acgccctcga 9421 ccgcacactc aacatcgtgc gcaacgaagt caaatacaag gctcgcatcg agacaaacta 9481 cggcagtttg cccgatatcg agtgcatacc ttcgcggctc caccaggtgt tcctcaacct 9541 gatcgtcgac gccggccacg cgatcgagga aaacggaaag atcacgattt cgaccggcac 9601 cggcacaacc gccaacgaga tctggatccg tttcgaggac accggctgcg gcattcccaa 9661 agaacacctc aaccgcatct tcgagccgtt cttcaccaca aagccggtcg gtcaggggac 9721 cggactcggg ctgtcggtgt cctactcgat cgtttgcggg cacgggggat cgatcgatgt 9781 cgagagcgag gtcggtcgcg gaacgcggtt caccattcgt ctaccggtgc gacaactgcg 9841 cacgacggtt atggaagccg aagcggcgct ggccagctag cgtcgtgaat caaaagttcg 9901 cagcatattt cttattgtca ttcccgcgca agcgggaatc cagcggcttt gaaataaggg 9961 aaaccggcga cattcaacga cgctgaagcc tgccctcgag tgcttgaatc gggggcgggg 10021 acgacggctc tggaatatca ggcgaatttc tgacttggga cgcttgacgc gctcttcgta 10081 ggaagtgtgc cgccagtcga gaaaagccgc atggacgccg tcgatgaccc cgtttacgat 10141 agctcacgtc atcttgccat gcttcgcgat gcgttctccg gttgcgggat cgatggcttc 10201 gagtgtgact ggcatctcac atgatcgatg ttcgtgctgc cgtctacgcc ggcgtcgccg 10261 ccggcatcgt gtcgacgctc gcgcagatcg cgctgtggct gatattttcc gacgcgttcc 10321 ccgcgatcct gttcagggac gcgcgcctga ccgccgcgat cgtcatgggc cgcggtgtgc 10381 tggcaccgcc cacgaccttc gacttgccgg tgatgctggt cgcgacgctc gtacacttcg 10441 cgctgtcgat gctctatggc ctgatcctgt cgtggttgat gtcgcgtctc gctactccgc 10501 tttccacgat cgtcggcgcg gcgtttggcc tgattcttta cgcagccaat atgtacggtt 10561 tcaccgtggt atttccttgg ttcgaagcgg ctcgcgactg gataaccctc gcatcacatc 10621 ttgtttttgg cgtggtcgcg gcgcccgtct acaaggcgct ttcgcggcgg cggggcgttc 10681 atgcccatga tgcaggcgga tgagcggaat tcggccgccg caatgaccgc taagccagtt 10741 tcttgcgcga taacgcttcg gttgcagggc cgtggatgac ccgcccatcg caggaaaata 10801 tcgacccgtg acacgggcaa tcccaggtga gatccgcgtt gttccaggtg accgtgcagc 10861 ccatgtgcgt gcatgacgcg gagagggcat ggggcgtgcc gccgatggac ttccacactg 10921 caattttttc ttttccccgc ttgatgacgc caccctcgcc aggtggaatc tgttcgatct 10981 tgcgcaccag cgatttcgta tcgccgcccc gattgaagcc tttgggcgat ttgcgcgtcg 11041 gatcgtaaag tttcgcccag gggttttccc gcccttgtat ctgatccgcg atcgtcattc 11101 cggccgcggt cccgttgctg ataccccagg cgttgaatcc ggtcgcgacg taaagacctg 11161 gcgctttcgg gccaaccacg cccgcatagg gaatgcgatc cggcgagtcg taatcctcgt 11221 tgacccatcg ccacgcggca tcgcctgccg ggatgttgtc gcgaacccac ttttcaagct 11281 gacggaacct tctggccaca tcgccttcct gccctgtctt gaactttgcg ccgagcacga 11341 tgagcagcgg cccctctcta tctgctccca ttcgcaagga atgcgtaggc cgatcgacgt 11401 cgatgaacat tccgtcgatc attcccgcgg gtgacgctcg aaacgccatc gccacgtgag 11461 accgcggctg cgtcaactta tcgaactgga tcggaccggc gatgggcagg ttggtggcaa 11521 gcacgacctt ttcagcatca agatggtggc cgccggcgct gacgcgccag cgcttgccgg 11581 atttcacatt cgtcacccgc gtgttctcga agatgcggcc gccatgagtc gtcaccgcag 11641 cggcgagccc gatgagatat cgcaccggat tgaactgcgc ctgattgcga aaccgcaacg 11701 ctccgaccgt cgcgaatgga agcggcgccg aggcaagcgc ctccgcttcc agcccaagac 11761 aacgagcagc ttcggcttcg gcctcgatcg ccgcgcgcct cgcgggtcgg ctcgtatacg 11821 cgtatgcgtc ctttcgttcc agattgcatg cgatcgattc ggtcctgacc cagtcgcata 11881 gtcggcgcac ggcggtctgg ttagcatctg catagagtcg ggcggtttcg aagccggact 11941 tctcgatcag atagttgtag ataagcgcgt gctggctcgt cactttcgcc gtcgagcggc 12001 ctgttacctg acgtccgatc tttcttgctt cgagtaccgc gacagatacg ccggcctgcg 12061 cgagaacata cgccgtcgtc agtccgacaa tcccgccgcc gacgatggct acgtccgagc 12121 tggccgattt cctgaaagcc ggatatcccg tttccggagc gcccgccatc caacagcact 12181 cacgttcccc gggtagattc atgaatgggt atcgtatccg agcccgtcag gcgaacgtgt 12241 tcgagtttcg gattgcgtgc gacgagcgct tcattgtcag ttcgaggcct atccattccc 12301 tagcgagttc ctcggcgatc tggtcgatga gagccaacgg gatatcggat gcagcgacac 12361 gaacaacatt gcgacgcgag gacgaatcgg cgaaccactc gatggcaacg aaatcgaatc 12421 cgaaggagcc cgcgtgtgaa acgggaagcg cggacgataa gagcctctcg cacaggacgg 12481 cggcccgctg ctttgcgctt gctatggaca agttgccgct gggcgtcaac gtcagataaa 12541 ggccgtgctg aaatgccgaa acaaccttcg ggcccaattc agccgaaagc agacgagcca 12601 tccttctgtt gtttcggatg atcgctgcga tgcgcgcgac actgcagcgc tcgaattggg 12661 aattgcctgt aaagggcgga aaattcgctg gaatcggcgc ggcaccgagc agtcgaaccg 12721 aatcgccggt ttggcgcgtg agatcggtta tccatcccag gcgccccgac agaatgcccc 12781 gaaggggcgc ggcgaccgcc accgatccga gtcggccgta ttcgatgccc agacaatcga 12841 gcttagcgtg actgcgcaca agcgccagcg gcagcattga acgcatggcc cagttcacca 12901 ctcgctggat tcgcgccgag cttcgccaaa ggcacgtcgt gtcgaaaatc acaagatcga 12961 tgtcatgggg cggcattcga agaaatccaa aaaagccggt ggggatgctg gagtccagca 13021 gcgcgacgcg cgtgatgccc tccctcggtc tgctagggct gcgccaggct tccaaaggca 13081 gaatgcgaaa gagtccaccg aagctttcaa tgagctcgcg ggtctcgcta tagcatccct 13141 gcggaacaag aacctggatg gaacctctct gcctcgacag cgccgtcagc agtgcagcca 13201 tcgcgctcat tccggagccg gtgtagatcg tgtgctggaa gatatttgtg ctcggtgacc 13261 agagcgccgg gtagattggt gggccgtgaa tatgcaggtc agctctctgg tagccgtagc 13321 tgaactcgaa cggccctgat cgtgctcgcc gagtcgagga ccacgcggtt tcggtgaacg 13381 cccagctttg taaggagagc tcggccttga tggcagccac taggcgaaat ttccggtcga 13441 tcgcgattcc gatcgacgcc ggcttttgga cttcgagcgg agacgcgaga caactattga 13501 ggagcgttag ctctcgatat ttccggtcga gataggattc aatcgtttcc acgcgatcct 13561 gcacagatgg gggcttctgt gccccaagcg ggttccttgt tgcggatgct tgccgggatg 13621 acacgaccat gaatggcatg tctattcttc ttcaacagcc tgtcagggcg tcgaagtcga 13681 aaaggctcaa ttaccgcatc gtg //